summaryrefslogtreecommitdiff
path: root/candle-pyo3/py_src/candle/utils
Commit message (Collapse)AuthorAgeFilesLines
* pyo3 update. (#2545)Laurent Mazare2024-10-061-7/+3
| | | | | * pyo3 update. * Stub fix.
* Quantized GGUF style (#1523)Nicolas Patry2024-01-171-2/+6
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | * Metal quantized modifications proposal. - Add a device param, wherever needed. - Create new QMetal storage thing that implements QuantizedType. - Update everywhere needed. Fix Python. Fixing examples. Fix: fmt + clippy + stub. Moving everything around. Only missing the actual implems. Fixing everything + adding dequantized kernels. More work. Fixing matmul. Fmt + Clippy Some clippy fixes. Working state. Q2K Metal -> Bugged (also present in GGML). Q4K CPU -> Bugged (present previously, new test catch it). Q5K CPU -> Bugged (present previously). Q8_1 Both -> Never really implemented it seems Q8K metal -> Never implemented in metal Fixing Q2K bug (present in ggml). * Cleanup. * Fix the rebase. * Removing the fences speeds everything up and *is* correct this time... * Cleanup the fence. * After rebase. * Bad code removal. * Rebase after phi2 merge + fix replit default to CPU. * Making the CI happy. * More happy tests. --------- Co-authored-by: Nicolas Patry <nicolas@Nicolass-MacBook-Pro.local>
* PyO3: Better shape handling (#1143)Lukas Kreussel2023-10-291-1/+1
| | | | | | | | | | | * Negative and `*args` shape handling * Rename to `PyShapeWithHole` + validate that only one hole exists * Regenerate stubs --------- Co-authored-by: Laurent Mazare <laurent.mazare@gmail.com>
* Extend `stub.py` to accept external typehinting (#1102)Lukas Kreussel2023-10-171-1/+1
|
* Add return types to `*.pyi` stubs (#880)Lukas Kreussel2023-09-172-9/+17
| | | | | | | | | * Start generating return types * Finish tensor type hinting * Add `save_gguf` to `utils` * Typehint `quant-llama.py`
* Generate `*.pyi` stubs for PyO3 wrapper (#870)Lukas Kreussel2023-09-162-0/+74
* Begin to generate typehints. * generate correct stubs * Correctly include stubs * Add comments and typhints to static functions * ensure candle-pyo3 directory * Make `llama.rope.freq_base` optional * `fmt`