summaryrefslogtreecommitdiff
path: root/candle-pyo3/quant-llama.py
Commit message (Expand)AuthorAgeFilesLines
* Make the Python Wrapper more Hackable and simplify Quantization (#1010)Lukas Kreussel2023-10-061-159/+38
* Add return types to `*.pyi` stubs (#880)Lukas Kreussel2023-09-171-15/+16
* Generate `*.pyi` stubs for PyO3 wrapper (#870)Lukas Kreussel2023-09-161-3/+4
* Return the metadata in the gguf pyo3 bindings. (#729)Laurent Mazare2023-09-041-4/+35
* Recommend using maturin. (#717)Laurent Mazare2023-09-021-14/+0
* More quantized llama in python. (#716)Laurent Mazare2023-09-021-6/+13
* Sketch a quantized llama using the pyo3 api. (#715)Laurent Mazare2023-09-021-0/+171