summaryrefslogtreecommitdiff
path: root/candle-pyo3/test.py
Commit message (Collapse)AuthorAgeFilesLines
* Add support for accelerate in the pyo3 bindings. (#1167)Laurent Mazare2023-10-241-0/+5
|
* Make the Python Wrapper more Hackable and simplify Quantization (#1010)Lukas Kreussel2023-10-061-1/+1
| | | | | | | | | | | | | | | | | | | | | | | | | | | * Some first `Module` implementations * Add `state_dict` and `load_state_dict` functionality * Move modules around and create `candle.nn.Linear` * Add `nn.Embedding` and `nn.LayerNorm` * Add BERT implementation * Batch q-matmul * Automatically dequantize `QTensors` if a `Tensor` is expected * Add Module `.to()`, `.cuda()`, `cpu()` and `.type()` functionality * Unittests for `Module`, `Tensor` and `candle.utils` * Add `pytorch` like slicing to `Tensor` * Cleanup and BERT fixes * `black` formatting + unit-test for `nn.Linear` * Refactor slicing implementation
* Add return types to `*.pyi` stubs (#880)Lukas Kreussel2023-09-171-5/+4
| | | | | | | | | * Start generating return types * Finish tensor type hinting * Add `save_gguf` to `utils` * Typehint `quant-llama.py`
* Generate `*.pyi` stubs for PyO3 wrapper (#870)Lukas Kreussel2023-09-161-4/+5
| | | | | | | | | | | | | | | * Begin to generate typehints. * generate correct stubs * Correctly include stubs * Add comments and typhints to static functions * ensure candle-pyo3 directory * Make `llama.rope.freq_base` optional * `fmt`
* Recommend using maturin. (#717)Laurent Mazare2023-09-021-15/+0
|
* Support for quantized tensors in the python api. (#706)Laurent Mazare2023-09-011-0/+6
| | | | | | | | | | | | | | | * Add more pyo3 support. * Add some support for quantized tensors in pyo3. * Add an arc layer on qmatmul. * Add the quantized matmul. * Quantization support. * More quantization support. * Test the python quantization.
* Pyo3 dtype (#327)Laurent Mazare2023-08-061-0/+17
| | | | | * Better handling of dtypes in pyo3. * More pyo3 dtype.
* Add some tensor creation functions to the pyo3 bindings. (#326)Laurent Mazare2023-08-061-0/+3
|
* Preliminary pyo3 support for device.laurent2023-07-021-1/+3
|
* Add dtype support.laurent2023-07-021-0/+1
|
* Add to the example.laurent2023-07-021-2/+3
|
* Handle more input types to create tensors.laurent2023-07-021-0/+5
|
* Add some binary operators.laurent2023-07-011-1/+1
|
* Add two methods.laurent2023-07-011-0/+1
|
* Very simple pyo3 bindings for candle.laurent2023-07-011-0/+5