diff options
author | Laurent Mazare <laurent.mazare@gmail.com> | 2024-05-04 10:14:57 +0200 |
---|---|---|
committer | GitHub <noreply@github.com> | 2024-05-04 10:14:57 +0200 |
commit | b13a82a4387a55df07bec4e2eb6f7a8ebd0b98a2 (patch) | |
tree | aed5a019e7e053900ffa5be57ddfd20bdfad8582 /test.onnx | |
parent | 59b18d974ec3cad6963b774aa245e23f8c80414f (diff) | |
download | candle-b13a82a4387a55df07bec4e2eb6f7a8ebd0b98a2.tar.gz candle-b13a82a4387a55df07bec4e2eb6f7a8ebd0b98a2.tar.bz2 candle-b13a82a4387a55df07bec4e2eb6f7a8ebd0b98a2.zip |
Separate quantized phi-3 implementation. (#2157)
* Separate quantized phi-3 implementation.
* Integrate the quantized phi3 model.=
* Small fixes, get the generation to work properly.
* Keep the old llama implementation around.
* Change the default.
Diffstat (limited to 'test.onnx')
0 files changed, 0 insertions, 0 deletions