summaryrefslogtreecommitdiff
path: root/.gitignore
diff options
context:
space:
mode:
authorLaurent Mazare <laurent.mazare@gmail.com>2023-08-22 19:41:10 +0100
committerGitHub <noreply@github.com>2023-08-22 19:41:10 +0100
commitf9ecc8447753d759e776e762ba9309bb90b76bb3 (patch)
tree311d0e2f4dad33ea8174225cc1bfa5bf429ba713 /.gitignore
parent07067b01dce3c63b45fe4bdeb8d972f279e88b45 (diff)
downloadcandle-f9ecc8447753d759e776e762ba9309bb90b76bb3.tar.gz
candle-f9ecc8447753d759e776e762ba9309bb90b76bb3.tar.bz2
candle-f9ecc8447753d759e776e762ba9309bb90b76bb3.zip
GQA support in the quantized model. (#555)
* GQA support in the quantized model. * Fix the reshaping. * Fix the main llama model. * Infer the proper gqa from the model kind.
Diffstat (limited to '.gitignore')
0 files changed, 0 insertions, 0 deletions