index
:
forks/candle.git
main
summary
refs
log
tree
commit
diff
log msg
author
committer
range
path:
root
/
candle-examples
/
examples
/
llama
/
main.rs
Commit message (
Expand
)
Author
Age
Files
Lines
*
Add the SmolLM2 models. (#2595)
Laurent Mazare
2024-11-03
1
-14
/
+43
*
Fix the repo name for llama 3.1. (#2576)
Laurent Mazare
2024-10-26
1
-2
/
+2
*
Add some llama-3.2 examples. (#2508)
Laurent Mazare
2024-09-26
1
-1
/
+13
*
Add support for Llama 3.1 (#2359)
Eric Buehler
2024-07-26
1
-6
/
+24
*
Support top-k in tthe llama example. (#2150)
Laurent Mazare
2024-05-01
1
-3
/
+21
*
Better time measurement for the llama example. (#2106)
Laurent Mazare
2024-04-22
1
-2
/
+5
*
Use llama v3 by default + add to readme. (#2094)
Laurent Mazare
2024-04-20
1
-1
/
+1
*
Also enable llama-v3 8b instruct. (#2088)
Laurent Mazare
2024-04-19
1
-1
/
+3
*
Llama v3. (#2085)
Laurent Mazare
2024-04-18
1
-9
/
+13
*
Make the cache for the llama model explicit too. (#1745)
Laurent Mazare
2024-02-22
1
-3
/
+3
*
Use the tokenizer-output-stream in the llama example. (#1715)
Laurent Mazare
2024-02-15
1
-11
/
+9
*
fix index_pos bug when kv cache is disabled. (#1517)
optman
2024-01-06
1
-4
/
+4
*
Add support for tiny-llama-1.1b. (#1512)
Laurent Mazare
2023-12-31
1
-2
/
+9
*
Rework the llama example config, add the solar model. (#1485)
Laurent Mazare
2023-12-26
1
-72
/
+36
*
Adapt more examples to the updated safetensor api. (#947)
Laurent Mazare
2023-09-23
1
-9
/
+1
*
Implement top_p / nucleus sampling (#819)
Juarez Bochi
2023-09-12
1
-1
/
+5
*
Move some models to candle-transformers so that it's easier to re-use. (#794)
Laurent Mazare
2023-09-10
1
-2
/
+1
*
Add some optional repeat penalty. (#623)
Laurent Mazare
2023-08-27
1
-0
/
+18
*
s/panic/bail/
Nicolas Patry
2023-08-25
1
-2
/
+2
*
Adding support for codellama in examples.
Nicolas Patry
2023-08-25
1
-5
/
+15
*
Add some tracing to the quantized example. (#473)
Laurent Mazare
2023-08-16
1
-1
/
+0
*
Using the real config from the hub when available.
Nicolas Patry
2023-08-16
1
-10
/
+18
*
Tweak the llama example. (#450)
Laurent Mazare
2023-08-15
1
-63
/
+14
*
Support local weights & dynamic outputs (#447)
Guoqing Bao
2023-08-15
1
-15
/
+39
*
Add a cuda kernel for upsampling. (#441)
Laurent Mazare
2023-08-14
1
-2
/
+2
*
Remove the checkpoint conversion script. (#405)
Laurent Mazare
2023-08-11
1
-3
/
+0
*
Support the Accelerate BLAS on macOS. (#325)
Laurent Mazare
2023-08-05
1
-0
/
+3
*
Add some tracing to llama. (#318)
Laurent Mazare
2023-08-03
1
-0
/
+14
*
Support both llama v1 and llama v2. (#272)
Laurent Mazare
2023-07-28
1
-1
/
+5
*
Upgrading hf-hub to `0.2.0` (Modified API to not pass the Repo around
Nicolas Patry
2023-07-27
1
-4
/
+4
*
Switch to using llama-v2 by default. (#251)
Laurent Mazare
2023-07-26
1
-4
/
+4
*
Better handling of dtypes in llama. (#243)
Laurent Mazare
2023-07-26
1
-1
/
+1
*
Add flash attention (#241)
Laurent Mazare
2023-07-26
1
-1
/
+4
*
Support for MQA for llama v2. (#205)
Laurent Mazare
2023-07-20
1
-29
/
+18
*
Removing `candle-hub` internal to extract into `hf-hub` standalone.
Nicolas Patry
2023-07-19
1
-1
/
+1
*
Add some 'cuda-if-available' helper function. (#172)
Laurent Mazare
2023-07-15
1
-14
/
+1
*
Removing cuda default.
Nicolas Patry
2023-07-14
1
-1
/
+11
*
Add a cli argument to easily switch the dtype. (#161)
Laurent Mazare
2023-07-13
1
-6
/
+7
*
Sketch the candle-transformers crate. (#147)
Laurent Mazare
2023-07-12
1
-17
/
+3
*
Use arange in the examples. (#146)
Laurent Mazare
2023-07-12
1
-4
/
+3
*
Add from_iter and arange, use it in the doctests. (#145)
Laurent Mazare
2023-07-12
1
-1
/
+0
*
Llama batch (#144)
Laurent Mazare
2023-07-12
1
-3
/
+2
*
Allow for lazy loading of npz files, use it in llama to reduce memory usage i...
Laurent Mazare
2023-07-11
1
-5
/
+1
*
Resurrect the llama npy support. (#140)
Laurent Mazare
2023-07-11
1
-2
/
+8
*
Refactor the llama example to make it more in sync with the other ones. (#139)
Laurent Mazare
2023-07-11
1
-349
/
+19
*
Add a KV cache to falcon. (#104)
Laurent Mazare
2023-07-07
1
-2
/
+1
*
Creating new sync Api for `candle-hub`.
Nicolas Patry
2023-07-06
1
-5
/
+4
*
MKL adjustments. (#87)
Laurent Mazare
2023-07-06
1
-0
/
+3
*
Add mkl support for matrix multiply. (#86)
Laurent Mazare
2023-07-06
1
-1
/
+4
*
Support dim indexes in cat.
laurent
2023-07-05
1
-11
/
+10
[next]