summaryrefslogtreecommitdiff
path: root/candle-core/examples
Commit message (Expand)AuthorAgeFilesLines
...
* Move some safetensors bits in the candle-core crate.laurent2023-07-031-31/+2
* Add a flag for custom prompt.laurent2023-07-011-2/+7
* Early conversion for the llama weights.laurent2023-06-302-45/+19
* Add a const to easily tweak the dtype used for llama internal computations.laurent2023-06-301-4/+8
* Tweak the kv-cache flag.laurent2023-06-291-4/+4
* Add a flag.laurent2023-06-291-6/+11
* Enable the KV cache after fixing the caching length and the rope bits.laurent2023-06-291-14/+21
* Only narrow when needed + deactivate the kv cache.laurent2023-06-291-2/+6
* Add some KV cache to llama.laurent2023-06-291-36/+72
* Typo.Nicolas Patry2023-06-291-1/+1
* Putting back main.Nicolas Patry2023-06-291-3/+2
* Putting back seed.Nicolas Patry2023-06-291-2/+6
* Revert some changes.Nicolas Patry2023-06-291-11/+6
* Remove unwrap.Nicolas Patry2023-06-291-132/+90
* Moving llama to f16.Nicolas Patry2023-06-291-78/+128
* Add a seed parameter to llama.laurent2023-06-291-13/+10
* Simple example fix.Ubuntu2023-06-292-48/+56
* Merge pull request #19 from LaurentMazare/llama_safetensorsNicolas Patry2023-06-293-97/+324
|\
| * Final updates -> moving to deterministic for easier comparison.Ubuntu2023-06-282-42/+120
| * Ok.Ubuntu2023-06-282-58/+81
| * Tmp.Ubuntu2023-06-282-82/+208
* | Simplify the pattern matching logic in the cuda backend.laurent2023-06-291-0/+2
|/
* Add some helper functions.laurent2023-06-271-6/+1
* Cache the causal mask in llama.laurent2023-06-271-17/+51
* Refactor the hierarchy.Nicolas Patry2023-06-275-0/+697