summaryrefslogtreecommitdiff
path: root/candle-examples/examples/falcon
Commit message (Expand)AuthorAgeFilesLines
* Use the new hub helper function. (#1484)Laurent Mazare2023-12-261-8/+1
* Adapt more examples to the updated safetensor api. (#947)Laurent Mazare2023-09-231-10/+1
* Add more example readmes. (#828)Laurent Mazare2023-09-121-0/+3
* Implement top_p / nucleus sampling (#819)Juarez Bochi2023-09-121-13/+24
* Move some models to candle-transformers so that it's easier to re-use. (#794)Laurent Mazare2023-09-102-487/+1
* Repeat-penalty in the falcon example. (#634)Laurent Mazare2023-08-281-1/+33
* Add a simple Module trait and implement it for the various nn layers (#500)Laurent Mazare2023-08-181-1/+1
* Add a cuda kernel for upsampling. (#441)Laurent Mazare2023-08-141-4/+2
* More accelerate optimizations (#427)Laurent Mazare2023-08-131-0/+3
* Use u8 tensors for masks. (#273)Laurent Mazare2023-07-291-1/+1
* Softmax numerical stability. (#267)Laurent Mazare2023-07-281-5/+7
* Upgrading hf-hub to `0.2.0` (Modified API to not pass the Repo aroundNicolas Patry2023-07-271-3/+7
* Rename the .r functions to .dims so as to be a bit more explicit. (#220)Laurent Mazare2023-07-221-4/+4
* Removing `candle-hub` internal to extract into `hf-hub` standalone.Nicolas Patry2023-07-191-1/+1
* Add some 'cuda-if-available' helper function. (#172)Laurent Mazare2023-07-151-14/+1
* Removing cuda default.Nicolas Patry2023-07-141-1/+10
* Add a cli argument to easily switch the dtype. (#161)Laurent Mazare2023-07-131-6/+10
* Tensor mutability (#154)Laurent Mazare2023-07-131-3/+3
* Sketch the candle-transformers crate. (#147)Laurent Mazare2023-07-121-21/+7
* Use arange in the examples. (#146)Laurent Mazare2023-07-121-2/+1
* Remove some dead-code pragmas. (#137)Laurent Mazare2023-07-112-20/+0
* VarBuilder path creation (#131)Laurent Mazare2023-07-102-50/+28
* Move the var-builder in a central place. (#130)Laurent Mazare2023-07-102-61/+4
* [nn] Move the Embedding and Activation parts. (#116)Laurent Mazare2023-07-101-29/+5
* Sketch the candle-nn crate. (#115)Laurent Mazare2023-07-101-79/+34
* Sketching the musicgen model. (#66)Laurent Mazare2023-07-091-1/+1
* Sample with temperature. (#106)Laurent Mazare2023-07-071-5/+15
* Use F32 for the reduce ops. (#105)Laurent Mazare2023-07-071-1/+6
* Add a KV cache to falcon. (#104)Laurent Mazare2023-07-072-41/+79
* Add some caching to the causal mask. (#103)Laurent Mazare2023-07-071-2/+10
* Clippy after rebase.Nicolas Patry2023-07-071-3/+1
* Fixing falcon example.Nicolas Patry2023-07-071-0/+1
* Convert the logits to f32 before extracting them. (#102)Laurent Mazare2023-07-071-1/+1
* Add some text generation pipeline for falcon. (#98)Laurent Mazare2023-07-072-16/+93
* Bugfixes. (#97)Laurent Mazare2023-07-062-6/+5
* Add the call to dense in the attention layer. (#96)Laurent Mazare2023-07-061-0/+1
* Fix some shape issues in falcon. (#95)Laurent Mazare2023-07-062-7/+21
* Sketch the Falcon model. (#93)Laurent Mazare2023-07-062-0/+678