index
:
forks/candle.git
main
summary
refs
log
tree
commit
diff
log msg
author
committer
range
path:
root
/
candle-examples
/
examples
/
falcon
Commit message (
Expand
)
Author
Age
Files
Lines
*
Use the new hub helper function. (#1484)
Laurent Mazare
2023-12-26
1
-8
/
+1
*
Adapt more examples to the updated safetensor api. (#947)
Laurent Mazare
2023-09-23
1
-10
/
+1
*
Add more example readmes. (#828)
Laurent Mazare
2023-09-12
1
-0
/
+3
*
Implement top_p / nucleus sampling (#819)
Juarez Bochi
2023-09-12
1
-13
/
+24
*
Move some models to candle-transformers so that it's easier to re-use. (#794)
Laurent Mazare
2023-09-10
2
-487
/
+1
*
Repeat-penalty in the falcon example. (#634)
Laurent Mazare
2023-08-28
1
-1
/
+33
*
Add a simple Module trait and implement it for the various nn layers (#500)
Laurent Mazare
2023-08-18
1
-1
/
+1
*
Add a cuda kernel for upsampling. (#441)
Laurent Mazare
2023-08-14
1
-4
/
+2
*
More accelerate optimizations (#427)
Laurent Mazare
2023-08-13
1
-0
/
+3
*
Use u8 tensors for masks. (#273)
Laurent Mazare
2023-07-29
1
-1
/
+1
*
Softmax numerical stability. (#267)
Laurent Mazare
2023-07-28
1
-5
/
+7
*
Upgrading hf-hub to `0.2.0` (Modified API to not pass the Repo around
Nicolas Patry
2023-07-27
1
-3
/
+7
*
Rename the .r functions to .dims so as to be a bit more explicit. (#220)
Laurent Mazare
2023-07-22
1
-4
/
+4
*
Removing `candle-hub` internal to extract into `hf-hub` standalone.
Nicolas Patry
2023-07-19
1
-1
/
+1
*
Add some 'cuda-if-available' helper function. (#172)
Laurent Mazare
2023-07-15
1
-14
/
+1
*
Removing cuda default.
Nicolas Patry
2023-07-14
1
-1
/
+10
*
Add a cli argument to easily switch the dtype. (#161)
Laurent Mazare
2023-07-13
1
-6
/
+10
*
Tensor mutability (#154)
Laurent Mazare
2023-07-13
1
-3
/
+3
*
Sketch the candle-transformers crate. (#147)
Laurent Mazare
2023-07-12
1
-21
/
+7
*
Use arange in the examples. (#146)
Laurent Mazare
2023-07-12
1
-2
/
+1
*
Remove some dead-code pragmas. (#137)
Laurent Mazare
2023-07-11
2
-20
/
+0
*
VarBuilder path creation (#131)
Laurent Mazare
2023-07-10
2
-50
/
+28
*
Move the var-builder in a central place. (#130)
Laurent Mazare
2023-07-10
2
-61
/
+4
*
[nn] Move the Embedding and Activation parts. (#116)
Laurent Mazare
2023-07-10
1
-29
/
+5
*
Sketch the candle-nn crate. (#115)
Laurent Mazare
2023-07-10
1
-79
/
+34
*
Sketching the musicgen model. (#66)
Laurent Mazare
2023-07-09
1
-1
/
+1
*
Sample with temperature. (#106)
Laurent Mazare
2023-07-07
1
-5
/
+15
*
Use F32 for the reduce ops. (#105)
Laurent Mazare
2023-07-07
1
-1
/
+6
*
Add a KV cache to falcon. (#104)
Laurent Mazare
2023-07-07
2
-41
/
+79
*
Add some caching to the causal mask. (#103)
Laurent Mazare
2023-07-07
1
-2
/
+10
*
Clippy after rebase.
Nicolas Patry
2023-07-07
1
-3
/
+1
*
Fixing falcon example.
Nicolas Patry
2023-07-07
1
-0
/
+1
*
Convert the logits to f32 before extracting them. (#102)
Laurent Mazare
2023-07-07
1
-1
/
+1
*
Add some text generation pipeline for falcon. (#98)
Laurent Mazare
2023-07-07
2
-16
/
+93
*
Bugfixes. (#97)
Laurent Mazare
2023-07-06
2
-6
/
+5
*
Add the call to dense in the attention layer. (#96)
Laurent Mazare
2023-07-06
1
-0
/
+1
*
Fix some shape issues in falcon. (#95)
Laurent Mazare
2023-07-06
2
-7
/
+21
*
Sketch the Falcon model. (#93)
Laurent Mazare
2023-07-06
2
-0
/
+678