index
:
forks/candle.git
main
summary
refs
log
tree
commit
diff
log msg
author
committer
range
path:
root
/
candle-nn
Commit message (
Expand
)
Author
Age
Files
Lines
...
*
feat: add silu activation function (#1706)
OlivierDehaene
2024-02-14
2
-4
/
+3
*
Detach the tensors on batch-norm eval. (#1702)
Laurent Mazare
2024-02-13
1
-2
/
+12
*
Fix clippy lints for 1.76. (#1682)
Laurent Mazare
2024-02-08
1
-1
/
+1
*
Enhance pickle to retrieve state_dict with a given key (#1671)
Dilshod Tadjibaev
2024-02-06
1
-1
/
+1
*
Add `VarBuilder::from_backend` (#1670)
Daniƫl de Kok
2024-02-06
1
-8
/
+17
*
Quantized GGUF style (#1523)
Nicolas Patry
2024-01-17
1
-1
/
+4
*
Update the Phi model to use the updated architecture. (#1580)
Laurent Mazare
2024-01-13
1
-0
/
+1
*
Simplifying our internal cargo dependencies. (#1529)
Nicolas Patry
2024-01-07
1
-2
/
+2
*
Simplify the one-hot implementation, support arbitrary rank. (#1514)
Laurent Mazare
2024-01-01
1
-181
/
+38
*
Add one-hot/cold encoding (#1489)
Ryan Tate
2024-01-01
3
-0
/
+414
*
Do not implement Module for BatchNorm. (#1513)
Laurent Mazare
2024-01-01
2
-15
/
+15
*
Small tweaks to batch-norm. (#1505)
Laurent Mazare
2023-12-30
1
-19
/
+16
*
[Breaking] Add training to batchnorm with exponential moving average (#1504)
nkoppel
2023-12-30
2
-50
/
+169
*
Bump the crate version to 0.3.3. (#1490)
Laurent Mazare
2023-12-28
1
-1
/
+1
*
Merge pull request #1318 from huggingface/metal4
Nicolas Patry
2023-12-20
2
-0
/
+44
|
\
|
*
Clippy pass.
Nicolas Patry
2023-12-18
1
-3
/
+3
|
*
Addressing a lot of comments.
Nicolas Patry
2023-12-15
1
-1
/
+2
|
*
Remove `unwrap()`.
Nicolas Patry
2023-12-15
1
-2
/
+2
|
*
Renamed all kernel names.
Nicolas Patry
2023-12-15
1
-3
/
+3
|
*
Fixing softmax.
Nicolas Patry
2023-12-15
1
-1
/
+1
|
*
Working with merging encoders and using fences.
Nicolas Patry
2023-12-14
1
-2
/
+0
|
*
Lots of updates including some stack of command buffers.
nicolas
2023-12-12
2
-2
/
+5
|
*
Starting to fix some tests.
Nicolas Patry
2023-11-30
2
-0
/
+42
*
|
Bump the crate version to 0.3.2. (#1452)
Laurent Mazare
2023-12-17
1
-1
/
+1
*
|
Fix a couple typos (#1451)
Laurent Mazare
2023-12-17
1
-1
/
+1
*
|
Expose AdamW parameters (#1449)
Dave Lage
2023-12-16
1
-0
/
+8
*
|
Speedup ShardedSafeTensors to load Tensors with default hints (#1384)
YiiSh
2023-12-14
1
-1
/
+7
*
|
Another prelu bugfix. (#1407)
Laurent Mazare
2023-12-06
1
-1
/
+1
*
|
Use the proper broadcasting for prelu. (#1406)
Laurent Mazare
2023-12-05
1
-5
/
+16
*
|
Add the prelu layer. (#1402)
Laurent Mazare
2023-12-03
3
-4
/
+51
|
/
*
Implement the module trait directly for QMatMul. (#1372)
Laurent Mazare
2023-11-25
1
-1
/
+1
*
Update for 0.3.1. (#1324)
Laurent Mazare
2023-11-11
1
-1
/
+1
*
Add support to UL2 model family (#1300)
Juarez Bochi
2023-11-09
1
-1
/
+0
*
Add weight and bias functions to LayerNorm (#1306)
jwnz
2023-11-09
1
-0
/
+8
*
Transposed conv1d in candle-nn. (#1252)
Laurent Mazare
2023-11-03
1
-0
/
+94
*
Add the swiglu activation from the chatglm PR. (#1246)
Laurent Mazare
2023-11-02
2
-0
/
+7
*
Add hard-sigmoid and hard-swish activations (#1244)
jamjamjon
2023-11-02
2
-0
/
+9
*
Add support for the marian base model. (#1221)
Laurent Mazare
2023-10-30
1
-0
/
+2
*
Allow for different behavior between training and eval (#1213)
Laurent Mazare
2023-10-29
3
-2
/
+43
*
Add the relu2 and relu6 activations. (#1201)
Laurent Mazare
2023-10-27
1
-0
/
+4
*
Add fuse-conv-bn method for Conv2d (#1196)
jamjamjon
2023-10-27
2
-0
/
+25
*
Expose the fields from batch-norm. (#1176)
Laurent Mazare
2023-10-25
1
-2
/
+12
*
Add Binary Cross Entropy With Logit Loss to nn crate (#1157)
Ogundepo Odunayo
2023-10-23
2
-0
/
+69
*
Make func cloneable. (#1137)
Laurent Mazare
2023-10-20
2
-6
/
+8
*
Add the sequential layer. (#1136)
Laurent Mazare
2023-10-20
2
-0
/
+64
*
Experiment with resnet (#1128)
Laurent Mazare
2023-10-19
1
-0
/
+9
*
feat: add pth varbuilder (#1108)
OlivierDehaene
2023-10-16
1
-0
/
+41
*
Add a matvec cpu benchmark. (#1076)
Laurent Mazare
2023-10-12
1
-3
/
+22
*
Convmixer (#1073)
Laurent Mazare
2023-10-11
1
-2
/
+2
*
Only optimize float tensors. (#1069)
Laurent Mazare
2023-10-10
1
-0
/
+5
[prev]
[next]