index
:
forks/candle.git
main
summary
refs
log
tree
commit
diff
log msg
author
committer
range
path:
root
/
candle-nn
Commit message (
Expand
)
Author
Age
Files
Lines
*
Sync upstream MLX sdpa vector kernels with mask (#2718)
HEAD
main
Eric Buehler
2025-01-16
1
-21
/
+74
*
ModernBERT model (#2713)
Jani Monoses
2025-01-13
2
-1
/
+12
*
Clippy fixes for 1.84. (#2710)
Laurent Mazare
2025-01-10
1
-2
/
+2
*
Lint fixes introduced with Rust 1.83 (#2646)
Anubhab Bandyopadhyay
2024-11-28
2
-10
/
+10
*
Provide a method to allow PTH files with state maps to be loaded. (#2639)
zachcp
2024-11-26
1
-1
/
+11
*
Update docs (#2553)
zachcp
2024-11-11
9
-0
/
+34
*
Add some fast Metal MLX SDPA kernels (#2584)
Eric Buehler
2024-11-05
2
-0
/
+396
*
Improved launch config for layer-norm/rms-norm. (#2591)
Laurent Mazare
2024-11-04
2
-4
/
+66
*
Make the RNN configs accessible from the models. (#2541)
Laurent Mazare
2024-10-04
1
-72
/
+103
*
Add/lstm direction (#2455)
Justin Sing
2024-09-30
1
-8
/
+25
*
Add Pixtral. (#2521)
Laurent Mazare
2024-09-30
1
-14
/
+22
*
Add a RotatingKVCache. (#2493)
Laurent Mazare
2024-09-23
2
-1
/
+333
*
onnx: implement LSTM op (#2268)
shua
2024-08-19
1
-0
/
+4
*
update: LSTMState and GRUState fields to be public (#2384)
Justin Sing
2024-08-01
1
-3
/
+3
*
Add support for Llama 3.1 (#2359)
Eric Buehler
2024-07-26
2
-3
/
+4
*
Depth Anything v2 (#2279)
Jeroen Vlek
2024-06-24
1
-1
/
+22
*
add where_cond f32 for metal (#2236)
Lionel Touati
2024-06-02
1
-1
/
+1
*
Enable the new layer-norm. (#2213)
Laurent Mazare
2024-05-24
2
-5
/
+19
*
Add the layernorm specialized op. (#2212)
Laurent Mazare
2024-05-24
2
-5
/
+280
*
Simplify the KvCache api. (#2207)
Laurent Mazare
2024-05-23
1
-36
/
+53
*
Add a couple kv-cache helper functions. (#2206)
Laurent Mazare
2024-05-23
1
-0
/
+29
*
Add a slice_set op. (#2193)
Laurent Mazare
2024-05-18
2
-0
/
+102
*
Fix VarBuilder::from_slice_safetensors (#2180)
Harry Stern
2024-05-12
1
-4
/
+30
*
Add SliceSafetensors. (#2179)
Laurent Mazare
2024-05-11
1
-0
/
+6
*
Bump the version number to 0.5.1. (#2155)
Laurent Mazare
2024-05-03
1
-1
/
+1
*
Bug Fix: When converting a tensor to a variable, clone if the tensor is alrea...
Jeffrey Dallatezza
2024-04-29
1
-2
/
+44
*
Fix sigmoid gradient calculation and move sigmoid into a specialized op (#2114)
MilkFather
2024-04-29
2
-2
/
+197
*
Apply the cast before the scaling. (#2135)
Laurent Mazare
2024-04-28
1
-1
/
+1
*
Use the faster rms-norm kernel for llama. (#2107)
Laurent Mazare
2024-04-22
1
-4
/
+13
*
Add a convenient way to rename tensors accessed through a varbuilder. (#2052)
Laurent Mazare
2024-04-13
1
-0
/
+93
*
Add the rope THD kernel. (#2014)
Laurent Mazare
2024-04-05
2
-0
/
+262
*
Relax the contiguous check for cuda kernels. (#2000)
Laurent Mazare
2024-04-03
1
-1
/
+1
*
Add benchmarks for the candle-nn package (#1995)
Thomas Santerre
2024-04-03
5
-0
/
+175
*
Add fn 'get_with_hints_dtype' in VarBuilder (#1877) (#1897)
yinqiwen
2024-04-01
1
-4
/
+15
*
Fix detail in new RoPE implementation (#1935)
Hugo Abonizio
2024-03-25
1
-1
/
+1
*
Contiguous variant of the rope kernel. (#1929)
Laurent Mazare
2024-03-25
2
-2
/
+282
*
Fast kernels for rotary embeddings. (#1928)
Laurent Mazare
2024-03-24
4
-0
/
+277
*
RmsNorm kernel for metal. (#1895)
Laurent Mazare
2024-03-21
1
-1
/
+46
*
Custom op for RmsNorm (#1890)
Laurent Mazare
2024-03-21
2
-8
/
+197
*
Optimize the cat operation on contiguous tensors (#1855)
Laurent Mazare
2024-03-17
1
-0
/
+19
*
add clone to candle dropout (#1814)
Kirpal Grewal
2024-03-08
1
-1
/
+1
*
Improve metal buffer usage (#1807)
ivarflakstad
2024-03-07
1
-1
/
+2
*
Add the StarCoder2 model. (#1779)
Laurent Mazare
2024-02-28
1
-0
/
+4
*
Encodec model. (#1771)
Laurent Mazare
2024-02-27
1
-1
/
+1
*
Tweak the VarMap set type. (#1758)
Laurent Mazare
2024-02-25
2
-2
/
+39
*
Support for attention bias in gemma + refactor things a bit. (#1744)
Laurent Mazare
2024-02-22
2
-6
/
+19
*
Bugfix for applying the bias in conv1d-transpose. (#1732)
Laurent Mazare
2024-02-18
1
-1
/
+1
*
Support for groups in conv-transpose1d. (#1731)
Laurent Mazare
2024-02-18
1
-3
/
+13
*
Expose the weights and biases in transposed convolutions. (#1727)
Laurent Mazare
2024-02-18
1
-0
/
+16
*
Expose more conv1d functions/structs. (#1726)
Laurent Mazare
2024-02-17
2
-2
/
+19
[next]