index
:
forks/candle.git
main
summary
refs
log
tree
commit
diff
log msg
author
committer
range
Commit message (
Expand
)
Author
Age
Files
Lines
*
Sync upstream MLX sdpa vector kernels with mask (#2718)
HEAD
main
Eric Buehler
2025-01-16
3
-49
/
+486
*
Bump the ug dependency. (#2720)
Laurent Mazare
2025-01-16
2
-4
/
+4
*
Fix the helium weights download. (#2717)
Laurent Mazare
2025-01-13
1
-1
/
+1
*
Helium repo update. (#2716)
Laurent Mazare
2025-01-13
2
-2
/
+8
*
Add the helium model. (#2715)
Laurent Mazare
2025-01-13
4
-0
/
+699
*
Fixes for running Phi-4 quantized. (#2714)
Jani Monoses
2025-01-13
2
-2
/
+6
*
ModernBERT model (#2713)
Jani Monoses
2025-01-13
6
-1
/
+612
*
Clippy fixes for 1.84. (#2710)
Laurent Mazare
2025-01-10
2
-6
/
+3
*
Update cudarc. (#2708)
Laurent Mazare
2025-01-08
1
-1
/
+1
*
Bump the caret version to 0.8.2. (#2703)
Laurent Mazare
2025-01-07
5
-16
/
+16
*
add link to README (#2701)
Andrei Fajardo
2025-01-04
1
-0
/
+1
*
Fix mistral attention on Metal (#2699)
Luka Zakrajšek
2025-01-04
1
-1
/
+2
*
UniPC for diffusion sampling (#2684)
Nick Senger
2025-01-01
6
-5
/
+1011
*
Update the hf-hub dependency to 0.4.0. (#2691)
Laurent Mazare
2024-12-31
2
-5
/
+5
*
Actually remove the default hf-hub cache path for glm. (#2696)
Laurent Mazare
2024-12-31
1
-1
/
+1
*
Use the default hf-hub cache for glm. (#2695)
Laurent Mazare
2024-12-31
1
-7
/
+10
*
Flash-Attn upgrade / SoftCap Candle-FlashAttn [3/n] (#2690)
Michael Feil
2024-12-31
3
-4
/
+7
*
Flash-Attn upgrade / SoftCap Candle-FlashAttn [2/n] (#2689)
Michael Feil
2024-12-31
4
-3
/
+182
*
Flash-Attn upgrade / SoftCap Candle-FlashAttn [1/n] (#2688)
Michael Feil
2024-12-31
41
-82
/
+139
*
Streamline the glm4 example. (#2694)
Laurent Mazare
2024-12-31
3
-147
/
+99
*
Fix a cuda warning. (#2693)
Laurent Mazare
2024-12-31
1
-39
/
+44
*
Update README.org (#2670)
jetsung
2024-12-30
1
-1
/
+1
*
Added XLMRobertaModel for Reranking (#2686)
Akshay Ballal
2024-12-30
4
-0
/
+853
*
Fix bug in whisper transformer (#2681)
mert-kurttutan
2024-12-24
1
-0
/
+1
*
Fix Batcher iterator break when return_last_incomplete_batch and items.is_emp...
hhllhhyyds
2024-12-24
1
-4
/
+4
*
Fix position encodings for Pixtral (#2678)
Amélie Royer
2024-12-23
1
-13
/
+55
*
Add a Context trait similar to anyhow::Context. (#2676)
Laurent Mazare
2024-12-22
13
-41
/
+97
*
make DepthAnythingV2 more reusable (#2675)
Edgar Riba
2024-12-21
2
-23
/
+27
*
Bump the crate version to 0.8.1. (#2662)
Laurent Mazare
2024-12-07
5
-16
/
+16
*
Change/bert encoder public (#2658)
Justin Sing
2024-12-04
1
-21
/
+30
*
Add Nvembed v2 model (#2649)
cdoko
2024-12-03
6
-0
/
+803
*
add scatter add (#2656)
zachcp
2024-12-01
2
-0
/
+2
*
add u32 - U32 gather (#2653)
zachcp
2024-11-30
2
-79
/
+81
*
Clippy fixes for the cuda feature. (#2650)
Laurent Mazare
2024-11-29
2
-11
/
+11
*
Adds support for stella_en_v5 embedding model -400M variant (#2608)
iskng
2024-11-29
3
-112
/
+555
*
Lint fixes introduced with Rust 1.83 (#2646)
Anubhab Bandyopadhyay
2024-11-28
19
-55
/
+57
*
Fix for whisper-microphone example failure if audio isn't chunk aligned (#2645)
Adam Nelson
2024-11-27
1
-3
/
+17
*
Onnx Support for Sign operation #2641 (#2642)
Ionut Mihalcea
2024-11-26
2
-0
/
+47
*
Provide a method to allow PTH files with state maps to be loaded. (#2639)
zachcp
2024-11-26
1
-1
/
+11
*
fix typo (#2606)
Andrei Fajardo
2024-11-23
1
-1
/
+1
*
Tweak the CI to avoid running out of disk space. (#2630)
Laurent Mazare
2024-11-19
1
-0
/
+3
*
20241118 docs (#2629)
zachcp
2024-11-19
27
-12
/
+72
*
Import the ggml_cuda_dp4a function. (#2628)
Laurent Mazare
2024-11-19
1
-33
/
+44
*
Fix for clippy. (#2626)
Laurent Mazare
2024-11-18
1
-1
/
+1
*
Module Docs (#2624)
zachcp
2024-11-18
39
-115
/
+170
*
More Model Module Docs (#2623)
zachcp
2024-11-17
12
-72
/
+291
*
Module Docs (#2620)
zachcp
2024-11-16
5
-10
/
+126
*
Remove some unused macros. (#2618)
Laurent Mazare
2024-11-15
9
-14
/
+13
*
Documentation Pass for Models (#2617)
zachcp
2024-11-15
94
-51
/
+1001
*
Add max-all/min-all. (#2616)
Laurent Mazare
2024-11-14
1
-0
/
+36
[next]