index
:
forks/candle.git
main
summary
refs
log
tree
commit
diff
log msg
author
committer
range
path:
root
/
candle-transformers
/
src
/
models
/
mistral.rs
Commit message (
Expand
)
Author
Age
Files
Lines
*
Add Pixtral. (#2521)
Laurent Mazare
2024-09-30
1
-4
/
+34
*
Support for mistral-nemo. (#2396)
Laurent Mazare
2024-08-04
1
-5
/
+12
*
Use cat for faster MQA computation. (#2043)
Laurent Mazare
2024-04-12
1
-14
/
+2
*
Use the new rope kernel in mistral. (#1937)
Laurent Mazare
2024-03-25
1
-14
/
+6
*
Support more mistral models. (#1927)
Laurent Mazare
2024-03-24
1
-20
/
+26
*
Avoid broadcasting on the batch dimension for the attention mask. (#1920)
Laurent Mazare
2024-03-23
1
-4
/
+3
*
Use a common with_tracing::RmsNorm in a few models. (#1871)
Jani Monoses
2024-03-18
1
-22
/
+1
*
add config_amazon_mistral_lite (#1493)
Daniel Clough
2023-12-28
1
-0
/
+18
*
feat: add clear_kv_cache to mistral and qmistral models (#1464)
drbh
2023-12-21
1
-0
/
+14
*
make fn name generic (#1459)
Daniel Clough
2023-12-21
1
-1
/
+2
*
add fn config_chat_ml (#1458)
Daniel Clough
2023-12-20
1
-0
/
+19
*
Make some model cloneable. (#1125)
Laurent Mazare
2023-10-18
1
-6
/
+6
*
Quantized version of mistral. (#1009)
Laurent Mazare
2023-09-30
1
-12
/
+12
*
Use flash-attn for mistral. (#1004)
Laurent Mazare
2023-09-30
1
-8
/
+37
*
Use a silu activation in mistral. (#991)
Laurent Mazare
2023-09-29
1
-1
/
+1
*
Add the sliding window. (#986)
Laurent Mazare
2023-09-28
1
-2
/
+9
*
Mistral gpu fix (#985)
Laurent Mazare
2023-09-28
1
-5
/
+8
*
Add the mistral example. (#984)
Laurent Mazare
2023-09-28
1
-11
/
+14
*
Add the Mistral 7b model (#983)
Laurent Mazare
2023-09-28
1
-0
/
+364