summaryrefslogtreecommitdiff
path: root/candle-transformers/src/models/t5.rs
Commit message (Expand)AuthorAgeFilesLines
* Module Docs (#2624)zachcp2024-11-181-3/+4
* More Model Module Docs (#2623)zachcp2024-11-171-0/+43
* Documentation Pass for Models (#2617)zachcp2024-11-151-2/+16
* Lazy upcasting for t5. (#2589)Laurent Mazare2024-10-301-3/+48
* Clippy fixes for 1.81.0. (#2461)Laurent Mazare2024-09-051-1/+1
* Add support for Llama 3.1 (#2359)Eric Buehler2024-07-261-1/+1
* Avoid crashes when running T5 models with F16 tensors on CPU (#2047)Victor-Mihaila2024-04-131-1/+1
* Change for the encoder-only ProstT5 model (#2045)Victor-Mihaila2024-04-131-1/+3
* Expose the t5 config fields + allow t5-large. (#1987)Laurent Mazare2024-04-011-16/+16
* Add min to buckets in relative_position_bucket (#1312)Andy Braga2023-11-101-1/+1
* Add support to UL2 model family (#1300)Juarez Bochi2023-11-091-6/+43
* Fix bug introduced in madlad PR (#1298)Juarez Bochi2023-11-081-2/+2
* Add support for MADLAD400 (#1285)Juarez Bochi2023-11-071-2/+15
* Make more models cloneable. (#1203)Laurent Mazare2023-10-281-11/+11
* Use softmax-last-dim where possible. (#1057)Laurent Mazare2023-10-081-1/+1
* Do not use the kv-cache on external key-value states. (#1054)Laurent Mazare2023-10-071-7/+7
* Delete invalid comment (#1038)Juarez Bochi2023-10-051-2/+0
* Tracing for the phi model (#936)Laurent Mazare2023-09-231-56/+15
* Add a quantized version of the t5 model. (#921)Laurent Mazare2023-09-211-1/+1
* Add a clear cache function to the t5 model. (#919)Laurent Mazare2023-09-211-0/+30
* Add more t5 tracing. (#915)Laurent Mazare2023-09-201-4/+17
* Add more t5 tracing. (#914)Laurent Mazare2023-09-201-5/+35
* Tracing mode for T5. (#913)Laurent Mazare2023-09-201-16/+74
* Flan T5: Read lm_head when word embeddings are not tied (#903)Juarez Bochi2023-09-191-7/+43
* Fix T5 kv cache (#899)Juarez Bochi2023-09-191-1/+6
* Avoid re-encoding the input in the T5 example. (#875)Laurent Mazare2023-09-171-3/+15
* Add a KV cache to T5. (#873)Laurent Mazare2023-09-171-27/+58
* Implement T5 decoding (#864)Juarez Bochi2023-09-151-28/+153
* Add support to flan-t5 (#840)Juarez Bochi2023-09-131-5/+49
* Add some sentence similarity part to the t5 example. (#835)Laurent Mazare2023-09-131-2/+5
* T5 tweaks (#831)Laurent Mazare2023-09-131-18/+33
* Clippy fix. (#830)Laurent Mazare2023-09-131-2/+0
* Extract T5 module and add main function to use it (#829)Juarez Bochi2023-09-131-0/+441