Commit message (Collapse) | Author | Age | Files | Lines | |
---|---|---|---|---|---|
* | bert attention mask (#1934) | Zheng Li | 2024-08-01 | 1 | -2/+10 |
| | | | | | | | | | | | | | | | * bert attention mask * Allow for using None as a mask. * Revert part of the changes so that the proper default mask applies. * Cosmetic change. * Another cosmetic tweak. --------- Co-authored-by: Laurent <laurent.mazare@gmail.com> | ||||
* | Speed up bert with approx gelu (#1410) | Juarez Bochi | 2023-12-06 | 1 | -2/+9 |
| | |||||
* | Handle LongStorage in pytorch checkpoints. (#1152) | Laurent Mazare | 2023-10-22 | 1 | -27/+20 |
| | |||||
* | Adapt more examples to the updated safetensor api. (#947) | Laurent Mazare | 2023-09-23 | 1 | -3/+2 |
| | | | | | | | | | * Simplify the safetensor usage. * Convert more examples. * Move more examples. * Adapt stable-diffusion. | ||||
* | Move some models to candle-transformers so that it's easier to re-use. (#794) | Laurent Mazare | 2023-09-10 | 1 | -2/+1 |
| | | | | | | | | | * Move some models to candle-transformers so that they can be shared. * Also move falcon. * Move Llama. * Move whisper (partial). | ||||
* | Upgrading hf-hub. | Nicolas Patry | 2023-08-29 | 1 | -4/+4 |
| | |||||
* | Add a yolo-v3 example. (#528) | Laurent Mazare | 2023-08-20 | 1 | -0/+3 |
| | | | | | | | | | | | | | | | * Add a couple functions required for yolo. * Add the yolo-v3 example. * Add minimum and maximum. * Use the newly introduced maximum. * Cuda support for min/max + add some testing. * Allow for more tests to work with accelerate. * Fix a typo. | ||||
* | Add a cuda kernel for upsampling. (#441) | Laurent Mazare | 2023-08-14 | 1 | -1/+4 |
| | | | | | * Add a cuda kernel for upsampling. * Update for the latest tokenizers version. | ||||
* | Normalize embeddings in the bert example. (#390) | Laurent Mazare | 2023-08-10 | 1 | -0/+14 |
| | |||||
* | Upgrading hf-hub to `0.2.0` (Modified API to not pass the Repo around | Nicolas Patry | 2023-07-27 | 1 | -3/+4 |
| | | | | all the time) | ||||
* | Rename the .r functions to .dims so as to be a bit more explicit. (#220) | Laurent Mazare | 2023-07-22 | 1 | -1/+1 |
| | |||||
* | Removing `candle-hub` internal to extract into `hf-hub` standalone. | Nicolas Patry | 2023-07-19 | 1 | -1/+1 |
| | |||||
* | Use mkl to accelerate binary ops. (#190) | Laurent Mazare | 2023-07-18 | 1 | -2/+5 |
| | | | | | | | | | | | * Vectorized binary ops with mkl. * Improve the binary op mkl support. * Push the support for mkl binary ops. * Proper vectorization of binary ops. * Proper mkl'isation when broadcasting binary ops. | ||||
* | Bert tracing (#184) | Laurent Mazare | 2023-07-17 | 1 | -461/+20 |
| | | | | | | | * Add some tracing to bert. * More tracing. * Add a flag for tracing. | ||||
* | Add some 'cuda-if-available' helper function. (#172) | Laurent Mazare | 2023-07-15 | 1 | -14/+1 |
| | |||||
* | Removing cuda default. | Nicolas Patry | 2023-07-14 | 1 | -1/+10 |
| | | | | | | | Seems very important for a lot of exploring users usually on laptop without GPUs. Adding more README instructions in a follow up. | ||||
* | Simplify the parameters used by sum and sum_keepdim. (#165) | Laurent Mazare | 2023-07-14 | 1 | -1/+1 |
| | |||||
* | Use the same default as pytorch for sum. (#164) | Laurent Mazare | 2023-07-13 | 1 | -4/+4 |
| | |||||
* | Tensor mutability (#154) | Laurent Mazare | 2023-07-13 | 1 | -1/+1 |
| | | | | | * Working towards tensor mutability. * Use a ref-cell to provide tensor mutability. | ||||
* | Remove some dead-code pragmas. (#137) | Laurent Mazare | 2023-07-11 | 1 | -10/+2 |
| | |||||
* | VarBuilder path creation (#131) | Laurent Mazare | 2023-07-10 | 1 | -72/+42 |
| | | | | | | | * Use a struct for the safetensor+routing. * Group the path and the var-builder together. * Fix for the empty path case. | ||||
* | Move the var-builder in a central place. (#130) | Laurent Mazare | 2023-07-10 | 1 | -61/+3 |
| | |||||
* | [nn] Move the Embedding and Activation parts. (#116) | Laurent Mazare | 2023-07-10 | 1 | -30/+7 |
| | | | | | * Share the Embedding and Activation parts. * Tweak some activations. | ||||
* | Sketch the candle-nn crate. (#115) | Laurent Mazare | 2023-07-10 | 1 | -76/+33 |
| | | | | | | | * Sketch the candle-nn crate. * Tweak the cuda dependencies. * More cuda tweaks. | ||||
* | Merge pull request #92 from LaurentMazare/sync_hub | Nicolas Patry | 2023-07-07 | 1 | -8/+7 |
|\ | | | | | Creating new sync Api for `candle-hub`. | ||||
| * | Creating new sync Api for `candle-hub`. | Nicolas Patry | 2023-07-06 | 1 | -8/+7 |
| | | | | | | | | | | | | - `api::Api` -> `api::tokio::api` (And created new `api::sync::Api`). - Remove `tokio` from all our examples. - Using similar codebase for now instead of ureq (for simplicity). | ||||
* | | Enabling `roberta` for the example (it's the same model as Bert, with | Nicolas Patry | 2023-07-06 | 1 | -8/+26 |
|/ | | | | just different naming.) | ||||
* | Add mkl support for matrix multiply. (#86) | Laurent Mazare | 2023-07-06 | 1 | -0/+4 |
| | | | | | | | | | | | | | * Fix some rebase issues. * Use mkl instead. * Use mkl in bert. * Add the optional mkl feature. * Conditional compilation based on the mkl feature. * Add more mkl support. | ||||
* | Add a simpler way to specify the dim index for some ops. | laurent | 2023-07-05 | 1 | -2/+2 |
| | |||||
* | Use avg pooling before the cosine similarity. | laurent | 2023-07-05 | 1 | -3/+4 |
| | |||||
* | Add some sentence similarity comparision to the bert example. | laurent | 2023-07-05 | 1 | -63/+122 |
| | |||||
* | Some polish. | Nicolas Patry | 2023-07-05 | 1 | -7/+11 |
| | |||||
* | Adding offline mode. | Nicolas Patry | 2023-07-05 | 1 | -11/+41 |
| | |||||
* | Upgrading bert example to work with `bert-base-uncased`. | Nicolas Patry | 2023-07-04 | 1 | -16/+61 |
| | | | | | | | | - Always take weights from the hub - Optional `model_id` + `revision` to use safetensors version potentially - Optional loading for `bert-base-uncased` (`weight` vs `gamma`). - Take the config from the hub. | ||||
* | Add a batch dimension on the bert example. | laurent | 2023-07-04 | 1 | -11/+15 |
| | |||||
* | Allow for batch dimensions in the embedding layer. | laurent | 2023-07-03 | 1 | -6/+14 |
| | |||||
* | Minor tweaks. | laurent | 2023-07-03 | 1 | -1/+2 |
| | |||||
* | Remove the fixed length hack. | laurent | 2023-07-03 | 1 | -1/+1 |
| | |||||
* | Bugfix for transpose. | laurent | 2023-07-03 | 1 | -3/+4 |
| | |||||
* | Fix the layer norm to properly handle bias. | laurent | 2023-07-03 | 1 | -11/+30 |
| | |||||
* | Get some embeddings out. | laurent | 2023-07-03 | 1 | -14/+23 |
| | |||||
* | Get the tensors to be loaded properly. | laurent | 2023-07-03 | 1 | -11/+72 |
| | |||||
* | Add the varbuilder + check shapes. | laurent | 2023-07-03 | 1 | -62/+156 |
| | |||||
* | Model creation. | laurent | 2023-07-03 | 1 | -6/+134 |
| | |||||
* | Complete (?) the forward pass. | laurent | 2023-07-03 | 1 | -2/+28 |
| | |||||
* | Add more to the forward pass. | laurent | 2023-07-03 | 1 | -9/+48 |
| | |||||
* | Boilerplate code for Bert. | laurent | 2023-07-03 | 1 | -0/+239 |