forks/candle.git -

	Commit message (Collapse)	Author	Age	Files	Lines
*	Allow for different behavior between training and eval (#1213)	Laurent Mazare	2023-10-29	1	-2/+2
\| \| \| \| \|	* Forward with training. * Do not use dropout on vgg evaluation.
*	Add the optimizer trait. (#702)	Laurent Mazare	2023-09-01	1	-2/+2
\|
*	Mnist training dropout (#677)	Laurent Mazare	2023-08-30	1	-7/+11
\| \| \| \| \|	* Use dropout in the mnist training. * Fix.
*	Add some documentation. (#673)	Laurent Mazare	2023-08-30	1	-1/+1
\| \| \| \| \|	* Add some documentation. * Bump the crate version.
*	Simplify usage of the pool functions. (#662)	Laurent Mazare	2023-08-29	1	-7/+9
\| \| \| \| \| \| \|	* Simplify usage of the pool functions. * Small tweak. * Attempt at using apply to simplify the convnet definition.
*	Add a convnet training example. (#661)	Laurent Mazare	2023-08-29	1	-1/+104
\| \| \| \| \| \| \|	* Add a convnet example. * Dataset fix. * Randomize batches.
*	Re-enable local dir for mnist.	Nicolas Patry	2023-08-28	1	-1/+9
\|
*	Training:	Nicolas Patry	2023-08-28	1	-1/+1
\| \| \| \| \| \| \|	- Removed a lot of surface (SerializedFileReader ownership is really painful). - Moved example + vision to hf.co version. - Removed feature gate.
*	VarBuilder cleanup (#627)	Laurent Mazare	2023-08-27	1	-2/+2
\| \| \| \| \| \| \| \| \|	* VarBuilder cleanup. * Implement the basic varbuilders. * Add the sharded code. * Proper support for tensor sharding.
*	Add a yolo-v3 example. (#528)	Laurent Mazare	2023-08-20	1	-0/+3
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	* Add a couple functions required for yolo. * Add the yolo-v3 example. * Add minimum and maximum. * Use the newly introduced maximum. * Cuda support for min/max + add some testing. * Allow for more tests to work with accelerate. * Fix a typo.
*	Add a simple Module trait and implement it for the various nn layers (#500)	Laurent Mazare	2023-08-18	1	-1/+1
\| \| \| \| \| \| \|	* Start adding the module trait. * Use the module trait. * Implement module for qmatmul.
*	Add the candle-datasets crate (#322)	Laurent Mazare	2023-08-05	1	-2/+2
\| \| \| \| \| \| \| \| \| \| \| \| \|	* Move the vision datasets to a separate crate. * Move the batcher bits. * Update the readme. * Move the tiny-stories bits. --------- Co-authored-by: Jane Doe <jane.doe@example.org>
*	Llama more training (#297)	Laurent Mazare	2023-08-01	1	-124/+15
\| \| \| \| \| \| \| \| \| \| \|	* Rework the var-builder to handle initializations. * Add some helper functions for layer creation. * Improve the layer initializations. * Get initialized variables. * Precompute the rot embeddings when training lamas.
*	Make the nll op closer to the pytorch version + add a test. (#286)	Laurent Mazare	2023-07-31	1	-4/+1
\|
*	Add a flag to set the number of epochs in the mnist training (#283)	Laurent Mazare	2023-07-31	1	-14/+28
\| \| \| \| \|	* Add a flag to change the number of epochs for the mnist training. * Increase the learning rate for the MLP.
*	Load a trained checkpoint in the mnist example. (#280)	Laurent Mazare	2023-07-30	1	-3/+36
\|
*	Add a flag to save the trained weights. (#279)	Laurent Mazare	2023-07-30	1	-0/+228