forks/candle.git -

	Commit message (Collapse)	Author	Age	Files	Lines
*	Fix for backprop in ConvTranspose2D with stride of 2 (#2337)	Ivor Wanders	2024-07-17	1	-0/+97
\| \| \| \| \| \| \| \| \| \| \|	* Add gradient test for conv_transpose2d with stride of 2. * Swap dilation and stride in ConvTranspose2D backpropagation. Without this, a shape mismatch occurs with a stride of 2 and dilation of 1. * Add further tests of the ConvTranspose2D gradient. Values calculated with torch, minor numerical errors adjusted and commented.
*	Backwards for ConvTranspose2D (#1910)	Kirpal Grewal	2024-03-23	1	-7/+154
\| \| \| \| \| \| \|	* add documentation for nackprop * add backwards for ConvTranspose2D * add test python code to test
*	Add support for conv_transpose2d on Metal backend (#1903)	Thomas Santerre	2024-03-21	1	-69/+55
\| \| \| \| \| \| \|	* add support for conv transpose 2d and add bench mark for float types * update bench calculation * enable testing all conv operations on metal
*	Add support for conv_transpose1d for metal backend (#1874)	Thomas Santerre	2024-03-19	1	-5/+0
\| \| \| \| \| \| \| \| \| \| \| \| \|	* first attempt * progress * integrate into metal backend * finish and get test passing * add other dtype support * update transpose1d dtypes supported
*	Optimize the cat operation on contiguous tensors (#1855)	Laurent Mazare	2024-03-17	1	-51/+77
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	* Add a specialized kernel for copy2d. * Move the cat operations. * Avoid transpositions in cat. * Bugfix. * Bugfix for the cuda kernel. * Add a benchmark. * Add more testing. * Test fix. * Faster kernel. * Add the missing kernel. * Tweak the test. * Add a metal kernel. * Fix for the metal kernel. * Get the tests to pass on metal. * Also use this opportunity to fix the metal kernel for ELU. * Add some bf16 kernels. * Clippy fixes.
*	Fast CPU kernel for transposed 1d convolutions. (#1822)	Laurent Mazare	2024-03-08	1	-20/+24
\| \| \| \| \|	* Fast CPU kernel for transposed 1d convolutions. * Bugfix.
*	Bugfix for conv-transpose1d (#1734)	Laurent Mazare	2024-02-19	1	-0/+14
\| \| \| \| \|	* Add a currently broken test. * Bugfix + fix test.
*	Support for groups in conv-transpose1d. (#1731)	Laurent Mazare	2024-02-18	1	-1/+1
\| \| \| \| \|	* Groups support in conv-transpose-1d. * Remove dangling file.
*	ConvTranspose1d cuda support. (#1697)	Laurent Mazare	2024-02-12	1	-11/+9
\| \| \| \| \| \| \|	* ConvTranspose1d cuda support. * Add the conv-transpose1d kernel. * Remove some unused variables.
*	Adding the test scaffolding.	Nicolas Patry	2023-11-20	1	-7/+28
\|
*	Test for the transposed conv1d. (#1254)	Laurent Mazare	2023-11-03	1	-0/+16
\|
*	Fix the conv2d gradient computation. (#1214)	Laurent Mazare	2023-10-29	1	-0/+65
\|
*	Support dilation in conv-transpose2d. (#671)	Laurent Mazare	2023-08-30	1	-0/+35
\|
*	Fix the dilated convolutions. (#659)	Laurent Mazare	2023-08-29	1	-12/+12
\|
*	Dilated convolutions (#657)	Laurent Mazare	2023-08-29	1	-13/+122
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	* Add the dilation parameter. * Restore the basic optimizer example. * Dilation support in cudnn. * Use the dilation parameter in the cpu backend. * More dilation support. * No support for dilation in transposed convolutions. * Add dilation to a test. * Remove a print. * Helper function.
*	Cuda conv transpose (#645)	Laurent Mazare	2023-08-28	1	-69/+59
\| \| \| \| \| \| \|	* Cuda kernel for conv-transpose. * Fix the cuda kernel. * Fix the tests.
*	Fix the cpu kernel for conv-transpose. (#643)	Laurent Mazare	2023-08-28	1	-10/+44
\|
*	Backprop for conv2d. (#638)	Laurent Mazare	2023-08-28	1	-0/+72
\| \| \| \| \| \| \| \| \| \| \|	* Start adding backprop for conv2d. * Backprop for conv2d. * Bugfix + start adding a conv2d test. * Conv2d backprop testing. * More conv fixes.
*	Add conv-transpose. (#635)	Laurent Mazare	2023-08-28	1	-0/+30
\| \| \| \| \| \| \| \| \| \| \|	* Add conv-transpose. * Return zeros for now. * Naive CPU implementation. * Add a conv-transpose test + fix the cpu implementation. * Add a second test.
*	Move the test-utils bits to a shared place. (#619)	Laurent Mazare	2023-08-27	1	-2/+1
\|
*	Add to the cuda example a reproduction of the issue. (#579)	Laurent Mazare	2023-08-24	1	-0/+36
\| \| \| \| \| \| \| \| \| \| \| \| \|	* Add to the cuda example a reproduction of the issue. * Tweak. * Add a test using non-square matrixes. * Fix the conv2d kernel. * Display the error. * And tweak the comment.
*	Add a test for conv2d with padding + bugfix the random number generation on ↵	Laurent Mazare	2023-08-24	1	-0/+12
\| \| \| \| \| \| \| \| \|	cuda. (#578) * Add a test for conv2d with padding. * Cosmetic changes. * Bugfix the rand function on the cuda backend.
*	Add some group parameter to convolutions. (#566)	Laurent Mazare	2023-08-23	1	-7/+7
\| \| \| \| \| \| \| \| \| \| \| \| \|	* Add some group parameter to convolutions. * Avoid some unnecessary groups checks. * Move the tensor convolution bits. * Properh handling of groups. * Bump the crate version. * And add a changelog.
*	Add a naive conv2d cuda kernel. (#438)	Laurent Mazare	2023-08-14	1	-15/+11
\| \| \| \| \| \| \| \| \| \| \|	* Add a naive conv2d cuda kernel. * Proper conv2d support on the rust side. * Conv1d testing on gpu. * Also use the test on gpus. * Fix the clean-ptx target.
*	Fix the conv tests. (#409)	Laurent Mazare	2023-08-11	1	-7/+5
\|
*	Conv1d test with padding. (#356)	Laurent Mazare	2023-08-09	1	-0/+31
\|
*	Add some conv1d test + bugfix using padding. (#349)	Laurent Mazare	2023-08-08	1	-0/+40
\|
*	Add some conv2d tests. (#347)	Laurent Mazare	2023-08-08	1	-0/+113
	* Add some conv2d tests. * Add a simpler conv2d test. * More conv2d testing + bugfix. * Add a todo.