summaryrefslogtreecommitdiff
path: root/candle-core/examples/cuda_sum_benchmark.rs
Commit message (Collapse)AuthorAgeFilesLines
* More accelerate optimizations (#427)Laurent Mazare2023-08-131-0/+3
| | | | | | | | | | | * Add more tracing to the whisper example. * Support accelerate in more examples. * Use accelerate for pointwise functions. * Use accelerate for binary operations too. * Bugfix for binary operation: use the rhs before the lhs.
* Rename the candle crate to candle-core (#301)Laurent Mazare2023-08-021-1/+1
| | | | | * Rename to candle-core. * More candle-core renaming.
* Simplify the parameters used by sum and sum_keepdim. (#165)Laurent Mazare2023-07-141-4/+4
|
* Use the same default as pytorch for sum. (#164)Laurent Mazare2023-07-131-8/+8
|
* Add some very simple sum benchmark. (#108)Laurent Mazare2023-07-081-0/+51
* Add some very simple sum benchmark. * Rename the file.