summaryrefslogtreecommitdiff
path: root/candle-wasm-tests
Commit message (Collapse)AuthorAgeFilesLines
* Add Stable Diffusion 3 Example (#2558)Czxck0012024-10-131-0/+1
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | * Add stable diffusion 3 example Add get_qkv_linear to handle different dimensionality in linears Add stable diffusion 3 example Add use_quant_conv and use_post_quant_conv for vae in stable diffusion adapt existing AutoEncoderKLConfig to the change add forward_until_encoder_layer to ClipTextTransformer rename sd3 config to sd3_medium in mmdit; minor clean-up Enable flash-attn for mmdit impl when the feature is enabled. Add sd3 example codebase add document crediting references pass the cargo fmt test pass the clippy test * fix typos * expose cfg_scale and time_shift as options * Replace the sample image with JPG version. Change image output format accordingly. * make meaningful error messages * remove the tail-end assignment in sd3_vae_vb_rename * remove the CUDA requirement * use default_value in clap args * add use_flash_attn to turn on/off flash-attn for MMDiT at runtime * resolve clippy errors and warnings * use default_value_t * Pin the web-sys dependency. * Clippy fix. --------- Co-authored-by: Laurent <laurent.mazare@gmail.com>
* chore: fix some typos in comments (#2121)hardlydearly2024-04-281-1/+1
| | | Signed-off-by: hardlydearly <799511800@qq.com>
* Quantized GGUF style (#1523)Nicolas Patry2024-01-171-1/+1
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | * Metal quantized modifications proposal. - Add a device param, wherever needed. - Create new QMetal storage thing that implements QuantizedType. - Update everywhere needed. Fix Python. Fixing examples. Fix: fmt + clippy + stub. Moving everything around. Only missing the actual implems. Fixing everything + adding dequantized kernels. More work. Fixing matmul. Fmt + Clippy Some clippy fixes. Working state. Q2K Metal -> Bugged (also present in GGML). Q4K CPU -> Bugged (present previously, new test catch it). Q5K CPU -> Bugged (present previously). Q8_1 Both -> Never really implemented it seems Q8K metal -> Never implemented in metal Fixing Q2K bug (present in ggml). * Cleanup. * Fix the rebase. * Removing the fences speeds everything up and *is* correct this time... * Cleanup the fence. * After rebase. * Bad code removal. * Rebase after phi2 merge + fix replit default to CPU. * Making the CI happy. * More happy tests. --------- Co-authored-by: Nicolas Patry <nicolas@Nicolass-MacBook-Pro.local>
* Simplifying our internal cargo dependencies. (#1529)Nicolas Patry2024-01-071-1/+1
|
* Bump the crate version to 0.3.3. (#1490)Laurent Mazare2023-12-281-1/+1
|
* Bump the crate version to 0.3.2. (#1452)Laurent Mazare2023-12-171-1/+1
|
* Implement the module trait directly for QMatMul. (#1372)Laurent Mazare2023-11-251-1/+1
|
* Update for 0.3.1. (#1324)Laurent Mazare2023-11-111-1/+1
|
* AVX optimized q8k vecdot. (#1024)Laurent Mazare2023-10-031-0/+9
|
* Improve the quantized whisper setup. (#1018)Laurent Mazare2023-10-021-1/+1
| | | | | | | * Improve the quantized whisper setup. * Fix the config file paths. * Use the standard matmul where possible.
* Improve the testing of the optimized quantized vec-dot ops (#1016)Laurent Mazare2023-10-021-0/+7
| | | | | * Expose the unopt functions for testing. * Better testing of the optimized quantized computations.
* Bump the version to 0.3.0. (#1014)Laurent Mazare2023-10-011-1/+1
| | | | | * Bump the version to 0.3.0. * Changelog update.
* Simd128 version of the q2k-q8k vecdot product. (#1011)Laurent Mazare2023-09-301-1/+1
| | | | | | | | | * Sketch the simd128 version of q2k vecdot. * Use a single accumulator. * Simdify the q2k-q8k vecdot product. * Cosmetic change.
* Add the remaining quantized tests to the wasm suite. (#980)Laurent Mazare2023-09-281-2/+32
|
* Sketch a simd128 optimized q4k vecdot. (#977)Laurent Mazare2023-09-271-0/+6
| | | | | | | | | | | * Sketch a simd128 optimized q4k vecdot. * Simdify. * More quantization optimizations. * Again more simdification. * Simdify the splitting loop.
* Simd128 vec-dot for q4_0. (#974)Laurent Mazare2023-09-275-0/+197
* Simd128 vec-dot for q4_0. * Bugfix. * Add wasm tests. * Bugfix for the q40 vecdot. * More quantization tests.