diff options
author | zachcp <zachcp@users.noreply.github.com> | 2024-11-17 14:27:24 -0500 |
---|---|---|
committer | GitHub <noreply@github.com> | 2024-11-17 20:27:24 +0100 |
commit | 12d7e7b1450f0c3f87c3cce3a2a1dd1674cb8fd7 (patch) | |
tree | 0d0b59c5538f1a61b6e9022b1bb7b39b6834db5d /candle-transformers/src/models/rwkv_v6.rs | |
parent | a3f200e36991418c25cddef0e09c426deea90606 (diff) | |
download | candle-12d7e7b1450f0c3f87c3cce3a2a1dd1674cb8fd7.tar.gz candle-12d7e7b1450f0c3f87c3cce3a2a1dd1674cb8fd7.tar.bz2 candle-12d7e7b1450f0c3f87c3cce3a2a1dd1674cb8fd7.zip |
More Model Module Docs (#2623)
* dinov2
* add another example
* ad dinov2reg4
* eva2
* efficientvit
* moondream
* update t5
* update t5
* rwkv
* stable diffusion docs
* add wasm link
* add segment_anything
* adjsut for clippy
* ignore bertdoc
* dinov2 ignore
* update block to be text
* remove the rust blocks for the moment
* bump python to 3.11
* add a setup-python step
* add py311 to test as well
Diffstat (limited to 'candle-transformers/src/models/rwkv_v6.rs')
-rw-r--r-- | candle-transformers/src/models/rwkv_v6.rs | 21 |
1 files changed, 17 insertions, 4 deletions
diff --git a/candle-transformers/src/models/rwkv_v6.rs b/candle-transformers/src/models/rwkv_v6.rs index c75aa885..5da1c5ce 100644 --- a/candle-transformers/src/models/rwkv_v6.rs +++ b/candle-transformers/src/models/rwkv_v6.rs @@ -1,7 +1,9 @@ //! RWKV v6 model implementation. //! -//! RWKV is an RNN with transformer-like performance. -//! Version 6 introduces refinements to the architecture. +//! The [RWKV model](https://wiki.rwkv.com/) is a recurrent neural network model +//! with performance on par with transformer architectures. Several variants are +//! available, candle implements the v5 and v6 versions and can be used with +//! Eagle 7B([blog post](https://blog.rwkv.com/p/eagle-7b-soaring-past-transformers)). //! //! Key characteristics: //! - Linear attention mechanism @@ -10,9 +12,20 @@ //! - Feed forward gating //! - State recycling for efficient inference //! -//! References: -//! - [RWKV Model](https://github.com/BlinkDL/RWKV-LM) +//! # Example //! +//! ```bash +//! cargo run --example rwkv --release -- \ +//! --prompt "The smallest prime is " +//! +//! > avx: true, neon: false, simd128: false, f16c: true +//! > temp: 0.00 repeat-penalty: 1.10 repeat-last-n: 64 +//! > The smallest prime is ϕ(2) = 2. +//! > The smallest composite is ϕ(3) = 3. +//! > The smallest perfect number is ϕ(5) = 5. +//! > The smallest perfect square is ϕ(4) = 4. +//! > The smallest perfect cube is ϕ(6) = 6. +//! ``` use super::with_tracing::{layer_norm, linear_no_bias as linear, LayerNorm, Linear}; use candle::{IndexOp, Result, Tensor}; |