summaryrefslogtreecommitdiff
path: root/candle-transformers/src/models/rwkv_v6.rs
diff options
context:
space:
mode:
authorzachcp <zachcp@users.noreply.github.com>2024-11-17 14:27:24 -0500
committerGitHub <noreply@github.com>2024-11-17 20:27:24 +0100
commit12d7e7b1450f0c3f87c3cce3a2a1dd1674cb8fd7 (patch)
tree0d0b59c5538f1a61b6e9022b1bb7b39b6834db5d /candle-transformers/src/models/rwkv_v6.rs
parenta3f200e36991418c25cddef0e09c426deea90606 (diff)
downloadcandle-12d7e7b1450f0c3f87c3cce3a2a1dd1674cb8fd7.tar.gz
candle-12d7e7b1450f0c3f87c3cce3a2a1dd1674cb8fd7.tar.bz2
candle-12d7e7b1450f0c3f87c3cce3a2a1dd1674cb8fd7.zip
More Model Module Docs (#2623)
* dinov2 * add another example * ad dinov2reg4 * eva2 * efficientvit * moondream * update t5 * update t5 * rwkv * stable diffusion docs * add wasm link * add segment_anything * adjsut for clippy * ignore bertdoc * dinov2 ignore * update block to be text * remove the rust blocks for the moment * bump python to 3.11 * add a setup-python step * add py311 to test as well
Diffstat (limited to 'candle-transformers/src/models/rwkv_v6.rs')
-rw-r--r--candle-transformers/src/models/rwkv_v6.rs21
1 files changed, 17 insertions, 4 deletions
diff --git a/candle-transformers/src/models/rwkv_v6.rs b/candle-transformers/src/models/rwkv_v6.rs
index c75aa885..5da1c5ce 100644
--- a/candle-transformers/src/models/rwkv_v6.rs
+++ b/candle-transformers/src/models/rwkv_v6.rs
@@ -1,7 +1,9 @@
//! RWKV v6 model implementation.
//!
-//! RWKV is an RNN with transformer-like performance.
-//! Version 6 introduces refinements to the architecture.
+//! The [RWKV model](https://wiki.rwkv.com/) is a recurrent neural network model
+//! with performance on par with transformer architectures. Several variants are
+//! available, candle implements the v5 and v6 versions and can be used with
+//! Eagle 7B([blog post](https://blog.rwkv.com/p/eagle-7b-soaring-past-transformers)).
//!
//! Key characteristics:
//! - Linear attention mechanism
@@ -10,9 +12,20 @@
//! - Feed forward gating
//! - State recycling for efficient inference
//!
-//! References:
-//! - [RWKV Model](https://github.com/BlinkDL/RWKV-LM)
+//! # Example
//!
+//! ```bash
+//! cargo run --example rwkv --release -- \
+//! --prompt "The smallest prime is "
+//!
+//! > avx: true, neon: false, simd128: false, f16c: true
+//! > temp: 0.00 repeat-penalty: 1.10 repeat-last-n: 64
+//! > The smallest prime is ϕ(2) = 2.
+//! > The smallest composite is ϕ(3) = 3.
+//! > The smallest perfect number is ϕ(5) = 5.
+//! > The smallest perfect square is ϕ(4) = 4.
+//! > The smallest perfect cube is ϕ(6) = 6.
+//! ```
use super::with_tracing::{layer_norm, linear_no_bias as linear, LayerNorm, Linear};
use candle::{IndexOp, Result, Tensor};