summaryrefslogtreecommitdiff
path: root/candle-nn
diff options
context:
space:
mode:
authorVictor-Mihaila <115141117+Victor-Mihaila@users.noreply.github.com>2024-04-13 11:07:28 +0200
committerGitHub <noreply@github.com>2024-04-13 11:07:28 +0200
commitfb805b8ca2c9413ad9227800328145434a08eaca (patch)
tree56b64200bf04fc6dd0dbe1bf021ff81eff0bb828 /candle-nn
parent79e3bec789bd7bdb0e331d628c8fe9d06c519f33 (diff)
downloadcandle-fb805b8ca2c9413ad9227800328145434a08eaca.tar.gz
candle-fb805b8ca2c9413ad9227800328145434a08eaca.tar.bz2
candle-fb805b8ca2c9413ad9227800328145434a08eaca.zip
Avoid crashes when running T5 models with F16 tensors on CPU (#2047)
* This change avoids crashes when running T5 models with F16 tensors on CPU. * This enables running ProstT5's (https://huggingface.co/Rostlab/ProstT5) encoder-only mode in Candle. This ProstT5 mode stores it's embed_tokens weights within the encoder, as its decoding stage was replaced with a CNN. You could write more, like: This alone is not sufficient to run ProstT5 within Candle examples. We will develop a ProstT5 runner outside candle for now, but would be willing to upstream it to candle-examples at a later point. * Revert "This enables running ProstT5's (https://huggingface.co/Rostlab/ProstT5) encoder-only mode in Candle. This ProstT5 mode stores it's embed_tokens weights within the encoder, as its decoding stage was replaced with a CNN. You could write more, like: This alone is not sufficient to run ProstT5 within Candle examples. We will develop a ProstT5 runner outside candle for now, but would be willing to upstream it to candle-examples at a later point." This reverts commit d886d3ce5e3f1504934f4f6f7cf86108b7efd191.
Diffstat (limited to 'candle-nn')
0 files changed, 0 insertions, 0 deletions