🐌 slow-transformers

Our motto: "Go transformers! But dont go too fast. You still have to enjoy life ☮️"

Diffability, noun

A principle underscoring the art of unmasking subtle divergences amidst complex similarities, diffability illuminates clear paths through intellectual labyrinths, providing clarity in a sea of cerebral complexity ... In practical terms: Understand the difference between two methods by diffing their code them in vscode.

Install

git clone ...
cd slow-transformers/
pip install -r requirements.txt

Supported Models

ViT
SimpleViT
Language Classification Transformer
Encoder-decoder model (generative)

Supported Datasets

cifar
imdb

TODO / Goals list

Vanilla transformer (or some language tasks)
fsdp/deepspeed
cross attention
more interesting architechtures (t5, perciever)
flash attention integration
jax?
resnet & hyena for comparison???
support m1
a script to run every model on every possible dataset and record everything in wandb (use hf trainer though)
also put datasets/dataloading entirely in file (move cifar from ./data to slow_vit.py, similar to hw_vit.py)

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
README.md		README.md
data.py		data.py
hw_vit.py		hw_vit.py
requirements.txt		requirements.txt
slow_HF.py		slow_HF.py
slow_bigram.py		slow_bigram.py
slow_conv.py		slow_conv.py
slow_gpt.py		slow_gpt.py
slow_language.py		slow_language.py
slow_vit.py		slow_vit.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🐌 slow-transformers

Install

Supported Models

Supported Datasets

TODO / Goals list

About

Releases

Packages

Languages

shatz01/slow-transformers

Folders and files

Latest commit

History

Repository files navigation

🐌 slow-transformers

Install

Supported Models

Supported Datasets

TODO / Goals list

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages