Skip to content

Commit

Permalink
Update README.md with Llama3.1 numbers (#10630)
Browse files Browse the repository at this point in the history
  • Loading branch information
cglagovichTT authored Jul 24, 2024
1 parent 116e3c0 commit dee91cf
Showing 1 changed file with 1 addition and 0 deletions.
1 change: 1 addition & 0 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -70,6 +70,7 @@
| [Falcon7B](./models/demos/t3000/falcon7b) | Data Parallel | 129th | 256 | 7.4 t/s/u - 1901 t/s | 15.5 t/s/u - 3968 t/s | 26 t/s/u |
| [LLaMA-2-70B](./models/demos/t3000/llama2_70b) | Tensor Parallel | 129th | 32 | 10.4 t/s/u - 333 t/s | 16.6 t/s/u - 532 t/s | 20 t/s/u |
| [LLaMA-3-70B](./models/demos/t3000/llama3_70b) | Tensor Parallel | 129th | 32 | 10.4 t/s/u - 333 t/s | 15.8 t/s/u - 506 t/s | 20 t/s/u |
| [LLaMA-3.1-70B](./models/demos/t3000/llama3_70b) | Tensor Parallel | 129th | 32 | 10.4 t/s/u - 333 t/s | 15.8 t/s/u - 506 t/s | 20 t/s/u |
| [Falcon40B](./models/demos/t3000/falcon40b) | Tensor Parallel | 129th | 32 | work-in-progress | 10.0 t/s/u - 320 t/s | 36 t/s/u |
| [Mixtral7Bx8](./models/demos/t3000/mixtral8x7b) | Tensor Parallel | 129th | 32 | 15.1 t/s/u - 483 t/s | 27.1 t/s/u - 868 t/s | 33 t/s/u |
| ResNet50 | Data Parallel | coming soon | | | | |
Expand Down

0 comments on commit dee91cf

Please sign in to comment.