Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add Batch=32 / paged attention / and new RoPE module support to all Llama3 demo and tests #15327

Draft
wants to merge 28 commits into
base: main
Choose a base branch
from

Commits on Nov 26, 2024

  1. Configuration menu
    Copy the full SHA
    722b71a View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    fa7cd65 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    3fd83f8 View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    0328db6 View commit details
    Browse the repository at this point in the history
  5. #13368: Add page attention and batch=32 support to test model. TODO i…

    …nvestigate PCC reduction with batch>1
    mtairum committed Nov 26, 2024
    Configuration menu
    Copy the full SHA
    a7d3d72 View commit details
    Browse the repository at this point in the history
  6. Configuration menu
    Copy the full SHA
    08fce34 View commit details
    Browse the repository at this point in the history
  7. #0: Add llama rope

    mtairum committed Nov 26, 2024
    Configuration menu
    Copy the full SHA
    2e38fb8 View commit details
    Browse the repository at this point in the history
  8. Configuration menu
    Copy the full SHA
    4c13276 View commit details
    Browse the repository at this point in the history
  9. Configuration menu
    Copy the full SHA
    564fb20 View commit details
    Browse the repository at this point in the history
  10. Configuration menu
    Copy the full SHA
    135df39 View commit details
    Browse the repository at this point in the history
  11. Configuration menu
    Copy the full SHA
    ba73d3c View commit details
    Browse the repository at this point in the history
  12. Configuration menu
    Copy the full SHA
    0165ddb View commit details
    Browse the repository at this point in the history
  13. #0: Add support for batch sizes that are not divisible by tile size, …

    …and batch sizes that don't lead to a rectangular core grid. TODO: confirm if tracing works.
    avoraTT authored and mtairum committed Nov 26, 2024
    Configuration menu
    Copy the full SHA
    b4f52a3 View commit details
    Browse the repository at this point in the history
  14. #0: Fix assert

    mtairum committed Nov 26, 2024
    Configuration menu
    Copy the full SHA
    85e0155 View commit details
    Browse the repository at this point in the history
  15. Configuration menu
    Copy the full SHA
    57f6fc9 View commit details
    Browse the repository at this point in the history
  16. #0: [REVERT] Added mayo input

    mtairum committed Nov 26, 2024
    Configuration menu
    Copy the full SHA
    7d7536d View commit details
    Browse the repository at this point in the history
  17. Configuration menu
    Copy the full SHA
    5cca54d View commit details
    Browse the repository at this point in the history
  18. #0: Update debug max seqlen

    mtairum committed Nov 26, 2024
    Configuration menu
    Copy the full SHA
    a967c69 View commit details
    Browse the repository at this point in the history
  19. Add padding to position ids to support rope with batch < 32 in trace …

    …mode. TODO: Debug inconsistent outputs of batch 1 vs batch 16/32
    avoraTT authored and mtairum committed Nov 26, 2024
    Configuration menu
    Copy the full SHA
    03c2e79 View commit details
    Browse the repository at this point in the history
  20. Configuration menu
    Copy the full SHA
    c71c207 View commit details
    Browse the repository at this point in the history
  21. Configuration menu
    Copy the full SHA
    0ec756b View commit details
    Browse the repository at this point in the history
  22. Configuration menu
    Copy the full SHA
    4ff627a View commit details
    Browse the repository at this point in the history
  23. Configuration menu
    Copy the full SHA
    29770af View commit details
    Browse the repository at this point in the history
  24. Configuration menu
    Copy the full SHA
    a1ad346 View commit details
    Browse the repository at this point in the history
  25. Update rope to do padding internally. Add comments explaining inconsi…

    …stency in output across batch sizes.
    avoraTT committed Nov 26, 2024
    Configuration menu
    Copy the full SHA
    9616567 View commit details
    Browse the repository at this point in the history
  26. Configuration menu
    Copy the full SHA
    834fecd View commit details
    Browse the repository at this point in the history
  27. Configuration menu
    Copy the full SHA
    a7f633c View commit details
    Browse the repository at this point in the history
  28. Merge branch 'mtairum/paged_attn_llama3' of https://github.com/tensto…

    …rrent/tt-metal into mtairum/paged_attn_llama3
    
    > Conflicts:
    >	models/demos/llama3/demo/demo.py
    mtairum committed Nov 26, 2024
    Configuration menu
    Copy the full SHA
    b15f45d View commit details
    Browse the repository at this point in the history