Llama3.2-Vision: Add reference submodule and tests #14051

cglagovichTT · 2024-10-21T16:15:15Z

Ticket

Problem description

The multimodal reference code was not tracked in git. We now have a fork of it we should use as reference code in tests and demos https://github.com/tenstorrent/llama-models/tree/main

What's changed

Add llama-models as a submodule
Move TT multimodal modules and tests into their own folders under models/demos/llama3
Added tests to t3k unit, t3k frequent, and t3k demo. For each pipeline, I run the tests on a 1x2 mesh and a 1x8 mesh.

Checklist

Post commit CI passes https://github.com/tenstorrent/tt-metal/actions/runs/11500543568
- most recent t3k CI after hang fix was merged https://github.com/tenstorrent/tt-metal/actions/runs/11555134057
- t3k CI after python dep changes https://github.com/tenstorrent/tt-metal/actions/runs/11558194724
- Just One More CI run after removing llama_models as a submodule https://github.com/tenstorrent/tt-metal/actions/runs/11574438253
- https://github.com/tenstorrent/tt-metal/actions/runs/11575070877

cglagovichTT · 2024-10-21T16:20:17Z

@tt-rkim could you check out .gitmodules to make sure that looks alright?

tt-rkim · 2024-10-21T16:20:57Z

Looks good

Where are we running the multimodal tests exactly on CI?

cglagovichTT · 2024-10-21T16:22:56Z

Where are we running the multimodal tests exactly on CI?

I'd like to add the multimodal tests to CI after I experiment locally to see if I can tighten up the PCC bounds. I'll make a separate PR to put the tests in CI.

tt-rkim · 2024-10-21T16:23:25Z

sounds good, so these reference files are not used anywhere in CI currently? Or are they used for other parts of 3.2?

mtairum · 2024-10-21T16:24:04Z

@cglagovichTT Shouldn't this PR also include the tests?

This is the PR that fully includes multimodal llama, so it's good practice to have it all good to go (tests included).

But I'm ok with either approach and will help with the testing.

cglagovichTT · 2024-10-21T16:26:46Z

Due to popular demand I will add the tests to this PR :)

tt-rkim · 2024-10-21T16:29:43Z

THANK YOU SIR

cglagovichTT · 2024-10-23T14:48:36Z

models/demos/llama3/tt/model_config.py

@@ -208,7 +208,7 @@ def __init__(self, mesh_device, instruct=False, dummy_weights=False, max_batch_s
            self.compute_kernel_config_sdpa = ttnn.WormholeComputeKernelConfig(
                math_fidelity=ttnn.MathFidelity.HiFi4,
                math_approx_mode=False,
-                fp32_dest_acc_en=False,
+                fp32_dest_acc_en=True,


@yieldthought is it going to be an issue in the non-vision attention modules to have fp32 acc here?

TT-billteng

love it!

…enstorrent#14051) * tenstorrent#13368: Move repeat interleave to xattn cache generation. * #0: Clean up demo, enable arbitrary padding for multimodal text sequence * tenstorrent#13368: Add llama_models Meta reference for Llama3.2 as a submodule * tenstorrent#13368: Change reference imports to use new submodule * tenstorrent#13368: Clean up comments after pushing repeat_interleave into xattn_cache generation. * tenstorrent#13368: Clean up vision tests. Unify assertions and pcc checks. Fix LM head splitting on T3k. * tenstorrent#13368: Fix LM head splits calculation * tenstorrent#13368: For all vision tests, get model-specific parameters from model_args rather than fixtures. This generalizes tests for base and finetuned 11B models. * tenstorrent#13368: Add vision tests to unit, frequent, and demo * tenstorrent#13368: Fixup mesh_device when not passed FAKE_DEVICE * tenstorrent#13368: Remove llama_models as submodule. Move its install to llama3 requirements.txt. --------- Co-authored-by: mtairum <mtairum@tenstorrent.com>

cglagovichTT requested review from yieldthought, mtairum and uaydonat as code owners October 21, 2024 16:15

cglagovichTT requested a review from tt-rkim October 21, 2024 16:19

tt-rkim approved these changes Oct 21, 2024

View reviewed changes

cglagovichTT temporarily deployed to dev October 21, 2024 16:20 — with GitHub Actions Inactive

cglagovichTT had a problem deploying to dev October 21, 2024 16:24 — with GitHub Actions Error

cglagovichTT had a problem deploying to dev October 21, 2024 16:24 — with GitHub Actions Failure

cglagovichTT temporarily deployed to dev October 21, 2024 16:24 — with GitHub Actions Inactive

mtairum approved these changes Oct 21, 2024

View reviewed changes

cglagovichTT marked this pull request as draft October 21, 2024 16:28

cglagovichTT temporarily deployed to dev October 21, 2024 16:49 — with GitHub Actions Inactive

cglagovichTT temporarily deployed to dev October 21, 2024 18:30 — with GitHub Actions Inactive

cglagovichTT had a problem deploying to dev October 21, 2024 18:30 — with GitHub Actions Failure

cglagovichTT temporarily deployed to dev October 21, 2024 20:01 — with GitHub Actions Inactive

cglagovichTT temporarily deployed to dev October 21, 2024 20:18 — with GitHub Actions Inactive

cglagovichTT requested a review from esmalTT October 23, 2024 14:19

cglagovichTT temporarily deployed to dev October 23, 2024 14:39 — with GitHub Actions Inactive

cglagovichTT commented Oct 23, 2024

View reviewed changes

cglagovichTT had a problem deploying to dev October 29, 2024 13:41 — with GitHub Actions Failure

cglagovichTT temporarily deployed to dev October 29, 2024 13:41 — with GitHub Actions Inactive

cglagovichTT had a problem deploying to dev October 29, 2024 13:41 — with GitHub Actions Failure

cglagovichTT temporarily deployed to dev October 29, 2024 13:41 — with GitHub Actions Inactive

cglagovichTT temporarily deployed to dev October 29, 2024 13:42 — with GitHub Actions Inactive

TT-billteng approved these changes Oct 29, 2024

View reviewed changes

cglagovichTT and others added 2 commits October 29, 2024 08:17

#13368: Fix resource path in multimodal demos.

3807f76

Merge branch 'main' into llama32-vision

2a0eb87

cglagovichTT merged commit b4d605f into main Oct 29, 2024
8 checks passed

cglagovichTT deleted the llama32-vision branch October 29, 2024 19:07

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Llama3.2-Vision: Add reference submodule and tests #14051

Llama3.2-Vision: Add reference submodule and tests #14051

cglagovichTT commented Oct 21, 2024 •

edited

Loading

cglagovichTT commented Oct 21, 2024

tt-rkim commented Oct 21, 2024

cglagovichTT commented Oct 21, 2024 •

edited

Loading

tt-rkim commented Oct 21, 2024

mtairum commented Oct 21, 2024 •

edited

Loading

cglagovichTT commented Oct 21, 2024

tt-rkim commented Oct 21, 2024

cglagovichTT Oct 23, 2024

TT-billteng left a comment

Llama3.2-Vision: Add reference submodule and tests #14051

Llama3.2-Vision: Add reference submodule and tests #14051

Conversation

cglagovichTT commented Oct 21, 2024 • edited Loading

Ticket

Problem description

What's changed

Checklist

cglagovichTT commented Oct 21, 2024

tt-rkim commented Oct 21, 2024

cglagovichTT commented Oct 21, 2024 • edited Loading

tt-rkim commented Oct 21, 2024

mtairum commented Oct 21, 2024 • edited Loading

cglagovichTT commented Oct 21, 2024

tt-rkim commented Oct 21, 2024

cglagovichTT Oct 23, 2024

Choose a reason for hiding this comment

TT-billteng left a comment

Choose a reason for hiding this comment

cglagovichTT commented Oct 21, 2024 •

edited

Loading

cglagovichTT commented Oct 21, 2024 •

edited

Loading

mtairum commented Oct 21, 2024 •

edited

Loading