Add support for resharding width-sharded tensors to/from DRAM #15526

esmalTT · 2024-11-28T02:46:29Z

Problem description

Resharding a width-sharded tensor from L1 → DRAM is currently not supported.

For context, this type of reshard is required for UNet Shallow's trace+2CQ implementation because the output tensor cannot be persistent in L1 due to size limitations.

The inverse operation (DRAM to L1) will be required for the UNet input tensor preprocessing once ttnn.convert_to_hwc is implemented.

What's changed

Implemented a new width shard → width shard reshard kernel that follows a similar design to the special case for height-sharded tensors.
- Resharding from L1→L1, L1→DRAM, and DRAM→L1 is supported
- Only row major tensors are supported for now. Future work should add support for TILE layout.
- We will fallback to generalized reshard (even though it is broken with bad PCC for most width sharded I tested) if these conditions aren't met.
Added unit tests covering the new kernel
- Added test cases for L1-to-L1 in test_reshard.py::test_reshard
- Added test cases for L1/DRAM in test_reshard.py::test_dram_reshard
- Added previously missing test for DRAM resharding + program cache in test_reshard.py::test_dram_reshard_with_program_cache
Re-formatted touched files (new stuff is here: link, rest is formatting or tests)

Checklist

Post commit CI passes (link)
New/Existing tests provide coverage for changes (added in tests/tt_eager/python_api_testing/unit_testing/misc/test_reshard.py)

This commit introduces a new width-shard to width-shard reshard kernel, modeled after the height-sharded tensor reshard special case implemented in `reshard_multi_core_same_width`. Supported operations include: - L1 to L1 - L1 to DRAM - DRAM to L1 Currently, only row-major tensors are supported. For unsupported cases, we fall back to the generalized reshard implementation. Unit tests have been added to validate the new kernel functionality.

esmalTT self-assigned this Nov 28, 2024

esmalTT force-pushed the esmal/reshard-width-sharding branch from a0c150c to c2809a3 Compare November 28, 2024 14:03

esmalTT changed the title ~~Add support for L1-to-DRAM reshard with width-sharded tensors~~ Add support for resharding width-sharded tensors to/from DRAM Nov 28, 2024

esmalTT force-pushed the esmal/reshard-width-sharding branch 2 times, most recently from 0021bcd to 2440b50 Compare November 28, 2024 14:23

esmalTT force-pushed the esmal/reshard-width-sharding branch from 2440b50 to 8c0efe8 Compare November 28, 2024 14:39

esmalTT marked this pull request as ready for review November 28, 2024 16:41

esmalTT requested review from ntarafdar, sjameelTT, jaykru-tt, yugi957, jvegaTT and llongTT as code owners November 28, 2024 16:41

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for resharding width-sharded tensors to/from DRAM #15526

Add support for resharding width-sharded tensors to/from DRAM #15526

esmalTT commented Nov 28, 2024 •

edited

Loading

Add support for resharding width-sharded tensors to/from DRAM #15526

Are you sure you want to change the base?

Add support for resharding width-sharded tensors to/from DRAM #15526

Conversation

esmalTT commented Nov 28, 2024 • edited Loading

Problem description

What's changed

Checklist

esmalTT commented Nov 28, 2024 •

edited

Loading