Need support for ttnn.max_pool2d to accept block and width sharded input. #12810

punithsekar · 2024-09-18T07:37:21Z

Describe the bug
ttnn.max_pool2d supports only height_sharded input tensor. Need support for block_sharded and width_sharded input.

To Reproduce
Steps to reproduce the behavior:

Checkout to branch punith/maxpool_issue
Run command pytest tests/ttnn/integration_tests/yolov4/test_ttnn_neck.py

Expected behavior
To accept Block_sharded and width_sharded layout.

Screenshots

E       RuntimeError: TT_FATAL @ ../ttnn/cpp/ttnn/operations/pool/maxpool/max_pool2d.cpp:58: shard_scheme == TensorMemoryLayout::HEIGHT_SHARDED
E       info:
E       Only height sharded tensors are supported.
E       backtrace:
E        --- /home/ubuntu/punith/tt-metal/ttnn/ttnn/_ttnn.so(+0x458ec9) [0x7f090cd77ec9]

Please complete the following environment information:

Device - WH-n150

Additional context
The input shape which we pass to maxpool is 1,10,10,512[NHWC]. Since the Channels is higher it should happen in width or block sharding to increasing the performance.

Current values when we use height sharding,

pool_1 = ttnn.max_pool2d(
            input_tensor=output_tensor,
            batch_size=1,
            input_h=10,
            input_w=10,
            channels=512,
            kernel_size=[5, 5],
            stride=[1, 1],
            padding=[2, 2],
            dilation=[1, 1],
            device=device,
        )

Attributes:
{'memory_config_':'MemoryConfig(memory_layout=TensorMemoryLayout::HEIGHT_SHARDED;buffer_type=BufferType::L1;shard_spec=ShardSpec(grid={[(x=0;y=0) - (x=3;y=0)]};shape={25; 0};orientation=ShardOrientation::ROW_MAJOR;halo=0))'; 'output_dtype_': 'DataType::BFLOAT16'; 'sliding_window_config_': 'SlidingWindowConfig(batch_size=1; input_hw=(10;10); window_hw=(5;5); stride_hw=(1;1); pad_hw=(2;2); dilation_hw=(1;1); num_cores_nhw=4; core_range_set_={[(x=0;y=0) - (x=3;y=0)]})'}

Core_count: 4

Kernel duration: 1077197 ns

The text was updated successfully, but these errors were encountered:

punithsekar · 2024-09-18T07:39:25Z

fyi @saichandax

dvartaniansTT · 2024-10-22T00:43:54Z

@mywoodstock is there a plan to support this towards yolov4 optimization efforts? cc: @mbahnasTT

mywoodstock · 2024-10-22T00:44:56Z

@mywoodstock is there a plan to support this towards yolov4 optimization efforts? cc: @mbahnasTT

Yes, the PR is nearly ready to be merged

mywoodstock · 2024-10-22T18:55:52Z

This is now in main

dvartaniansTT · 2024-10-23T21:22:44Z

thanks for the update @mywoodstock ! great news! We will test this on yolov4 and once confirmed we can close this issue.
@punithsekar please test this asap and let's see how it improves perf for yolov4.

punithsekar · 2024-10-24T07:08:10Z

@mywoodstock @dvartaniansTT, I am able to pass block-sharded input to the maxpool, and the execution is happening without any issue. However, the PCC of output coming from maxpool is very low(~0.055). I have create separate issue #14206 for it.

dvartaniansTT · 2024-10-30T00:50:49Z

@mywoodstock is this on your radar?
@punithsekar does this mean we are running at almost 0 pcc end to end now?

mywoodstock · 2024-10-30T01:00:27Z

@dvartaniansTT Yes, its being worked on: #14249

punithsekar · 2024-10-30T04:01:55Z

@dvartaniansTT , Yes, we are getting almost 0 pcc. The bug is tracked in #14249 issue as Abhinav mentioned.

punithsekar added bug Something isn't working op_cat: maxpool2D yolov4 mcw_cst tasks done for mcw_cst collaboration labels Sep 18, 2024

punithsekar assigned dvartaniansTT Sep 18, 2024

punithsekar changed the title ~~ttnn.max_pool2d only support height_sharded input tensor~~ Need support for ttnn.max_pool2d to accept block and width sharded input. Sep 18, 2024

dvartaniansTT added Customer_Bug P0 labels Sep 18, 2024

dvartaniansTT assigned mywoodstock and unassigned dvartaniansTT Sep 19, 2024

mywoodstock added feature-request External feature request Customer_Feature CNN_feature and removed bug Something isn't working labels Sep 19, 2024

punithsekar mentioned this issue Sep 24, 2024

Yolov4 bring up #13053

Open

mywoodstock closed this as completed Oct 22, 2024

mywoodstock assigned wransom-TT Oct 23, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Need support for ttnn.max_pool2d to accept block and width sharded input. #12810

Need support for ttnn.max_pool2d to accept block and width sharded input. #12810

punithsekar commented Sep 18, 2024

punithsekar commented Sep 18, 2024

dvartaniansTT commented Oct 22, 2024

mywoodstock commented Oct 22, 2024

mywoodstock commented Oct 22, 2024

dvartaniansTT commented Oct 23, 2024

punithsekar commented Oct 24, 2024 •

edited

Loading

dvartaniansTT commented Oct 30, 2024

mywoodstock commented Oct 30, 2024

punithsekar commented Oct 30, 2024 •

edited

Loading

Need support for ttnn.max_pool2d to accept block and width sharded input. #12810

Need support for ttnn.max_pool2d to accept block and width sharded input. #12810

Comments

punithsekar commented Sep 18, 2024

punithsekar commented Sep 18, 2024

dvartaniansTT commented Oct 22, 2024

mywoodstock commented Oct 22, 2024

mywoodstock commented Oct 22, 2024

dvartaniansTT commented Oct 23, 2024

punithsekar commented Oct 24, 2024 • edited Loading

dvartaniansTT commented Oct 30, 2024

mywoodstock commented Oct 30, 2024

punithsekar commented Oct 30, 2024 • edited Loading

punithsekar commented Oct 24, 2024 •

edited

Loading

punithsekar commented Oct 30, 2024 •

edited

Loading