Pow operation needs support for Tensor exponent variant #13857

KalaivaniMCW · 2024-10-16T09:10:02Z

At present, ttnn.pow supports Tensor input , scalar exponent.
For Pytorch tracing #13373, we need support for ttnn.pow with Tensor input , Tensor exponent.
https://github.com/tenstorrent/pytorch2.0_ttnn/blob/main/docs/operations/aten.pow.Tensor_Tensor.md

Is this support possible ?

cc: @eyonland

The text was updated successfully, but these errors were encountered:

eyonland · 2024-10-21T20:22:37Z

@rtawfik01 , we need a llk for power to support having the exponents in a tile. I think we need to have #13857 where we get another destination register perhaps?

rtawfik01 · 2024-10-21T20:35:52Z

fyi @ttmtrajkovic , this is similar to the issue of: #13582
Where an SFPU functions needs to support the ability to work on 2 inputs.
@eyonland please assign a priority for this, also can you please make a master issue linking all of these issues together?
So far I see these 3 ops:

binary pow operation: Pow operation needs support for Tensor exponent variant #13857
Binary bitwise operation: Binary Bitwise ops #13582
Binary shift operations: Binary Shift operators #10034

These all have the issue issue of needing SFPU to work with binary inputs

KalaivaniMCW · 2024-10-22T16:34:47Z

This work is critical for models being developed from Pytorch2 and hence the P0 status

ttmtrajkovic · 2024-10-23T09:51:49Z

@KalaivaniMCW, @eyonland,

Please list explicitly which ops is this blocking. I don't mind the P0 status but we need to differentiate between levels of blocking - is this blocking an entire project or several ops?

eyonland · 2024-10-24T17:41:32Z

We are blocked specifically on this op here: https://docs.tenstorrent.com/tt-metalium/latest/tt_metal/apis/kernel_apis/compute/power_tile.html
Notice that it does takes the second argument as a scalar. We need instead to have another dst register argument. We need to raise the values of the first tensor to the power of the values in the second tensor.

KalaivaniMCW · 2024-10-29T17:00:46Z

Hi @ttmtrajkovic ,
We are blocked on ttnn.pow op for Pytorch sweep tracing and Op generality
https://github.com/tenstorrent/pytorch2.0_ttnn/blob/main/docs/operations/aten.pow.Scalar.md
https://github.com/tenstorrent/pytorch2.0_ttnn/blob/main/docs/operations/aten.pow.Tensor_Tensor.md
At present the pass % is 0 since we have no support for Tensor exponent.

jvasilje · 2024-11-04T20:34:09Z

hey @ttmtrajkovic, we are reviewing eltwise blockers for generality. It's looking like this will be the top blocker, and we likely want to escalate the need for this feature to get unblocked. Just heads up. We will update here again in a few hours.

rdjogoTT · 2024-11-07T00:53:59Z

I've started my assessment for this issue. We know that this depends on a new op class - Binary SFPU OPs, and that we will also need to work on a new algorithm for implementing eltwise-pow with tensor base and exponent. More updates to follow shortly

rdjogoTT · 2024-11-20T00:00:44Z

With the latest 2 commits on my branch rd/binary_sfpu_pow I've added the supporting LLK code for general binary SFPU OPs, as well as implemented the LLKs for a few OPs. These include Add, Sub, Mul, and I've also included the preliminary work for Pow (gives PCC ~= 0.998 for non-negative operands, negative case not handled yet). Also, in the second commit, I include the changes to tt-metal that I made to be able to test my code, including a modified python test, program factory file, and compute kernel. I basically hijacked the regular eltwise binary ops we have, to run them on SFPU instead.

The LLK APIs for the basic binary OPs (Add, Sub, Mul) are ready in tt_metal/include/compute_kernel_api/eltwise_binary_sfpu.h, but more work is needed for Pow before it is ready. To get this completed faster, I think someone from the tt-metal side should help implement the necessary changes to support this new OP class, while I focus on the Pow algorithm. They can test functionality using the basic OPs like Add, or even with the current Pow. @eyonland @jvasilje, do either of you know who I can ask to take over the higher level changes required for SFPU Binary OP support?

eyonland · 2024-11-21T15:33:19Z

@KalaivaniMCW , @umadevimcw & @VirdhatchaniKN , let's plan this out.

KalaivaniMCW · 2024-11-22T15:41:11Z

Hi @rdjogoTT , @eyonland ,
The current eltwise binary does not use SFPU ( I guess it uses FPU ? ), but it does have other additional implementations like pre and post activation scaling for input/output.

Should we modify the current implementation in ttnn/operations/eltwise/binary or create a new implementation to use the new binary SFPU OPs ? something like ttnn/operations/eltwise/binary_sfpu and test it out and see how it goes with the models ?

Do we modify this ttnn/cpp/ttnn/operations/eltwise/binary/device/kernels/compute/eltwise_binary_kernel.cpp like done in rd/binary_sfpu_pow (and include the additional implementations) or write a new one for binary SFPU OPs ?

rdjogoTT · 2024-11-22T16:48:10Z

Yes, current eltwise binary OPs are all on FPU. I don't know the answer to your questions, I think the decisions need to be made from the tt-metal side depending on the requirements for this new OP class.

One additional question that may be relevant to deciding how to implement this is how to handle users calling one of the eltwise binary OPs like ttnn.add with FP32 dataformat. How will we decide to call on the binary SFPU implementation which can support full FP32 accuracy while being slower, versus calling the FPU implementation which loses precision due to converting to TF32.

rdjogoTT · 2024-11-26T16:13:52Z

Pow is moving along well, I get PCC of >0.99998 for x in [0,10] and y in [-5, 5] for x^y. I am now looking into the special cases that occur when x<0, which sometimes causes NaN for torch.pow(). @KalaivaniMCW @eyonland do either of you know which spec I should follow for these special cases of pow? i.e. when dealing with a negative base or NaN or INF base/exponent? Also do you have an ETA for the tt-metal side changes?

Also, @KalaivaniMCW can I ask that as part of your change you also add the changes needed for doing the SPFU binary Pow OP? I think it would be called as ttnn.pow(x,y) but with tensor for y.

KalaivaniMCW added LLK WH labels Oct 16, 2024

KalaivaniMCW mentioned this issue Oct 16, 2024

Pytorch Tracing sweeps - debugs & fixes #13521

Open

16 tasks

eyonland added the pytorch-compiler label Oct 21, 2024

rtawfik01 closed this as completed Oct 21, 2024

rtawfik01 reopened this Oct 21, 2024

eyonland added the P1 label Oct 21, 2024

This was referenced Oct 21, 2024

LLKs needed for binary ops #14073

Closed

LLK Bugs & Features for Binary Generality #9702

Open

eyonland added op_cat: eltwise P0 and removed P1 labels Oct 21, 2024

mrakitaTT mentioned this issue Oct 28, 2024

TTNN pow op is missing support for tensor exponent tenstorrent/tt-mlir#1094

Closed

mrakitaTT added the forge label Oct 28, 2024

umadevimcw added the Op Generalization Generalization and relaxations of requirements in Ops label Oct 29, 2024

eyonland assigned ttmtrajkovic Nov 1, 2024

eyonland added P1 and removed P0 labels Nov 1, 2024

ttmtrajkovic assigned rdjogoTT and unassigned rdjogoTT and ttmtrajkovic Nov 6, 2024

eyonland mentioned this issue Nov 15, 2024

[Bug Report] Binary ops with float32 tensors doesn't work as expected #14825

Open

ttmtrajkovic mentioned this issue Nov 15, 2024

Add support for binary SFPU OPs #15122

Open

This was referenced Nov 18, 2024

Add StableHLO to TTIR conversion for power OP tenstorrent/tt-mlir#1033

Open

[Ops] Add support for TTIR/TTNN Power op tenstorrent/tt-mlir#1203

Open

Add tests for power op tenstorrent/tt-xla#62

Open

rdjogoTT added a commit that referenced this issue Nov 19, 2024

#13857: Add WHB0 support for binary sfpu ops

9c72a61

rdjogoTT added a commit that referenced this issue Nov 19, 2024

#13857: Add example compute kernel and test for binary sfpu op pow

563b6e6

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Pow operation needs support for Tensor exponent variant #13857

Pow operation needs support for Tensor exponent variant #13857

KalaivaniMCW commented Oct 16, 2024 •

edited

Loading

eyonland commented Oct 21, 2024

rtawfik01 commented Oct 21, 2024 •

edited

Loading

KalaivaniMCW commented Oct 22, 2024

ttmtrajkovic commented Oct 23, 2024

eyonland commented Oct 24, 2024

KalaivaniMCW commented Oct 29, 2024

jvasilje commented Nov 4, 2024

rdjogoTT commented Nov 7, 2024 •

edited

Loading

rdjogoTT commented Nov 20, 2024

eyonland commented Nov 21, 2024

KalaivaniMCW commented Nov 22, 2024 •

edited

Loading

rdjogoTT commented Nov 22, 2024

rdjogoTT commented Nov 26, 2024 •

edited

Loading

Pow operation needs support for Tensor exponent variant #13857

Pow operation needs support for Tensor exponent variant #13857

Comments

KalaivaniMCW commented Oct 16, 2024 • edited Loading

eyonland commented Oct 21, 2024

rtawfik01 commented Oct 21, 2024 • edited Loading

KalaivaniMCW commented Oct 22, 2024

ttmtrajkovic commented Oct 23, 2024

eyonland commented Oct 24, 2024

KalaivaniMCW commented Oct 29, 2024

jvasilje commented Nov 4, 2024

rdjogoTT commented Nov 7, 2024 • edited Loading

rdjogoTT commented Nov 20, 2024

eyonland commented Nov 21, 2024

KalaivaniMCW commented Nov 22, 2024 • edited Loading

rdjogoTT commented Nov 22, 2024

rdjogoTT commented Nov 26, 2024 • edited Loading

KalaivaniMCW commented Oct 16, 2024 •

edited

Loading

rtawfik01 commented Oct 21, 2024 •

edited

Loading

rdjogoTT commented Nov 7, 2024 •

edited

Loading

KalaivaniMCW commented Nov 22, 2024 •

edited

Loading

rdjogoTT commented Nov 26, 2024 •

edited

Loading