CustomCallOp unsupported target: TopK on Llama 3.1 with topk=2 #1025

steeve · 2024-11-04T17:01:18Z

Hi,

Our (@zml) Llama implementation fails to compile if we run with topk > 1.
We're not sure what triggers it, it looks like some pattern matching.

Please find attached our implementation with topk=2.

Thanks!!

error(pjrt): [PJRT_Client_Compile] RunNeuronCCImpl: error condition error != 0: <class 'subprocess.CalledProcessError'>: Command '['neuronx-cc', 'compile', '--framework=XLA', '--target=trn1', '--verbose=35', '--output=/tmp/tmpqle33ids/file.neff', '/tmp/tmpqle33ids/file.code', '--model-type=transformer', '--auto-cast=none']' returned non-zero exit status 70.

llama.mlir.txt

The text was updated successfully, but these errors were encountered:

nalwayaakshay · 2024-11-05T20:31:32Z

Hi,

For debugging it further, could you please share your source code and/or the HLO file generated which is causing compiler error?

steeve changed the title ~~CustomCallOp unsupported target: TopK~~ CustomCallOp unsupported target: TopK on Llama 3.1 with topk=2 Nov 4, 2024

aws-taylor added the bug Something isn't working label Nov 8, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CustomCallOp unsupported target: TopK on Llama 3.1 with topk=2 #1025

CustomCallOp unsupported target: TopK on Llama 3.1 with topk=2 #1025

steeve commented Nov 4, 2024

nalwayaakshay commented Nov 5, 2024

CustomCallOp unsupported target: TopK on Llama 3.1 with topk=2 #1025

CustomCallOp unsupported target: TopK on Llama 3.1 with topk=2 #1025

Comments

steeve commented Nov 4, 2024

nalwayaakshay commented Nov 5, 2024