Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Packet queue performance can be improved #15474

Open
nhuang-tt opened this issue Nov 26, 2024 · 2 comments
Open

Packet queue performance can be improved #15474

nhuang-tt opened this issue Nov 26, 2024 · 2 comments
Assignees
Labels
metal tt-metal issue

Comments

@nhuang-tt
Copy link
Member

The base class of the routing kernels, packet_queue, has a lot of branches and accesses data members at runtime that can be otherwise done at compile time. Just running test_tx_rx (Traffic generator to receiver bandwidth test), the baseline is 0.84 B/cycle and wall clock bandwidth is 22.86 MB/s. After changing the if statements to compile time it increased to 0.95 B/cycle and 26.45 MB/s.

@nhuang-tt nhuang-tt added the metal tt-metal issue label Nov 26, 2024
@nhuang-tt nhuang-tt self-assigned this Nov 26, 2024
@davorchap
Copy link
Collaborator

@ubcheema sync with Nigel

@nhuang-tt
Copy link
Member Author

Yes, I will make the updates after the dependency: power of 2 ring buffer size ticket

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
metal tt-metal issue
Projects
None yet
Development

No branches or pull requests

3 participants