v0.51.0-rc1
Pre-release
Pre-release
github-actions
released this
11 Jul 02:01
·
2851 commits
to main
since this release
📦 Uncategorized
- Migrate Pad Device and All references
- PR: #9891
- #0: Multi-CQ support for R-Chip
- PR: #10002
- #10028: Remove skip and reduce test case for
moreh_groupnorm
test- PR: #10029
- #10005: Change input tensor parameter to optional in moreh_sum_backward
- PR: #10007
- #10004: Revise bias tensor usage in moreh_linear_backward
- PR: #10006
- #9663: support moreh_nll_loss_unreduced
- PR: #9804
- #8865: Switch ported ops from tt_lib to ttnn for host dispatch time m…
- PR: #10009
- #0: Update README.md grammar for idiomatic description of TT-NN
- PR: #9827
- #9767: removed more no longer needed manually specified attributes for reflection
- PR: #10023
- Add distributed layernorm kernel documentation
- PR: #9982
- #10031: Fix -Werror=return-type error in composite_ops
- PR: #10036
- #9492: update matmul path in CODEOWNERS
- PR: #10022
- #9450: change silicon fixtures to session scope
- PR: #10019
- Uplift UMD to grab support for configuring static TLBs and Hugepage for BH
- PR: #9934
- #9441: add all typecasts to unit test
- PR: #10046
- #9801: Add cb alignment fix for blackhole that was missed in rebase
- PR: #10051
- #9973: Fix addrmod for reduce scalar, port over missing narrow tile c…
- PR: #10047
- #10052: Add metal pack untilize test
- PR: #10057
- Add ttnn matmul tests to TG unit tests
- PR: #9477
- Add
ssm_prefix_scan
test coverage for N=16- PR: #10061
- Add PyBind to TTNN Slice (Formerly Referred to Unpad in TT Lib)
- PR: #10056
- #8450: Cleanup items pending from PR #9068
- PR: #10053
- #10030: fix moreh_nll_loss hang
- PR: #10040
- #7736: Remove unused reduce dim & type from reduce_init*
- PR: #10060
- #9871: Update backward files
- PR: #10037
- #9874: Move Unary Backward ops to TTNN
- PR: #9949
- Update op_perf_results
- PR: #10042
- #9962: Enable flags for profiler globals in jit build
- PR: #9964
- Added prefill mode for mamba modules
- PR: #10063
- Increase timeout for Mamba full model tests
- PR: #10064
- Support multiple user indices in paged_update_cache
- PR: #10050
- #10085: Make ttnn::Buffer deallocate execute without querying a potentially destroyed buffer instance
- PR: #10095
- Pack runtime arguments across brisc/ncrisc/trisc
- PR: #9781
- Llama Demo Refactor
- PR: #10018
- #5424: Delegated sfpu reciprocal calls to wh_b0 submodule functions
- PR: #10103
- #0: Move t3k demo tests to perf pipeline because it requires perf governor
- PR: #10106
- #5424: Delegated sfpu reciprocal calls to gs submodule functions
- PR: #10105
- Add trace and multi cq implementations/tests for WH Resnet
- PR: #10021
- #0: (MINOR) Update to v0.51.0
- PR: #10114