Skip to content

Commit

Permalink
[FEA] distributed autograd primitives (for tensors of variable size o…
Browse files Browse the repository at this point in the history
…n each rank) (#105)

* initial commit

* add tests and fix bugs

* format code

* proper pytest markups

* add a few docstrings

* add test remark

* update changelog

* remove usage of all_to_all_single, address feedback

* update docstrings to be more precise about bwd ops

* format code

* fix typo

---------

Co-authored-by: Maximilian Stadler <mstadler.nvidia.com>
  • Loading branch information
stadlmax authored Aug 22, 2023
1 parent 012abfc commit bdb10aa
Show file tree
Hide file tree
Showing 4 changed files with 1,111 additions and 2 deletions.
1 change: 1 addition & 0 deletions CHANGELOG.md
Original file line number Diff line number Diff line change
Expand Up @@ -12,6 +12,7 @@ and this project adheres to [Semantic Versioning](https://semver.org/spec/v2.0.0

- Added distributed utilities to create process groups and orthogonal process groups.
- Added distributed AFNO model implementation.
- Added distributed utilities for communication of buffers of varying size per rank.

### Changed

Expand Down
Loading

0 comments on commit bdb10aa

Please sign in to comment.