-
Notifications
You must be signed in to change notification settings - Fork 213
Pull requests: pytorch/torchtitan
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
test csv schedule on different runtime
CLA Signed
This label is managed by the Meta Open Source bot.
Support 3rd-party distributed backend
CLA Signed
This label is managed by the Meta Open Source bot.
#706
opened Nov 27, 2024 by
qiongerfei
Loading…
[do not review]Enable optimizer in backward in TorchTitan
CLA Signed
This label is managed by the Meta Open Source bot.
[WIP] Allow benchmark between multiple configs
CLA Signed
This label is managed by the Meta Open Source bot.
#703
opened Nov 26, 2024 by
H-Huang
Loading…
W&B wandb support
CLA Signed
This label is managed by the Meta Open Source bot.
#699
opened Nov 25, 2024 by
msaroufim
Loading…
Configure RNGs appropriately for Pipeline + SPMD
CLA Signed
This label is managed by the Meta Open Source bot.
#689
opened Nov 22, 2024 by
wconstab
Loading…
necessary changes to unblock Sequence Parallel on odd length sequences
CLA Signed
This label is managed by the Meta Open Source bot.
#686
opened Nov 20, 2024 by
tianyu-l
Loading…
[cp] apply fsdp to model when CP is enabled without DP for correct loss and lower mem usage
CLA Signed
This label is managed by the Meta Open Source bot.
#685
opened Nov 20, 2024 by
XilunWu
Loading…
[cp] add option to choose kv shards rotation method
CLA Signed
This label is managed by the Meta Open Source bot.
#684
opened Nov 20, 2024 by
XilunWu
Loading…
[WIP] Adding OBELICS DataLoader
CLA Signed
This label is managed by the Meta Open Source bot.
#663
opened Oct 30, 2024 by
TJ-Solergibert
Loading…
[not for land] torch.compile individual linears
CLA Signed
This label is managed by the Meta Open Source bot.
#661
opened Oct 29, 2024 by
vkuzo
Loading…
empty_cache
before barrier
CLA Signed
#660
opened Oct 29, 2024 by
carmocca
Loading…
Init weights only if not loading a checkpoint
CLA Signed
This label is managed by the Meta Open Source bot.
[DO NOT REVIEW] gaps to enable FDSP2 cpu offloading
CLA Signed
This label is managed by the Meta Open Source bot.
#622
opened Oct 16, 2024 by
weifengpy
Loading…
[Not for land] Settings to make Llama3-8B on 8 GPUs faster
CLA Signed
This label is managed by the Meta Open Source bot.
[not for land] TE experiments, take 2
CLA Signed
This label is managed by the Meta Open Source bot.
#614
opened Oct 14, 2024 by
vkuzo
Loading…
[DO NOT REVIEW] --experimental.fsdp_sharding_on_largest_dim
CLA Signed
This label is managed by the Meta Open Source bot.
#607
opened Oct 9, 2024 by
weifengpy
Loading…
fix mixed precision for This label is managed by the Meta Open Source bot.
replicate
/ pure DDP
CLA Signed
#591
opened Sep 29, 2024 by
152334H
Loading…
[not for land yet] hack max and abs out of ops eligible for AC
CLA Signed
This label is managed by the Meta Open Source bot.
#580
opened Sep 17, 2024 by
vkuzo
Loading…
add pp validation for schedule
CLA Signed
This label is managed by the Meta Open Source bot.
#568
opened Sep 5, 2024 by
H-Huang
Loading…
[DO NOT REVIEW] Runtime estimation with FakeTensor + TorchDispatchMode
CLA Signed
This label is managed by the Meta Open Source bot.
#536
opened Aug 20, 2024 by
weifengpy
Loading…
[Not for land] Added changes for GPT-2 perf
CLA Signed
This label is managed by the Meta Open Source bot.
Previous Next
ProTip!
Follow long discussions with comments:>50.