[perf] feat: support ref/rm offload (#121)
- Force ref/rm to use CPUOffload. Fix root FSDP unit not reshard weights after forward - HSDP support is on hold and assert False right now.
Showing
This diff is collapsed.
Click to expand it.
Please
register
or
sign in
to comment