[docs] add fsdp_tips.rst #455

sshleifer · 2021-03-01T19:54:57Z

This is a dumping ground to collect things we want to document about FSDP.

Please comment with anything random that comes to mind.

Once there have been no changes to FSDP for 48 hrs (or some other proxy for stability), I will finalize this and set it to Ready For Review.

min-xu-ai · 2021-03-03T01:27:08Z

fairscale/nn/data_parallel/README_fsdp.md

+
+
+
+#### Misc


should this include the new wrap/auto_wrap feature?

Yes leave desired markdown in comment else I can take a pass tomorrow.

#### The `enable_wrap` context There are two cases where the `enable_wrap` context can be useful: * When you'd like to apply the same parameters to all child modules that you wrap with FSDP. Calling the `wrap` function within the said context will save you from passing the same set of FSDP parameters explicitly. * When wrapping large models that does NOT fit within the CPU memory. I.e. you don't first create the full model and then traverse it to wrap it with FSDP at different parts. Instead, you create a wrapped instance of the model incrementally as you build up the model, allowing large modules to be sharded in-place. example: with enable_wrap(**fsdp_params): # Wraps layer in FSDP by default if within context self.l1 = wrap(torch.nn.Linear(5, 5)) # Wraps children modules by default based on min_num_params self.l2 = auto_wrap(TransformerBlock(), min_num_params=1e8)

min-xu-ai · 2021-03-04T00:28:58Z

One more thing to be added to the doc:

If model weight initialization happens deterministically after sharding, the final weights will be comprised of N piece of identical shards. This would have negative effects on the model initial state.

@myleott @sshleifer Does the above looks good? If so, I can also add it to the doctoring of FSDP. We generate doc from the doctoring.

sshleifer · 2021-03-05T15:51:34Z

These are not at all perfect, please feel free to push changes to this branch or comment.
I made a separate fsdp_tips.rst and cross-linked with fsdp.rst to avoid one crowding out the other on the same page. There may be a nicer way to do this in rst, (e.g. linking to section 2 at the top of 1 document).

I also copied the contents of the docstring of _init_param_attributes into a section called "State management with extra parameter attributes", which is not shown in the other page cause it's private and useful to know.

sshleifer · 2021-03-05T15:57:43Z

fsdp_tips.html.zip. Unzip and open in put the path in the chrome URL bar.

docs/source/api/nn/fsdp_tips.rst

Vittorio-Caggiano

just a small suggestion.
Would it possible to add also a small tutorial?

myleott · 2021-03-07T14:50:56Z

Would it possible to add also a small tutorial?

Yes, this is a great idea 😄

Just to cross-reference and not lose track, Min also suggested the tutorial should include custom weight init inside a summon_full_params context: #454 (comment)

myleott

This looks great, thanks @sshleifer! I made some comments below, but I think we can ship this and iterate. What do you think @min-xu-ai?

fairscale/nn/data_parallel/fully_sharded_data_parallel.py

myleott · 2021-03-08T16:59:47Z

fairscale/nn/data_parallel/fully_sharded_data_parallel.py

+        from fairscale.nn.auto_wrap import enable_wrap, auto_wrap
+        from fairscale.
+        fsdp_params = dict(mixed_precision=True, flatten_parameters=True)
+        with enable_wrap(**fsdp_params):


the new syntax is enable_wrap(wrapper_cls=FSDP, **fsdp_params)

docs/source/api/nn/fsdp_tips.rst

min-xu-ai · 2021-03-08T17:36:48Z

I think we can ship this and iterate. What do you think @min-xu-ai?

Yeah, totally.

copy PR description, start CPU and mixed precision sections

932d126

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Mar 1, 2021

sshleifer linked an issue Mar 1, 2021 that may be closed by this pull request

FSDP Docs #454

Closed

min-xu-ai reviewed Mar 3, 2021

View reviewed changes

sshleifer added 5 commits March 4, 2021 23:04

Merge branch 'master' into fsdp-docs

36d0803

RST attempt

6bcf783

cleanup

2a2a366

Cross link, fix typos

ada0d97

Remove .md

0733d59

sshleifer marked this pull request as ready for review March 5, 2021 15:49

sshleifer requested a review from Vittorio-Caggiano March 5, 2021 15:51

sshleifer changed the title ~~[WIP] FSDP docs~~ FSDP docs Mar 5, 2021

Vittorio-Caggiano reviewed Mar 5, 2021

View reviewed changes

docs/source/api/nn/fsdp_tips.rst Outdated Show resolved Hide resolved

Vittorio-Caggiano reviewed Mar 5, 2021

View reviewed changes

docs/source/api/nn/fsdp_tips.rst Outdated Show resolved Hide resolved

Vittorio-Caggiano reviewed Mar 6, 2021

View reviewed changes

Fix warning, new image

e86920c

myleott reviewed Mar 8, 2021

View reviewed changes

sshleifer added 3 commits March 8, 2021 13:17

link fairseq

3d60cb7

better anchor text

d073088

Thanks for the comments, Myle

d5060e9

sshleifer changed the title ~~FSDP docs~~ [docs] add fsdp_tips.rst Mar 8, 2021

sshleifer merged commit ad611a3 into master Mar 8, 2021

sshleifer deleted the fsdp-docs branch March 8, 2021 20:23

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[docs] add fsdp_tips.rst #455

[docs] add fsdp_tips.rst #455

sshleifer commented Mar 1, 2021

min-xu-ai Mar 3, 2021

sshleifer Mar 3, 2021 •

edited

Loading

min-xu-ai Mar 3, 2021

min-xu-ai commented Mar 4, 2021

sshleifer commented Mar 5, 2021 •

edited

Loading

sshleifer commented Mar 5, 2021 •

edited

Loading

Vittorio-Caggiano left a comment

myleott commented Mar 7, 2021

myleott left a comment

myleott Mar 8, 2021

min-xu-ai commented Mar 8, 2021

[docs] add fsdp_tips.rst #455

[docs] add fsdp_tips.rst #455

Conversation

sshleifer commented Mar 1, 2021

min-xu-ai Mar 3, 2021

Choose a reason for hiding this comment

sshleifer Mar 3, 2021 • edited Loading

Choose a reason for hiding this comment

min-xu-ai Mar 3, 2021

Choose a reason for hiding this comment

min-xu-ai commented Mar 4, 2021

sshleifer commented Mar 5, 2021 • edited Loading

sshleifer commented Mar 5, 2021 • edited Loading

Vittorio-Caggiano left a comment

Choose a reason for hiding this comment

myleott commented Mar 7, 2021

myleott left a comment

Choose a reason for hiding this comment

myleott Mar 8, 2021

Choose a reason for hiding this comment

min-xu-ai commented Mar 8, 2021

sshleifer Mar 3, 2021 •

edited

Loading

sshleifer commented Mar 5, 2021 •

edited

Loading

sshleifer commented Mar 5, 2021 •

edited

Loading