.. toctree::
:maxdepth: 1
:hidden:
<../advanced/gradient_accumulation>
<../advanced/distributed_communication>
<../advanced/multiple_setup>
<../advanced/model_parallel/fsdp>
###############
Advanced skills
###############
.. raw:: html
.. displayitem::
:header: Use efficient gradient accumulation
:description: Learn how to perform efficient gradient accumulation in distributed settings
:button_link: ../advanced/gradient_accumulation.html
:col_css: col-md-4
:height: 170
:tag: advanced
.. displayitem::
:header: Distribute communication
:description: Learn all about communication primitives for distributed operation. Gather, reduce, broadcast, etc.
:button_link: ../advanced/distributed_communication.html
:col_css: col-md-4
:height: 170
:tag: advanced
.. displayitem::
:header: Use multiple models and optimizers
:description: See how flexible Fabric is to work with multiple models and optimizers!
:button_link: ../advanced/multiple_setup.html
:col_css: col-md-4
:height: 170
:tag: advanced
.. displayitem::
:header: Train models with billions of parameters
:description: Train the largest models with FSDP across multiple GPUs and machines
:button_link: ../advanced/model_parallel/fsdp.html
:col_css: col-md-4
:height: 170
:tag: advanced
.. raw:: html