:orphan:
##################################################
Level 20: Train models with billions of parameters
##################################################
Scale to billions of parameters with multiple distributed strategies.
----
.. raw:: html
.. Add callout items below this line
.. displayitem::
:header: Scale with distributed strategies
:description: Learn about different distributed strategies to reach bigger model parameter sizes.
:col_css: col-md-6
:button_link: ../accelerators/gpu_intermediate.html
:height: 150
:tag: intermediate
.. displayitem::
:header: Train models with billions of parameters
:description: Scale to billions of params on GPUs with FSDP or Deepspeed.
:col_css: col-md-6
:button_link: ../advanced/model_parallel.html
:height: 150
:tag: advanced
.. raw:: html