ea89133c65 | ||
---|---|---|
.. | ||
README.md | ||
train.py |
README.md
Transformers
This example contains a simple training loop for next-word prediction with a Transformer model on a subset of the WikiText2 dataset. It is a simplified version of the official PyTorch example.
Train with Fabric
# CPU
fabric run --accelerator=cpu train.py
# GPU (CUDA or M1 Mac)
fabric run --accelerator=gpu train.py
# Multiple GPUs
fabric run --accelerator=gpu --devices=4 train.py