This tutorial shows how to implement 1Cycle schedules for learning rate and momentum in PyTorch.
This tutorial will help you get started running DeepSpeed on Azure virtual machines. Looking forward, we will be integrating these techniques and additional ...
Train your first model with DeepSpeed!
First steps with DeepSpeed
This tutorial shows how to use to perform Learning Rate range tests in PyTorch.
If you haven’t already, we advise you to first read through the Getting Started guide before stepping through this tutorial.
This tutorial shows how to enable the DeepSpeed transformer kernel and set its different configuration parameters.