megatron.optimizer.grad_scaler.DynamicGradScaler#

class megatron.optimizer.grad_scaler.DynamicGradScaler(initial_scale, min_scale, growth_factor, backoff_factor, growth_interval, hysteresis)#

Bases: MegatronGradScaler