megatron.optimizer.clip_grads#
Description
Gradient clipping.
Functions
|
Clips gradient norm of an iterable of parameters whose gradients |
|
Description
Gradient clipping.
Functions
|
Clips gradient norm of an iterable of parameters whose gradients |
|