megatron.model.fused_softmax#
Description
Classes
|
fused operation: scaling + mask + softmax |
|
Fused operation which performs following three operations in sequence 1. |
|
Fused operation which performs following two operations in sequence 1. |
|
Fused operation which performs following three operations in sequence 1. |