backward_step (optimizer, input_tensor, ...)
|
Backward step through passed-in output tensor. |
custom_backward (output, grad_output)
|
Directly call C++ autograd engine. |
deallocate_output_tensor (out)
|
Pseudo-deallocate (i.e., set to scalar) the output tensor's '.data' field. |
dummy_handler ()
|
|
forward_backward_no_pipelining (...[, ...])
|
Run forward and backward passes with no pipeline parallelism (no inter-stage communication). |
forward_backward_pipelining_with_interleaving (...)
|
Run interleaved 1F1B schedule (model split into model chunks), with communication between pipeline stages as needed. |
forward_backward_pipelining_without_interleaving (...)
|
Run non-interleaved 1F1B schedule, with communication between pipeline stages. |
forward_step (forward_step_func, ...[, ...])
|
Forward step for passed-in model. |
get_forward_backward_func ()
|
|
get_tensor_shapes (rank, model_type)
|
|
recv_backward (tensor_shapes, timers)
|
|
recv_forward (tensor_shapes, timers)
|
|
send_backward (input_tensor_grads, ...)
|
|
send_backward_recv_forward (...)
|
|
send_forward (output_tensors, tensor_shapes, ...)
|
|
send_forward_recv_backward (output_tensors, ...)
|
|