megatron.schedules.forward_backward_pipelining_without_interleaving#
- megatron.schedules.forward_backward_pipelining_without_interleaving(forward_step_func, data_iterator, model, optimizer, timers, forward_only, collect_non_loss_data=False)#
Run non-interleaved 1F1B schedule, with communication between pipeline stages.
Returns dictionary with losses if the last stage, empty dict otherwise.