megatron.data.t5_dataset.T5Dataset#

class megatron.data.t5_dataset.T5Dataset(name, indexed_dataset, data_prefix, num_epochs, max_num_samples, masked_lm_prob, max_seq_length, max_seq_length_dec, short_seq_prob, seed)#

Bases: Dataset