megatron.data.bert_dataset.BertDataset#

class megatron.data.bert_dataset.BertDataset(name, indexed_dataset, data_prefix, num_epochs, max_num_samples, masked_lm_prob, max_seq_length, short_seq_prob, seed, binary_head)#

Bases: Dataset