megatron.data.realm_dataset_utils#
Description
Classes
|
A struct for fully describing a fixed-size block of data as used in REALM |
|
Functions
|
Get samples mapping for a dataset over fixed size blocks. |
|
|
|
Specifically one epoch to be used in an indexing job. |
|
Join a list of strings, handling spaces appropriately |