User guide#
- Getting started
- Instruction finetuning
- Frequently Asked Questions
- How to add special tokens?
- How to set TP and PP?
- How to launch training on multiple nodes?
- What are the basic hardware requirements?
- How to shard and merge models?
- What arguments are used to train LLaMa 2?
- How to convert a LLaMa or Falcon architecture from a non-official checkpoint?
- I’m getting a
17300 Bus error (core dumped)
error! - I’m getting a
ImportError: cannot import name 'helpers' from 'megatron.data'
error!
- How to tokenize a dataset?
- Weights conversion