megatron.model.llama_model.LlamaModel#

class megatron.model.llama_model.LlamaModel(num_tokentypes: int = 0, parallel_output: bool = True, pre_process: bool = True, post_process: bool = True, model_type=None, version: int = 2)#

Bases: GPTModel