Skip to main content
megatron.model.llama_model.LlamaModel
-
class megatron.model.llama_model.LlamaModel(num_tokentypes: int = 0, parallel_output: bool = True, pre_process: bool = True, post_process: bool = True, model_type=None, version: int = 2)
Bases: GPTModel