qwen2_5_0_5b¶
- torchtune.models.qwen2_5.qwen2_5_0_5b() TransformerDecoder [source]¶
Builder for creating a Qwen2.5 model (base or instruct) initialized w/ the default 0.5B parameter values from https://huggingface.co/Qwen/Qwen2.5-0.5B-Instruct
- Returns:
Instantiation of Qwen2.5 0.5B model
- Return type:
Note
Qwen2.5 0.5B-3B model builders will enable
tie_word_embeddings
by default (seeqwen2()
)