qwen2_5_72b_base¶
- torchtune.models.qwen2_5.qwen2_5_72b_base() TransformerDecoder [source]¶
Builder for creating a Qwen2.5 base model initialized w/ the default 72B parameter values from https://huggingface.co/Qwen/Qwen2.5-72B
- Returns:
Instantiation of Qwen2.5 72B model
- Return type:
Note
The base and instruct versions have slightly different architectures for all Qwen2.5 model sizes except 0.5B and 3B. Make sure to select the correct model builder for the weights.