mistral_reward_7b¶
- torchtune.models.mistral.mistral_reward_7b() TransformerDecoder [source]¶
Builder for creating a Mistral 7B model initialized w/ the default 7b parameter values from: https://huggingface.co/Ray2333/reward-model-Mistral-7B-instruct-Unified-Feedback where the output layer is a classification layer projecting to a single class for reward modelling.
- Returns:
Instantiation of Mistral 7B classifier model
- Return type: