Shortcuts

alpaca_cleaned_dataset

torchtune.datasets.alpaca_cleaned_dataset(tokenizer: ModelTokenizer, *, source: str = 'yahma/alpaca-cleaned', train_on_input: bool = True, max_seq_len: int = 512, packed: bool = False) InstructDataset

Builder for a variant of Alpaca-style datasets with the cleaned version of the original Alpaca dataset, yahma/alpaca-cleaned. See the dataset page and alpaca_dataset() for more details.

Docs

Access comprehensive developer documentation for PyTorch

View Docs

Tutorials

Get in-depth tutorials for beginners and advanced developers

View Tutorials

Resources

Find development resources and get your questions answered

View Resources