Blog

Announcements

PyTorch Foundation Welcomes New Executive Director

The PyTorch Foundation is excited to welcome Matt White, our new executive director. The PyTorch…

PyTorch FoundationJune 11, 2024

Blog

INT4 Decoding GQA CUDA Optimizations for LLM Inference

An efficient decoding Grouped-Query Attention with low-precision KV cache Introduction Generative AI has taken the…

Sarunya Pumma, Jongsoo Park, Jianyu Huang, Amy Yang, Jaewon Lee, Daniel Haziza, Grigory Sizov, Jeremy Reizenstein, Jeff Johnson, Ying ZhangJune 6, 2024

Announcements

Ready, Set, Contribute: PyTorch Docathon Kickoff H1 2024

The PyTorch Docathon is now live! This event is dedicated to enhancing the quality of…

PyTorch FoundationJune 4, 2024

Case Studies

AI Helps Duolingo Personalize Language Learning

Learning a foreign language was probably one of your goals last year. And the year…

PyTorch FoundationMay 25, 2024

Blog

Maximizing Training Throughput Using PyTorch FSDP and Torch.compile

Recently, we demonstrated how FSDP and selective activation checkpointing can be used to achieve 57% MFU…

Team PyTorch at IBM and Team PyTorch at MetaMay 21, 2024

Blog

Achieving Sustainability Goals with PyTorch and Intel AI

This post was contributed by Intel AI in partnership with the PyTorch Foundation. In 2017,…

PyTorch FoundationMay 15, 2024

Blog

Speeding up ViTs using Block Sparsity

TLDR: We show promising results of up to a 1.46x speedup with <2% drop in accuracy on float32…

FAIR at Meta: Mostafa Elhoushi, Sensors and Systems at Meta Reality Labs Research: Syed Shakib Sarwar, Aaryan Kothapalli, Mia Kasperek, Barbara De Salvo, PyTorch at Meta: Christian Puhrsch, Jesse Cai, Joe Isaacson, Quantsight: Andrew James, Pearu Peterson, Nikita VedeneevMay 14, 2024

Community

Introducing depyf: mastering torch.compile with ease

We are thrilled to introduce depyf, a new project to the PyTorch ecosystem designed to help…

Kaichao YouMay 11, 2024

Community

Deep Learning Energy Measurement and Optimization

This post is authored by Jae-Won Chung, a PhD student at the University of Michigan and…

Jae-Won ChungMay 11, 2024

Announcements Community

Enhancing Deep Learning Workflows: PyTorch Ecosystem Tools

Welcome to the thriving PyTorch ecosystem, where a wealth of tools and libraries await, purpose-built…

PyTorch FoundationMay 11, 2024

Blog

A Hitchhiker’s Guide to Speculative Decoding

Speculative decoding is an optimization technique for inference that makes educated guesses about future tokens…

Team PyTorch at IBMMay 2, 2024

Announcements

Announcing PyTorch Docathon June, 2024

We are thrilled to announce the upcoming PyTorch Docathon in June! The Docathon, akin to…

PyTorch FoundationMay 2, 2024

Blog

Accelerating Llama3 FP8 Inference with Triton Kernels

1.0 Summary We present an optimized Triton FP8 GEMM (General Matrix-Matrix Multiply) kernel TK-GEMM, which…

Adnan Hoque, Less Wright, Chih Chieh YangMay 1, 2024

Blog

ExecuTorch Alpha: Taking LLMs and AI to the Edge with Our Community and Partners

We are excited to announce the release of ExecuTorch alpha, focused on deploying large language models…

PyTorch FoundationApril 30, 2024

Blog

PyTorch 2.3 Release Blog

We are excited to announce the release of PyTorch® 2.3 (release note)! PyTorch 2.3 offers…

PyTorch FoundationApril 24, 2024

Announcements

torchtune: Easily fine-tune LLMs using PyTorch

We’re pleased to announce the alpha release of torchtune, a PyTorch-native library for easily fine-tuning…

PyTorch FoundationApril 16, 2024

Blog

Accelerating MoE model inference with Locality-Aware Kernel Design

1.0 Summary We show that by implementing column-major scheduling to improve data locality, we can…

Adnan Hoque, Less Wright, Antoni Virós Martin, Chih-Chieh YangApril 4, 2024

Blog

Maximizing training throughput using PyTorch FSDP

In this blog, we demonstrate the scalability of FSDP with a pre-training exemplar, a 7B…

Team PyTorch at IBM and Team PyTorch at MetaMarch 13, 2024

Community

Exploring scientific machine learning pipelines through the SimulAI toolkit

SciML, short for Scientific Machine Learning, encompasses work that merges quantitative sciences with machine learning.…

Joao Lucas de Sousa AlmeidaFebruary 15, 2024

Announcements

PyTorch 2 paper and tutorial @ ASPLOS 2024

The PyTorch team is excited to share that our paper on PyTorch 2 has been…

PyTorch FoundationFebruary 6, 2024

PyTorch Foundation Welcomes New Executive Director

INT4 Decoding GQA CUDA Optimizations for LLM Inference

Ready, Set, Contribute: PyTorch Docathon Kickoff H1 2024

AI Helps Duolingo Personalize Language Learning

Maximizing Training Throughput Using PyTorch FSDP and Torch.compile

Achieving Sustainability Goals with PyTorch and Intel AI

Speeding up ViTs using Block Sparsity

Introducing depyf: mastering torch.compile with ease

Deep Learning Energy Measurement and Optimization

Enhancing Deep Learning Workflows: PyTorch Ecosystem Tools

A Hitchhiker’s Guide to Speculative Decoding

Announcing PyTorch Docathon June, 2024

Accelerating Llama3 FP8 Inference with Triton Kernels

ExecuTorch Alpha: Taking LLMs and AI to the Edge with Our Community and Partners

PyTorch 2.3 Release Blog

torchtune: Easily fine-tune LLMs using PyTorch

Accelerating MoE model inference with Locality-Aware Kernel Design

Maximizing training throughput using PyTorch FSDP

Exploring scientific machine learning pipelines through the SimulAI toolkit

PyTorch 2 paper and tutorial @ ASPLOS 2024

Docs

Tutorials

Resources

Stay in touch for updates, event info, and the latest news