Blog

Blog

Deploying PyTorch Models to the Micro-Edge with ExecuTorch and Arm

The world of AI is expanding beyond the cloud, reaching devices that fit in the…

Dominica Abena Oforiwaa AmanfoMarch 5, 2026

Blog

Quantization-Aware Training in TorchAO (II)

In our previous Quantization-Aware Training (QAT) blog, we introduced the initial QAT flow in TorchAO…

Meta: Andrew Or, Lisa Jin, Scott Roy, Jerry Zhang, Mergen Nachin, Supriya Rao, Lin Xiao Unsloth: Daniel Han Axolotl: Salman MohammadiMarch 4, 2026

Announcements Blog

Kubetorch Joins the PyTorch Ecosystem Landscape: A Fast, Pythonic, Fault-Tolerant Interface into Kubernetes for ML

Kubetorch enables ML research and development on Kubernetes, across training, inference, RL, evals, data processing,…

Paul Yang, Donny GreenbergFebruary 27, 2026

Blog

Enhancing Multimodal Training and Memory Efficiency with DeepSpeed

Overview This blog walks through two crucial DeepSpeed updates: (1) a PyTorch-identical backward API that…

Masahiro Tanaka (Anyscale) and Olatunji Ruwase (Snowflake)February 24, 2026

Blog

Accelerating Autotuning in Helion with Bayesian Optimization

Introduction As introduced in a previous blog post, Helion is a high-level DSL that empowers…

Ethan Che, Oguz Ulgen, Max Balandat, Jongsok Choi, Jason AnselFebruary 24, 2026

PyTorch Foundation New Members Press Release

Announcements Blog

PyTorch Foundation Announces New Members as Agentic AI Demand Grows

Foundation welcomes Clockwork.io, Emmi AI, NIPA, Nota AI., Yasp, CommonAI CIC, Carnegie Mellon University, Monash…

PyTorch FoundationFebruary 24, 2026

Announcements Blog

PyTorchCon Europe Schedule is Live

The schedule for PyTorch Conference Europe is officially live! Join us 7-8 April in Paris…

PyTorch FoundationFebruary 23, 2026

Announcements Ecosystem

Mooncake Joins PyTorch Ecosystem

We are thrilled to announce that Mooncake has officially joined the PyTorch Ecosystem! By integrating…

The Mooncake TeamFebruary 12, 2026

Blog

Pyrefly Now Type Checks PyTorch

We’re excited to share that PyTorch now leverages Pyrefly to power type checking across our…

PyTorch and Pyrefly Teams at MetaFebruary 12, 2026

Announcements Blog

Why I’m Joining the PyTorch Foundation

I want to start by thanking Matt White for everything he has built over the…

Mark Collier, Executive Director, PyTorch FoundationFebruary 11, 2026

Announcements Blog

PyTorch Foundation: The Next Chapter, Together

Over the past nearly two years, I’ve had the privilege of serving as Executive Director…

Matt WhiteFebruary 11, 2026

Announcements

PyTorch Day India 2026: A builder-focused milestone for open source AI in Bengaluru

PyTorch Day India 2026: A builder-focused milestone for open source AI in Bengaluru On February…

PyTorch FoundationFebruary 10, 2026

Blog

Accelerating Mamba2 with Kernel Fusion

Summary In this post, we discuss how we optimized the Mamba-2 State-Space Dual (SSD) module…

Rishi Astra, Tri Dao, Adnan HoqueFebruary 6, 2026

Blog

Some Matrix Multiplication Engines Are Not As Accurate As We Thought

What is an accumulator in an accelerator's GEMM engine and why does it matter? GPUs…

Chi-Chun (Charlie) Liu, Monodeep Kar, Naigang Wang, Raghu Kiran Ganti, Mudhakar SrivatsaFebruary 6, 2026

Blog

Building Highly Efficient Inference System for Recommenders Using PyTorch

Why Choose PyTorch for Recommendation System PyTorch has emerged as the de facto framework in…

Lu Fang, Shiyan Deng, Hongyi Jia, Huamin Li, Ilina Mitra, Sheng Qin, Zhengkai Zhang, Zhuoran Zhao, Zinnia ZhengFebruary 5, 2026

Blog

Portable Paged Attention in Helion

Recently, the PyTorch team released Helion, a new domain-specific and PyTorch-based language to make the…

Burkhard Ringlein (IBM Research) and the vLLM Team at IBM ResearchFebruary 3, 2026

Blog Community

Unlock Reasoning in Llama 3.1-8B via Full Fine-Tuning on NVIDIA DGX Spark

What is the unsaid joy of local LLMs? The magic of downloading weights, running some…

Sanyam Bhutani (PyTorch Meta), Hamid Shojanazeri (PyTorch Meta), Clement Anthonioz Blanc (Meta)February 2, 2026

Blog

Accelerating On-Device ML Inference with ExecuTorch and Arm SME2

Interactive image segmentation has become a defining mobile experience across the world’s most popular apps.…

Jason Zhu, Tyler Mullenbach, Damien Dooley, and Gian Marco Idoice, ArmJanuary 29, 2026

Announcements

Feast Joins the PyTorch Ecosystem: Bridging Feature Stores and Deep Learning

PyTorch revolutionized how we build and serve AI models, but getting them to production remains…

Francisco Javier Arceo, Hao Xu, Shuchu HanJanuary 22, 2026

Blog

PyTorch 2.10 Release Blog

We are excited to announce the release of PyTorch® 2.10 (release notes)! This release features…

PyTorch FoundationJanuary 21, 2026

Deploying PyTorch Models to the Micro-Edge with ExecuTorch and Arm

Quantization-Aware Training in TorchAO (II)

Kubetorch Joins the PyTorch Ecosystem Landscape: A Fast, Pythonic, Fault-Tolerant Interface into Kubernetes for ML

Enhancing Multimodal Training and Memory Efficiency with DeepSpeed

Accelerating Autotuning in Helion with Bayesian Optimization

PyTorch Foundation Announces New Members as Agentic AI Demand Grows

PyTorchCon Europe Schedule is Live

Mooncake Joins PyTorch Ecosystem

Pyrefly Now Type Checks PyTorch

Why I’m Joining the PyTorch Foundation

PyTorch Foundation: The Next Chapter, Together

PyTorch Day India 2026: A builder-focused milestone for open source AI in Bengaluru

Accelerating Mamba2 with Kernel Fusion

Some Matrix Multiplication Engines Are Not As Accurate As We Thought

Building Highly Efficient Inference System for Recommenders Using PyTorch

Portable Paged Attention in Helion

Unlock Reasoning in Llama 3.1-8B via Full Fine-Tuning on NVIDIA DGX Spark

Accelerating On-Device ML Inference with ExecuTorch and Arm SME2

Feast Joins the PyTorch Ecosystem: Bridging Feature Stores and Deep Learning

PyTorch 2.10 Release Blog

Docs

Tutorials

Resources

Stay in touch for updates, event info, and the latest news