PyTorch at NVIDIA GTC 2026: Join Us in San Jose! Blog PyTorch at NVIDIA GTC 2026: Join Us in San Jose! We're excited to announce that PyTorch will have a strong presence at NVIDIA GTC 2026,…Clement Anthonioz Blanc, Chris Gottbrath, PyTorch Team at MetaMarch 9, 2026
KernelAgent: Hardware-Guided GPU Kernel Optimization via Multi-Agent Orchestration Blog KernelAgent: Hardware-Guided GPU Kernel Optimization via Multi-Agent Orchestration Summary Recently, the PyTorch team released KernelAgent, an open agentic system achieving 100% correctness across…Kaiming Cheng, Laura Wang, Jack Khuu, Mark Saroufim, Wenyuan Chi, Jiannan Wang, and Joe IsaacsonMarch 6, 2026
FlexAttention + FlashAttention-4: Fast and Flexible Blog FlexAttention + FlashAttention-4: Fast and Flexible TL;DR: On Hopper and Blackwell GPUs, FlexAttention now has a FlashAttention-4 backend. We added support…Driss Guessous, Reuben Stern, Markus Hoehnerbach, Fung Xie, Ted Zadouri, Jay Shah, Tri DaoMarch 5, 2026
Deploying PyTorch Models to the Micro-Edge with ExecuTorch and Arm Blog Deploying PyTorch Models to the Micro-Edge with ExecuTorch and Arm The world of AI is expanding beyond the cloud, reaching devices that fit in the…Dominica Abena Oforiwaa AmanfoMarch 5, 2026
Quantization-Aware Training in TorchAO (II) Blog Quantization-Aware Training in TorchAO (II) In our previous Quantization-Aware Training (QAT) blog, we introduced the initial QAT flow in TorchAO…Meta: Andrew Or, Lisa Jin, Scott Roy, Jerry Zhang, Mergen Nachin, Supriya Rao, Lin Xiao Unsloth: Daniel Han Axolotl: Salman MohammadiMarch 4, 2026
Kubetorch Joins the PyTorch Ecosystem Landscape: A Fast, Pythonic, Fault-Tolerant Interface into Kubernetes for ML AnnouncementsBlog Kubetorch Joins the PyTorch Ecosystem Landscape: A Fast, Pythonic, Fault-Tolerant Interface into Kubernetes for ML Kubetorch enables ML research and development on Kubernetes, across training, inference, RL, evals, data processing,…Paul Yang, Donny GreenbergFebruary 27, 2026
Enhancing Multimodal Training and Memory Efficiency with DeepSpeed Blog Enhancing Multimodal Training and Memory Efficiency with DeepSpeed Overview This blog walks through two crucial DeepSpeed updates: (1) a PyTorch-identical backward API that…Masahiro Tanaka (Anyscale) and Olatunji Ruwase (Snowflake)February 24, 2026
Accelerating Autotuning in Helion with Bayesian Optimization Blog Accelerating Autotuning in Helion with Bayesian Optimization Introduction As introduced in a previous blog post, Helion is a high-level DSL that empowers…Ethan Che, Oguz Ulgen, Max Balandat, Jongsok Choi, Jason AnselFebruary 24, 2026
PyTorch Foundation Announces New Members as Agentic AI Demand Grows AnnouncementsBlog PyTorch Foundation Announces New Members as Agentic AI Demand Grows Foundation welcomes Clockwork.io, Emmi AI, NIPA, Nota AI., Yasp, CommonAI CIC, Carnegie Mellon University, Monash…PyTorch FoundationFebruary 24, 2026
PyTorchCon Europe Schedule is Live AnnouncementsBlog PyTorchCon Europe Schedule is Live The schedule for PyTorch Conference Europe is officially live! Join us 7-8 April in Paris…PyTorch FoundationFebruary 23, 2026
Mooncake Joins PyTorch Ecosystem AnnouncementsEcosystem Mooncake Joins PyTorch Ecosystem We are thrilled to announce that Mooncake has officially joined the PyTorch Ecosystem! By integrating…The Mooncake TeamFebruary 12, 2026
Pyrefly Now Type Checks PyTorch Blog Pyrefly Now Type Checks PyTorch We’re excited to share that PyTorch now leverages Pyrefly to power type checking across our…PyTorch and Pyrefly Teams at MetaFebruary 12, 2026
Why I’m Joining the PyTorch Foundation AnnouncementsBlog Why I’m Joining the PyTorch Foundation I want to start by thanking Matt White for everything he has built over the…Mark Collier, Executive Director, PyTorch FoundationFebruary 11, 2026
PyTorch Foundation: The Next Chapter, Together AnnouncementsBlog PyTorch Foundation: The Next Chapter, Together Over the past nearly two years, I’ve had the privilege of serving as Executive Director…Matt WhiteFebruary 11, 2026
PyTorch Day India 2026: A builder-focused milestone for open source AI in Bengaluru Announcements PyTorch Day India 2026: A builder-focused milestone for open source AI in Bengaluru PyTorch Day India 2026: A builder-focused milestone for open source AI in Bengaluru On February…PyTorch FoundationFebruary 10, 2026
Accelerating Mamba2 with Kernel Fusion Blog Accelerating Mamba2 with Kernel Fusion Summary In this post, we discuss how we optimized the Mamba-2 State-Space Dual (SSD) module…Rishi Astra, Tri Dao, Adnan HoqueFebruary 6, 2026
Some Matrix Multiplication Engines Are Not As Accurate As We Thought Blog Some Matrix Multiplication Engines Are Not As Accurate As We Thought What is an accumulator in an accelerator's GEMM engine and why does it matter? GPUs…Chi-Chun (Charlie) Liu, Monodeep Kar, Naigang Wang, Raghu Kiran Ganti, Mudhakar SrivatsaFebruary 6, 2026
Building Highly Efficient Inference System for Recommenders Using PyTorch Blog Building Highly Efficient Inference System for Recommenders Using PyTorch Why Choose PyTorch for Recommendation System PyTorch has emerged as the de facto framework in…Lu Fang, Shiyan Deng, Hongyi Jia, Huamin Li, Ilina Mitra, Sheng Qin, Zhengkai Zhang, Zhuoran Zhao, Zinnia ZhengFebruary 5, 2026
Portable Paged Attention in Helion Blog Portable Paged Attention in Helion Recently, the PyTorch team released Helion, a new domain-specific and PyTorch-based language to make the…Burkhard Ringlein (IBM Research) and the vLLM Team at IBM ResearchFebruary 3, 2026
Unlock Reasoning in Llama 3.1-8B via Full Fine-Tuning on NVIDIA DGX Spark BlogCommunity Unlock Reasoning in Llama 3.1-8B via Full Fine-Tuning on NVIDIA DGX Spark What is the unsaid joy of local LLMs? The magic of downloading weights, running some…Sanyam Bhutani (PyTorch Meta), Hamid Shojanazeri (PyTorch Meta), Clement Anthonioz Blanc (Meta)February 2, 2026