Torch-TensorRT¶

In-framework compilation of PyTorch inference code for NVIDIA GPUs¶

Torch-TensorRT is a inference compiler for PyTorch, targeting NVIDIA GPUs via NVIDIA’s TensorRT Deep Learning Optimizer and Runtime. It supports both just-in-time (JIT) compilation workflows via the torch.compile interface as well as ahead-of-time (AOT) workflows. Torch-TensorRT integrates seamlessly into the PyTorch ecosystem supporting hybrid execution of optimized TensorRT code with standard PyTorch code.

More Information / System Architecture:

Torch-TensorRT 2.0

Getting Started¶

Installation

User Guide¶

Dynamo Frontend¶

TorchScript Frontend¶

FX Frontend¶

Torch-TensorRT (FX Frontend) User Guide

Tutorials¶

Python API Documentation¶

C++ API Documentation¶

CLI Documentation¶

torchtrtc

Torch-TensorRT¶

In-framework compilation of PyTorch inference code for NVIDIA GPUs¶

Getting Started¶

User Guide¶

Dynamo Frontend¶

TorchScript Frontend¶

FX Frontend¶

Tutorials¶

Python API Documentation¶

C++ API Documentation¶

CLI Documentation¶

Contributor Documentation¶

Indices¶

Legacy Further Information (TorchScript)¶

Docs

Tutorials

Resources