TorchServe

⚠️ Notice: Limited Maintenance

This project is no longer actively maintained. While existing releases remain available, there are no planned updates, bug fixes, new features, or security patches. Users should be aware that vulnerabilities may not be addressed.

TorchServe is a performant, flexible and easy to use tool for serving PyTorch models in production.

What’s going on in TorchServe?

TorchServe Quick Start

Topics: Quick Start

Learn how to install TorchServe and serve models.

Running TorchServe

Topics: Running TorchServe

Indepth explanation of how to run TorchServe

Why TorchServe

Topics: Examples

Various TorchServe use cases

TorchServe GenAI Use Cases

Topics: Use Cases

Showcasing GenAI deployment scenarios and use cases

Performance

Topics: Performance, Troubleshooting

Guides and best practices on how to improve perfromance when working with TorchServe

Metrics

Topics: Metrics, Performance, Troubleshooting

Collecting and viewing Torcherve metrics

1

2