Shortcuts

Define C10_DEVICE_HOST_FUNCTION

Define Documentation

C10_DEVICE_HOST_FUNCTION

Defines the Half type (half-precision floating-point) including conversions to standard C types and basic arithmetic operations.

Note that arithmetic operations are implemented by converting to floating point and performing the operation in float32, instead of using CUDA half intrinsics. Most uses of this type within ATen are memory bound, including the element-wise kernels, and the half intrinsics aren’t efficient on all GPUs. If you are writing a compute bound kernel, you can use the CUDA half intrinsics directly on the Half type from device code.

Docs

Access comprehensive developer documentation for PyTorch

View Docs

Tutorials

Get in-depth tutorials for beginners and advanced developers

View Tutorials

Resources

Find development resources and get your questions answered

View Resources