torch.cuda.comm.reduce_add
- torch.cuda.comm.reduce_add(inputs, destination=None)[source]
Sums tensors from multiple GPUs.
All inputs should have matching shapes, dtype, and layout. The output tensor will be of the same shape, dtype, and layout.
Sums tensors from multiple GPUs.
All inputs should have matching shapes, dtype, and layout. The output tensor will be of the same shape, dtype, and layout.