Shortcuts

Function at::_scaled_dot_product_flash_attention_for_cpu

Function Documentation

inline ::std::tuple<at::Tensor, at::Tensor> at::_scaled_dot_product_flash_attention_for_cpu(const at::Tensor &query, const at::Tensor &key, const at::Tensor &value, double dropout_p = 0.0, bool is_causal = false, const ::std::optional<at::Tensor> &attn_mask = {}, ::std::optional<double> scale = ::std::nullopt)

Docs

Access comprehensive developer documentation for PyTorch

View Docs

Tutorials

Get in-depth tutorials for beginners and advanced developers

View Tutorials

Resources

Find development resources and get your questions answered

View Resources