mirror of
https://github.com/NVIDIA/cutlass.git
synced 2026-05-20 04:48:57 +00:00
29 lines
931 B
ReStructuredText
29 lines
931 B
ReStructuredText
.. _talks_and_presentations:
|
|
.. |DSL| replace:: CuTe DSL
|
|
|
|
Talks and Presentations
|
|
=======================
|
|
|
|
This page collects talks, presentations, and other resources related to |DSL|
|
|
and CUTLASS Python infrastructure.
|
|
|
|
Conference Talks
|
|
----------------
|
|
|
|
**CuTeDSL: CUTLASS Python DSL Infrastructure** — *LLVM 2025*
|
|
|
|
An introduction to the |DSL| architecture, covering the hybrid AST-rewrite and
|
|
tracing approach, MLIR code generation, and integration with CUTLASS.
|
|
|
|
* `Video <https://www.youtube.com/watch?v=5NXd6MbKYNQ>`__
|
|
* `Slides (PDF) <https://llvm.org/devmtg/2025-10/slides/technical_talks/ozen.pdf>`_
|
|
|
|
----
|
|
|
|
**Enable Tensor Core Programming in Python with CUTLASS 4.0** — *GTC 2025*
|
|
|
|
Learn how to leverage Tensor Cores directly from Python using CUTLASS 4.0's
|
|
new DSL front-end, enabling rapid kernel development without writing CUDA C++.
|
|
|
|
* `Video <https://www.nvidia.com/en-us/on-demand/session/gtc25-s74639/>`__
|