Longsheng Du
08185b9c3e
Update blackwell tutorial to be compatible with 4.5-dev version ( #3130 )
...
* Update blackwell tutorial to be compatible with 4.5-dev version
* update example for reverted changes
* add more example fix
2026-04-09 14:40:33 +08:00
Junkai-Wu
a221da7ccf
v4.5 dev update. ( #3153 )
2026-04-07 12:16:05 -04:00
Junkai-Wu
1b741cabaa
v4.4.2 update. ( #3104 )
2026-03-17 00:58:19 -04:00
Blake Ledden
087c84df83
docs: Fix float16 documentation in elementwise_add notebook ( #2949 ) ( #3047 )
...
The notebook uses float16 tensors but the vectorized kernel documentation
incorrectly describes elements as 32-bit and uses 4-element vectorization.
Updated to correctly state 16-bit elements with 8-element vectorization
for proper 128-bit loads/stores.
Signed-off-by: Blake Ledden <bledden@users.noreply.github.com >
Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com >
2026-03-12 10:29:46 +08:00
Junkai-Wu
d4bbf728ca
v4.4 tag release update. ( #3032 )
2026-02-13 23:27:58 -05:00
drazi
71aa7a0abc
Merge pull request #2919 from pbelevich/patch-1
...
Refactor binary_op functions to remove unused result parameter
2026-02-11 11:48:58 +08:00
Junkai-Wu
0d2b201e8c
v4.3.5 update. ( #2934 )
...
* v4.3.5 update.
* Update copyright to 2026
2026-01-08 15:02:56 -05:00
Pavel Belevich
b6d7703e02
Refactor binary_op functions to remove unused result parameter
2026-01-02 11:23:43 -05:00
Pavel Belevich
f9bedd9096
Fix print statement for floor division result
2026-01-02 11:15:15 -05:00
whatdhack
4a55379686
Update notebook title from 'Tour to' to 'Tour of'
...
Grammar check . LLM's can quickly clean up such issues.
2025-11-24 20:11:14 -08:00
Linfeng Zheng
406e078b29
add a notebook for tour to sol gemm ( #2780 )
...
* add tour to sol gemm notebook
* change some typos
* change some typos
2025-11-20 09:41:01 -05:00
Junkai-Wu
b1d6e2c9b3
v4.3 update. ( #2709 )
...
* v4.3 update.
* Update the cute_dsl_api changelog's doc link
* Update version to 4.3.0
* Update the example link
* Update doc to encourage user to install DSL from requirements.txt
---------
Co-authored-by: Larry Wu <larwu@nvidia.com >
2025-10-21 14:26:30 -04:00
Junkai-Wu
6a35b4d22f
v4.2 tag release. ( #2638 )
2025-09-15 12:21:53 -04:00
Junkai-Wu
fd6cfe1ed0
v4.1 release update v2. ( #2481 )
2025-07-21 22:03:55 -04:00
Junkai-Wu
a1aaf2300a
v4.1 release
2025-07-03 08:07:53 -04:00
Junkai-Wu
8bdbfca682
v4.0 update. ( #2371 )
2025-06-06 02:39:20 -04:00
Driss Guessous
f89cd95b16
Update elementwise_add.ipynb ( #2298 )
2025-05-15 09:38:27 -04:00
Kihiro Bando
f115c3f854
Release v4.0.0 ( #2294 )
2025-05-13 15:55:29 -04:00