Files
composable_kernel/include
John Shumway 2d74123427 Shard several of the most costly targets.
Introduces a filter_tuple_by_modulo to break up tuples.

Drops build time of target from 21 minutes to under 14 minutes with 64
build processes, or 11 minutes with 128 build processes.

time ninja -j 64 device_grouped_conv3d_fwd_instance
2025-05-30 20:00:03 +00:00
..