Files
composable_kernel/example
juuso-oskari e9cf036a81 Add medium-tier small-cache optimization (zero overhead for <100K blocks)
- Add medium tier small-cache variants to unified_attention.cpp dispatch
- Create instance files for medium tier with MaxNumBlocks=100000
- Add instances to optCompilerConfig.json
- Results: zero overhead (6.062ms vs 6.067ms baseline) for 50K blocks
- Large cache (1.5M blocks) still works correctly with runtime rebasing
2026-05-07 09:27:55 +00:00
..
2026-01-14 07:31:45 -08:00