Shaojie WANG
40942b9098
Optimization for gridwise group norm ( #453 )
...
* use another instance to check the efficiency
* optimize group layer norm
* 1. coalesce load/store data for gridwise layer norm welford. 2. move a sqrt and divison into a outer static loop
* add more instances to layernorm
* add 2 more test cases
* remove ignore in generating tuple of vector
Co-authored-by: Chao Liu <chao.liu2@amd.com >
2022-10-06 21:24:13 -05:00
..
2022-09-01 09:31:17 -05:00
2022-07-29 18:19:25 -05:00
2022-07-29 18:19:25 -05:00
2022-08-23 10:38:41 -05:00
2022-09-01 09:31:17 -05:00
2022-08-31 16:32:17 -05:00
2022-08-25 16:58:48 -05:00
2022-07-29 18:19:25 -05:00
2022-07-29 18:19:25 -05:00
2022-08-25 17:19:15 -05:00
2022-08-30 11:38:26 -05:00
2022-07-29 18:19:25 -05:00
2022-08-13 00:16:14 -05:00
2022-08-15 10:11:02 -05:00
2022-07-29 18:19:25 -05:00
2022-08-15 10:11:02 -05:00
2022-08-25 17:19:15 -05:00
2022-09-06 12:22:48 -05:00
2022-08-25 17:19:15 -05:00
2022-08-12 15:22:39 -05:00
2022-07-29 18:19:25 -05:00
2022-09-19 22:30:46 -05:00
2022-08-10 12:20:29 -05:00
2022-08-10 12:20:29 -05:00
2022-09-19 11:25:28 -05:00
2022-09-01 09:31:17 -05:00
2022-09-20 12:43:53 -05:00
2022-08-15 10:11:02 -05:00
2022-08-15 10:11:02 -05:00
2022-08-25 17:19:15 -05:00
2022-09-09 10:41:15 -05:00
2022-09-14 17:54:18 -05:00
2022-09-19 11:25:28 -05:00
2022-09-19 21:30:25 -05:00
2022-09-01 09:31:17 -05:00
2022-10-06 21:24:13 -05:00
2022-09-19 11:25:28 -05:00