Optimizing GPU occupancy and resource usage with
2017-5-24 · Register file usage is 40 VGPRs per thread, for a total of 40,960 VGPRs, or 160 KiB. Thus, 96 KiB (37.5%) of each CU register file is wasted. As you can see, maximum size thread groups can easily result in bad GPU resource utilization if only one group fits to a …
Get Price