yxsamliu wrote:

Here is the size distribution of individual code object file (each code object 
file is for one GPU arch, and a fat binary contains a bunch of code object 
files, therefore the optimal compression parameter is mostly related to code 
object file size ).

| Bin Size      | Count | Percentage | Cumulative Percentage | Example File     
                                                                                
             |
|---------------|-------|------------|-----------------------|---------------------------------------------------------------------------------------------------------------|
| 0-16K         | 961   | 12.31%     | 12.31%                | 
`librocrand.so#offset=27172864&size=0`                                          
                              |
| 16K-32K       | 602   | 7.71%      | 20.03%                | 
`librocalution_hip.so#offset=35438592&size=27264`                               
                              |
| 32K-64K       | 1463  | 18.75%     | 38.77%                | 
`librocalution_hip.so#offset=32391168&size=37808`                               
                              |
| 64K-128K      | 1134  | 14.53%     | 53.31%                | 
`libMIOpen.so#offset=566800384&size=98984`                                      
                              |
| 128K-256K     | 897   | 11.49%     | 64.80%                | 
`libMIOpen.so#offset=562827264&size=141624`                                     
                              |
| 256K-512K     | 977   | 12.52%     | 77.32%                | 
`libMIOpen.so#offset=659791872&size=504120`                                     
                              |
| 512K-1M       | 482   | 6.18%      | 83.50%                | 
`libMIOpen.so#offset=567713792&size=545032`                                     
                              |
| 1M-2M         | 443   | 5.68%      | 89.17%                | 
`libMIOpen.so#offset=569909248&size=1134632`                                    
                              |
| 2M-4M         | 412   | 5.28%      | 94.45%                | 
`librocrand.so#offset=27172864&size=2650696`                                    
                              |
| 4M-8M         | 251   | 3.22%      | 97.67%                | 
`librocblas.so#offset=1671168&size=5344160`                                     
                              |
| 8M-16M        | 136   | 1.74%      | 99.41%                | 
`librocblas.so#offset=389632000&size=15117200`                                  
                              |
| 16M-32M       | 41    | 0.53%      | 99.94%                | 
`librccl.so#offset=135168&size=20252464`                                        
                              |
| 32M-64M       | 1     | 0.01%      | 99.95%                | 
`TensileLibrary_Type_HH_HPA_Contraction_l_Alik_Bljk_Cijk_Dijk_gfx90a.co`        
                             |
| 64M-128M      | 4     | 0.05%      | 100.00%               | 
`TensileLibrary_Type_HH_HPA_ExperimentalGrid_Contraction_l_Ailk_Bjlk_Cijk_Dijk_CU104_gfx90a.co`
              |

>From the table we can see 99.9% of code object files are below 32MB. Also all 
>code object files are below 128MB.


https://github.com/llvm/llvm-project/pull/83605
_______________________________________________
cfe-commits mailing list
cfe-commits@lists.llvm.org
https://lists.llvm.org/cgi-bin/mailman/listinfo/cfe-commits

Reply via email to