PPoPP 2024
Sat 2 - Wed 6 March 2024 Edinburgh, United Kingdom

Reducing floating-point (FP) precision is used to trade the quality degradation of a numerical program’s output for performance. Existing tools implement this optimization in a decoupled way by first generating all mixed-precision code variants and next searching the best-performing one. In this paper, we present a holistic approach called PrecTuner to consolidate the code generator and the autotuner via only one parameter r. Initialized by automatically sampled values, r is first used to generate several code variants in the polyhedral model, combining this optimization with various loop transformations. These code variants are then used to solve a performance model expressed in terms of r, possibly under a quality degradation budget. The best-performing value of r is finally predicted without evaluating all code variants. Experimental results of the PolyBench benchmarks on CPU demonstrate that PrecTuner outperforms LuIs by 3.28x while achieving smaller errors, and we also validate its effectiveness to optimize a real-life large-scale application. In addition, PrecTuner also obtains a mean speedup of 1.81x and 1.52x-1.73x over Pluto on single- and multi-core CPU, respectively, and 1.71x over PPCG on GPU.

Mon 4 Mar

Displayed time zone: London change

11:30 - 12:50
Compilers and Runtimes for Parallel SystemsMain Conference at Moorfoot
Chair(s): Mohamed Riyadh Baghdadi
11:30
20m
Talk
Liger: Interleaving Intra- and Inter-Operator Parallelism for Distributed Large Model Inference
Main Conference
Jiangsu Du Sun Yat-sen University, jinhui wei Sun Yat-sen University, Jiazhi Jiang Sun Yat-sen University, Shenggan Cheng National University of Singapore, Zhiguang Chen Sun Yat-sen University, Dan Huang , Yutong Lu Sun Yat-sen University
Link to publication DOI
11:50
20m
Talk
A Holistic Approach to Automatic Mixed-Precision Code Generation and Tuning for Affine Programs
Main Conference
Jinchen Xu Information Engineering University, Guanghui Song Li Auto Inc., Bei Zhou Information Engineering University, Fei Li Information Engineering University, Jiangwei Hao Information Engineering University, Jie Zhao State Key Laboratory of Mathematical Engineering and Advanced Computing
Link to publication DOI
12:10
20m
Talk
Language-Agnostic Static Deadlock Detection for Futures
Main Conference
Stefan K. Muller Illinois Institute of Technology
Link to publication DOI
12:30
20m
Talk
Recurrence Analysis for Automatic Parallelization of Subscripted Subscripts
Main Conference
Akshay Bhosale University of Delaware, USA, Rudolf Eigenmann University of Delaware
Link to publication DOI