"Mitigating Silent Data Corruptions in HPC Applications across Multiple Program Inputs" by Yafan Huang, Shengjian Guo, Sheng Di, and Guanpeng Li, was both Best Paper Finalist and Best Student Paper Finalist at SC22.
In this paper, the authors "propose MINPSID, an automated SID framework that automatically identifies and re-prioritizes incubative instructions in a given program to enhance SDC coverage. Evaluation shows MINPSID can effectively mitigate the loss of SDC coverage across multiple inputs." In short, the paper is about mitigating silent failures across multiple program inputs in HPC applications.
Zhengyang He, Hasan Rahman, and Yihua Wei, three fellow labmates, joined Huang, Li, and Peng at SC22.
The International Conference for High Performance Computing, Networking, Storage and Analysis is the annual conference established in 1988 by the Association for Computing Machinery and the IEEE Computer Society. This year, technical papers covered the following ten technical areas: Algorithms; Applications; Architecture and Networks; Clouds and Distributed Computing; Data Analytics, Visualization, and Storage; Machine Learning with HPC; Performance Measurement, Modeling, and Tools; Post-Moore Computing; Programming Frameworks and System Software; and State of the Practice. From 320 submissions, 81 were for publication in a rigorous peer-review process, resulting in an acceptance rate of 25.3%. Out of these 81 papers, 7 were nominated for the Best Paper award and 5 for the Best Student Paper award.