Write a Blog >>
Tue 25 Feb 2020 11:20 - 11:45 - Program Analysis (Mediterranean Ballroom) Chair(s): Michael Garland

We present an approach to automatically detect and reproduce error code propagation bugs in MPI implementations. Specifically, we combine static analysis and program repair for bug detection, and apply fault injection to reproduce error propagation bugs found in MPI libraries written in C/C++. We demonstrate our approach on the MPICH library—one of the most popular implementations of MPI, and the MPICH-based implementation MVAPICH2, uncovering 447 previously unknown bugs. We discovered that 31 of these bugs result in program crashes, and 60% of the MPICH test suite is susceptible to crashing due to failures to propagate error codes. Moreover, 95 bugs produce undesirable behavior that has been confirmed dynamically, causing tests to fail, hanging processes, or simply dropping error codes before reaching user applications.

Tue 25 Feb
Times are displayed in time zone: (GMT-07:00) Tijuana, Baja California change

10:55 - 12:35: Main Conference - Program Analysis (Mediterranean Ballroom)
Chair(s): Michael GarlandNVIDIA
PPoPP-2020-papers10:55 - 11:20
Sonali SahaIIT Madras, V Krishna NandivadaIIT Madras
PPoPP-2020-papers11:20 - 11:45
Daniel DeFreezUniversity of California, Davis, Antara BhowmickUniversity of California, Davis, Ignacio LagunaLawrence Livermore National Laboratory, Cindy Rubio-GonzálezUniversity of California, Davis
PPoPP-2020-papers11:45 - 12:10
Omar InversoGran Sasso Science Institute, Catia TrubianiGran Sasso Science Institute
PPoPP-2020-papers12:10 - 12:35
Yifan XuWashington University in St. Louis, Kyle SingerWashington University in St. Louis, I-Ting Angelina LeeWashington University in St. Louis