Skip to Content
Find More Like This
Return to Search

Aggregating job exit statuses of a plurality of compute nodes executing a parallel application

United States Patent

9,086,962
July 21, 2015
View the Complete Patent at the US Patent & Trademark Office
Aggregating job exit statuses of a plurality of compute nodes executing a parallel application, including: identifying a subset of compute nodes in the parallel computer to execute the parallel application; selecting one compute node in the subset of compute nodes in the parallel computer as a job leader compute node; initiating execution of the parallel application on the subset of compute nodes; receiving an exit status from each compute node in the subset of compute nodes, where the exit status for each compute node includes information describing execution of some portion of the parallel application by the compute node; aggregating each exit status from each compute node in the subset of compute nodes; and sending an aggregated exit status for the subset of compute nodes in the parallel computer.
Aho; Michael E. (Rochester, MN), Attinella; John E. (Rochester, MN), Gooding; Thomas M. (Rochester, MN), Mundy; Michael B. (Rochester, MN)
International Business Machines Corporation (Armonk, NY)
13/ 524,602
20130339805
June 15, 2012
STATEMENT REGARDING FEDERALLY SPONSORED RESEARCH OR DEVELOPMENT This invention was made with Government support under Contract No. B579040 awarded by the Department of Energy. The Government has certain rights in this invention.