Skip to Content
Find More Like This
Return to Search

Locating hardware faults in a data communications network of a parallel computer

United States Patent

January 12, 2010
View the Complete Patent at the US Patent & Trademark Office
Hardware faults location in a data communications network of a parallel computer. Such a parallel computer includes a plurality of compute nodes and a data communications network that couples the compute nodes for data communications and organizes the compute node as a tree. Locating hardware faults includes identifying a next compute node as a parent node and a root of a parent test tree, identifying for each child compute node of the parent node a child test tree having the child compute node as root, running a same test suite on the parent test tree and each child test tree, and identifying the parent compute node as having a defective link connected from the parent compute node to a child compute node if the test suite fails on the parent test tree and succeeds on all the child test trees.
Archer; Charles J. (Rochester, MN), Megerian; Mark G. (Rochester, MN), Ratterman; Joseph D. (Rochester, MN), Smith; Brian E. (Rochester, MN)
International Business Machines Corporation (Armonk, NY)
11/ 279,586
April 13, 2006
GOVERNMENT RIGHTS IN INVENTION The U.S. Government has a paid-up license in this invention and the right to limited circumstances to require the patent owner to license others on reasonable terms as provided for by the terms of Contract No. B519700 awarded by the Department of Energy.