Skip to Content
Find More Like This
Return to Search

MANAGING VARIATIONS AMONG NODES IN PARALLEL SYSTEM FRAMEWORKS

United States Patent Application

20170279703
A1
View the Complete Application at the US Patent & Trademark Office
Lawrence Livermore National Laboratory - Visit the Industrial Partnerships Office Website
Systems, apparatuses, and methods for managing variations among nodes in parallel system frameworks. Sensor and performance data associated with the nodes of a multi-node cluster may be monitored to detect variations among the nodes. A variability metric may be calculated for each node of the cluster based on the sensor and performance data associated with the node. The variability metrics may then be used by a mapper to efficiently map tasks of a parallel application to the nodes of the cluster. In one embodiment, the mapper may assign the critical tasks of the parallel application to the nodes with the lowest variability metrics. In another embodiment, the hardware of the nodes may be reconfigured so as to reduce the node-to-node variability.
Wasmundt, Samuel Lawrence (San Diego, CA), Piga, Leonardo (Austin, TX), Paul, Indrani (Round Rock, TX), Huang, Wei (Austin, TX), Arora, Manish (Dublin, CA)
15/ 081,558
March 25, 2016
[0001] The invention described herein was made with government support under contract number DE-AC52-07NA27344 awarded by the United States Department of Energy. The United States Government has certain rights in the invention.