Techniques for generating a dynamic treatment control policy for a cyber-physical system having one or more components, including a data collector for collecting data representative of the cyber-physical system, and adaptive stochastic controller including one or more models for generating a predicted value corresponding to available actions based on an objective function, and an approximate dynamic programming element configured to receive actual operation metrics corresponding to the available actions. The approximate dynamic programming element can learn a state-action map and generate a dynamic treatment control policy using the one or more models.
STATEMENT OF FEDERALLY SPONSORED RESEARCH OR DEVELOPMENT
 This invention was made with government support under Grant no. OE-OE0000197, awarded by the Department of Energy. The government has certain rights in the invention.