12
ORNL is managed by UT-Battelle for the US Department of Energy Optimization of Sensor Networks for Climate Models (OSCM): Overview July 24 th , 2018 Daniel Ricciuto 1 , Khachik Sargsyan 2 , Youssef Marzouk 3 , Dan Lu 1 , Cosmin Safta 2 , Kate Evans 1 , Miroslav Stoyanov 1 , Jayanth Jagalur Mohan 3 , Fengyi Li 3 1 Oak Ridge National Laboratory 2 Sandia National Laboratories 3 Massachusetts Institute of Techology

Optimization of Sensor Networks for Climate Models (OSCM ...€¦ · 5 Presentation name Strategies for building E3SM surrogate models 5 • Goal: Develop accurate surrogate models

  • Upload
    others

  • View
    1

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Optimization of Sensor Networks for Climate Models (OSCM ...€¦ · 5 Presentation name Strategies for building E3SM surrogate models 5 • Goal: Develop accurate surrogate models

ORNL is managed by UT-Battelle for the US Department of Energy

Optimization of Sensor Networks for Climate Models (OSCM): Overview

July 24th, 2018

Daniel Ricciuto1, Khachik Sargsyan2, Youssef Marzouk3,

Dan Lu1, Cosmin Safta2, Kate Evans1, Miroslav Stoyanov1, Jayanth Jagalur Mohan3, Fengyi Li3

1Oak Ridge National Laboratory2Sandia National Laboratories3Massachusetts Institute of Techology

Page 2: Optimization of Sensor Networks for Climate Models (OSCM ...€¦ · 5 Presentation name Strategies for building E3SM surrogate models 5 • Goal: Develop accurate surrogate models

2 Presentation name

• Earth System models like E3SM are computationally expensive to do even a single forward simulation, but contain many uncertain parameters/processes.

• Uncertainty quantification in or calibration of an ESM requires ensembles, which can become quite large given uncertain parameters

• Many of the relevant land and atmospheric observations are at “point” scale (1 model grid cell or smaller) – how can we best use these?

Key Challenges

Page 3: Optimization of Sensor Networks for Climate Models (OSCM ...€¦ · 5 Presentation name Strategies for building E3SM surrogate models 5 • Goal: Develop accurate surrogate models

3 Presentation name

Project objectives• A new framework enabled by new UQ algorithms

and high performance computing to guide placement of new observations to maximize uncertainty reduction in climate model predictions.

• A network of single-column coupled land-atmosphere models co-located with existing and proposed measurement networks, calibrated with existing or synthetic observations.

• Propagation of uncertainty with mutli-fidelity approach: Multi-level Monte Carlo (MLMC) can be used to propagate uncertainties in fully coupled mode over a range of fidelity and resolution.

Land/atmosphere measurement networks

Page 4: Optimization of Sensor Networks for Climate Models (OSCM ...€¦ · 5 Presentation name Strategies for building E3SM surrogate models 5 • Goal: Develop accurate surrogate models

4 Presentation name

Major project tasksUQ methods development:• Improve sensitivity analysis and surrogate modeling methods for E3SM

• Develop model calibration techniques, including representation of structural error (embedded approach)

• Implement methods to reconstruct global maps from multi-point simulations

• Develop algorithms for Network Optimization using E3SM single column models

• Develop and implement Multi-level Monte Carlo for uncertainty propagation in E3SM

Earth System Model Infrastructure:• Single column ensemble infrastructure and computational architecture

• Data synthesis for network calibration, process identification and validation

Page 5: Optimization of Sensor Networks for Climate Models (OSCM ...€¦ · 5 Presentation name Strategies for building E3SM surrogate models 5 • Goal: Develop accurate surrogate models

5 Presentation name

Strategies for building E3SM surrogate models5

• Goal: Develop accurate surrogate models for E3SM-Land Model (65 input parameters)

3K simulations (approx. 50K CPU hours/grid cell)

§ 2.5K samples for training using cross-validation

to determine regularization knobs

§ 0.5K samples for testing

§ reshuffle and re-process data

§ The Neural Network Model (NNM) shows increased accuracy compared to Sparse Regression Polynomial

Chaos Expansions for the ELM

§ Sparse Regression PCE Model Error: 0.15…0.19§ Neural Network Model Error: 0.06…0.09

Gross Primary Production @ US-KS2

...

✓(i)1

✓(i)2

✓(i)3

✓(i)d

......

W [1], b[1]

�[1]

...W [2], b[2]

�[2] �[L�1]

W [L], b[L]

�[L]

y(i)

. . .

Inputlayer

Hiddenlayer 1

Hiddenlayer 2

Hiddenlayer(L-1)

Outputlayer

§ Next steps:• Explore additional regularization techniques (l1, adversarial

networks), embedding physical constraints, …

Page 6: Optimization of Sensor Networks for Climate Models (OSCM ...€¦ · 5 Presentation name Strategies for building E3SM surrogate models 5 • Goal: Develop accurate surrogate models

6 Presentation name

Model Surrogates for sensitivity analysis6

• Goal: Determine key parameter sensitivities for dimensionality reduction

• Sensitivity Analysis selected a subset of 10 principal parameters (out of 65) that impact selected model outputs– The Neural Network Model captured key sensitivities that were missed by previous

methods

Model Output #1: Gross Primary Production (GPP) Model Output #2: Total Leaf Area Index (TLAI)

Land model simulation results for Kennedy Space Center Site (US-KS2)

Page 7: Optimization of Sensor Networks for Climate Models (OSCM ...€¦ · 5 Presentation name Strategies for building E3SM surrogate models 5 • Goal: Develop accurate surrogate models

7 Presentation name

Calibration of the E3SM land model using surrogate based global optimization

Objective• Calibrate E3SM land model (ELM) to improve model projection

of carbon fluxes.

New science

• Advanced sparse grid interpolation to construct a fast-to-evaluate surrogate system of the ELM.

• Efficient global optimization algorithm to calibrate the ELM and find the optimal parameter values.

Significance

• An accurate surrogate model can be created for the ELM to reduce the model evaluation time.

• Application of the optimized parameters improves ELM performance and predictive capability of carbon fluxes.

Lu, Ricciuto, Stoyanov, Gu (2018). “Calibration of the E3SM land model using surrogate based global optimization.” Journal of Advances in Modeling Earth Systems. Accepted.

2006 2007 2008 2009 2010 2011 2012 2013 2014−800

−600

−400

−200

0

200

NE

E (

gCm−2

)

(a)

2006 2007 2008 2009 2010 2011 2012 2013 20140.5

1.5

2.5

3.5

4.5

5.5

TLA

I

(b)

2006 2007 2008 2009 2010 2011 2012 2013 201410

30

50

70

90

LH (

Wm−2

)

Years

(c)

Pred. period Obs. Optimized Default 95% CI of Obs.

The optimized parameters improve model fit to observation (Obs.) compared to the default parameter values.

Single gridcell model calibrationOSCM goal: Calibrate E3SM land and atmosphere models using surrogate modeling approaches

Page 8: Optimization of Sensor Networks for Climate Models (OSCM ...€¦ · 5 Presentation name Strategies for building E3SM surrogate models 5 • Goal: Develop accurate surrogate models

8 Presentation name

Reconstruction and network design

0.5x0.5 GPP “truth”5000 points Bilinear (64 pt)

August 1988 GPP

Kriging (64pt)

Goal: Reproduce model spatial variations using a network of single column simulations• Simple ELM running at MIT and interface with OED

system being developed

• OED problem using Bayesian linear-Gaussian setting, with "greedy" selection strategy for some simple synthetic problems.

• Provide performance guarantees for the greedy strategy that bound how much worse it could be than the combinatorial optimum

Page 9: Optimization of Sensor Networks for Climate Models (OSCM ...€¦ · 5 Presentation name Strategies for building E3SM surrogate models 5 • Goal: Develop accurate surrogate models

9 Presentation name

Atmosphere SCM ensemble infrastructureComputational challenges• Scaling from ~10s to ~10K+ SCM

simulations• I/O limitations (based on experience

with ELM)• How to optimize ensembles for pre-

exascale architectures Model development challenges (E3SM/CMDV)• Eulerian dycore à SE dycore• Development of relevant use cases

– Locations of interest– Length of simulations

HPC system interconnect

Rack interconnect

Chassis Node interconnect

Storag

e

Proc N

Ra

ck

N

No

de

N

... ... ...

DRAM NV-RAM

Page 10: Optimization of Sensor Networks for Climate Models (OSCM ...€¦ · 5 Presentation name Strategies for building E3SM surrogate models 5 • Goal: Develop accurate surrogate models

10 Presentation name

Collaborations: Surrogate modeling and machine learningMachine learning

– Tuning of neural network hyperparameters– Bayesian Neural network approaches– Implementation on LCF facilities– Neural networks with physical constraints

(e.g. we can’t have negative leaf area)

Ensemble analysis/visualization– reducing the parameter space to explore– Calibration “on the fly”– Efficient calculation of ensemble statistics

EDEN visualization tool

Page 11: Optimization of Sensor Networks for Climate Models (OSCM ...€¦ · 5 Presentation name Strategies for building E3SM surrogate models 5 • Goal: Develop accurate surrogate models

11 Presentation name

sELM – In Progress: Exploring Model Structure and Calibration Work11

Net Ecosystem Exchange

[gC/m2/day]

• Partnership with FASTMath Institute to explore low-rank functional tensor train representations

• discover model structure in the combined spatial-stochastic spaces• Use sensitivity analysis and model error embedding to calibrate model components that are

responsible the most for discrepancies between predictions and observations.• Optimal experimental Design (OED) formulations

August2004

Page 12: Optimization of Sensor Networks for Climate Models (OSCM ...€¦ · 5 Presentation name Strategies for building E3SM surrogate models 5 • Goal: Develop accurate surrogate models

12 Presentation name

Next steps and potential collaborations

• Create surrogate models for land-atmosphere QOIs using efficient techniques.

• Further develop our OED system to link surrogate models with sensor placement optimization

• Use both land and land-atm coupled SCM in this framework

• Demonstrate E3SM successful use case for multi-fidelity uncertainty quantification.

• Potential collaboration with other BER SciDAC projects– I/O issues for large network ensembles

– Coupling for ensemble simulations

– UQ algorithms development