Model Reduction And Neural Networks For Parametric PDEs

Bhattacharya, Kaushik; Hosseini, Bamdad; Kovachki, Nikola B.; Stuart, Andrew M.

doi:10.5802/smai-jcm.74

Bhattacharya, Kaushik ¹ ; Hosseini, Bamdad ² ; Kovachki, Nikola B.² ; Stuart, Andrew M.²

¹ Mechanical and Civil Engineering, California Institute of Technology, Pasadena, CA, USA
² Computing and Mathematical Sciences, California Institute of Technology, Pasadena, CA, USA

The SMAI Journal of computational mathematics, Tome 7 (2021), pp. 121-157.

Résumé

We develop a general framework for data-driven approximation of input-output maps between infinite-dimensional spaces. The proposed approach is motivated by the recent successes of neural networks and deep learning, in combination with ideas from model reduction. This combination results in a neural network approximation which, in principle, is defined on infinite-dimensional spaces and, in practice, is robust to the dimension of finite-dimensional approximations of these spaces required for computation. For a class of input-output maps, and suitably chosen probability measures on the inputs, we prove convergence of the proposed approximation methodology. We also include numerical experiments which demonstrate the effectiveness of the method, showing convergence and robustness of the approximation scheme with respect to the size of the discretization, and compare it with existing algorithms from the literature; our examples include the mapping from coefficient to solution in a divergence form elliptic partial differential equation (PDE) problem, and the solution operator for viscous Burgers’ equation.

Publié le : 2021-07-07

DOI : 10.5802/smai-jcm.74

Classification : 65N75, 62M45, 68T05, 60H30, 60H15
Mots clés : approximation theory, deep learning, model reduction, neural networks, partial differential equations.

Affiliations des auteurs :

Bhattacharya, Kaushik ¹ ; Hosseini, Bamdad ² ; Kovachki, Nikola B. ² ; Stuart, Andrew M. ²

¹ Mechanical and Civil Engineering, California Institute of Technology, Pasadena, CA, USA
² Computing and Mathematical Sciences, California Institute of Technology, Pasadena, CA, USA

@article{SMAI-JCM_2021__7__121_0,
     author = {Bhattacharya, Kaushik and Hosseini, Bamdad and Kovachki, Nikola B. and Stuart, Andrew M.},
     title = {Model {Reduction} {And} {Neural} {Networks} {For} {Parametric} {PDEs}},
     journal = {The SMAI Journal of computational mathematics},
     pages = {121--157},
     publisher = {Soci\'et\'e de Math\'ematiques Appliqu\'ees et Industrielles},
     volume = {7},
     year = {2021},
     doi = {10.5802/smai-jcm.74},
     language = {en},
     url = {http://www.numdam.org/articles/10.5802/smai-jcm.74/}
}

TY  - JOUR
AU  - Bhattacharya, Kaushik
AU  - Hosseini, Bamdad
AU  - Kovachki, Nikola B.
AU  - Stuart, Andrew M.
TI  - Model Reduction And Neural Networks For Parametric PDEs
JO  - The SMAI Journal of computational mathematics
PY  - 2021
SP  - 121
EP  - 157
VL  - 7
PB  - Société de Mathématiques Appliquées et Industrielles
UR  - http://www.numdam.org/articles/10.5802/smai-jcm.74/
DO  - 10.5802/smai-jcm.74
LA  - en
ID  - SMAI-JCM_2021__7__121_0
ER  -

%0 Journal Article
%A Bhattacharya, Kaushik
%A Hosseini, Bamdad
%A Kovachki, Nikola B.
%A Stuart, Andrew M.
%T Model Reduction And Neural Networks For Parametric PDEs
%J The SMAI Journal of computational mathematics
%D 2021
%P 121-157
%V 7
%I Société de Mathématiques Appliquées et Industrielles
%U http://www.numdam.org/articles/10.5802/smai-jcm.74/
%R 10.5802/smai-jcm.74
%G en
%F SMAI-JCM_2021__7__121_0

Bhattacharya, Kaushik; Hosseini, Bamdad; Kovachki, Nikola B.; Stuart, Andrew M. Model Reduction And Neural Networks For Parametric PDEs. The SMAI Journal of computational mathematics, Tome 7 (2021), pp. 121-157. doi : 10.5802/smai-jcm.74. http://www.numdam.org/articles/10.5802/smai-jcm.74/

Bibliographie
Cité par

[1] Adler, J.; Oktem, O. Solving ill-posed inverse problems using iterative deep neural networks, Inverse Probl., Volume 33 (2017) no. 12, 124007 | DOI | MR | Zbl

[2] Almroth, B. O.; Stern, P.; Brogan, F. A. Automatic choice of global shape functions in structural analysis, AIAA J., Volume 16 (1978) no. 5, pp. 525-528 | DOI

[3] Barrault, M.; Maday, Y.; Nguyen, N. C.; Patera, A. T. An ‘empirical interpolation’ method: application to efficient reduced-basis discretization of partial differential equations, C. R. Math. Acad. Sci. Paris, Volume 339 (2004) no. 9, pp. 667-672 | DOI | MR | Zbl

[4] Baxendale, P. Gaussian Measures on Function Spaces, Am. J. Math., Volume 98 (1976) no. 4, pp. 891-952 | DOI | MR | Zbl

[5] Belkin, M.; Niyogi, P. Laplacian Eigenmaps for Dimensionality Reduction and Data Representation, Neural Computation, Volume 15 (2003) no. 6, pp. 1373-1396 | DOI | Zbl

[6] Benner, P.; Goyal, P.; Kramer, B.; Peherstorfer, B.; Willcox, K. Operator inference for non-intrusive model reduction of systems with non-polynomial nonlinear terms, Comput. Methods Appl. Mech. Eng., Volume 372 (2020), p. 113433 | DOI | MR | Zbl

[7] Bhatnagar, S.; Afshar, Y.; Pan, S.; Duraisamy, K.; Kaushik, S. Prediction of aerodynamic flow fields using convolutional neural networks, Comput. Mech. (2019), pp. 1-21 | DOI | MR | Zbl

[8] Binev, P.; Cohen, A.; Dahmen, W.; DeVore, R.; Petrova, G.; Wojtaszczyk, P. Data assimilation in reduced modeling, SIAM/ASA J. Uncertain. Quantif., Volume 5 (2017) no. 1, pp. 1-29 | DOI | MR | Zbl

[9] Blanchard, G.; Bousquet, O.; Zwald, L. Statistical properties of kernel principal component analysis, Machine Learning, Volume 66 (2007) no. 2, pp. 259-294 | DOI | Zbl

[10] Boyaval, S.; Le Bris, C.; Lelievre, T.; Maday, Y.; Nguyen, N. C.; Patera, A. T. Reduced basis techniques for stochastic problems, Arch. Comput. Methods Eng., Volume 17 (2010) no. 4, pp. 435-454 | DOI | MR | Zbl

[11] Cai, S.; Wang, Z.; Lu, L.; Zaki, T. A; Karniadakis, G. E. DeepM&Mnet: Inferring the electroconvection multiphysics fields based on operator approximation by neural networks (2020) (https://arxiv.org/abs/2009.12935)

[12] Chen, T.; Chen, H. Universal approximation to nonlinear operators by neural networks with arbitrary activation functions and its application to dynamical systems, IEEE Transactions on Neural Networks, Volume 6 (1995) no. 4, pp. 911-917 | DOI

[13] Cheng, L.; Kovachki, N.; Welborn, M.; Miller, T. F. Regression Clustering for Improved Accuracy and Training Costs with Molecular-Orbital-Based Machine Learning, Journal of Chemical Theory and Computation, Volume 15 (2019) no. 12, pp. 6668-6677 | DOI

[14] Chkifa, A.; Cohen, A.; DeVore, R.; Schwab, C. Sparse adaptive Taylor approximation algorithms for parametric and stochastic elliptic PDEs, ESAIM, Math. Model. Numer. Anal., Volume 47 (2013) no. 1, pp. 253-280 | DOI | Numdam | MR | Zbl

[15] Cohen, A.; Dahmen, W.; DeVore, R. State Estimation–The Role of Reduced Models (2020) (https://arxiv.org/abs/2002.00220)

[16] Cohen, A.; DeVore, R. Approximation of high-dimensional parametric PDEs, Acta Numer., Volume 24 (2015), pp. 1-159 | DOI | MR | Zbl

[17] Cohen, A.; DeVore, R.; Schwab, C. Convergence Rates of Best N-term Galerkin Approximations for a Class of Elliptic SPDEs, Found. Comput. Math., Volume 10 (2010) no. 6, pp. 615-646 | DOI | MR | Zbl

[18] Cohen, A.; DeVore, R.; Scwhab, C. Analytic regularity and polynomial approximation of parametric and stochastic elliptic PDEs, Anal. Appl., Singap., Volume 09 (2011) no. 01, pp. 11-47 | DOI | MR

[19] Coifman, R. R.; Lafon, S.; Lee, A. B.; Maggioni, M.; Nadler, B.; Warner, F.; Zucker, S. W. Geometric diffusions as a tool for harmonic analysis and structure definition of data: Diffusion maps, Proceedings of the National Academy of Sciences, Volume 102 (2005) no. 21, pp. 7426-7431 | DOI | Zbl

[20] Dashti, M.; Harris, S.; Stuart, A. M. Besov priors for Bayesian inverse problems, Inverse Probl. Imaging, Volume 6 (2012), pp. 183-200 | DOI | MR | Zbl

[21] Daubechies, I.; DeVore, R.; Foucart, S.; Hanin, B.; Petrova, G. Nonlinear Approximation and (Deep) ReLU Networks (2019) (https://arxiv.org/abs/1905.02199)

[22] DeVore, R. Nonlinear approximation, Acta Numer., Volume 7 (1998), pp. 51-150 | DOI | MR

[23] DeVore, R. The Theoretical Foundation of Reduced Basis Methods, Model Reduction and Approximation, Society for Industrial and Applied Mathematics, 2014 | DOI

[24] Dockhorn, T. A Discussion on Solving Partial Differential Equations using Neural Networks (2019) (https://arxiv.org/abs/1904.07200)

[25] E, W.; Yu, B. The Deep Ritz Method: A Deep Learning-Based Numerical Algorithm for Solving Variational Problems, Communications in Mathematics and Statistics (2018) | DOI | MR | Zbl

[26] Evans, L. C. Partial differential equations, American Mathematical Society, 2010

[27] Fan, K. On a Theorem of Weyl Concerning Eigenvalues of Linear Transformations I, Proceedings of the National Academy of Sciences, Volume 35 (1949) no. 11, pp. 652-655 | DOI | MR

[28] Fresca, S.; Dede, L.; Manzoni, A. A comprehensive deep learning-based approach to reduced order modeling of nonlinear time-dependent parametrized PDEs (2020) (https://arxiv.org/abs/2001.04001) | Zbl

[29] Geist, M.; Petersen, P.; Raslan, M.; Schneider, R.; Kutyniok, G. Numerical solution of the parametric diffusion equation by deep neural networks (2020) (https://arxiv.org/abs/2004.12131)

[30] Gilmer, J.; Schoenholz, S. S; Riley, P. F.; Vinyals, O.; Dahl, G. E. Neural message passing for quantum chemistry, Proceedings of the 34th International Conference on Machine Learning (2017) (http://proceedings.mlr.press/v70/gilmer17a.html)

[31] Gonzalez, F. J.; Balajewicz, M. Deep convolutional recurrent autoencoders for learning low-dimensional feature dynamics of fluid systems (2018) (https://arxiv.org/abs/1808.01346)

[32] Gonzalez-Garcia, R.; Rico-Martínez, R.; Kevrekidis, I. G. Identification of distributed parameter systems: A neural net based approach, Computers & Chemical Engineering, Volume 22 (1998), p. S965-S968 | DOI

[33] Goodfellow, I.; Bengio, Y.; Courville, A. Deep Learning, MIT Press, 2016 http://www.deeplearningbook.org | Zbl

[34] Haber, E.; Ruthotto, L. Stable architectures for deep neural networks, Inverse Probl., Volume 34 (2017) no. 1, p. 014004 | DOI | MR | Zbl

[35] Herrmann, L.; Schwab, C.; Zech, J. Deep ReLU Neural Network Expression Rates for Data-to-QoI Maps in Bayesian PDE Inversion (2020) (https://www.sam.math.ethz.ch/sam_reports/reports_final/reports2020/2020-02.pdf)

[36] Hesthaven, J. S.; Rozza, G.; Stamm, B. et al. Certified reduced basis methods for parametrized partial differential equations, SpringerBriefs in Mathematics, Springer, 2016 | DOI | Zbl

[37] Hesthaven, J. S.; Ubbiali, S. Non-intrusive reduced order modeling of nonlinear problems using neural networks, J. Comput. Phys., Volume 363 (2018), pp. 55-78 | DOI | MR | Zbl

[38] Hinton, G.; Salakhutdinov, R. Reducing the Dimensionality of Data with Neural Networks, Science, Volume 313 (2006) no. 5786, pp. 504-507 | DOI | MR | Zbl

[39] Holland, J. R; Baeder, J. D; Duraisamy, K. Field Inversion and Machine Learning With Embedded Neural Networks: Physics-Consistent Neural Network Training, AIAA Aviation 2019 Forum (2019), 3200 pages | DOI

[40] Hsieh, J.-T.; Zhao, S.; Eismann, S.; Mirabella, L.; Ermon, S. Learning Neural PDE Solvers with Convergence Guarantees, International Conference on Learning Representations (2019) (https://openreview.net/forum?id=rklawn0qk7)

[41] Iglesias, M. A.; Lin, K.; Stuart, A. M. Well-posed Bayesian geometric inverse problems arising in subsurface flow, Inverse Probl., Volume 30 (2014), p. 114001 | DOI | MR | Zbl

[42] Khoo, Y.; Lu, J.; Ying, L. Solving parametric PDE problems with artificial neural networks (2017) (https://arxiv.org/abs/1707.03351)

[43] Klambauer, G.; Unterthiner, T.; Mayr, A.; Hochreiter, S. Self-Normalizing Neural Networks, Advances in Neural Information Processing Systems 30 (Guyon, I.; Luxburg, U. V.; Bengio, S.; Wallach, H.; Fergus, R.; Vishwanathan, S.; Garnett, R., eds.), Curran Associates, 2017, pp. 971-980

[44] Krischer, K.; Rico-Martínez, R.; Kevrekidis, I. G.; Rotermund, H. H.; Ertl, G.; Hudson, J. L. Model identification of a spatiotemporally varying catalytic reaction, AIChE J., Volume 39 (1993) no. 1, pp. 89-98 | DOI

[45] Kutyniok, G.; Petersen, P.; Raslan, M.; Schneider, R. A theoretical analysis of deep neural networks and parametric PDEs (2019) (https://arxiv.org/abs/1904.00377)

[46] Laakmann, F.; Petersen, P. Efficient Approximation of Solutions of Parametric Linear Transport Equations by ReLU DNNs (2020) (https://arxiv.org/abs/2001.11441) | Zbl

[47] Lagaris, I. E; Likas, A.; Fotiadis, D. I Artificial neural networks for solving ordinary and partial differential equations, IEEE Transactions on Neural Networks, Volume 9 (1998) no. 5, pp. 987-1000 | DOI

[48] Law, K.; Stuart, A.; Zygalakis, K. Data Assimilation: A Mathematical Introduction, Texts in Applied Mathematics, 62, Springer, 2015 | DOI | Zbl

[49] LeCun, Y.; Bengio, Y.; Hinton, G. Deep learning, Nature, Volume 521 (2015) no. 7553, pp. 436-444 | DOI

[50] Lee, K.; Carlberg, K. T. Model reduction of dynamical systems on nonlinear manifolds using deep convolutional autoencoders, J. Comput. Phys., Volume 404 (2020) | MR | Zbl

[51] Li, Z.; Kovachki, N.; Azizzadenesheli, K.; Liu, B.; Bhattacharya, K.; Stuart, A.; Anandkumar, A. Neural Operator: Graph Kernel Networkfor Partial Differential Equations (2020) (https://arxiv.org/abs/2003.03485)

[52] Lin, C.; Li, Z.; Lu, L.; Cai, S.; Maxey, M.; Karniadakis, G. E. Operator learning for predicting multiscale bubble growth dynamics (2020) (https://arxiv.org/abs/2012.12816)

[53] Lord, G. J.; Powell, C. E.; Shardlow, T. An introduction to computational stochastic PDEs, 50, Cambridge University Press, 2014 | DOI | Zbl

[54] Lu, L.; Jin, P.; Karniadakis, G. E. DeepONet: Learning nonlinear operators for identifying differential equations based on the universal approximation theorem of operators (2019) (https://arxiv.org/abs/1910.03193)

[55] Lu, L.; Jin, P.; Pang, G.; Zhang, Z.; Karniadakis, G. E. Learning nonlinear operators via DeepONet based on the universal approximation theorem of operators, Nature Machine Intelligence (2020)

[56] Maday, Y.; Patera, A. T.; Penn, J. D.; Yano, M. A parameterized-background data-weak approach to variational data assimilation: formulation, analysis, and application to acoustics, Int. J. Numer. Meth. Engng., Volume 102 (2015) no. 5, pp. 933-965 | DOI | MR | Zbl

[57] Maiorov, V.; Pinkus, A. Lower Bounds for Approximation by MLP Neural Networks, Neurocomputing, Volume 25 (1999), pp. 81-91 | DOI | Zbl

[58] Mao, Z.; Lu, L.; Marxen, O.; Zaki, T. A; Karniadakis, G. E. DeepM&Mnet for hypersonics: Predicting the coupled flow and finite-rate chemistry behind a normal shock using neural-network approximation of operators (2020) (https://arxiv.org/abs/2011.03349)

[59] McQuarrie, S. A.; Huang, C.; Willcox, K. Data-driven reduced-order models via regularized operator inference for a single-injector combustion process (2020) (https://arxiv.org/abs/2008.02862)

[60] Murphy, K. P. Machine Learning: A Probabilistic Perspective, The MIT Press, 2012 https://www.cs.ubc.ca/~murphyk/mlbook/ | Zbl

[61] Nagy, D. A. Modal representation of geometrically nonlinear behavior by the finite element method, Computers & Structures, Volume 10 (1979) no. 4, pp. 683-688 | DOI | Zbl

[62] Overton, M. L.; Womersley, R. S. On the Sum of the Largest Eigenvalues of a Symmetric Matrix, SIAM Journal of Matrix Analysis and Applications, Volume 13 (1992) no. 1, pp. 41-45 | DOI | MR | Zbl

[63] Pearson, K. LIII. On lines and planes of closest fit to systems of points in space, The London, Edinburgh, and Dublin Philosophical Magazine and Journal of Science, Volume 2 (1901) no. 11, pp. 559-572 | DOI | Zbl

[64] Peherstorfer, B. Sampling low-dimensional Markovian dynamics for pre-asymptotically recovering reduced models from data with operator inference (2019) (https://arxiv.org/abs/1908.11233)

[65] Peherstorfer, B.; Willcox, K. Data-driven operator inference for nonintrusive projection-based model reduction, Comput. Methods Appl. Mech. Eng., Volume 306 (2016), pp. 196-215 | DOI | MR | Zbl

[66] Qian, E.; Kramer, B.; Peherstorfer, B.; Willcox, K. Lift & Learn: Physics-informed machine learning for large-scale nonlinear dynamical systems, Physica D: Nonlinear Phenomena, Volume 406 (2020), p. 132401 | DOI | MR

[67] Quarteroni, A.; Manzoni, A.; Negri, F. Reduced basis methods for partial differential equations: an introduction, Springer, 2015 | DOI

[68] Raissi, M.; Perdikaris, P.; Karniadakis, G. E. Physics-informed neural networks: A deep learning framework for solving forward and inverse problems involving nonlinear partial differential equations, J. Comput. Phys., Volume 378 (2019), pp. 686-707 | DOI | MR | Zbl

[69] Reich, S.; Cotter, C. Probabilistic forecasting and Bayesian data assimilation, Cambridge University Press, 2015 | DOI | Zbl

[70] Ruthotto, L.; Haber, E. Deep Neural Networks Motivated by Partial Differential Equations, J. Math. Imaging Vis., Volume 62 (2019), pp. 352-364 | DOI | MR | Zbl

[71] Schölkopf, B.; Smola, A.; Müller, K. Nonlinear Component Analysis as a Kernel Eigenvalue Problem, Neural Computation, Volume 10 (1998) no. 5, pp. 1299-1319 | DOI

[72] Schwab, C.; Zech, J. Deep learning in high dimension: Neural network expression rates for generalized polynomial chaos expansions in UQ, Anal. Appl., Singap., Volume 17 (2019) no. 01, pp. 19-55 | DOI | MR | Zbl

[73] Shawe-Taylor, J.; Williams, C.; Cristianini, N.; Kandola, J. On the eigenspectrum of the Gram matrix and its relationship to the operator eigenspectrum, International Conference on Algorithmic Learning Theory, Springer, 2002, pp. 23-40 | DOI | Zbl

[74] Shawe-Taylor, J.; Williams, C.; Cristianini, N.; Kandola, J. On the eigenspectrum of the Gram matrix and the generalization error of kernel-PCA, IEEE Transactions on Information Theory, Volume 51 (2005) no. 7, pp. 2510-2522 | DOI | MR | Zbl

[75] Shin, Y.; Darbon, J.; Karniadakis, G. E. On the convergence and generalization of physics informed neural networks (2020) (https://arxiv.org/abs/2004.01806)

[76] Smith, J. D.; Azizzadenesheli, K.; Ross, Z. E. EikoNet: Solving the Eikonal equation with Deep Neural Networks (2020) (https://arxiv.org/abs/2004.00361)

[77] Takahasi, S.-E.; Rassias, J. M.; Saitoh, S.; Takahashi, Y. Refined generalizations of the triangle inequality on Banach spaces, Math. Inequal. Appl., Volume 13 (2010) no. 4, pp. 733-741 | DOI | MR | Zbl

[78] Temam, R. Infinite-dimensional dynamical systems in mechanics and physics, 68, Springer, 2012

[79] Wang, Q.; Hesthaven, J. S.; Ray, D. Non-intrusive reduced order modeling of unsteady flows using artificial neural networks with application to a combustion problem, J. Comput. Phys., Volume 384 (2019), pp. 289-307 | DOI | MR | Zbl

[80] Weinan, E. A proposal on machine learning via dynamical systems, Communications in Mathematics and Statistics, Volume 5 (2017) no. 1, pp. 1-11 | MR | Zbl

[81] Yarotsky, D. Error bounds for approximations with deep ReLU networks, Neural Netw., Volume 94 (2017), pp. 103-114 | DOI | Zbl

[82] Zeidler, E. Applied Functional Analysis: Applications to Mathematical Physics, Springer, 2012

[83] Zhu, Y.; Zabaras, N. Bayesian Deep Convolutional Encoder-Decoder Networks for Surrogate Modeling and Uncertainty Quantification, J. Comput. Phys., Volume 366 (2018) no. C, pp. 415-447 | DOI | MR | Zbl

Cité par Sources :