Gradient conjugate priors and multi-layer neural networks

Gurevich, P.; Stuke, H.

Gradient conjugate priors and multi-layer neural networks

The paper deals with learning probability distributions of observed data by artificial neural networks. We suggest a so-called gradient conjugate prior (GCP) update appropriate for neural networks, which is a modification of the classical Bayesian update for conjugate priors. We establish a connection between the gradient conjugate prior update and the maximization of the log-likelihood of the predictive distribution. Unlike for the Bayesian neural networks, we use deterministic weights of neural networks, but rather assume that the ground truth distribution is normal with unknown mean and variance and learn by the neural networks the parameters of a prior (normal-gamma distribution) for these unknown mean and variance. The update of the parameters is done, using the gradient that, at each step, directs towards minimizing the Kullback–Leibler divergence from the prior to the posterior distribution (both being normal-gamma). We obtain a corresponding dynamical system for the prior's parameters and analyze its properties. In particular, we study the limiting behavior of all the prior's parameters and show how it differs from the case of the classical full Bayesian update. The results are validated on synthetic and real world data sets. © 2019 Elsevier B.V.

Authors

Gurevich P. ^1, ² , Stuke H. ¹

Journal

Artificial Intelligence

Publisher

Elsevier B.V.

Language

English

State

Published

Link

External link

DOI

10.1016/J.ARTINT.2019.103184

Number

103184

Volume

278

Year

2020

Organizations

¹ Free University of Berlin, Arnimallee 3, Berlin, 14195, Germany
² RUDN University, Miklukho-Maklaya 6, Moscow, 117198, Russian Federation

Keywords

Asymptotics; Conjugate priors; Deep neural networks; Kullback–Leibler divergence; Latent variables; Outliers; Regression; Student's t-distribution; Uncertainty quantification

Cite

ГОСТ MLA RIS BibTex

TRANSFORMATION OF BUSINESS MODELS IN TERMS OF DIGITALIZATION

Article

Digilina O.B., Teslenko I.B.

Lecture Notes in Networks and Systems. Vol. 91. 2020. P.. 503-509

D-AMINO ACIDS IN NATURE, AGRICULTURE AND BIOMEDICINE

Article

Grishin D.V., Zhdanov D.D., Pokrovskaya M.V., Sokolov N.N.

Frontiers in Life Science. Vol. 13. 2020. P.. 11-22

Gradient conjugate priors and multi-layer neural networks

Other records

TRANSFORMATION OF BUSINESS MODELS IN TERMS OF DIGITALIZATION

D-AMINO ACIDS IN NATURE, AGRICULTURE AND BIOMEDICINE

Cite