Variational inference for Bayes Network

Created in April 14, 2020

2020 · elfsong.cn · Uncategorized

In general neural networks have a sort of loss like that:

However, The part of the denominator integral is intractable of finding an analytic solution solution in practice. Therefore, we are going to make a distribution approaching the original distribution. KL divergence can be used to indicate the difference between these two distributions.

Enjoy Reading This Article?

Here are some more articles you might like to read next:

Resillience

Multi-Head Attention

Preference Alignment 101

Challenges in Code Generation

PREDICTING AND OPTIMIZING LLVM COMPILER PASS ORDER