A short introction to Importance Sampling.
This one is a little (lot?) long. I hope I can condense it to a card.
First, recall we might want to solve:
\[ \begin{equation} \tag{1} E_f[h(x)] = \int_x h(x) f(x) dx \approx \frac{1}{N} \sum_{x_i \sim f} h(x) \end{equation} \]
This works if \(\int f(x) dx = 1\), ie \(f(x)\) is a proper distribution. If not, we can do:
\[ \begin{equation} \tag{2} E_f[h(x)] = \int_x h(x) \frac{c f(x)}{c} dx \approx \frac{c}{N} \sum_{x_i \sim f/c} h(x) \end{equation} \]
where \(\int f(x) dx = c \lt \infty\). So, what if we have a proposal function \(g(x)\) such that \(\Omega_f \le \Omega_g\). Well,
\[ \begin{equation} \tag{3} E_f[h(x)] = \int_x h(x) f(x) dx = \int_x h(x) \frac{f(x) g(x)}{g(x)} dx \approx \frac{1}{N} \sum_{x_i \sim g} h(x) \frac{f(x)}{g(x)} \end{equation} \]
\[ \begin{eqnarray} \tag{4} min\ E_g\left [ \left( h(x) \frac{f(x)}{g(x)}\right)^2\right] &=& \int_x h^2(x)\frac{f^2(x)}{g^2(x)}g(x) dx \\ &=& \int_x h^2(x)\frac{f(x)}{g(x)} f(x) dx \\ &=& E_f\left [ h^2(x) \frac{f(x)}{g(x)}\right] \end{eqnarray} \]
Note, if the importance rate, \[ \frac{f(x)}{g(x)} \le M \lt \infty \], then
\[ \begin{equation} \tag{5} E_f\left [ h^2(x) \frac{f(x)}{g(x)}\right] \le M\ast E[h^2(x)] \lt \infty \end{equation} \] defines tail dominance.
This is optimal:
\[ \begin{equation} \tag{6} g(x) = g^{\ast}(x) = \frac{\mid h(x) \mid f(x)}{\int \mid h(x) \mid f(x) dx} \end{equation} \]
\[ \begin{equation} \tag{7} Var_g\left(h(x)\frac{f(x)}{g(x)}\right) = E_g \left[ \left( h(x) \frac{f(x)}{g(x)}\right)^2 \right] - E_g\left [ h(x) \frac{f(x)}{g(x)}\right]^2 \end{equation} \]
Remembering Jensen’s Inequality:
\[ \begin{eqnarray} E_g\left [ h^2(x) \frac{f^2(x)}{g^2(x)}\right] &\ge & E_g\left [ \mid h(x)\mid \frac{f(x)}{g(x)}\right]^2 \\ &\ge & \left( \int_g \mid h(x)\mid \frac{f(x)}{g(x)} g(x) dx\right)^2 \\ \tag{8} &\ge & \left( \int_g \mid h(x)\mid f(x) dx\right)^2 \end{eqnarray} \] IS the lower bound.
Can we reach the lower bound?? Plug (6) into \[E_g \left[ \left( h(x) \frac{f(x)}{g(x)}\right)^2 \right]\]
YES, and optimal!!
To use (6), we need to evaluate the integral in the denominator. BUT, that is as hard as the original problem. However, recall:
\[ \begin{equation} \tag{9} E_f[h(x)] = \int h(x) f(x) dx \approx \sum_g h(x)\frac{f(x)}{g(x)} \equiv \sum_g h(x) \omega (x) \end{equation} \]
which is just a weighted average. Remembering \(E_g[w(x)]=1\), we could also use:
\[ \begin{equation} \tag{10} E_f[h(x)] = \approx \frac{\sum_g h(x)\omega (x)}{\sum_g \omega (x)} \end{equation} \]
Finally, don’t forget the effective sample size (ESS):
\[ \begin{equation} \tag{11} ESS = \frac{N}{1+Var_g(\omega (x))} \end{equation} \]
If you see mistakes or want to suggest changes, please create an issue on the source repository.
Text and figures are licensed under Creative Commons Attribution CC BY 4.0. Source code is available at https://github.com/rsettlage/rsettlage.github.io, unless otherwise noted. The figures that have been reused from other sources don't fall under this license and can be recognized by a note in their caption: "Figure from ...".
For attribution, please cite this work as
Settlage (2022, July 12). Robert E. Settlage: Importance-sampling-tldr. Retrieved from https://rsettlage.github.io/mcmc/2022-07-12-importance-sampling-tldr/
BibTeX citation
@misc{settlage2022importance-sampling-tldr, author = {Settlage, Robert}, title = {Robert E. Settlage: Importance-sampling-tldr}, url = {https://rsettlage.github.io/mcmc/2022-07-12-importance-sampling-tldr/}, year = {2022} }