• Definition of Kullback-Leibler divergence(KL Distance): The

  • When symmetrized:.

  • KL distance of the probability distribution of a parameter θ and a parameter slightly (δ)-displaced:.

  • Now put .

  • By Taylor expansion of the logarithm and a first-order approximation

  • therefore

  • First-order approximation of the difference in function values:.

  • due to

  • By the derivative of the logarithm [$ (\log x)’ = x’ / x

  • Substituting Δ:.

To organize:.

  • To summarize so far

  • That is, the metric tensor (physics) of the space containing the KL distance is the Fisher matrix (


This page is auto-translated from /nishio/KL距離からのFisher行列の導出 using DeepL. If you looks something interesting but the auto-translated English is not good enough to understand it, feel free to let me know at @nishio_en. I’m very happy to spread my thought to non-Japanese readers.