The “evidence” or “marginal likelihood” integral, , is often considered hard to calculate. The most general method I know of is Nested Sampling, although sometimes other methods can outperform it if the shape of the likelihood function is kind to the method.
However, there are other integrals that Bayesians need to do that are orders of magnitude harder than this. Suppose I want to measure some parameters from some data , and there are also nuisance parameters in the problem. The amount of information learned, , is quantified by how compressed the posterior is relative to the prior:
Remembering that we have nuisance parameters, the posterior and prior in this expression may be written as marginalisations of the joint posterior/prior of and .
That’s how much information the data provided to us about the parameters of interest. Before we get the data, we might be interested in knowing the expected amount of information we will get, , with the expectation being taken with respect to the prior for the data, :
We can use this to do experimental design: it makes sense to choose your experiment to optimise the expected amount of information you’ll get about the parameters of interest. This wraps an optimisation problem around the above expression too.
In summary, the evidence is not a hard integral. We know how to do it. Other useful integrals, not so much.
Pingback: Brendon Brewer on “Hard Integrals” | Hypergeometric