Hyperprior information

In Bayesian statistics, a hyperprior is a prior distribution on a hyperparameter, that is, on a parameter of a prior distribution.

As with the term hyperparameter, the use of hyper is to distinguish it from a prior distribution of a parameter of the model for the underlying system. They arise particularly in the use of hierarchical models.^[1]^[2]

For example, if one is using a beta distribution to model the distribution of the parameter p of a Bernoulli distribution, then:

The Bernoulli distribution (with parameter p) is the model of the underlying system;
p is a parameter of the underlying system (Bernoulli distribution);
The beta distribution (with parameters α and β) is the prior distribution of p;
α and β are parameters of the prior distribution (beta distribution), hence hyperparameters;
A prior distribution of α and β is thus a hyperprior.

In principle, one can iterate the above: if the hyperprior itself has hyperparameters, these may be called hyperhyperparameters, and so forth.

One can analogously call the posterior distribution on the hyperparameter the hyperposterior, and, if these are in the same family, call them conjugate hyperdistributions or a conjugate hyperprior. However, this rapidly becomes very abstract and removed from the original problem.

^ Ntzoufras, Ioannis (2009). "Bayesian Hierarchical Models". Bayesian Modelling using WinBUGS. Wiley. pp. 305–340. ISBN 978-0-470-14114-4.
^ McElreath, Richard (2020). "Models With Memory". Statistical Rethinking : A Bayesian Course with Examples in R and Stan. CRC Press. ISBN 978-0-367-13991-9.

[1] Ntzoufras, Ioannis (2009). "Bayesian Hierarchical Models". Bayesian Modelling using WinBUGS. Wiley. pp. 305–340. ISBN 978-0-470-14114-4.

[2] McElreath, Richard (2020). "Models With Memory". Statistical Rethinking : A Bayesian Course with Examples in R and Stan. CRC Press. ISBN 978-0-367-13991-9.

Hyperprior information

and 13 Related for: Hyperprior information

Hyperprior

Hyperparameter

Bayesian hierarchical modeling

Empirical Bayes method

Gibbs sampling

Categorical distribution

Prior probability

Dirichlet distribution

List of statistics articles

Exponential family

Expected utility hypothesis

Information field theory

Comparison of Gaussian process software

Bayesian statistics
Part of a series on

Posterior = Likelihood × Prior ÷ Evidence
Background
Bayesian inference Bayesian probability Bayes' theorem Bernstein–von Mises theorem Coherence Cox's theorem Cromwell's rule Principle of indifference Principle of maximum entropy
Model building
Weak prior ... Strong prior Conjugate prior Linear regression Empirical Bayes Hierarchical model
Posterior approximation
Markov chain Monte Carlo Laplace's approximation Integrated nested Laplace approximations Variational inference Approximate Bayesian computation
Estimators
Bayesian estimator Credible interval Maximum a posteriori estimation
Evidence approximation
Evidence lower bound Nested sampling
Model evaluation
Bayes factor Model averaging Posterior predictive
Mathematics portal
v t e