Global Information Lookup Global Information

Hyperprior information


In Bayesian statistics, a hyperprior is a prior distribution on a hyperparameter, that is, on a parameter of a prior distribution.

As with the term hyperparameter, the use of hyper is to distinguish it from a prior distribution of a parameter of the model for the underlying system. They arise particularly in the use of hierarchical models.[1][2]

For example, if one is using a beta distribution to model the distribution of the parameter p of a Bernoulli distribution, then:

  • The Bernoulli distribution (with parameter p) is the model of the underlying system;
  • p is a parameter of the underlying system (Bernoulli distribution);
  • The beta distribution (with parameters α and β) is the prior distribution of p;
  • α and β are parameters of the prior distribution (beta distribution), hence hyperparameters;
  • A prior distribution of α and β is thus a hyperprior.

In principle, one can iterate the above: if the hyperprior itself has hyperparameters, these may be called hyperhyperparameters, and so forth.

One can analogously call the posterior distribution on the hyperparameter the hyperposterior, and, if these are in the same family, call them conjugate hyperdistributions or a conjugate hyperprior. However, this rapidly becomes very abstract and removed from the original problem.

  1. ^ Ntzoufras, Ioannis (2009). "Bayesian Hierarchical Models". Bayesian Modelling using WinBUGS. Wiley. pp. 305–340. ISBN 978-0-470-14114-4.
  2. ^ McElreath, Richard (2020). "Models With Memory". Statistical Rethinking : A Bayesian Course with Examples in R and Stan. CRC Press. ISBN 978-0-367-13991-9.

and 13 Related for: Hyperprior information

Request time (Page generated in 0.5352 seconds.)

Hyperprior

Last Update:

In Bayesian statistics, a hyperprior is a prior distribution on a hyperparameter, that is, on a parameter of a prior distribution. As with the term hyperparameter...

Word Count : 678

Hyperparameter

Last Update:

take a probability distribution on the hyperparameter itself, called a hyperprior. One often uses a prior which comes from a parametric family of probability...

Word Count : 489

Bayesian hierarchical modeling

Last Update:

distribution, namely: Hyperparameters: parameters of the prior distribution Hyperpriors: distributions of Hyperparameters Suppose a random variable Y follows...

Word Count : 3630

Empirical Bayes method

Last Update:

estimates of the variance). Bayes estimator Bayesian network Hyperparameter Hyperprior Best linear unbiased prediction Robbins lemma Spike-and-slab variable...

Word Count : 2483

Gibbs sampling

Last Update:

example, when there are multiple Dirichlet priors related by the same hyperprior. Each Dirichlet prior can be independently collapsed and affects only...

Word Count : 6140

Categorical distribution

Last Update:

distribution given a collection of N samples. Intuitively, we can view the hyperprior vector α as pseudocounts, i.e. as representing the number of observations...

Word Count : 4008

Prior probability

Last Update:

Uncertainty about these hyperparameters can, in turn, be expressed as hyperprior probability distributions. For example, if one uses a beta distribution...

Word Count : 6690

Dirichlet distribution

Last Update:

distribution given a collection of N samples. Intuitively, we can view the hyperprior vector α as pseudocounts, i.e. as representing the number of observations...

Word Count : 6539

List of statistics articles

Last Update:

Hyperbolic secant distribution Hypergeometric distribution Hyperparameter Hyperprior Hypoexponential distribution Idealised population Idempotent matrix Identifiability...

Word Count : 8280

Exponential family

Last Update:

prior, here a combination of two beta distributions; this is a form of hyperprior. An arbitrary likelihood will not belong to an exponential family, and...

Word Count : 11100

Expected utility hypothesis

Last Update:

whose parameters are themselves drawn from a higher-level distribution (hyperpriors). Starting with studies such as Lichtenstein & Slovic (1971), it was...

Word Count : 5645

Information field theory

Last Update:

inferred along with the field itself. This requires the specification of a hyperprior P ( S ) {\displaystyle {\mathcal {P}}(S)} . Often, statistical homogeneity...

Word Count : 5984

Comparison of Gaussian process software

Last Update:

parameters in the formula of the kernel. Prior: whether specifying arbitrary hyperpriors on the hyperparameters is supported. Posterior: whether estimating the...

Word Count : 1559

PDF Search Engine © AllGlobal.net