This article may be too technical for most readers to understand. Please help improve it to make it understandable to non-experts, without removing the technical details.(June 2014) (Learn how and when to remove this message)
The sample mean (sample average) or empirical mean (empirical average), and the sample covariance or empirical covariance are statistics computed from a sample of data on one or more random variables.
The sample mean is the average value (or mean value) of a sample of numbers taken from a larger population of numbers, where "population" indicates not number of people but the entirety of relevant data, whether collected or not. A sample of 40 companies' sales from the Fortune 500 might be used for convenience instead of looking at the population, all 500 companies' sales. The sample mean is used as an estimator for the population mean, the average value in the entire population, where the estimate is more likely to be close to the population mean if the sample is large and representative. The reliability of the sample mean is estimated using the standard error, which in turn is calculated using the variance of the sample. If the sample is random, the standard error falls with the size of the sample and the sample mean's distribution approaches the normal distribution as the sample size increases.
The term "sample mean" can also be used to refer to a vector of average values when the statistician is looking at the values of several variables in the sample, e.g. the sales, profits, and employees of a sample of Fortune 500 companies. In this case, there is not just a sample variance for each variable but a sample variance-covariance matrix (or simply covariance matrix) showing also the relationship between each pair of variables. This would be a 3×3 matrix when 3 variables are being considered. The sample covariance is useful in judging the reliability of the sample means as estimators and is also useful as an estimate of the population covariance matrix.
Due to their ease of calculation and other desirable characteristics, the sample mean and sample covariance are widely used in statistics to represent the location and dispersion of the distribution of values in the sample, and to estimate the values for the population.
and 22 Related for: Sample mean and covariance information
The samplemean (sample average) or empirical mean (empirical average), and the samplecovariance or empirical covariance are statistics computed from...
probability distribution, and (2) the samplecovariance, which in addition to serving as a descriptor of the sample, also serves as an estimated value of...
weighted meanandcovariance reduce to the unweighted samplemeanandcovariance above. The above generalizes easily to the case of taking the mean of vector-valued...
{\displaystyle \mu } is the samplemean, and S {\displaystyle S} is the covariance matrix of the samples. When the affine span of the samples is not the entire...
theory and statistics, a covariance matrix (also known as auto-covariance matrix, dispersion matrix, variance matrix, or variance–covariance matrix)...
with by using the samplecovariance matrix. The samplecovariance matrix (SCM) is an unbiased and efficient estimator of the covariance matrix if the space...
Fréchet mean Generalized mean Inequality of arithmetic and geometric means Samplemeanandcovariance Standard deviation Standard error of the mean Summary...
(see standard error of the samplemean). The scaling property and the Bienaymé formula, along with the property of the covariance Cov(aX, bY) = ab Cov(X, Y)...
sample variance sample_covar = C / (n - 1) A small modification can also be made to compute the weighted covariance: def online_weighted_covariance(data1...
Standard error of the weighted meanSamplemeanandsamplecovariance Standard error of the median Variance Variance of the meanand predicted responses Altman...
distribution Selection algorithm Sample maximum and minimum Quantile Percentile Decile Quartile Median MeanSamplemeanandcovariance David, H. A.; Nagaraja,...
Markowitz or Mean-Variance Efficient Portfolio is calculated from the samplemeanandcovariance, which are likely different from the population meanand covariance...
the covariance of two variables and the product of their standard deviations; thus, it is essentially a normalized measurement of the covariance, such...
In statistics and in probability theory, distance correlation or distance covariance is a measure of dependence between two paired random vectors of arbitrary...
by reducing sampling error. It can produce a weighted mean that has less variability than the arithmetic mean of a simple random sample of the population...
Analysis of covariance (ANCOVA) is a general linear model that blends ANOVA and regression. ANCOVA evaluates whether the means of a dependent variable...
The eddy covariance (also known as eddy correlation and eddy flux) is a key atmospheric measurement technique to measure and calculate vertical turbulent...
population or sampleand the standard error of a statistic (e.g., of the samplemean) are quite different, but related. The samplemean's standard error...
discrepancy between the observed covariance matrix and the model-implied covariance matrix. Chi-square increases with sample size only if the model is detectably...
efficiency of candidate estimators shows that the samplemean is more statistically efficient when—and only when— data is uncontaminated by data from heavy-tailed...
population mean). The residual is the difference between the observed value and the estimated value of the quantity of interest (for example, a samplemean). The...