Global Information Lookup Global Information

DFFITS information


In statistics, DFFIT and DFFITS ("difference in fit(s)") are diagnostics meant to show how influential a point is in a linear regression, first proposed in 1980.[1]

DFFIT is the change in the predicted value for a point, obtained when that point is left out of the regression:

where and are the prediction for point i with and without point i included in the regression.

DFFITS is the Studentized DFFIT, where Studentization is achieved by dividing by the estimated standard deviation of the fit at that point:

where is the standard error estimated without the point in question, and is the leverage for the point.

DFFITS also equals the products of the externally Studentized residual () and the leverage factor ():[2]

Thus, for low leverage points, DFFITS is expected to be small, whereas as the leverage goes to 1 the distribution of the DFFITS value widens infinitely.

For a perfectly balanced experimental design (such as a factorial design or balanced partial factorial design), the leverage for each point is p/n, the number of parameters divided by the number of points. This means that the DFFITS values will be distributed (in the Gaussian case) as times a t variate. Therefore, the authors suggest investigating those points with DFFITS greater than .

Although the raw values resulting from the equations are different, Cook's distance and DFFITS are conceptually identical and there is a closed-form formula to convert one value to the other.[3]

  1. ^ Belsley, David A.; Kuh, Edwin; Welsh, Roy E. (1980). Regression Diagnostics: Identifying Influential Data and Sources of Collinearity. Wiley Series in Probability and Mathematical Statistics. New York: John Wiley & Sons. pp. 11–16. ISBN 0-471-05856-4.
  2. ^ Montgomery, Douglas C.; Peck, Elizabeth A.; Vining, G. Geoffrey (2012). Introduction to Linear Regression Analysis (5th ed.). Wiley. p. 218. ISBN 978-0-470-54281-1. Retrieved 22 February 2013. Thus, DFFITSi is the value of R-student multiplied by the leverage of the ith observation [hii/(1 − hii)]1/2.
  3. ^ Cohen, Jacob; Cohen, Patricia; West, Stephen G.; Aiken, Leona S. (2003). Applied Multiple Regression/Correlation Analysis for the Behavioral Sciences. ISBN 0-8058-2223-2.

and 5 Related for: DFFITS information

Request time (Page generated in 0.526 seconds.)

DFFITS

Last Update:

In statistics, DFFIT and DFFITS ("difference in fit(s)") are diagnostics meant to show how influential a point is in a linear regression, first proposed...

Word Count : 540

Regression diagnostic

Last Update:

Outliers Influential observations Leverage (statistics), partial leverage DFFITS Cook's distance Everitt, B.S. (2002) The Cambridge Dictionary of Statistics...

Word Count : 352

Outline of regression analysis

Last Update:

validation Studentized residual Cook's distance Variance inflation factor DFFITS Partial residual plot Partial regression plot Leverage Durbin–Watson statistic...

Word Count : 327

Influential observation

Last Update:

third dataset from Anscombe's quartet (bottom left chart in the figure): DFFITS - difference in fits Cook's D measures the effect of removing a data point...

Word Count : 716

List of statistics articles

Last Update:

information criterion Deviation (statistics) Deviation analysis (disambiguation) DFFITS – a regression diagnostic Diagnostic odds ratio Dickey–Fuller test Difference...

Word Count : 8280

PDF Search Engine © AllGlobal.net