In statistics, exploratory data analysis (EDA) is an approach of analyzing data sets to summarize their main characteristics, often using statistical graphics and other data visualization methods. A statistical model can be used or not, but primarily EDA is for seeing what the data can tell us beyond the formal modeling and thereby contrasts traditional hypothesis testing. Exploratory data analysis has been promoted by John Tukey since 1970 to encourage statisticians to explore the data, and possibly formulate hypotheses that could lead to new data collection and experiments. EDA is different from initial data analysis (IDA),[1][2] which focuses more narrowly on checking assumptions required for model fitting and hypothesis testing, and handling missing values and making transformations of variables as needed. EDA encompasses IDA.
^Chatfield, C. (1995). Problem Solving: A Statistician's Guide (2nd ed.). Chapman and Hall. ISBN 978-0412606304.
^Baillie, Mark; Le Cessie, Saskia; Schmidt, Carsten Oliver; Lusa, Lara; Huebner, Marianne; Topic Group "Initial Data Analysis" of the STRATOS Initiative (2022). "Ten simple rules for initial data analysis". PLOS Computational Biology. 18 (2): e1009819. Bibcode:2022PLSCB..18E9819B. doi:10.1371/journal.pcbi.1009819. PMC 8870512. PMID 35202399.
and 27 Related for: Exploratory data analysis information
In statistics, exploratorydataanalysis (EDA) is an approach of analyzing data sets to summarize their main characteristics, often using statistical graphics...
descriptive statistics, exploratorydataanalysis (EDA), and confirmatory dataanalysis (CDA). EDA focuses on discovering new features in the data while CDA focuses...
In multivariate statistics, exploratory factor analysis (EFA) is a statistical method used to uncover the underlying structure of a relatively large set...
Boolean analysis – a method to find deterministic dependencies between variables in a sample, mostly used in exploratorydataanalysis Cluster analysis – techniques...
Causal analysis is the field of experimental design and statistical analysis pertaining to establishing cause and effect. Exploratory causal analysis (ECA)...
can involve tasks such as data cleaning, data visualization, and exploratorydataanalysis to gain insights into the data and develop hypotheses about...
is now called data science. Tukey articulated the important distinction between exploratorydataanalysis and confirmatory dataanalysis, believing that...
component analysis (PCA) is a linear dimensionality reduction technique with applications in exploratorydataanalysis, visualization and data preprocessing...
follow the Spirtes-Glymour approach to verification. Exploratory causal analysis, also known as "data causality" or "causal discovery" is the use of statistical...
by John Tukey, who later published on the subject in his book "ExploratoryDataAnalysis" in 1977. A boxplot is a standardized way of displaying the dataset...
the two given data sets are similar enough move_random_points() is a function that randomly moves data points Exploratorydataanalysis Goodness of fit...
(mathematical programming) methods. Data mining is a related (parallel) field of study, focusing on exploratorydataanalysis (EDA) through unsupervised learning...
multi-armed bandit problem. Exploratoryanalysis of Bayesian models is an adaptation or extension of the exploratorydataanalysis approach to the needs and...
dimensionality increases, data becomes increasingly sparse while density and distance between points, critical to clustering and outlier analysis, becomes less meaningful...
correspondence analysis (MCA) is a dataanalysis technique for nominal categorical data, used to detect and represent underlying structures in a data set. It...
elected office Exploratorydataanalysis, an approach to analyzing data for the purpose of formulating hypotheses worth testing Exploratory engineering,...
scientific study, post hoc analysis (from Latin post hoc, "after this") consists of statistical analyses that were specified after the data were seen. They are...
factors. In Peirce's three modes of inference, exploratorydataanalysis corresponds to abduction. Meta-analysis is the technique research psychologists use...
summarisation techniques has been formulated under the heading of exploratorydataanalysis: an example of such a technique is the box plot. In the business...
interactive data visualization software Exploratorydataanalysis Machine learning Data profiling Data visualization FOSTER Open Science, Overview of Data Exploration...