The process of analyzing data to discover useful information and support decision-making
Part of a series on Statistics
Data and information visualization
Major dimensions
Exploratory data analysis
Information design
Interactive data visualization
Descriptive statistics
Inferential statistics
Statistical graphics
Plot
Data analysis
Infographic
Data science
Important figures
Tamara Munzner
Ben Shneiderman
John Tukey
Edward Tufte
Simon Wardley
Hans Rosling
David McCandless
Kim Albrecht
Alexander Osterwalder
Ed Hawkins
Hadley Wickham
Leland Wilkinson
Mike Bostock
Jeffrey Heer
Ihab Ilyas
Information graphic types
Line chart
Bar chart
Histogram
Scatter plot
Box plot
Pareto chart
Pie chart
Area chart
Tree map
Bubble chart
Stripe graphic
Control chart
Run chart
Stem-and-leaf display
Cartogram
Small multiple
Sparkline
Table
Marimekko chart
Related topics
Data
Information
Big data
Database
Chartjunk
Visual perception
Regression analysis
Statistical model
Misleading graph
v
t
e
Computational physics
Mechanics
Electromagnetics
Multiphysics
Particle physics
Thermodynamics
Simulation
Potentials
Morse/Long-range potential
Lennard-Jones potential
Yukawa potential
Morse potential
Fluid dynamics
Finite difference
Finite volume
Finite element
Boundary element
Lattice Boltzmann
Riemann solver
Dissipative particle dynamics
Smoothed particle hydrodynamics
Turbulence models
Monte Carlo methods
Integration
Gibbs sampling
Metropolis algorithm
Particle
N-body
Particle-in-cell
Molecular dynamics
Scientists
Godunov
Ulam
von Neumann
Galerkin
Lorenz
Wilson
Alder
Richtmyer
v
t
e
Data analysis is the process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.[1] Data analysis has multiple facets and approaches, encompassing diverse techniques under a variety of names, and is used in different business, science, and social science domains.[2] In today's business world, data analysis plays a role in making decisions more scientific and helping businesses operate more effectively.[3]
Data mining is a particular data analysis technique that focuses on statistical modeling and knowledge discovery for predictive rather than purely descriptive purposes, while business intelligence covers data analysis that relies heavily on aggregation, focusing mainly on business information.[4] In statistical applications, data analysis can be divided into descriptive statistics, exploratory data analysis (EDA), and confirmatory data analysis (CDA).[5] EDA focuses on discovering new features in the data while CDA focuses on confirming or falsifying existing hypotheses.[6][7] Predictive analytics focuses on the application of statistical models for predictive forecasting or classification, while text analytics applies statistical, linguistic, and structural techniques to extract and classify information from textual sources, a species of unstructured data. All of the above are varieties of data analysis.[8]
Data integration is a precursor to data analysis, and data analysis is closely linked to data visualization and data dissemination.[9]
^"Transforming Unstructured Data into Useful Information", Big Data, Mining, and Analytics, Auerbach Publications, pp. 227–246, 2014-03-12, doi:10.1201/b16666-14, ISBN 978-0-429-09529-0, retrieved 2021-05-29
^"The Multiple Facets of Correlation Functions", Data Analysis Techniques for Physical Scientists, Cambridge University Press, pp. 526–576, 2017, doi:10.1017/9781108241922.013, ISBN 978-1-108-41678-8, retrieved 2021-05-29
^Xia, B. S., & Gong, P. (2015). Review of business intelligence through data analysis. Benchmarking, 21(2), 300-311. doi:10.1108/BIJ-08-2012-0050
^Exploring Data Analysis
^"Data Coding and Exploratory Analysis (EDA) Rules for Data Coding Exploratory Data Analysis (EDA) Statistical Assumptions", SPSS for Intermediate Statistics, Routledge, pp. 42–67, 2004-08-16, doi:10.4324/9781410611420-6, ISBN 978-1-4106-1142-0, retrieved 2021-05-29
^Spie (2014-10-01). "New European ICT call focuses on PICs, lasers, data transfer". SPIE Professional. doi:10.1117/2.4201410.10. ISSN 1994-4403.
^Samandar, Petersson; Svantesson, Sofia (2017). Skapandet av förtroende inom eWOM : En studie av profilbildens effekt ur ett könsperspektiv. Högskolan i Gävle, Företagsekonomi. OCLC 1233454128.
^Goodnight, James (2011-01-13). "The forecast for predictive analytics: hot and getting hotter". Statistical Analysis and Data Mining: The ASA Data Science Journal. 4 (1): 9–10. doi:10.1002/sam.10106. ISSN 1932-1864. S2CID 38571193.
^Sherman, Rick (4 November 2014). Business intelligence guidebook: from data integration to analytics. Amsterdam. ISBN 978-0-12-411528-6. OCLC 894555128.{{cite book}}: CS1 maint: location missing publisher (link)
Dataanalysis is the process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions...
exploratory dataanalysis (EDA) is an approach of analyzing data sets to summarize their main characteristics, often using statistical graphics and other data visualization...
In applied mathematics, topological dataanalysis (TDA) is an approach to the analysis of datasets using techniques from topology. Extraction of information...
when used in small amounts only. Big dataanalysis challenges include capturing data, data storage, dataanalysis, search, sharing, transfer, visualization...
Functional dataanalysis (FDA) is a branch of statistics that analyses data providing information about curves, surfaces or anything else varying over...
observed data; how they can be used as part of statistical inference, particularly where several different quantities are of interest to the same analysis. Certain...
Forensic dataanalysis (FDA) is a branch of digital forensics. It examines structured data with regard to incidents of financial crime. The aim is to...
Geometric dataanalysis comprises geometric aspects of image analysis, pattern analysis, and shape analysis, and the approach of multivariate statistics...
Dark dataData (computer science) Data acquisition DataanalysisData bank Data cable Data curation Data domain Data element Data farming Data governance...
exploratory dataanalysis, and a common technique for statistical dataanalysis, used in many fields, including pattern recognition, image analysis, information...
Data envelopment analysis (DEA) is a nonparametric method in operations research and economics for the estimation of production frontiers. DEA has been...
DataAnalysis Expressions (DAX) is the native formula and query language for Microsoft PowerPivot, Power BI Desktop and SQL Server Analysis Services (SSAS)...
component analysis (PCA) is a linear dimensionality reduction technique with applications in exploratory dataanalysis, visualization and data preprocessing...
research. Secondary dataanalysis can save time that would otherwise be spent collecting data and, particularly in the case of quantitative data, can provide...
a discipline, a workflow, and a profession. Data science is "a concept to unify statistics, dataanalysis, informatics, and their related methods" to...
experiments. From the perspective of the scientist, data collection, dataanalysis, discussion of the data in the context of the research literature, and drawing...
interpreting patterns of meaning (or "themes") within qualitative data. Thematic analysis is often understood as a method or technique in contrast to most...
methods) from a data set and transforming the information into a comprehensible structure for further use. Data mining is the analysis step of the "knowledge...
statistics, oversampling and undersampling in dataanalysis are techniques used to adjust the class distribution of a data set (i.e. the ratio between the different...
a research strategy that focuses on quantifying the collection and analysis of data. It is formed from a deductive approach where emphasis is placed on...
series analysis comprises methods for analyzing time series data in order to extract meaningful statistics and other characteristics of the data. Time...
survival analysis involves the modelling of time to event data; in this context, death or failure is considered an "event" in the survival analysis literature...
quality Path quality analysis Fourier analysis In statistics, the term analysis may refer to any method used for dataanalysis. Among the many such methods...
spatial analysis is geospatial analysis, the technique applied to structures at the human scale, most notably in the analysis of geographic data. It may...
The LTPP International DataAnalysis Contest or the LTPP DataAnalysis Contest is an annual international dataanalysis contest held by the American Society...
In statistics, combinatorial dataanalysis (CDA) is the study of data sets where the order in which objects are arranged is important. CDA can be used...