"Scientific data" redirects here. Not to be confused with Scientific Data (journal).
For data in computer science, see Data (computer science). For other uses, see Data (disambiguation) and Datum (disambiguation).
Part of a series on
Epistemology
Outline
Category
Index
Schools
Coherentism
Contextualism
Dogmatism
Empiricism
Fallibilism
Foundationalism
Infallibilism
Infinitism
Naturalism
Perspectivism
Pragmatism
Rationalism
Relativism
Skepticism
Solipsism
Structuralism
Concepts
Action
Analytic–synthetic distinction
A priori and a posteriori
Belief
Credence
Certainty
Data
Experience
Information
Justification
Induction
Knowledge
Meaning
Rationality
Reason
Truth
Wisdom
Domains
Applied epistemology
Evolutionary epistemology
Formal epistemology
Historical epistemology
Metaepistemology
Social epistemology
Virtue epistemology
Epistemologists
Aristotle
Sextus Empiricus
Edmund Gettier
Wang Yangming
René Descartes
David Hume
Immanuel Kant
W. V. O. Quine
more...
Related fields
Epistemic cognition
Epistemic logic
Philosophy of perception
Philosophy of science
v
t
e
In common usage, data (/ˈdeɪtə/, also US: /ˈdætə/; ) is a collection of discrete or continuous values that convey information, describing the quantity, quality, fact, statistics, other basic units of meaning, or simply sequences of symbols that may be further interpreted formally. A datum is an individual value in a collection of data. Data is usually organized into structures such as tables that provide additional context and meaning, and which may themselves be used as data in larger structures. Data may be used as variables in a computational process.[1][2] Data may represent abstract ideas or concrete measurements.[3]
Data is commonly used in scientific research, economics, and in virtually every other form of human organizational activity. Examples of data sets include price indices (such as consumer price index), unemployment rates, literacy rates, and census data. In this context, data represents the raw facts and figures from which useful information can be extracted.
Data is collected using techniques such as measurement, observation, query, or analysis, and is typically represented as numbers or characters which may be further processed. Field data is data that is collected in an uncontrolled in-situ environment. Experimental data is data that is generated in the course of a controlled scientific experiment. Data is analyzed using techniques such as calculation, reasoning, discussion, presentation, visualization, or other forms of post-analysis. Prior to analysis, raw data (or unprocessed data) is typically cleaned: Outliers are removed and obvious instrument or data entry errors are corrected.
Data can be seen as the smallest units of factual information that can be used as a basis for calculation, reasoning, or discussion. Data can range from abstract ideas to concrete measurements, including, but not limited to, statistics. Thematically connected data presented in some relevant context can be viewed as information. Contextually connected pieces of information can then be described as data insights or intelligence. The stock of insights and intelligence that accumulates over time resulting from the synthesis of data into information, can then be described as knowledge. Data has been described as "the new oil of the digital economy".[4][5] Data, as a general concept, refers to the fact that some existing information or knowledge is represented or coded in some form suitable for better usage or processing.
Advances in computing technologies have led to the advent of big data, which usually refers to very large quantities of data, usually at the petabyte scale. Using traditional data analysis methods and computing, working with such large (and growing) datasets is difficult, even impossible. (Theoretically speaking, infinite data would yield infinite information, which would render extracting insights or intelligence impossible.) In response, the relatively new field of data science uses machine learning (and other artificial intelligence (AI)) methods that allow for efficient applications of analytic methods to big data.
^OECD Glossary of Statistical Terms. OECD. 2008. p. 119. ISBN 978-92-64-025561.
^"Statistical Language - What are Data?". Australian Bureau of Statistics. 2013-07-13. Archived from the original on 2019-04-19. Retrieved 2020-03-09.
^"Data vs Information - Difference and Comparison | Diffen". www.diffen.com. Retrieved 2018-12-11.
^Yonego, Joris Toonders (July 23, 2014). "Data Is the New Oil of the Digital Economy". Wired – via www.wired.com.
^"Data is the new oil". July 16, 2018. Archived from the original on 2018-07-16.
Dark dataData (computer science) Data acquisition Data analysis Data bank Data cable Data curation Data domain Data element Data farming Data governance...
Debt, AIDS, Trade, Africa (DATA) was a multinational non-governmental organization founded in January 2002 in London by U2's lead vocalist, Bono, with...
Data science is an interdisciplinary academic field that uses statistics, scientific computing, scientific methods, processes, algorithms and systems to...
Data mining is the process of extracting and discovering patterns in large data sets involving methods at the intersection of machine learning, statistics...
Personal data, also known as personal information or personally identifiable information (PII), is any information related to an identifiable person. The...
Data processing is the collection and manipulation of digital data to produce meaningful information. Data processing is a form of information processing...
In information theory, data compression, source coding, or bit-rate reduction is the process of encoding information using fewer bits than the original...
Big data primarily refers to data sets that are too large or complex to be dealt with by traditional data-processing application software. Data with many...
science, a data structure is a data organization, and storage format that is usually chosen for efficient access to data. More precisely, a data structure...
Data storage is the recording (storing) of information (data) in a storage medium. Handwriting, phonographic recording, magnetic tape, and optical discs...
generally data modeling or, more specifically, database design. Data models are typically specified by a data expert, data specialist, data scientist, data librarian...
Data communication, including data transmission and data reception, is the transfer of data, transmitted and received over a point-to-point or point-to-multipoint...
data. All of the above are varieties of data analysis. Data integration is a precursor to data analysis, and data analysis is closely linked to data visualization...
computer science and computer programming, a data type (or simply type) is a collection or grouping of data values, usually specified by a set of possible...
Data in transit, also referred to as data in motion and data in flight, is data en route between source and destination, typically on a computer network...
In computing, a data warehouse (DW or DWH), also known as an enterprise data warehouse (EDW), is a system used for reporting and data analysis and is...
A data center (American English) or data centre (Commonwealth English) is a building, a dedicated space within a building, or a group of buildings used...
parts of the data and then replacing, modifying, or deleting the dirty or coarse data. Data cleansing may be performed interactively with data wrangling...
A data link is a means of connecting one location to another for the purpose of transmitting and receiving digital information (data communication). It...
A data set (or dataset) is a collection of data. In the case of tabular data, a data set corresponds to one or more database tables, where every column...
Data rate and data transfer rate can refer to several related and overlapping concepts in communications networks: Bit rate, the number of bits that are...
usually a single store of data including raw copies of source system data, sensor data, social data etc., and transformed data used for tasks such as reporting...
of data deficient birds IUCN Red List data deficient species (Cnidaria) List of data deficient fishes List of data deficient insects List of data deficient...
Data reporting is the process of collecting and submitting data. The effective management of any organization relies on accurate data. Inaccurate data...
Secondary data refers to data that is collected by someone other than the primary user. Common sources of secondary data for social science include censuses...
Data quality refers to the state of qualitative or quantitative pieces of information. There are many definitions of data quality, but data is generally...
A data store is a repository for persistently storing and managing collections of data which include not just repositories like databases, but also simpler...
A data hub is a center of data exchange that is supported by data science, data engineering, and data warehouse technologies to interact with endpoints...
submit new material and suggest edits to existing entries. Most of the site's data has been provided by these volunteers. Registered users with a proven track...
Data modeling in software engineering is the process of creating a data model for an information system by applying certain formal techniques. It may...