Global Information Lookup Global Information

Data mining information


Data mining is the process of extracting and discovering patterns in large data sets involving methods at the intersection of machine learning, statistics, and database systems.[1] Data mining is an interdisciplinary subfield of computer science and statistics with an overall goal of extracting information (with intelligent methods) from a data set and transforming the information into a comprehensible structure for further use.[1][2][3][4] Data mining is the analysis step of the "knowledge discovery in databases" process, or KDD.[5] Aside from the raw analysis step, it also involves database and data management aspects, data pre-processing, model and inference considerations, interestingness metrics, complexity considerations, post-processing of discovered structures, visualization, and online updating.[1]

The term "data mining" is a misnomer because the goal is the extraction of patterns and knowledge from large amounts of data, not the extraction (mining) of data itself.[6] It also is a buzzword[7] and is frequently applied to any form of large-scale data or information processing (collection, extraction, warehousing, analysis, and statistics) as well as any application of computer decision support system, including artificial intelligence (e.g., machine learning) and business intelligence. Often the more general terms (large scale) data analysis and analytics—or, when referring to actual methods, artificial intelligence and machine learning—are more appropriate.

The actual data mining task is the semi-automatic or automatic analysis of large quantities of data to extract previously unknown, interesting patterns such as groups of data records (cluster analysis), unusual records (anomaly detection), and dependencies (association rule mining, sequential pattern mining). This usually involves using database techniques such as spatial indices. These patterns can then be seen as a kind of summary of the input data, and may be used in further analysis or, for example, in machine learning and predictive analytics. For example, the data mining step might identify multiple groups in the data, which can then be used to obtain more accurate prediction results by a decision support system. Neither the data collection, data preparation, nor result interpretation and reporting is part of the data mining step, although they do belong to the overall KDD process as additional steps.

The difference between data analysis and data mining is that data analysis is used to test models and hypotheses on the dataset, e.g., analyzing the effectiveness of a marketing campaign, regardless of the amount of data. In contrast, data mining uses machine learning and statistical models to uncover clandestine or hidden patterns in a large volume of data.[8]

The related terms data dredging, data fishing, and data snooping refer to the use of data mining methods to sample parts of a larger population data set that are (or may be) too small for reliable statistical inferences to be made about the validity of any patterns discovered. These methods can, however, be used in creating new hypotheses to test against the larger data populations.

  1. ^ a b c "Data Mining Curriculum". ACM SIGKDD. 2006-04-30. Archived from the original on 2013-10-14. Retrieved 2014-01-27.
  2. ^ Clifton, Christopher (2010). "Encyclopædia Britannica: Definition of Data Mining". Archived from the original on 2011-02-05. Retrieved 2010-12-09.
  3. ^ Hastie, Trevor; Tibshirani, Robert; Friedman, Jerome (2009). "The Elements of Statistical Learning: Data Mining, Inference, and Prediction". Archived from the original on 2009-11-10. Retrieved 2012-08-07.
  4. ^ Han, Jaiwei; Kamber, Micheline; Pei, Jian (2011). Data Mining: Concepts and Techniques (3rd ed.). Morgan Kaufmann. ISBN 978-0-12-381479-1.
  5. ^ Cite error: The named reference Fayyad was invoked but never defined (see the help page).
  6. ^ Han, Jiawei; Kamber, Micheline (2001). Data mining: concepts and techniques. Morgan Kaufmann. p. 5. ISBN 978-1-55860-489-6. Thus, data mining should have been more appropriately named "knowledge mining from data," which is unfortunately somewhat long
  7. ^ OKAIRP 2005 Fall Conference, Arizona State University Archived 2014-02-01 at the Wayback Machine
  8. ^ Olson, D. L. (2007). Data mining in business services. Service Business, 1(3), 181–193. doi:10.1007/s11628-006-0014-7

and 26 Related for: Data mining information

Request time (Page generated in 0.8364 seconds.)

Data mining

Last Update:

Data mining is the process of extracting and discovering patterns in large data sets involving methods at the intersection of machine learning, statistics...

Word Count : 5009

Examples of data mining

Last Update:

Data mining, the process of discovering patterns in large data sets, has been used in many applications. In business, data mining is the analysis of historical...

Word Count : 4817

Educational data mining

Last Update:

Educational data mining (EDM) is a research field concerned with the application of data mining, machine learning and statistics to information generated...

Word Count : 3425

Java Data Mining

Last Update:

Data Mining (JDM) is a standard Java API for developing data mining applications and tools. JDM defines an object model and Java API for data mining objects...

Word Count : 152

Data Mining Extensions

Last Update:

Data Mining Extensions (DMX) is a query language for data mining models supported by Microsoft's SQL Server Analysis Services product. Like SQL, it supports...

Word Count : 309

Text mining

Last Update:

Text mining, text data mining (TDM) or text analytics is the process of deriving high-quality information from text. It involves "the discovery by computer...

Word Count : 4493

Data stream mining

Last Update:

Data Stream Mining (also known as stream learning) is the process of extracting knowledge structures from continuous, rapid data records. A data stream...

Word Count : 1363

Relational data mining

Last Update:

Relational data mining is the data mining technique for relational databases. Unlike traditional data mining algorithms, which look for patterns in a...

Word Count : 297

Oracle Data Mining

Last Update:

Oracle Data Mining (ODM) is an option of Oracle Database Enterprise Edition. It contains several data mining and data analysis algorithms for classification...

Word Count : 1874

Evolutionary data mining

Last Update:

Evolutionary data mining, or genetic data mining is an umbrella term for any data mining using evolutionary algorithms. While it can be used for mining data from...

Word Count : 525

Data science

Last Update:

included "knowledge discovery" and "data mining". In 2012, technologists Thomas H. Davenport and DJ Patil declared "Data Scientist: The Sexiest Job of the...

Word Count : 2827

Process mining

Last Update:

Process mining is a family of techniques used to analyze event data in order to understand and improve operational processes. Part of the fields of data science...

Word Count : 2670

Data analysis

Last Update:

world, data analysis plays a role in making decisions more scientific and helping businesses operate more effectively. Data mining is a particular data analysis...

Word Count : 9552

Data wrangling

Last Update:

that data mining does not use it, there are many use cases for data wrangling in data mining. Data wrangling can benefit data mining by removing data that...

Word Count : 1808

Data preprocessing

Last Update:

step in the data mining process. Data collection methods are often loosely controlled, resulting in out-of-range values, impossible data combinations, and...

Word Count : 1755

Data sanitization

Last Update:

Preserving Data Mining (PPDM) is the process of data mining while maintaining privacy of sensitive material. Data mining involves analyzing large datasets to gain...

Word Count : 5292

Social media mining

Last Update:

Social media mining is the process of obtaining big data from user-generated content on social media sites and mobile apps in order to extract actionable...

Word Count : 4200

Cyborg data mining

Last Update:

Cyborg data mining is the practice of collecting data produced by an implantable device that monitors bodily processes for commercial interests. As an...

Word Count : 2096

Data

Last Update:

governance Data integrity Data maintenance Data management Data mining Data modeling Data point Data preservation Data protection Data publication Data remanence...

Word Count : 2522

Data set

Last Update:

authors. The Bupa liver data – Used in several papers in the machine learning (data mining) literature. Anscombe's quartet – Small data set illustrating the...

Word Count : 893

Domain driven data mining

Last Update:

Domain driven data mining is a data mining methodology for discovering actionable knowledge and deliver actionable insights from complex data and behaviors...

Word Count : 706

Data mining in agriculture

Last Update:

Data mining in agriculture is a research topic consisting of the application of data mining and data science techniques to agriculture. Recent technologies...

Word Count : 1539

Machine learning

Last Update:

(mathematical programming) methods. Data mining is a related (parallel) field of study, focusing on exploratory data analysis (EDA) through unsupervised...

Word Count : 14304

Data integration

Last Update:

coherent data store that provides synchronous data across a network of files for clients. A common use of data integration is in data mining when analyzing...

Word Count : 3745

National Center for Data Mining

Last Update:

The National Center for Data Mining (NCDM) is a center of the University of Illinois at Chicago (UIC), established in 1998 to serve as a resource for...

Word Count : 99

Pentaho

Last Update:

intelligence (BI) software that provides data integration, OLAP services, reporting, information dashboards, data mining and extract, transform, load (ETL)...

Word Count : 979

PDF Search Engine © AllGlobal.net