Global Information Lookup Global Information

Data stream mining information


Data Stream Mining (also known as stream learning) is the process of extracting knowledge structures from continuous, rapid data records. A data stream is an ordered sequence of instances that in many applications of data stream mining can be read only once or a small number of times using limited computing and storage capabilities.[1]

In many data stream mining applications, the goal is to predict the class or value of new instances in the data stream given some knowledge about the class membership or values of previous instances in the data stream.[2] Machine learning techniques can be used to learn this prediction task from labeled examples in an automated fashion. Often, concepts from the field of incremental learning are applied to cope with structural changes, on-line learning and real-time demands. In many applications, especially operating within non-stationary environments, the distribution underlying the instances or the rules underlying their labeling may change over time, i.e. the goal of the prediction, the class to be predicted or the target value to be predicted, may change over time.[3] This problem is referred to as concept drift. Detecting concept drift is a central issue to data stream mining.[4][5] Other challenges[6] that arise when applying machine learning to streaming data include: partially and delayed labeled data,[7][8] recovery from concept drifts,[1] and temporal dependencies.[9]

Examples of data streams include computer network traffic, phone conversations, ATM transactions, web searches, and sensor data. Data stream mining can be considered a subfield of data mining, machine learning, and knowledge discovery.

  1. ^ a b Gomes, Heitor M.; Bifet, Albert; Read, Jesse; Barddal, Jean Paul; Enembreck, Fabrício; Pfharinger, Bernhard; Holmes, Geoff; Abdessalem, Talel (2017-10-01). "Adaptive random forests for evolving data stream classification". Machine Learning. 106 (9): 1469–1495. doi:10.1007/s10994-017-5642-8. hdl:10289/11231. ISSN 1573-0565.
  2. ^ Medhat, Mohamed; Zaslavsky; Krishnaswamy (2005-06-01). "Mining data streams". ACM SIGMOD Record. 34 (2): 18–26. doi:10.1145/1083784.1083789. S2CID 705946.
  3. ^ Lemaire, Vincent; Salperwyck, Christophe; Bondu, Alexis (2015), Zimányi, Esteban; Kutsche, Ralf-Detlef (eds.), "A Survey on Supervised Classification on Data Streams", Business Intelligence: 4th European Summer School, eBISS 2014, Berlin, Germany, July 6–11, 2014, Tutorial Lectures, Lecture Notes in Business Information Processing, Springer International Publishing, pp. 88–125, doi:10.1007/978-3-319-17551-5_4, ISBN 978-3-319-17551-5
  4. ^ Webb, Geoffrey I.; Lee, Loong Kuan; Petitjean, François; Goethals, Bart (2017-04-02). "Understanding Concept Drift". arXiv:1704.00362 [cs.LG].
  5. ^ Gama, João; Žliobaitė; Bifet; Pechenizkiy; Bouchachia (2014-03-01). "A survey on concept drift adaptation" (PDF). ACM Computing Surveys. 46 (4): 1–37. doi:10.1145/2523813. S2CID 207208264.
  6. ^ Gomes, Heitor Murilo; Read; Bifet; Barddal; Gama (2019-11-26). "Machine learning for streaming data". ACM SIGKDD Explorations Newsletter. 21 (2): 6–22. doi:10.1145/3373464.3373470. S2CID 208607941.
  7. ^ Gomes, Heitor Murilo; Grzenda, Maciej; Mello, Rodrigo; Read, Jesse; Le Nguyen, Minh Huong; Bifet, Albert (2022-02-28). "A Survey on Semi-Supervised Learning for Delayed Partially Labelled Data Streams". ACM Computing Surveys. 55 (4): 1–42. arXiv:2106.09170. doi:10.1145/3523055. ISSN 0360-0300.
  8. ^ Grzenda, Maciej; Gomes, Heitor Murilo; Bifet, Albert (2019-11-16). "Delayed labelling evaluation for data streams". Data Mining and Knowledge Discovery. 34 (5): 1237–1266. doi:10.1007/s10618-019-00654-y. ISSN 1573-756X.
  9. ^ Žliobaitė, Indrė; Bifet, Albert; Read, Jesse; Pfahringer, Bernhard; Holmes, Geoff (2015-03-01). "Evaluation methods and decision theory for classification of streaming data with temporal dependence". Machine Learning. 98 (3): 455–482. doi:10.1007/s10994-014-5441-4. hdl:10289/8954. ISSN 1573-0565.

and 25 Related for: Data stream mining information

Request time (Page generated in 0.8363 seconds.)

Data stream mining

Last Update:

Data Stream Mining (also known as stream learning) is the process of extracting knowledge structures from continuous, rapid data records. A data stream...

Word Count : 1363

Data mining

Last Update:

Data mining is the process of extracting and discovering patterns in large data sets involving methods at the intersection of machine learning, statistics...

Word Count : 5009

Stream processing

Last Update:

In computer science, stream processing (also known as event stream processing, data stream processing, or distributed stream processing) is a programming...

Word Count : 4575

Concept drift

Last Update:

retraining, also known as refreshing, of any model is necessary. Data stream mining Data mining Persuasions of the Witch's Craft Snyk, a company whose portfolio...

Word Count : 2927

Streaming algorithm

Last Update:

In computer science, streaming algorithms are algorithms for processing data streams in which the input is presented as a sequence of items and can be...

Word Count : 3578

Process mining

Last Update:

Process mining is a family of techniques used to analyze event data in order to understand and improve operational processes. Part of the fields of data science...

Word Count : 2702

Examples of data mining

Last Update:

Data mining, the process of discovering patterns in large data sets, has been used in many applications. In business, data mining is the analysis of historical...

Word Count : 4817

Massive Online Analysis

Last Update:

Analysis (MOA) is a free open-source software project specific for data stream mining with concept drift. It is written in Java and developed at the University...

Word Count : 807

Evolving classification function

Last Update:

typically employed for data stream mining tasks in dynamic and changing environments. Supervised Classification on Data Streams Evolving fuzzy rule-based...

Word Count : 209

Smart meter

Last Update:

Advanced Metering Infrastructure Using Intrusion Detection System with Data Stream Mining" (PDF). Archived from the original (PDF) on 2016-09-10. Retrieved...

Word Count : 7646

Symmetric hash join

Last Update:

other inputs. If so, output the records. Data stream management system Data stream mining "Issues in Data Stream Management" (PDF). "University of Waterloo...

Word Count : 73

Knowledge extraction

Last Update:

Graphs Molecule mining Sequences Data stream mining Learning from time-varying data streams under concept drift Web Data model Metadata Metamodels Ontology...

Word Count : 4398

Data

Last Update:

governance Data integrity Data maintenance Data management Data mining Data modeling Data point Data preservation Data protection Data publication Data remanence...

Word Count : 2522

Mountaintop removal mining

Last Update:

Mountaintop removal mining (MTR), also known as mountaintop mining (MTM), is a form of surface mining at the summit or summit ridge of a mountain. Coal...

Word Count : 8440

Gold mining

Last Update:

Gold mining is the extraction of gold by mining. Historically, mining gold from alluvial deposits used manual separation processes, such as gold panning...

Word Count : 10228

Glossary of artificial intelligence

Last Update:

machine learning and artificial intelligence, typically employed for data stream mining tasks in dynamic and changing environments. existential risk The hypothesis...

Word Count : 27506

Mining

Last Update:

Mining is the extraction of valuable geological materials and minerals from the surface of the Earth. Mining is required to obtain most materials that...

Word Count : 12941

Task Force on Process Mining

Last Update:

session on Process Mining. Process mining is a type of research that is a mix of computational intelligence and data mining, as well as process modeling and...

Word Count : 624

IEEE 1849

Last Update:

event data (e.g., for process mining)". In 2023, the standard has been revised in and superseded by the IEEE Standard 1849-2023. Process mining aims to...

Word Count : 394

Auroop Ratan Ganguly

Last Update:

Policy Sustainability and Data Sciences Laboratory, Northeastern University risQ Climate as complex networks Data stream mining Nonlinear Processes in Geophysics:...

Word Count : 2272

Special Interest Group on Knowledge Discovery and Data Mining

Last Update:

Discovery and Data Mining, hosts an influential annual conference. The KDD Conference grew from KDD (Knowledge Discovery and Data Mining) workshops at...

Word Count : 1637

Data extraction

Last Update:

connector (e.g. USB) through which 'raw data' can be streamed into a personal computer. Typical unstructured data sources include web pages, emails, documents...

Word Count : 390

Coal mining

Last Update:

Coal mining is the process of extracting coal from the ground or from a mine. Coal is valued for its energy content and since the 1880s has been widely...

Word Count : 11774

Surface mining

Last Update:

Surface mining, including strip mining, open-pit mining and mountaintop removal mining, is a broad category of mining in which soil and rock overlying...

Word Count : 3206

Cryptocurrency

Last Update:

offerings and shut down mining. Many Chinese miners have since relocated to Canada and Texas. One company is operating data centers for mining operations at Canadian...

Word Count : 19343

PDF Search Engine © AllGlobal.net