Global Information Lookup Global Information

SMART Information Retrieval System information


The SMART (System for the Mechanical Analysis and Retrieval of Text) Information Retrieval System is an information retrieval system developed at Cornell University in the 1960s.[1] Many important concepts in information retrieval were developed as part of research on the SMART system, including the vector space model, relevance feedback, and Rocchio classification.

Gerard Salton led the group that developed SMART. Other contributors included Mike Lesk.

The SMART system also provides a set of corpora, queries and reference rankings, taken from different subjects, notably

  • ADI: publications from information science reviews
  • Computer science
  • Cranfield collection: publications from aeronautic reviews
  • Forensic science: library science
  • MEDLARS collection: publications from medical reviews
  • Time magazine collection: archives of the generalist review Time in 1963

To the legacy of the SMART system belongs the so-called SMART triple notation, a mnemonic scheme for denoting tf-idf weighting variants in the vector space model. The mnemonic for representing a combination of weights takes the form ddd.qqq, where the first three letters represents the term weighting of the collection document vector and the second three letters represents the term weighting for the query document vector. For example, ltc.lnn represents the ltc weighting applied to a collection document and the lnn weighting applied to a query document.

The following tables establish the SMART notation:[2]

Symbols and notation
represents a document vector, where is the weight of the term in and is the number of unique terms in . Positive features characterize terms that are present in a document, and the weight of zero is used for terms that are absent from a document.
Occurrence frequency of term in document Number of unique terms in document
Number of collection documents Average number of unique terms in a document
Number of documents with term present Number of characters in document
Occurrence frequency of the most common term in document Average number of characters in a document
Average occurrence frequency of a term in document Global collection statistics
The slope in the context of pivoted document length normalization[3]
Smart term-weighting triple notation
Term frequency Document frequency Document length normalization
b Binary weight x n Disregards the collection frequency x n No document length normalization
t n Raw term frequency f Inverse collection frequency c Cosine normalization
a Augmented normalized term frequency t Inverse collection frequency u Pivoted unique normalization[3]
l Logarithm p Probabilistic inverse collection frequency b Pivoted characted length normalization[3]
L Average-term-frequency-based normalization[3]
d Double logarithm

The gray letters in the first, fifth, and ninth columns are the scheme used by Salton and Buckley in their 1988 paper.[4] The bold letters in the second, sixth, and tenth columns are the scheme used in experiments reported thereafter.

  1. ^ Salton, G, Lesk, M.E. (June 1965). "The SMART automatic document retrieval systems—an illustration". Communications of the ACM. 8 (6): 391–398. doi:10.1145/364955.364990.{{cite journal}}: CS1 maint: multiple names: authors list (link)
  2. ^ Palchowdhury, Sauparna (2016). "On The Provenance of tf-idf". sauparna.sdf.org. Retrieved 2019-07-29.
  3. ^ a b c d Singhal, A., Buckley, C., & Mitra, M. (1996). Pivoted Document Length Normalization. SIGIR Forum, 51, 176-184.
  4. ^ Salton, G., & Buckley, C. (1988). Term-Weighting Approaches in Automatic Text Retrieval. Inf. Process. Manage., 24, 513-523.

and 26 Related for: SMART Information Retrieval System information

Request time (Page generated in 0.8512 seconds.)

SMART Information Retrieval System

Last Update:

The SMART (System for the Mechanical Analysis and Retrieval of Text) Information Retrieval System is an information retrieval system developed at Cornell...

Word Count : 359

Information retrieval

Last Update:

Information retrieval (IR) in computing and information science is the task of identifying and retrieving information system resources that are relevant...

Word Count : 3387

Smart

Last Update:

Technology (S.M.A.R.T.), a standard used in computer storage devices SMART Information Retrieval System, an information retrieval system developed at...

Word Count : 573

Vector space model

Last Update:

used in information filtering, information retrieval, indexing and relevancy rankings. Its first use was in the SMART Information Retrieval System[citation...

Word Count : 1390

Rocchio algorithm

Last Update:

in information retrieval systems which stemmed from the SMART Information Retrieval System developed between 1960 and 1964. Like many other retrieval systems...

Word Count : 795

Pick operating system

Last Update:

and retrieval efficiency for specific kinds of datasets. Pick was originally implemented as the Generalized Information Retrieval Language System (GIRLS)...

Word Count : 3386

Gerard Salton

Last Update:

of information retrieval during his time, and "the father of Information Retrieval". His group at Cornell developed the SMART Information Retrieval System...

Word Count : 838

Management information system

Last Update:

MIS systems: Retrieval and dissemination are dependent on technology hardware and software. Potential for inaccurate information. Enterprise systems—also...

Word Count : 1889

Automated storage and retrieval system

Last Update:

An automated storage and retrieval system (ASRS or AS/RS) consists of a variety of computer-controlled systems for automatically placing and retrieving...

Word Count : 3618

List of search engine software

Last Update:

Openbook OpenSearchServer Pubget Q-go Quixey Sci-Hub SinglePoint SMART Information Retrieval System Sparrho Sphinx Svensk mediedatabas Swiftype Thunderstone Software...

Word Count : 116

National Center for Biotechnology Information

Last Update:

different sources, databases, and formats into a uniform information model and retrieval system which can efficiently retrieve that relevant references...

Word Count : 1236

Mike Lesk

Last Update:

Michael Lesk worked for the SMART Information Retrieval System project, wrote much of its retrieval code and did many of the retrieval experiments, as well as...

Word Count : 455

Cognitive city

Last Update:

cities and smart cities in the fact that it is steadily learning through constant interaction with its citizens through advanced information and communications...

Word Count : 1504

Query language

Last Update:

to factual questions, while an information retrieval query language attempts to find documents containing information that is relevant to an area of inquiry...

Word Count : 928

Recommender system

Last Update:

roots in information retrieval and information filtering research. To create a user profile, the system mostly focuses on two types of information: A model...

Word Count : 9789

Relevance feedback

Last Update:

Relevance feedback is a feature of some information retrieval systems. The idea behind relevance feedback is to take the results that are initially returned...

Word Count : 1130

Temporal information retrieval

Last Update:

Temporal information retrieval (T-IR) is an emerging area of research related to the field of information retrieval (IR) and a considerable number of sub-areas...

Word Count : 281

Database

Last Update:

that allow entry, storage and retrieval of large quantities of information and provides ways to manage how that information is organized. Because of the...

Word Count : 9539

Decision support system

Last Update:

A decision support system (DSS) is an information system that supports business or organizational decision-making activities. DSSs serve the management...

Word Count : 3290

Learning to rank

Last Update:

reinforcement learning, in the construction of ranking models for information retrieval systems. Training data may, for example, consist of lists of items with...

Word Count : 3789

Query expansion

Last Update:

Relevance feedback in information retrieval. In The SMART Retrieval System, p. 313-323. 1971. C. Buckley. Automatic query expansion using SMART: TREC 3. In Proceedings...

Word Count : 1416

Visual Word

Last Update:

Visual words, as used in image retrieval systems, refer to small parts of an image that carry some kind of information related to the features (such as...

Word Count : 837

Semantic Scholar

Last Update:

and information retrieval. Semantic Scholar began as a database for the topics of computer science, geoscience, and neuroscience. In 2017, the system began...

Word Count : 1341

M270 Multiple Launch Rocket System

Last Update:

Rocket System/Guided Multiple Launch Rocket System Alternative Warhead (GMLRS/GMLRS AW)" (PDF). Defense Acquisition Management Information Retrieval. p. 15...

Word Count : 8294

List of CBIR engines

Last Update:

This is a list of publicly available content-based image retrieval (CBIR) engines. These image search engines look at the content (pixels) of images in...

Word Count : 34

Query understanding

Last Update:

normalisation by and large did not help retrieval performance. Once the attention of the information retrieval field moved to languages other than English...

Word Count : 964

PDF Search Engine © AllGlobal.net