Global Information Lookup Global Information

Inverted index information


In computer science, an inverted index (also referred to as a postings list, postings file, or inverted file) is a database index storing a mapping from content, such as words or numbers, to its locations in a table, or in a document or a set of documents (named in contrast to a forward index, which maps from documents to content).[1] The purpose of an inverted index is to allow fast full-text searches, at a cost of increased processing when a document is added to the database.[2] The inverted file may be the database file itself, rather than its index. It is the most popular data structure used in document retrieval systems,[3] used on a large scale for example in search engines. Additionally, several significant general-purpose mainframe-based database management systems have used inverted list architectures, including ADABAS, DATACOM/DB, and Model 204.

There are two main variants of inverted indexes: A record-level inverted index (or inverted file index or just inverted file) contains a list of references to documents for each word. A word-level inverted index (or full inverted index or inverted list) additionally contains the positions of each word within a document.[4] The latter form offers more functionality (like phrase searches), but needs more processing power and space to be created.

  1. ^ Knuth, D. E. (1997) [1973]. "6.5. Retrieval on Secondary Keys". The Art of Computer Programming (Third ed.). Reading, Massachusetts: Addison-Wesley. ISBN 0-201-89685-0.
  2. ^ Salton, Gerard; Fox, Edward A.; Wu, Harry (November 1983). "Extended Boolean information retrieval". Communications of the ACM. 26 (11): 1022–1036. doi:10.1145/182.358466. hdl:1813/6351.
  3. ^ Zobel, Justin; Moffat, Alistair; Ramamohanarao, Kotagiri (December 1998). "Inverted files versus signature files for text indexing". ACM Transactions on Database Systems. 23 (4). New York: Association for Computing Machinery: 453–490. doi:10.1145/296854.277632. S2CID 7293918.
  4. ^ Baeza-Yates, Ricardo; Ribeiro-Neto, Berthier (1999). Modern information retrieval. Reading, Massachusetts: Addison-Wesley Longman. p. 192. ISBN 0-201-39829-X.

and 28 Related for: Inverted index information

Request time (Page generated in 0.8485 seconds.)

Inverted index

Last Update:

inverted index (or inverted file index or just inverted file) contains a list of references to documents for each word. A word-level inverted index (or full...

Word Count : 875

Search engine indexing

Last Update:

circumvents the inverted index update bottleneck. The forward index is sorted to transform it to an inverted index. The forward index is essentially a...

Word Count : 4766

Database index

Last Update:

CREATE INDEX syntax with some specific options that depend on their software's capabilities. Index locking Inverted index – Type of database index Search...

Word Count : 2458

Microsoft SQL Server

Last Update:

frequently and are not useful for search. With the remaining words, an inverted index is created, associating each word with the columns they were found in...

Word Count : 7126

Database model

Last Update:

support SQL, combining the strengths of both models. In an inverted file or inverted index, the contents of the data are used as keys in a lookup table...

Word Count : 2961

Query language

Last Update:

terms as bag of words that are to be matched with the postings in the inverted index and where subsequently ranking models are applied to retrieve the most...

Word Count : 928

Boolean model of information retrieval

Last Update:

An inverted index file contains two parts: a vocabulary containing all the terms used in the collection, and for each distinct term an inverted index that...

Word Count : 1389

Google data centers

Last Update:

Google indexes documents by building a data structure known as inverted index. Such an index obtains a list of documents by a query word. The index is very...

Word Count : 4869

Document retrieval

Last Update:

inferior to inverted files in terms of speed, size and functionality, it is not used widely. However, with proper parameters it can beat the inverted files...

Word Count : 723

Reverse index

Last Update:

can reuse 14538's space.[citation needed] Inverted index Reverse dictionary "Introduction To Reverse Key Indexes: Part I". Richard Foote's Oracle Blog. 2008-01-14...

Word Count : 491

Search engine

Last Update:

have programmed extensively. The other is a system that generates an "inverted index" by analyzing texts it locates. This first form relies much more heavily...

Word Count : 7560

Compressed data structure

Last Update:

opposed to the inverted index, which can support only word-based searches. In addition, inverted indexes do not have the self-indexing feature. An important...

Word Count : 471

Vector space model

Last Update:

Compound term processing Conceptual space Eigenvalues and eigenvectors Inverted index Nearest neighbor search Sparse distributed memory w-shingling Büttcher...

Word Count : 1390

Explicit semantic analysis

Last Update:

concepts it points to in the inverted index." The output of the inverted index for a single word query is a list of indexed documents (Wikipedia articles)...

Word Count : 1036

BitFunnel

Last Update:

source in 2016. BitFunnel uses bit-sliced signatures instead of an inverted index in an attempt to reduce operations cost. Progress on the implementation...

Word Count : 609

Signature file

Last Update:

viruses In document retrieval the signature file method competes with the inverted index method to produce query results. A signature block, or sig file is a...

Word Count : 96

Database

Last Update:

combines the two related structures. Physical data models include: Inverted index Flat file Other models include: Multidimensional model Array model Multivalue...

Word Count : 9539

List of terms relating to algorithms and data structures

Last Update:

intractable introsort introspective sort inverse Ackermann function inverted file index inverted index irreflexive isomorphic iteration Jaro–Winkler distance Johnson's...

Word Count : 3134

Substring index

Last Update:

such as inverted files and document retrieval. See full text search. Substring indexes include: Suffix tree Suffix array N-gram index, an inverted file for...

Word Count : 183

MapReduce

Last Update:

link-graph reversal, Singular Value Decomposition, web access log stats, inverted index construction, document clustering, machine learning, and statistical...

Word Count : 5491

Dynatext

Last Update:

structure (similar to DOM for XML, but persistent), as well as a full-text inverted index of the text, elements, and attributes. Customers typically distributed...

Word Count : 749

List of pioneers in computer science

Last Update:

information retrieval, who proposed the vector space model and the inverted index. 1962 Sammet, Jean E. Developed the FORMAC programming language. She...

Word Count : 1515

Apache Pinot

Last Update:

Pinot supports pluggable indexing technologies - Sorted Index, Bitmap Index, Inverted Index, Star-Tree Index, and Range Index, which are what primarily...

Word Count : 922

Polynomial kernel

Last Update:

conjunctions in a training set to produce an approximate expansion; inverted indexing of support vectors. One problem with the polynomial kernel is that...

Word Count : 1126

Codependency

Last Update:

Part of a series on Sociology History Outline Index Key themes Society Globalization Human behavior Human environmental impact Identity Industrial revolutions...

Word Count : 3238

List of important publications in computer science

Last Update:

Wu Commun. ACM 26(11): 1022–1036 (1983) Description: Presented the inverted index Karen Spärck Jones Journal of Documentation 28: 11–21 (1972). doi:10...

Word Count : 5931

Economic indicator

Last Update:

in American English), housing starts, consumer price index (a measure for inflation), Inverted yield curve, consumer leverage ratio, industrial production...

Word Count : 1556

PostgreSQL

Last Update:

regular B-tree and hash table indexes, and four index access methods: generalized search trees (GiST), generalized inverted indexes (GIN), Space-Partitioned...

Word Count : 8582

PDF Search Engine © AllGlobal.net