A database in which data is stored across different physical locations.
This article has multiple issues. Please help improve it or discuss these issues on the talk page. (Learn how and when to remove these template messages)
This article needs additional citations for verification. Please help improve this article by adding citations to reliable sources. Unsourced material may be challenged and removed. Find sources: "Distributed database" – news · newspapers · books · scholar · JSTOR(August 2010) (Learn how and when to remove this message)
This article includes a list of general references, but it lacks sufficient corresponding inline citations. Please help to improve this article by introducing more precise citations.(April 2013) (Learn how and when to remove this message)
(Learn how and when to remove this message)
A distributed database is a database in which data is stored across different physical locations.[1] It may be stored in multiple computers located in the same physical location (e.g. a data centre); or maybe dispersed over a network of interconnected computers. Unlike parallel systems, in which the processors are tightly coupled and constitute a single database system, a distributed database system consists of loosely coupled sites that share no physical components.
System administrators can distribute collections of data (e.g. in a database) across multiple physical locations. A distributed database can reside on organised network servers or decentralised independent computers on the Internet, on corporate intranets or extranets, or on other organisation networks. Because distributed databases store data across multiple computers, distributed databases may improve performance at end-user worksites by allowing transactions to be processed on many machines, instead of being limited to one.[2]
Two processes ensure that the distributed databases remain up-to-date and current: replication[3] and duplication.
Replication involves using specialized software that looks for changes in the distributive database. Once the changes have been identified, the replication process makes all the databases look the same. The replication process can be complex and time-consuming, depending on the size and number of the distributed databases. This process can also require much time and computer resources.
Duplication, on the other hand, has less complexity. It identifies one database as a master and then duplicates that database. The duplication process is normally done at a set time after hours. This is to ensure that each distributed location has the same data. In the duplication process, users may change only the master database. This ensures that local data will not be overwritten.
Both replication and duplication can keep the data current in all distributive locations.[2]
Besides distributed database replication and fragmentation, there are many other distributed database design technologies. For example, local autonomy, synchronous, and asynchronous distributed database technologies. The implementation of these technologies can and do depend on the needs of the business and the sensitivity/confidentiality of the data stored in the database and the price the business is willing to spend on ensuring data security, consistency and integrity.
When discussing access to distributed databases, Microsoft favors the term distributed query, which it defines in protocol-specific manner as "[a]ny SELECT, INSERT, UPDATE, or DELETE statement that references tables and rowsets from one or more external OLE DB data sources".[4]
Oracle provides a more language-centric view in which distributed queries and distributed transactions form part of distributed SQL.[5]
^ ab
O'Brien, J. & Marakas, G.M.(2008) Management Information Systems (pp. 185-189). New York, NY: McGraw-Hill Irwin
^Ozsu, M.T.; Valduriez, P. (1991). "Distributed database systems: where are we now?". Computer. 24 (8): 68–78. doi:10.1109/2.84879. ISSN 1558-0814. S2CID 5898169.
^
"TechNet Glossary". Microsoft. 28 January 2010. Retrieved 2013-07-16. distributed query[:] Any SELECT, INSERT, UPDATE, or DELETE statement that references tables and rowsets from one or more external OLE DB data sources.
^
Ashdown, Lance; Kyte, Tom (September 2011). "Oracle Database Concepts, 11g Release 2 (11.2)". Oracle Corporation. Archived from the original on 2013-07-15. Retrieved 2013-07-17. Distributed SQL synchronously accesses and updates data distributed among multiple databases. [...] Distributed SQL includes distributed queries and distributed transactions.
and 22 Related for: Distributed database information
administrators can distribute collections of data (e.g. in a database) across multiple physical locations. A distributeddatabase can reside on organised...
data, and distributed computing issues, including supporting concurrent access and fault tolerance. Computer scientists may classify database management...
Distributed computing is a field of computer science that studies distributed systems, defined as computer systems whose inter-communicating components...
A distributed SQL database is a single relational database which replicates data across multiple servers. Distributed SQL databases are strongly consistent...
(link) "NoSQL databases eat into the relational database market". 4 March 2015. Retrieved 2018-03-14. Reinsch, R. (1988). "Distributeddatabase for SAA"....
nodes. Distributeddatabases are usually non-relational databases that enable a quick access to data over a large number of nodes. Some distributed databases...
Designing a centralized database is generally much less complex than designing a distributeddatabase, as distributeddatabase systems are based on a hierarchical...
inconsistency. Database systems implement distributed transactions as transactions accessing data over multiple nodes. A distributed transaction enforces...
Oracle Database (commonly referred to as Oracle DBMS, Oracle Autonomous Database, or simply as Oracle) is a proprietary multi-model database management...
Apache Cassandra is a free and open-source, distributed, wide-column store, NoSQL database management system designed to handle large amounts of data across...
DRDA describes the architecture for distributed relational databases. It defines the rules for accessing the distributed data, but it does not provide the...
In database theory, the PACELC theorem is an extension to the CAP theorem. It states that in case of network partitioning (P) in a distributed computer...
CockroachDB is a commercial distributed SQL database management system developed by Cockroach Labs. CockroachDB is a distributed SQL database built on top of a...
In the fields of databases and transaction processing (transaction management), a schedule (or history) of a system is an abstract model to describe the...
"open-source distributed, non-relational databases". The name attempted to label the emergence of an increasing number of non-relational, distributed data stores...
servers. This mechanism provides distributed and fault-tolerant service and was designed to avoid a single large central database. In addition, the DNS specifies...
graphics or Digidesign Database File, in the Alphabetical list of filename extensions DDB, distributeddatabase, is a database in which storage devices...
In computing, a distributed cache is an extension of the traditional concept of cache used in a single locale. A distributed cache may span multiple servers...
isolation property. Guaranteeing ACID properties in a distributed transaction across a distributeddatabase, where no single node is responsible for all data...
A distributed transaction is a database transaction in which two or more network hosts are involved. Usually, hosts provide transactional resources, while...
A distributed ledger (also called a shared ledger or distributed ledger technology or DLT) is the consensus of replicated, shared, and synchronized digital...
Directory service Distributeddatabase management system Hierarchical model Navigational database Network model Object model Object database (OODBMS) Object–relational...