Data publishing (also data publication) is the act of releasing research data in published form for use by others. It is a practice consisting in preparing certain data or data set(s) for public use thus to make them available to everyone to use as they wish.
This practice is an integral part of the open science movement.
There is a large and multidisciplinary consensus on the benefits resulting from this practice.[1][2][3]
The main goal is to elevate data to be first class research outputs.[4] There are a number of initiatives underway as well as points of consensus and issues still in contention.[5]
There are several distinct ways to make research data available, including:
publishing data as supplemental material associated with a research article, typically with the data files hosted by the publisher of the article
hosting data on a publicly available website, with files available for download
hosting data in a repository that has been developed to support data publication, e.g. figshare, Dryad, Dataverse, Zenodo. A large number of general and specialty (such as by research topic) data repositories exist.[6] For example, the UK Data Service enables users to deposit data collections and re-share these for research purposes.
publishing a data paper about the dataset, which may be published as a preprint, in a regular journal, or in a data journal that is dedicated to supporting data papers. The data may be hosted by the journal or hosted separately in a data repository.
Publishing data allows researchers to both make their data available to others to use, and enables datasets to be cited similarly to other research publication types (such as articles or books), thereby enabling producers of datasets to gain academic credit for their work.
The motivations for publishing data may range for a desire to make research more accessible, to enable citability of datasets, or research funder or publisher mandates that require open data publishing. The UK Data Service is one key organisation working with others to raise the importance of citing data correctly[7] and helping researchers to do so.
Solutions to preserve privacy within data publishing has been proposed, including privacy protection algorithms, data ”masking” methods, and regional privacy level calculation algorithm.[8]
^Smith VS (2009). "Data publication: towards a database of everything". BMC Research Notes. 2 (113): 113. doi:10.1186/1756-0500-2-113. PMC 2702265. PMID 19552813.
^Lawrence, B; Jones, C.; Matthews, B.; Pepler, S.; Callaghan, S. (2011). "Citation and Peer Review of Data: Moving Towards Formal Data Publication". International Journal of Digital Curation. 6 (2): 4–37. doi:10.2218/ijdc.v6i2.205.
^Callaghan S, Donegan S, Pepler S, Thorley M, Cunningham N, Kirsch P, Ault L, Bell P, Bowie R, Leadbetter A, Lowry R, Moncoiffé G, Harrison K, Smith-Haddon B, Weatherby A, Wright D (2012). "Making data a first class scientific output: Data citation and publication by NERCs environmental data centres". International Journal of Digital Curation. 7 (1): 107–113. doi:10.2218/ijdc.v7i1.218.
^Kratz J, Strasser C (2014). "Data publication consensus and controversies". F1000Research. 3 (94): 94. doi:10.12688/f1000research.4518. PMC 4097345. PMID 25075301.
^Assante, M.; Candela, L.; Castelli, D.; Tani, A. (2016). "Are Scientific Data Repositories Coping with Research Data Publishing?". Data Science Journal. 15. doi:10.5334/dsj-2016-006.
^Service, UK Data. "New to using data". UK Data Service.
^Zhang, Longbin; Wang, Yuxiang; Xu, Xiaoliang (August 2017). "Logic-Partition Based Gaussian Sampling for Online Aggregation". 2017 Fifth International Conference on Advanced Cloud and Big Data (CBD). IEEE. pp. 182–187. doi:10.1109/cbd.2017.39. ISBN 978-1-5386-1072-5. S2CID 40025084.
Datapublishing (also data publication) is the act of releasing research data in published form for use by others. It is a practice consisting in preparing...
Wikiversity Data from Wikidata Library resources about Publishing Resources in your library International Publishers' organisation Printing and publishing – Law...
Variable-datapublishing (VDP) (also known as database publishing) is a term referring to the output of a variable composition system. While these systems...
Data.gov, Data.gov.uk and Data.gov.in. Open data can be linked data - referred to as linked open data. One of the most important forms of open data is...
Open scientific data or open research data is a type of open data focused on publishing observations and results of scientific activities available for...
DataCite is an international not-for-profit organization which aims to improve data citation in order to: establish easier access to research data on the...
Data wrangling, sometimes referred to as data munging, is the process of transforming and mapping data from one "raw" data form into another format with...
scholarly publishing. The development of the web shifted the focus of scholarly communication from publication to a large variety of outputs (data, software...
was acquired by Cable One. Hargray established DataPublishing in 1986. It is responsible for publishing all of the Horry Telephone Directories. The company...
data. All of the above are varieties of data analysis. Data integration is a precursor to data analysis, and data analysis is closely linked to data visualization...
Project Open Data Institute Open education Open educational resources Open format Open Knowledge International Open copyright license Open publishing Open research...
the data, and/or those potentially affected by data-sharing. Data archive Data dissemination Data privacy DatapublishingData citation FAIR data File...
science, a data structure is a data organization, and storage format that is usually chosen for efficient access to data. More precisely, a data structure...
Research data archiving is the long-term storage of scholarly research data, including the natural sciences, social sciences, and life sciences. The various...
standardized datapublishing methodology. There are several types of data that researchers must to protect when collecting, handling, storing and sharing data to...
Big data primarily refers to data sets that are too large or complex to be dealt with by traditional data-processing application software. Data with many...
Desktop publishing (DTP) is the creation of documents using dedicated software on a personal ("desktop") computer. It was first used almost exclusively...
Academic publishing is the subfield of publishing which distributes academic research and scholarship. Most academic work is published in academic journal...
Data mining is the process of extracting and discovering patterns in large data sets involving methods at the intersection of machine learning, statistics...
information, making information search and data integration more efficient. Although semantic publishing is not specific to the Web, it has been driven...
Data processing is the collection and manipulation of digital data to produce meaningful information. Data processing is a form of information processing...
Osprey Publishing is a British publishing company specializing in military history based in Oxford. Predominantly an illustrated publisher, many of their...
company division for healthcare dictating applications IHS Markit, a datapublishing company (Information Handling Services) that originated in 1959, and...
(magazine), a defunct American trade journal Electronic storage, the storage of data using an electronic device Electronic commerce or e-commerce, the trading...
opposed to open data, open source and non-profit products like Unpaywall which facilitates usage of open access works. SAGE Publishing was a founding member...
a feed-publishing system based on the Atom Publishing Protocol, plus some extensions for handling queries. It relies on XML or JSON as a data format....
process. Many data-driven stories begin with newly available resources such as open source software, open access publishing and open data, while others...
Monte C. (2005). "Cassiterite" (PDF). Handbook of Mineralogy. Mineral DataPublishing. Retrieved 19 June 2022. Cassiterite, Mindat.org Webmineral Hurlbut...