Application for data cleanup and data transformation
OpenRefine
Developer(s)
Freebase, then Google, now open source community
Initial release
November 10, 2010; 13 years ago (2010-11-10)
Stable release
3.7.9[1]
/ 10 February 2024; 3 months ago (10 February 2024)
Repository
github.com/OpenRefine/OpenRefine
Written in
Java[2]
Platform
Microsoft Windows, Linux, macOS
Available in
English, Italian, Chinese, Japanese, French, German
Type
Data management
Data visualization
License
BSD License
Website
openrefine.org
OpenRefine is an open-source desktop application for data cleanup and transformation to other formats, an activity commonly known as data wrangling.[3] It is similar to spreadsheet applications, and can handle spreadsheet file formats such as CSV, but it behaves more like a database.
It operates on rows of data which have cells under columns, similar to the manner in which relational database tables operate. OpenRefine projects consist of one table, whose rows can be filtered using facets that define criteria (for example, showing rows where a given column is not empty).
Unlike spreadsheets, most operations in OpenRefine are done on all visible rows, for example, the transformation of all cells in all rows under one column,[4] or the creation of a new column based on existing data. Actions performed on a dataset are stored the project and can be 'replayed' on other datasets. Formulas are not stored in cells, but are used to transform the data. Transformation is done only once.[5] Formula expressions can be written in General Refine Expression Language (GREL),[6] in Jython (i.e., Python), and in Clojure.[7]
The program operates as a local web app: it starts a web server and opens the default browser to 127.0.0.1:3333.
^"Release 3.7.9". 10 February 2024. Retrieved 20 February 2024.
^"OpenRefine/OpenRefine - GitHub". GitHub. Retrieved 25 June 2017.
^"openrefine.github.com". openrefine.org.
^"Editing by transforming: Cell Editing wiki page from Refine documentation". Retrieved 18 April 2012.
^"Comparison with spreadsheet software: Cell Editing wiki page in Refine documentation". Retrieved 18 April 2012.
^General Refine expression language OpenRefine/OpenRefine Wiki GitHub. Github.com (2013-04-03). Retrieved on 2013-08-16.
^"Expressions: Refine documentation". Retrieved 18 April 2012.
4, has been made available under the terms of the BSD License via the OpenRefine project. The Double Metaphone phonetic encoding algorithm is the second...
dataflow code. Early prototypes of visual data wrangling tools include OpenRefine and the Stanford/Berkeley Wrangler research system; the latter evolved...
tag-soup." jsoup is used in a number of current projects, including Google's OpenRefine data-wrangling tool. Comparison of HTML parsers Web scraping Data wrangling...
Retrieved December 12, 2023. Grant, Nico (September 15, 2022). "YouTube Opens More Pathways for Creators to Make Money on the Platform". The New York...
December 2006, it has provided links to both published versions and major open access repositories, including all those posted on individual faculty web...
to a user's local computer in a variety of formats such as PDF and Office Open XML. Sheets supports tagging for archival and organizational purposes. Launched...
users. Google Docs supports opening and saving documents in the standard OpenDocument format as well as in Rich text format, plain Unicode text, zipped...
applications such as Gmail and Google Maps. In 2010, Pichai also announced the open-sourcing of the new video codec VP8 by Google and introduced the new video...
appeared for email messages (from specific senders) that the user had not opened for a month. A few popular Inbox by Gmail features were subsequently added...
operating system developed by the Open Handset Alliance led by Google. It was released to the public and the Android Open Source Project (AOSP) on October...