On this page you will find a list of OpenRefine distributions and extensions available for download. Are we missing something? Want to fix a typo? You can submit changes.
Official Distribution
Read the installation instructions.
You can also download all official releases and source from our GitHub releases page
OpenRefine 3.5.0
The latest stable release of OpenRefine 3.5.0, released on November 7, 2021. Please backup your workspace directory before installing and report any problems that you encounter. A change log is provided on the release page.
- Windows kit, This requires Java to be installed on your computer. Download, unzip, and double-click on openrefine.exe or refine.bat if the former does not work.
- Windows kit with embedded Java, includes OpenJDK Java, available under the GPLv2+CE license. Download, unzip, and double-click on openrefine.exe or refine.bat if the former does not work.
- Mac kit, Download, open, drag icon into the Applications folder and double click on it. You do not need to install Java separately.
- Linux kit, Download, extract, then type ./refine to start. This requires Java to be installed on your computer.
OpenRefine 3.4.1
The previous stable release of OpenRefine, released on September 24, 2020. Please backup your workspace directory before installing and report any problems that you encounter. A change log is provided on the release page.
- Windows kit, This requires Java to be installed on your computer. Download, unzip, and double-click on openrefine.exe or refine.bat if the former does not work.
- Windows kit with embedded Java, includes OpenJDK Java, available under the GPLv2+CE license. Download, unzip, and double-click on openrefine.exe or refine.bat if the former does not work.
- Mac kit, Download, open, drag icon into the Applications folder and double click on it. You do not need to install Java separately.
- Linux kit, Download, extract, then type ./refine to start. This requires Java to be installed on your computer.
Other Distributions
The following distributions have been customized for a specific usage or integration with other technologies.
Distribution | Description | Authors |
---|---|---|
LODRefine | LODRefine is actually OpenRefine with integrated extensions that make transition from tabular data to Linked Data a bit easier. Integrated extensions are: RDF extension, DBpedia extension, CrowdFlower crowdsourcing extension, Stats extension | Sparkica |
OpenDataRise | Tool to cleanse and semantify datasets from CKAN repositories. Based on OpenRefine. | Open Data in Trentino |
p3-batchrefine | BatchRefine adds batch processing capabilities to OpenRefine and support multiple back end including spark | SpazioDati |
RefineOnSpark | RefineOnSpark is a driver program to run OpenRefine jobs on the Spark cluster | SpazioDati |
Reconciliation-and-Matching-Framework | A framework to allow the matching of string entities using customised sets of transformations and matchers, plus a tool to produce the necessary configurations and another to expose them as OpenRefine reconciliation services. | RBGKew |
OntoRefine | GraphDB OntoRefine is a data transformation tool, based on OpenRefine and integrated in the GraphDB Workbench. It can be used for converting tabular data into RDF and importing it into a GraphDB repository, using simple SPARQL queries and a virtual endpoint. | Ontotext |
List of Extensions
You can install the following extensions to add functionalities to OpenRefine. We do not vouch for the function of any of these extensions. Some only work with earlier version of OpenRefine, or with third-party distributions. See how to install an extension.
Extension Name | Description | Authors | Latest compatible version |
---|---|---|---|
OSM Extractor | Adds a new import option that allows you to import OpenStreetMap data into OpenRefine using the Overpass API. Also adds a new GREL function called getInteriorPoint() which extracts the interior point of any Geometry (represented using Well-Known Text). | Labian Gashi | 3.5.0 |
GeoJSON Export | Adds a Graphical User Interface (GUI) that allows you to export OpenRefine data to the GeoJSON format. Supports latitude/longitude coordinates and Well-known Text (WKT). | Labian Gashi | 3.5.0 |
FAIR metadata | Supports FAIR metadata by integrating with FAIR Data Point to store your data and export to FAIR. It is a successor of the legacy FAIRifier OpenRefine 2.6 fork. Manual. | FAIRDataTeam | 3.4.1 |
VIB-Bits | Adds history tools, pivot tool, cross() graphical user interface (GUI), and scatterplot tool. Manual PDF. | VIB-Bits | 3.4.1 |
Named-Entity Recognition | Extracts identifiable or reconcilable entities from text fields. It supports AlchemyAPI, DBpedia Lookup, and Zemanta. Manual. | Ruben Verborgh and Stuart Kenny | 3.4.1 |
String-Transformers | Adds a collection of Java string transformers to Jython functions. Some generic, some aimed at botany and taxonomy. Download this and this and follow their installation instructions. | RBGKew | 3.4.1 |
RDF extension | RDF Refine - an improved fork for exporting RDF. Adds a graphical user interface (GUI) for exporting data in RDF format. | stkenny & DRI | 3.4.1 |
GOKb Utilities | Adds three new features: prepend rows (add new blank rows); trim all data (remove preceding/trailing whitespace from across all cells); extractHost() (GREL function that extracts a host name from a URL). | GOKb & Owen Stephens | 3.1 |
Stats extension for Google Refine 2.5+ | Computes elementary statistics on column data. | sparkica | 3.1 |
CKAN Storage Extension | Exports data to CKAN Storage 2.2+ using CKAN storage API. | Ontodia | 3.1 |
GeoRefine | Add geospatial processing GREL functions. | Ryan Baumann | 2.8 |
D2Refine | A clinical study metadata harmonization and validation workbench to 1) convert restrictions (specified in a spreadsheet-like interface) into a clinical model; and 2) provide reconciliation services leveraging the Common Terminology Services 2 (CTS2). Manual. | Deepak K. Sharma | 2.7 |
Legacy Extensions
These extensions were produced in the past and are good examples of desired uses of OpenRefine. They are maintained here for historical reference, and in case a contributor would like to renew them.
Extension Name | Description | Authors | Suggested compatible version |
---|---|---|---|
VIB-BITS Diff plugin | Adds a “diff” tool which compares two text values. Manual PDF. | VIB-Bits | 3.4.1. The VIB website states “We are investigating how to provide [the diff plugin] again with version 3.4 of OpenRefine.” |
VIB-BITS Save facets plugin | Adds support for save facet settings and re-apply the same facets at a later time. Manual PDF. | VIB-Bits | Unknown. The VIB website states “We are investigating how to provide [the diff plugin] again with version 3.4 of OpenRefine.” |
geoXtension | geoXtension is based on GDAL, Proj, and GEOS libraries. Adding this extension takes a lot of time, usage of the ready-made docker is recommended. | giTorto | (circa 2.6) |
geoXtension | geoXtension is based on GDAL, Proj, and GEOS libraries. Adding this extension takes a lot of time, usage of the ready-made docker is recommended. | giTorto | (circa 2.6) |
extraCTU-plugin | To extract e-mails, telephone numbers, URLs and identification numbers from text. | giTorto | (circa 2.7) |
Opentree | Displays phylogenetic trees from the Open Tree of Life. Manual. | nickynicolson | 2.5 |
BioVeL | Adds functions for cleaning & clustering biodiversity data. The entire package can be installed and run locally as well as on a dedicated server. | Botanic Garden and Botanical Museum Berlin-Dahlem | 2.5 |
DBpedia extension | Adds columns from DBpedia to reconciled data. Data has to be reconciled with DBpedia or SPARQL endpoint, which returns DBpedia resource types. | Zemanta | 2.5 |
Stats extension for Google Refine 2.1 | Computes elementary statistics on column data. | The Chicago Tribune | 2.1 |
Reconciliation Services
OpenRefine can connect to several reconciliation services. Please read our wiki for a list of them.
Client Libraries
You can automate some OpenRefine operations using one of the existing libraries. Those libraries are using the OpenRefine API.
- python
- An other python library
- ruby
- javascript - nodejs
- R
- php
- java
- bash
- C# - .NET Core