Overview and features

Ontotext Refine (“Refine”) is a version of the open-source OpenRefine data transformation tool adapted to work with Ontotext GraphDB.

Refine allows fast cleaning, mapping and transformation of any structured data to RDF and loading it to GraphDB.

Refine supports input from:

  • Tabular formats (TSV, CSV, *SV)
  • Fixed-width text files
  • Excel (XLS, XLSX)
  • JSON, JSON-LD, XML
  • RDF: XML, Turtle/N3
  • Google sheets (public or access controlled with OAuth)
  • Databases (PostgreSQL, MySQL, MariaDB, SQLite)

You can input data from local files, remote URLs, and clipboard snippets.

Refine enables you to:

  • Create projects and upload your data file(s)

  • Clean and transform the data using powerful row and column manipulations, faceting, clustering

  • Implement complex transformations using:

  • Create a visual RDF mapping of the cleaned data

    • The RDF mapping visual UI is optimized to guide you in defining URLs, choosing the right predicates and types, defining datatypes, etc.
    • Export a respective SPARQL query
    • Export the RDF data
  • Expose a virtual SPARQL endpoint that allows you to write complex SPARQL queries

    • Export RDF data using a SPARQL Construct query
    • Load RDF data to a GraphDB repository using Federated SPARQL Update query

You can generate Refine queries from semantic models using Ontotext’s open source rdf2rml toolkit