probonas.net

Pipeline ETL

An ETL (Extract, Transform, Load) pipeline developed for the Getty Research Institute. Processes legacy art provenance data and enriches it with semantic linked open data (LOD).

Features:

  • Extracts artwork and provenance records from legacy database systems
  • Transforms data into the CIDOC-CRM ontology for cultural heritage
  • Produces semantically enriched Linked Open Data output
  • Built with the Bonobo ETL framework for Python

Built with: Python, Bonobo ETL, CIDOC-CRM, Linked Open Data standards.

View on GitHub