Pipeline ETL
An ETL (Extract, Transform, Load) pipeline developed for the Getty Research Institute. Processes legacy art provenance data and enriches it with semantic linked open data (LOD).
Features:
- Extracts artwork and provenance records from legacy database systems
- Transforms data into the CIDOC-CRM ontology for cultural heritage
- Produces semantically enriched Linked Open Data output
- Built with the Bonobo ETL framework for Python
Built with: Python, Bonobo ETL, CIDOC-CRM, Linked Open Data standards.
View on GitHub