Senior Data Engineer
The J. Paul Getty Trust is looking for an enthusiastic Senior Data Engineer, with the experience and passion to lead the execution of technical projects to support, enrich and ensure the persistence of the institution's cultural heritage knowledge bases. Our aim is to provide a deeply connected, consistent experience for scholars, researchers, and art enthusiasts as they explore the complex information held across the organization, and your participation is crucial for that to be successful.
You will report to the Enterprise Semantic Architect, and interact equally with both software engineers and content specialists. Your work will improve the quality, reliability, connectedness, and consistency of our data by engineering data pipelines and validation tools, configuring Linked Open Usable Data (LOUD) platforms, and assisting with the design and implementation of our, and the community's, overall data model.
The Getty is among the most prestigious cultural heritage organizations in the world, dedicated to furthering the study of the history of art. You will work on an amazing campus amongst fabulous art, architecture, and information systems, collaborating with world-class scientists, curators, librarians, archivists, and academics. We offer 4 weeks of vacation a year plus every other Friday off, excellent benefits, and a very strong commitment to balancing work and personal life.
Major Job Responsibilities
- Work with software engineering, systems and content stakeholders to understand data-oriented technical requirements
- Design and implement robust data transformation pipelines to migrate legacy datasets into Linked Open Usable Data (LOUD)
- Design and implement validation services for LOUD models
- Design and implement agile data management practices, with testing, continuous integration and deployment paradigms
- Design and implement LOUD data cleaning processes and pipelines, in collaboration with content stakeholders
- Assist with configuration of institutional LOUD platform instances
- Assist software engineering teams by translating stakeholder requirements into feature requests for platform development
- Assist the Semantic Architect in data model and API design and documentation
- Integrate external content services to enrich and reconcile our data, as time allows
- Bachelor's degree in Computer Science, Information Science, or equivalent experience; Master's or PhD preferred
- 5+ years software development experience; data engineering experience preferred
Knowledge, Skills and Abilities
- Interest in the digital work of cultural heritage organizations
- Attention to detail combined with a focus on data and usability
- Excellent verbal and written communication skills, especially when interacting with non-technical stakeholders and senior management
- Expertise in Linked Open Data standards and technologies
- Proficiency in Python, or willingness to translate experience in equivalent language
- Proficiency in SQL and document oriented databases
- Familiarity with cultural heritage data standards, such as IIIF and CIDOC-CRM
- Familiarity with tools such as git, docker, AWS, Elastic.
- Familiarity with test driven and agile software development methodologies
- Familiarity with machine learning techniques, parallel processing
Last updated: Thursday, February 7, 2019 21:25 UTC