Full time Remote / Telecommute

Research Data Infrastructure Developer

Princeton, NJ
0 other recent jobs
Salary: Depends on experience
Created: July 26, 2022


Reporting to the Director of Advanced Data and Storage Management, this 3-year term position will provide critical development work on the configuration of custom integrations, workflow, and data extraction processes to assist with the implementation of TigerData, a data storage and management system that supports the advancement of research at Princeton University. Understanding the needs of faculty researchers and collaborating with developers in the Princeton University Library (PUL), you will develop processes and tools for metadata entry and management, automated metadata harvesting from common file types and other campus systems, and automation workflows for ensuring smooth transitions between storage stages, including movement of data to publication and long-term archive. 

This is a 3-year benefits-eligible term position. A remote work arrangement may be considered for candidates with the appropriate background and experience.


  • Build, test, debug, and document software designed to support research data management, including discovery, metadata extraction from common file types and other campus systems, and data movement workflows 
  • Develop interfaces for the TigerData presentation layer using API  
  • Integrate with data sources such as TigerData, ORCID, Globus, cloud storage, and existing data repositories.  
  • Analyze, transform, migrate, and process data and metadata  
  • Build tools and workflows to validate research data submissions 
  • Automate and streamline manual or inefficient tasks 
  • Help plan and estimate work on software projects  
  • Embedded with a team of PUL developers working in an agile environment  
  • Maintain a strong partnership with the PUL software development group to enhance collaboration in the implementation of TigerData

Essential Qualifications

  • Minimum of 3-5 years’ experience as a developer working in an environment that includes complex software systems, object-oriented programming, web-based applications and services, and distributed architecture 
  • Demonstrated experience with an object oriented language, preferably Ruby or Java 
  • Experience building complex web forms that drive workflows and meet contemporary usability guidelines using modern HTML, CSS, and Javascript frameworks 
  • Demonstrated experience with test-driven development, preferably using RSpec or JUnit 
  • Ability to work in a Linux-based environment 
  • Willingness to learn new technologies and data/metadata formats 
  • Experience with Agile software development practices 
  • Strong oral and written communication skills 
  • Education: A bachelor’s degree or equivalent experience. 

Preferred Qualifications

  • Experience with relevant cultural heritage metadata formats (such as MARC, MODS, and Dublin Core); ideally with archival and/or geospatial metadata formats 
  • Experience with Python or R 
  • Experience with research data management and institutional repositories 
  • Experience with data harvesting APIs such as OAI-PMH or ResourceSync 
  • Familiarity with best practices for data modeling and data management 
  • Experience working on and contributing to open source software projects 
  • Proficiency with common tools for source code version control, collaboration, and deployment; such as Git, GitHub, Capistrano and Ansible 
  • Experience with DevOps and deployment automation 
  • Advanced degree in Library Science, Computer Science, Geographic Information Systems, or another research field.

Last updated: Wednesday, July 27, 2022 18:36 UTC