In this role as R\&D Data Management Reference Data Tech (all genders), you will act as a key link between R\&D Data Strategy \& Governance, scientific data users, and IT teams, driving the development and management of reference and master data within a global data environment, based in Hyderabad, India
STAKEHOLDERS
Data producers, consumers, owners and stewards in Research Biology and Translational Science (RED CVRI, RED Onc), Drug Discovery (DDS) Preclinical Research (PCD) while CMC (Pharmaceutical Sciences), Clinical Development, Regulatory, Pharmacovigilance etc. These could include lab and computational scientists, clinicians, statisticians, and business operations professionals
Data architects, engineers, UI/UX engineers, product/platform/technology teams
Vendors and external contract staff
YOUR TASKS AND RESPONSIBILITIES
Support the technical establishment of reference and master data assets in OneEDG, including technical, business\-defined mappings (e.g., industry classification, management hierarchies)
Ensure the technical foundation and reference data tools are aligned with business objectives on a continuous basis (e.g. semantic modeling and AI solutions)
Collaborate with standards teams, scientific data users (scientists/ data scientists), business data stewards and IT to assess reference and master data needs, usage expectations
Create new data structures or update/change existing data structures in OneEDG and/or MARLAY or lifecycle management of reference data
Perform limited technical adaptations in OneEDG, e.g., new columns, via the user interface of OneEDG. There is no expectation of coding
Support the timely and effective delivery of reference data to consuming workflows while maintaining traceability
Create and maintain documentation and act as an expert guide on data definitions, data flows, data models etc. for applicable reference data
Provide training and ongoing support to R\&D teams on tools and technologies used in reference and master data management
Contribute to the awareness and adoption of data standards, governance principles, and best practices in Reference Data Management across R\&D
WHO YOU ARE
Master's degree in Biological Sciences, Informatics or Computer Science in a biomedical context
Extensive years in data modeling (relational, object\-oriented and graph based), architecture, and metadata management and controlled vocabularies for the biomedical domain
Proven experience managing life sciences or medical data, including data modeling and high\-level ETL pipeline design and working with TopBraid EDG or related ontology management systems; basic programming skills SQL, Python, and SPARQL, and standards like OWL, SKOS, and SHACL
Deep understanding of ontologies, knowledge graphs, semantic data representation, common industry reference data standards such as NCIT, NCBI, SPOR, CDISC, IDMP, and USDM etc. and technologies such as AWS, Databricks etc.
Strong grasp of data governance including access rights and user management in digital and business workflows
Self\-starter, independent yet collaborative, detail oriented, structured, process\-oriented practitioner with reliable project management skills, able to influence a variety of stakeholders and manage ambiguity in a complex and changing environment
Effective communicator with a high learning agility, capable of translating across multiple domains and stakeholder archetypes, between business and technical disciplines
Excellent written and verbal communication skills in English
Pay: ₹759,757\.02 \- ₹2,369,506\.89 per year
Work Location: Hybrid remote in Mejedihi, West Bengal
AI-Assisted Full Stack Developer
peak group · Remote
IT Analyst Applications
Caterpillar · Bengaluru
AI-Assisted Full Stack Developer
peak group · Remote