Required Skills

pythonrawssql

Job Description

In this role as R\&D Data Management Reference Data Tech (all genders), you will act as a key link between R\&D Data Strategy \& Governance, scientific data users, and IT teams, driving the development and management of reference and master data within a global data environment, based in Hyderabad, India

STAKEHOLDERS

The role will interact and establish strong partnerships with

Data producers, consumers, owners and stewards in Research Biology and Translational Science (RED CVRI, RED Onc), Drug Discovery (DDS) Preclinical Research (PCD) while CMC (Pharmaceutical Sciences), Clinical Development, Regulatory, Pharmacovigilance etc. These could include lab and computational scientists, clinicians, statisticians, and business operations professionals

Data architects, engineers, UI/UX engineers, product/platform/technology teams

Vendors and external contract staff

YOUR TASKS AND RESPONSIBILITIES

Support the technical establishment of reference and master data assets in OneEDG, including technical, business\-defined mappings (e.g., industry classification, management hierarchies)

Ensure the technical foundation and reference data tools are aligned with business objectives on a continuous basis (e.g. semantic modeling and AI solutions)

Collaborate with standards teams, scientific data users (scientists/ data scientists), business data stewards and IT to assess reference and master data needs, usage expectations

Create new data structures or update/change existing data structures in OneEDG and/or MARLAY or lifecycle management of reference data

Perform limited technical adaptations in OneEDG, e.g., new columns, via the user interface of OneEDG. There is no expectation of coding

Support the timely and effective delivery of reference data to consuming workflows while maintaining traceability

Create and maintain documentation and act as an expert guide on data definitions, data flows, data models etc. for applicable reference data

Provide training and ongoing support to R\&D teams on tools and technologies used in reference and master data management

Contribute to the awareness and adoption of data standards, governance principles, and best practices in Reference Data Management across R\&D

WHO YOU ARE

Master's degree in Biological Sciences, Informatics or Computer Science in a biomedical context

Extensive years in data modeling (relational, object\-oriented and graph based), architecture, and metadata management and controlled vocabularies for the biomedical domain

Proven experience managing life sciences or medical data, including data modeling and high\-level ETL pipeline design and working with TopBraid EDG or related ontology management systems; basic programming skills SQL, Python, and SPARQL, and standards like OWL, SKOS, and SHACL

Deep understanding of ontologies, knowledge graphs, semantic data representation, common industry reference data standards such as NCIT, NCBI, SPOR, CDISC, IDMP, and USDM etc. and technologies such as AWS, Databricks etc.

Strong grasp of data governance including access rights and user management in digital and business workflows

Self\-starter, independent yet collaborative, detail oriented, structured, process\-oriented practitioner with reliable project management skills, able to influence a variety of stakeholders and manage ambiguity in a complex and changing environment

Effective communicator with a high learning agility, capable of translating across multiple domains and stakeholder archetypes, between business and technical disciplines

Excellent written and verbal communication skills in English

Pay: ₹759,757\.02 \- ₹2,369,506\.89 per year

Work Location: Hybrid remote in Mejedihi, West Bengal