AI DATA SCIENTIST

FOR HEALTHCARE AND PHARMACEUTICAL INDUSTRY

I am Corina Roca. Software developer with Bioinformatics background and + 8 years of experience in the pharmaceutical industry.

SERVICES

Services That I Provide

AI Tech Stack Consultancy

Providing guidance about newest AI tools and technologies that could help your business to automate/simplify pipelines.

Learn More
Full-stack development

Locally or in the cloud. I work with JS, Python, R and SQL or non-sql databases to organize and bring your data from raw to customer. From monolitic architectues to containerized software.

Learn More
Data Science ad-hoc projects

With a focus in healthcare and pharmaceutical industry needs, I collaborate/support data science teams and epidemiologist to bring the most of the value from the data to enable medical discoveries.

Learn More
WORK

My Recent Projects

Data Science | RWE insights for ES-SCLC

2024 New Publication under review to showcase insights after statistical analysis in RWE set.

See our previous publication (2023)

WebApp | Clinical trials reporting with R and Teal

R shiny dashboards using Teal packages (v.2024) or others from Pharmaverse

See Here

AI Agents and Agentic Workflows (2025)

Local or in cloud Agentic Workflows to integrate with structured and non-structured data. Model refinement and Evaluations in place to stablish guardrails and certain levels of accuracy.

DEMO of the 🏆 Winner project at OT Hackathon '25

PHUSE volunteer (2024-2025)

Collaborative project to bring best practices and innovation around the clinical trials area across several organizations

See Here

Data curation/Harmonization (2016-2017)

My background in pharmacology and bioinformatics allows me to perform some data curation over certain datasets. Currently is useful to me when creating "evals" for Scientific Agentic workflows.

Example of the model of the month (previous work at EBI)
Experience

My Education + Work History

Education

CDISC course (2024)

SDTM and JSON-dataset

Introduction to SDTM format and JSON-dataset structure to store clinical trial data

MSc Computational Biology(2014-2016)

Instituto Carlos III (UCM)

Bioinformatic tools, analysis and learning about multi-omics data and computational approaches to get insights for drug discovery (Gene Enrichment analysis, Protein structural homology, Machine Learning, ). MSc thesis. Thesis within a biotech industry with focus in diagnosis project linking cancer blood clinical samples with drug resistance predictions.

Pharmaceutical industry degree (2008-2014)

Universidad de Alcala + 1 Year Erasmus at University of Warsaw

Deep knowledge in Biochemistry, Pharmakokinetics, Pharmakodynamics, Biology, Statistics and Pharmaceutical technology. Final laboratory year within the Immunology research laboratory in the Medical university of Warsaw. Topic: Macrophage behaviour when Ligand receptor interaction of the protein CD220

Work Experience

AI software engineer

Astex Pharmaceuticals (2024-Currently)

Mantaining and developing scientific web apps. Developing AI stack technnology in-house.

RWE Data scientist Consultant

AstraZeneca (2022-2023)

Data architect - Consultant

Roche (2018-2022)

Data Governance team. Full stack developer, creating apps upon request to address specific company issues in a more user-friendly way.

EBI-EMBL

Data curator (2016-2017)

Curator of cuantitative and qualitative databases (Biomodels and Reactome). Also Researcher for an auto-immune orphan disease, using bioinformatic tools. EBIEMBL Heilderberg conference 2016