Roche / Genentech

Exploration of Large language model-based frameworks to develop R codes for output delivery in pharma

Over thousands of outputs (tables, graphs and listings) may need to be generated each year for filing, external publications, internal read outs and other activities in a pharmaceutical company. Although most of these outputs could be produced …

Why we Need to Improve Software Engineering in Biostatistics - A Call to Action

Programming is ubiquitous in applied biostatistics, and most statisticians know a programming language such as R - yet software engineering is still neglected as a skill and undervalued as a profession in pharmaceutical statistics. Why is this a …

Learnings from deploying a Python package to orchestrate clinical trials outputs generation

Streamlining clinical trial output workflows is a key challenge for clinical studies. Our project leverages Python to link the planned analysis stored in a google sheet LoPO (List of clinical study Planned Outputs) to the study scripts that generates …

The importance of the SCE in enabling our shift from proprietary programming to open-source data science

Historically building a great SCE for clinical reporting involved selecting a vendor, integrating their product, and supporting a single proprietary language. The shift to report clinical trials using R has had a much broader impact than just …

Trends and Current Landscape of ML/AI Publications with Open Source Tools

Pharmaverse Friday: Generating TLGs using NEST / Falcon - a Masterclass

Pharmaverse Friday: Revolutionize Clinical Trial Data Exploration - teal

Data Science in the Pharmaceutical Industry: A Coursera Specialisation

The Pharmaceutical industry is adopting new tools and technologies, putting pressure on individuals to learn many new skills in a short period of time. In order to both promote these new ways of working, and to assist those adopting it, at Genentech …

Assuring SDTM data quality with the sdtmchecks package

The job of a data scientist working on a clinical trial team in the pharmaceutical industry is to provide the most accurate analysis possible in order to enable valid insights from the data. Ensuring data quality is extremely hard work and there are …

Pharmaverse: Breaking boundaries through open source collaboration!

Roche/Genentech, GSK, Atorus and J&J/Janssen have initiated a collaboration called pharmaverse to bring together a curated subset of open-source R packages to enable clinical reporting (from CRF to eSubmission). Where gaps are identified, new …