2019 Conference

Joe Rickert

Image credit: iStock

Details

Major themes addressed at the conference were Shiny, reproducible research, package administration, scaling R for production, and using R in a regulatory environment.

Date

Aug 20, 2019 1:00 PM — Aug 22, 2019 3:00 PM

Event

2019 Conference

Location

Harvard University

1737 Cambridge St, Cambridge, MA 02138

This post was originally shared on R/Views.

It’s no secret that there are few industries more competitive than the pharmaceutical industry. Big money placed on long-shot bets for block-buster drugs where being first makes all the difference means a constant struggle to gain a competitive edge. So, you might find it surprising that the inaugural R / Pharma Conference held this past August on the Harvard campus in a very classy auditorium was all about collaboration.

Some might also find it surprising that data scientists from competitive companies would gather to share information, but this is quite common. I have seen it before in other competitive industries, for example in IEEE-led standards initiatives, where engineers gather to forge a common technology. Not only is there the human need to share and learn from peers (and also brag a little), there is a larger force at play: a kind of market clearing operation where experts gather to gain as much of an advantage as they can by ensuring that no easily exploitable arbitrage opportunities remain.

It was a surprise, though (and I think a source of general amusement as the conference proceeded), that nearly every talk seemed to be about Shiny. Looking back, it is clear that it should not have been: 49% of the abstracts explicitly mention Shiny. This word cloud was built from the abstract submissions.

Abstract wordcloud.

Shiny is basically a technology for sharing complex information across multiple organizations and stakeholders with different skill sets. Shiny, too, is all about collaboration. For a look into the large, production-grade Shiny app, bioWARP, see Sebastian Wolf’s recent post.

Other major themes addressed at the conference were: reproducible research, package administration, scaling R for production, and using R in a regulatory environment. This last theme was underscored by a strong FDA presence. Lilliam Rosario from the FDA Center for Drug Evaluation & Research delivered the opening keynote, in which she addressed the regulatory role of CDER and the use of R. FDA speaker Mat Souktup spoke about the need to transcend the compartmentalized culture common in medical research, and how open-source tools are helpful in working towards this goal. He explicitly noted along the way that the FDA does not specify what software may be used. The third FDA speaker, Paul Schuette, filled in some details associated with topics raised by Rosario and talked about the use of R and Shiny at CDER. Along these same lines, Andy Nicholls from GSK conducted a well-attended and very informative workshop on The Challenges of Validating R. You can find Andy’s slides here.

Other keynote speakers were Max Kuhn, who talked about Modeling in the tidyverse (slides here); Joe Cheng, who described how to use Shiny responsibly in pharma (slides here); and Michael Lawrence, who spoke about enabling open-source analytics in the enterprise.

My very biased impression was that R / Pharma was an unqualified success at accomplishing the major objectives of bringing together data scientists and statisticians working in the Pharmaceutical industry, and of presenting a high quality program that explored several issues relating to the production use of R in a regulatory environment.

The following chart shows that representatives from quite a few pharmaceutical companies attended in spite of organization problems that artificially limited the overall number of attendees to about 140.

Attendees.

R/Pharma 2019 schedule

All times below in US ET.

8:30 AM

Shiny Reproducibility

Joe Cheng, Rstudio
8:40 AM

R Validation Hub (past, current and future state)

Andy Nicholls, Glaxosmithkline
8:40 AM

Artificial neural networks in R with Keras and TensorFlow

Leon Eyrich Jessen, Technical University of Denmark
1:30 PM

Machine learning

Max Kuhn, Rstudio
1:30 PM

plotly

Carson Sievert, Rstudio
1:30 PM

Machine learning workflow management with drake

Will Landau, Eli Lilly

8:15 AM

Package Management (Coffee session)
8:15 AM

R Education in Pharma (Coffee session)
8:15 AM

R and Python Interoperability (Coffee session)
8:45 AM

Coffee
9:00 AM

Opening Remarks
9:15 AM

Reproducibility and the role of code in reproducible data science

Garrett Grolemund, RStudio
10:00 AM

Using R for Generic Drug Evaluation and SABE R-package for Assessing Bioequivalence of Topical Dermatological Products

Elena Rantou, FDA
10:20 AM

How to win friends and influence people: Efficiency, Reproducibility, and Scalability with R Project templates and parameterized R Markdown

Leigh Alexander, SomaLogic
10:30 AM

Teaching an old dog new tricks: modernizing gsDesign

Keaven Anderson, Merck
10:50 AM

Coffee
11:10 AM

nlmixr: an R package for population PKPD modeling

Mirjam Trame, Novartis
11:30 AM

Creating and reviving Shiny apps with golem

Eric Nantz, Eli Lilly
11:40 AM

Exploratory Graphics (xGx): Promoting the purposeful exploration of PKPD data

Alison Margolskee, Novartis
12:00 PM

Interactive Visualization of Standardized CDISC-SEND-Formatted Toxicology Study Data Using R Shiny

Kevin Snyder, FDA
12:10 PM

Using RStudio.Cloud to advance R proficiency: a crowdsourcing training experience

Paulo Bargo, Janssen
12:30 PM

Lunch
1:30 PM

Updates on Analyzing Clinical Trials Data with R

Adrian Waddell, Roche / Genentech
1:50 PM

R Packages for Analyzing Clinical Trials Data with R Focusing on Safety And Early Efficacy

Nina Qi, Roche / Genentech
2:00 PM

Package management

Devin Pastoor, Metrum Research Group
2:20 PM

Shinytized R Markdown: A Potent OTC Alternative to 1,3,7-Trimethylxanthine & Currently Indicated for NDA Document Generation, Among Others

Mark Rothe, Sanofi
2:30 PM

Collaborating at scale: managing an enterprise analytical computing ecosystem

Rena Yang, Roche / Genentech
2:50 PM

Embrace R in Pharma - building internal R community and establishing fit-for-purpose R pilots

Ning Leng, Roche / Genentech
3:00 PM

Re-envisioning Clinical Content Delivery in the Open Source World

Doug Kelkhoff, Roche / Genentech
3:20 PM

The use of R for improved reproducibility of biomarker detection in liquid biopsies

Vivian Zhuang, FDA
3:30 PM

Coffee
3:50 PM

Breaking the Speed Limit: How R Gets Faster

Marianna Foos, Bluebird Bio
4:35 PM

Leveraging multiple R tools to make effective pediatric dosing decisions

Jeannine Fisher, Metrum Research Group
4:45 PM

Using Machine Learning and Interactive Graphics to Find New Cancer Targets

David Cooper, Glaxosmithkline
5:05 PM

Machine learning workflow management with drake

Will Landau, Eli Lilly
5:15 PM

An R package for Data Science and Deep Visualization of a complex clinical database

David James, Novartis

8:15 AM

Shiny for Early Drug Discovery Research (Coffee session)
8:15 AM

**** (Coffee session)
8:15 AM

Shiny in Production (Coffee session)
8:45 AM

Break
9:00 AM

Opening Remarks
9:05 AM

Multi-modal data integration

Aedin Culhane, Dana-Farber
9:45 AM

Your Missing Step in Reproducible R Programming: Continuous Deployment

Chase Clark, University of Illinois
10:00 AM

From playing in the backyard to designing one: Shiny transforms study designs, data analyses and statistical thinking of oncology in vivo group at Janssen

Volha Tryputsen, Janssen
10:20 AM

ModViz POP: R-Shiny Based PK/PD Interface for Empowering Teams to Perform Real-Time Simulations

Pavan Vaddady, Merck
10:30 AM

Break
10:50 AM

Building Open Source Tools for Safety Monitoring: Advancing Research Through Community Collaboration

Becca Krouse, Rho
11:10 AM

Its Not Whats on the Outside, but Its Whats on the Back-end That Matters: The World Beyond CSV Files

Marcus Adams, Merck
11:30 AM

Improve installation sequences for R package cohorts

Juliane Manitz, EDM Serono
11:40 AM

Evaluating the performance of advanced causal inference methods applied to healthcare claims data

Jessica Myers Franklin, Harvard University
12:00 PM

Tidysq for Working with Biological Sequence Data in ML Driven Epitope Prediction in Cancer Immunotherapy

Leon Eyrich Jessen, Technical University of Denmark
12:10 PM

Lunch
12:30 PM

This one is not like the others: Applicability Domain methods in R

Max Kuhn, RStudio
1:30 PM

Accelerating Chemistry Research through the Integration of Data Science with High-Throughput Experimentation

Jason Stevens, Bristol Myers Squibb
1:50 PM

Reproducible shiny apps with shinymeta

Carson Sievert, RStudio
2:00 PM

Shiny apps for accelerating early drug discovery research

Gordon Turner, Novartis
2:20 PM

Coffee
2:30 PM

Using R to foster the communication with non-statisticians on Bayesian dose escalation models

Marianna Grinberg, Merck
2:40 PM

Prediction of maternal-fetal exposures of CYP450-metabolized drugs using physiologic pharmacokinetic modeling implemented in R and mrgsolve

Madeleine S. Gastonguay, Metrum Research Group
3:00 PM

Making Better Decisions

Andy Nicholls, Glaxosmithkline
3:10 PM

From CDISC to TLFs, using R to support Pharmacokinetic Analyses

Jessica Higgins, Nuventra Pharma Sciences
3:30 PM

Coffee
3:40 PM

Simulations, and Complex Innovative Trial Designs

Paul Schuette, FDA
4:00 PM

Validation Framework for Assay Processing Pipelines

Ellis Hughes, Fred Hutch
4:45 PM

Shiny in Production: Building bridges from data science to IT

Kelly O’Briant, RStudio
4:55 PM

Identifying progression-free survival in Veterans with Diffuse Large B-Cell Lymphoma using electronic healthcare records

Debbie Morreall, University of Utah
5:15 PM

Democratizing Natural Language Processing with I2E and R Shiny

Abhik Seal, Abbvie