Trial Design Intern

Internship, Software Development

United States - NY, New York

Requisition ID



Medidata: Power Smarter Treatments and Healthier People

Medidata is leading the digital transformation of life sciences, creating hope for millions of patients. Medidata helps generate the evidence and insights to help pharmaceutical, biotech, medical device and diagnostics companies, and academic researchers accelerate value, minimize risk, and optimize outcomes. More than one million registered users across 1,900+ customers and partners access the world's most trusted platform for clinical development, commercial, and real-world data. Medidata, a Dassault Systèmes company, is headquartered in New York City and has offices around the world to meet the needs of its customers. Discover more at and follow us @medidata.

Program Overview:

The Medidata AI internship program is a competitive and comprehensive 12-week rotational program. We provide pioneering data and analytics products to most of the $100B pharmaceutical development industry and our team is made up of data scientists, statisticians, computer scientists, implementation designers, engineers, and business experts. 

We are looking for interns to be an integral part of this dynamic team, where you would drive innovative research, client-facing deliverables to collaboratively build new data science solutions. Our team leverages industry-leading data assets and analytical models to transform the clinical development industry, driving clinical and operational success for our clients and partners. 

Participation in the internship program requires that you are located in the United States for the duration of the internship program. Roles are based out of either New York City or Boston. This internship is intended for students who are currently pursuing a Master’s degree program in a quantitative discipline with an anticipated graduation date on or before June 2024, depending on their program. 

Project Description: 

This project will be focused on developing data modeling techniques for irregularly sampled longitudinal clinical data. The goal will be to research methods of modeling sparse longitudinal data into and adapt these for creating featured representations of patients enrolled in clinical trials.

This role will work alongside Clinical Data Science  team working with CAR T cell and other immunotherapy data, building pipelines for data quality assessments, model training, tuning and evaluations.

This project will require adapting existing statistical techniques for sparse longitudinal data to a specific domain. To succeed, the intern will bring to the role a strong foundation in statistical learning, a mathematical aptitude, excellent skills in R/Python and copious amounts of tenacious creativity. They will leave the program with the knowledge of having used their skills to contribute substantially to an important problem in clinical development, hands on experience of working with a one-of-its-kind dataset and very likely co-authorship on a high-impact scientific publication.

This internship also offers a unique opportunity to participate in an Innovation Lab. This allows interns to partner with industry leaders and cross-functional teams to work on a real-world business problem that Medidata currently faces. All participants present their solutions to the leadership of the AI team, and the winner presents to the SVPs and CEO of Medidata and other key leadership.

Role Responsibilities: 

  • Conducting exploratory data analysis from complex, disparate data sources.

  • Designing, developing and validating statistical models for novel medical applications.

  • Implementing and evaluating algorithms based on research literature.

  • Creating, documenting, and maintaining code.

  • Reporting findings to internal teams.

Qualifications / Competencies:

  • Proficiency in Python. pandas, numpy, sklearn data visualization toolkits  (or the equivalent in R)  that allows you to be self-sufficient in analyzing data.; Understanding of Machine Learning techniques (classification, regressions, and feature selection); Ability to apply statistical analysis techniques. Evidence of earlier work with longitudinal data will be a plus.

  • Ability to think creatively, access and analyze data independently, and evaluate both the big picture and the key details effectively; Excellent interpersonal, verbal, and written communication; Strong time management and problem-solving skills; Ability to multi-task in a fast-paced environment and prioritize deliverables to achieve results.

Education and Experience:

  • Strong performance in an MS/PhD program in Data Science, Mathematics, Statistics, Biostatistics, or Computer Science; Experience with statistical analysis

  • Familiarity with data science techniques and the theory behind them; Knowledge of clinical trial data and/or large healthcare datasets is a plus; Rising or Graduating Senior

The salary range posted below refers only to positions that will be physically based in New York City.  As with all roles, Medidata sets ranges based on a number of factors including function, level, candidate expertise and experience, and geographic location. Pay ranges for candidates in locations other than New York City, may differ based on the local market data in that region. The base salary pay range for this position is $32.00 to $37.00 per hour with a $3500 sign on bonus. 


Equal Employment Opportunity

In order to provide equal employment and advancement opportunities to all individuals, employment decisions at Medidata are based on merit, qualifications and abilities. Medidata is committed to a policy of non-discrimination and equal opportunity for all employees and qualified applicants without regard to race, color, religion, gender, sex (including pregnancy, childbirth or medical or common conditions related to pregnancy or childbirth), sexual orientation, gender identity, gender expression, marital status, familial status, national origin, ancestry, age, disability, veteran status, military service, application for military service, genetic information, receipt of free medical care, or any other characteristic protected under applicable law. Medidata will make reasonable accommodations for qualified individuals with known disabilities, in accordance with applicable law.

Covid Statement

Our Company requires all U.S. employees to be fully vaccinated against COVID-19 and to provide documentation of full vaccination, unless qualified for a medical, religious or state-required accommodation or otherwise exempt consistent with applicable law. Although accommodation requests will be considered (and granted where appropriate/possible), it may be determined that a candidate is unable to adequately perform the essential functions of the position without imposing an undue hardship due to customer requirements, staffing needs, or other business reasons. Definition of full-vaccination: Employees are considered to be fully vaccinated two weeks after their second dose in a 2-dose series or two weeks after a single-dose vaccine.