Data Management Intern

Internship, Software Development

United States - NY, New York

Requisition ID



Medidata: Power Smarter Treatments and Healthier People

Medidata is leading the digital transformation of life sciences, creating hope for millions of patients. Medidata helps generate the evidence and insights to help pharmaceutical, biotech, medical device and diagnostics companies, and academic researchers accelerate value, minimize risk, and optimize outcomes. More than one million registered users across 1,900+ customers and partners access the world's most trusted platform for clinical development, commercial, and real-world data. Medidata, a Dassault Systèmes company, is headquartered in New York City and has offices around the world to meet the needs of its customers. Discover more at and follow us @medidata.

Job Description:

Program Overview:

The Medidata AI internship program is a competitive and comprehensive 12-week rotational program. We provide pioneering data and analytics products to most of the $100B pharmaceutical development industry and our team is made up of data scientists, statisticians, computer scientists, implementation designers, engineers, and business experts. 

We are looking for interns to be an integral part of this dynamic team, where you would drive innovative research, client-facing deliverables to collaboratively build new data science solutions. Our team leverages industry-leading data assets and analytical models to transform the clinical development industry, driving clinical and operational success for our clients and partners. 

Participation in the internship program requires that you are located in the United States for the duration of the internship program. Roles are based out of either New York City or Boston. This internship is intended for students who are currently pursuing a Master’s degree program in a quantitative discipline with an anticipated graduation date on or before June 2024, depending on their program. 

Position Overview/ Project Description

  • This project will be focused on interpreting and analyzing multiple clinical and real world data sources to assist in the improvement of several Medidata AI data processes. A particular focus will be on data quality assessments and comparisons between datasets in clinical research and real-world medical data (charts, literature, claims data, EHR). This analysis will cover the breadth of clinical data management including data wrangling and cleaning, database querying and creation, and statistical programming and analysis.

  • This role will work alongside the data management team who specializes in harmonizing / standardizing diverse datasets and prototyping analytical tools to support several products and teams within Medidata AI. 

  • This project will require knowledge in randomized clinical trial and real world evidence datasets, data harmonization and linkage, biostatistical programming and analysis.  

  • This internship also offers a unique opportunity to participate in an Innovation Lab. This allows interns to partner with industry leaders and cross-functional teams to work on a real-world business problem that Medidata currently faces. All participants present their solutions to the leadership of the AI team, and the winner presents to the SVPs and CEO of Medidata and other key leadership.

Role responsibilities: 

  • Conducting exploratory data analysis to assess data quality and density across complex, disparate data sources

  • Designing, developing and validating methods of data aggregation across data platforms

  • Extracting and manipulating relevant data from unique data sources

  • Creating, improving, and documenting analytical methods

  • Reporting findings to internal teams

Qualifications / Competencies:

  • Proficiency in Python / R and SQL that allows you to be self-sufficient in analyzing data

  • Familiarity with drug development life cycle and experience with the manipulation, analysis and reporting of clinical trial data

  • A passion for understanding complex issues with creative, data-driven approaches

  • Excellent interpersonal, verbal, and written communication

  • Ability to multi-task in a fast-paced environment and prioritize deliverables to achieve results

  • Nice to have: Familiarity with oncology / cancer biology

Education and Experience:

  • Strong performance in a Master’s program in Data Science, Data Engineering, Mathematics / Biostatistics, Epidemiology, or related clinical field

  • Familiarity with clinical data management programming and techniques

  • Experience with data mining, cleaning, and/or querying clinical trial data and/or large healthcare datasets

The salary range posted below refers only to positions that will be physically based in New York City.  As with all roles, Medidata sets ranges based on a number of factors including function, level, candidate expertise and experience, and geographic location.  Pay ranges for candidates in locations other than New York City, may differ based on the local market data in that region. The base salary pay range for this position is $32.00 to $37.00 per hr with a $3500 sign on bonus. with a $3500 sign on bonus. 


Equal Employment Opportunity

In order to provide equal employment and advancement opportunities to all individuals, employment decisions at Medidata are based on merit, qualifications and abilities. Medidata is committed to a policy of non-discrimination and equal opportunity for all employees and qualified applicants without regard to race, color, religion, gender, sex (including pregnancy, childbirth or medical or common conditions related to pregnancy or childbirth), sexual orientation, gender identity, gender expression, marital status, familial status, national origin, ancestry, age, disability, veteran status, military service, application for military service, genetic information, receipt of free medical care, or any other characteristic protected under applicable law. Medidata will make reasonable accommodations for qualified individuals with known disabilities, in accordance with applicable law.

Covid Statement

Our Company requires all U.S. employees to be fully vaccinated against COVID-19 and to provide documentation of full vaccination, unless qualified for a medical, religious or state-required accommodation or otherwise exempt consistent with applicable law. Although accommodation requests will be considered (and granted where appropriate/possible), it may be determined that a candidate is unable to adequately perform the essential functions of the position without imposing an undue hardship due to customer requirements, staffing needs, or other business reasons. Definition of full-vaccination: Employees are considered to be fully vaccinated two weeks after their second dose in a 2-dose series or two weeks after a single-dose vaccine.