Data Science Intern
Internship, Software Development
United States - NY, New York
Medidata: Power Smarter Treatments and Healthier People
Medidata is leading the digital transformation of life sciences, creating hope for millions of patients. Medidata helps generate the evidence and insights to help pharmaceutical, biotech, medical device and diagnostics companies, and academic researchers accelerate value, minimize risk, and optimize outcomes. More than one million registered users across 1,900+ customers and partners access the world's most trusted platform for clinical development, commercial, and real-world data. Medidata, a Dassault Systèmes company, is headquartered in New York City and has offices around the world to meet the needs of its customers. Discover more at www.medidata.com and follow us @medidata.
The Medidata AI internship program is a competitive and comprehensive 12-week rotational program. We provide pioneering data and analytics products to most of the $100B pharmaceutical development industry and our team is made up of data scientists, statisticians, computer scientists, implementation designers, engineers, and business experts.
We are looking for interns to be an integral part of this dynamic team, where you would drive innovative research, client-facing deliverables to collaboratively build new data science solutions. Our team leverages industry-leading data assets and analytical models to transform the clinical development industry, driving clinical and operational success for our clients and partners.
Participation in the internship program requires that you are located in the United States for the duration of the internship program. Roles are based out of either New York City or Boston. This internship is intended for students who are currently pursuing a Master’s degree program in a quantitative discipline with an anticipated graduation date on or before June 2024, depending on their program.
Position Overview / Project Description:
This position supports the Intelligent Trials product area that provides operational analytics to accelerate enrollment and increase agility in execution in clinical trials. The role specifically will work alongside the modeling team who specializes in building ML models supporting multiple products under Intelligent Trials. This project will leverage machine learning and engineering skills to provide different levels of support to model productization, which includes creating and streamlining pipeline for data preprocessing, model development, and model testing; it will also provide opportunities to explore various approaches and data sources to improve ML models, as well as opportunities to understand datasets in the clinical trial domain. This role will require expertise in machine learning, model testing, pipeline automation, and R/Python programming.
This internship also offers a unique opportunity to participate in an Innovation Lab. This allows interns to partner with industry leaders and cross-functional teams to work on a real-world business problem that Medidata currently faces. All participants present their solutions to the leadership of the AI team, and the winner presents to the SVPs and CEO of Medidata and other key leadership.
Designing and developing machine learning systems using state-of-the-art technologies.
Building and validating statistical models for novel clinical trial applications.
Supporting productization of developed methods and code for integration with existing/new products
Creating, documenting, and maintaining code.
Reporting findings to internal teams.
Qualifications / Competencies:
Proficiency in Python or R, and SQL that allows you to be self-sufficient in analyzing data.
Good understanding of Machine Learning techniques (classification, regressions, and feature selection, etc)
Familiarity with software development tools, including Github, docker, AWS toolkit, etc
Strong time management and facility in prioritizing activities to achieve results.
Excellent interpersonal, verbal, and written communication.
Education and Experience:
Strong performance in an MS program in Data Science, Computer Science, Mathematics, Statistics, or Biostatistics
Experience with AutoML is a plus
The salary range posted below refers only to positions that will be physically based in New York City. As with all roles, Medidata sets ranges based on a number of factors including function, level, candidate expertise and experience, and geographic location. Pay ranges for candidates in locations other than New York City, may differ based on the local market data in that region. The base salary pay range for this position is $32.00 to $37.00 per hr with a $3500 sign on bonus.