Trial Design Intern
Internship, Software Development
United States - NY, New York
Medidata: Power Smarter Treatments and Healthier People
Medidata is leading the digital transformation of life sciences, creating hope for millions of patients. Medidata helps generate the evidence and insights to help pharmaceutical, biotech, medical device and diagnostics companies, and academic researchers accelerate value, minimize risk, and optimize outcomes. More than one million registered users across 1,900+ customers and partners access the world's most trusted platform for clinical development, commercial, and real-world data. Medidata, a Dassault Systèmes company, is headquartered in New York City and has offices around the world to meet the needs of its customers. Discover more at www.medidata.com and follow us @medidata.
The Medidata AI internship program is a competitive and comprehensive 12-week rotational program. We provide pioneering data and analytics products to most of the $100B pharmaceutical development industry and our team is made up of data scientists, statisticians, computer scientists, implementation designers, engineers, and business experts.
We are looking for interns to be an integral part of this dynamic team, where you would drive innovative research, client-facing deliverables to collaboratively build new data science solutions. Our team leverages industry-leading data assets and analytical models to transform the clinical development industry, driving clinical and operational success for our clients and partners.
Participation in the internship program requires that you are located in the United States for the duration of the internship program. Roles are based out of either New York City or Boston. This internship is intended for students who are currently pursuing a Master’s degree program in a quantitative discipline with an anticipated graduation date on or before June 2024, depending on their program.
This project will be focused on developing data modeling techniques for irregularly sampled longitudinal clinical data. The goal will be to research methods of modeling sparse longitudinal data into and adapt these for creating featured representations of patients enrolled in clinical trials.
This role will work alongside Clinical Data Science team working with CAR T cell and other immunotherapy data, building pipelines for data quality assessments, model training, tuning and evaluations.
This project will require adapting existing statistical techniques for sparse longitudinal data to a specific domain. To succeed, the intern will bring to the role a strong foundation in statistical learning, a mathematical aptitude, excellent skills in R/Python and copious amounts of tenacious creativity. They will leave the program with the knowledge of having used their skills to contribute substantially to an important problem in clinical development, hands on experience of working with a one-of-its-kind dataset and very likely co-authorship on a high-impact scientific publication.
This internship also offers a unique opportunity to participate in an Innovation Lab. This allows interns to partner with industry leaders and cross-functional teams to work on a real-world business problem that Medidata currently faces. All participants present their solutions to the leadership of the AI team, and the winner presents to the SVPs and CEO of Medidata and other key leadership.
Conducting exploratory data analysis from complex, disparate data sources.
Designing, developing and validating statistical models for novel medical applications.
Implementing and evaluating algorithms based on research literature.
Creating, documenting, and maintaining code.
Reporting findings to internal teams.
Qualifications / Competencies:
Proficiency in Python. pandas, numpy, sklearn data visualization toolkits (or the equivalent in R) that allows you to be self-sufficient in analyzing data.; Understanding of Machine Learning techniques (classification, regressions, and feature selection); Ability to apply statistical analysis techniques. Evidence of earlier work with longitudinal data will be a plus.
Ability to think creatively, access and analyze data independently, and evaluate both the big picture and the key details effectively; Excellent interpersonal, verbal, and written communication; Strong time management and problem-solving skills; Ability to multi-task in a fast-paced environment and prioritize deliverables to achieve results.
Education and Experience:
Strong performance in an MS/PhD program in Data Science, Mathematics, Statistics, Biostatistics, or Computer Science; Experience with statistical analysis
Familiarity with data science techniques and the theory behind them; Knowledge of clinical trial data and/or large healthcare datasets is a plus; Rising or Graduating Senior
The salary range posted below refers only to positions that will be physically based in New York City. As with all roles, Medidata sets ranges based on a number of factors including function, level, candidate expertise and experience, and geographic location. Pay ranges for candidates in locations other than New York City, may differ based on the local market data in that region. The base salary pay range for this position is $32.00 to $37.00 per hour with a $3500 sign on bonus.