Career

Data Science Engineer – Summer Intern

Internship, Research & Development

United States - NY, New York

Requisition ID

538663

Apply

About our Company:

Medidata: Powering Smarter Treatments and Healthier People

Medidata, a Dassault Systèmes company, is leading the digital transformation of life sciences, creating hope for millions of people. Medidata helps generate the evidence and insights to help pharmaceutical, biotech, medical device and diagnostics companies, and academic researchers accelerate value, minimize risk, and optimize outcomes. More than one million registered users across 2,000+ customers and partners access the world's most trusted platform for clinical development, commercial, and real-world data. Known for its groundbreaking technological innovations, Medidata has supported more than 30,000 clinical trials and 9 million study participants. Medidata is headquartered in New York City and has offices around the world to meet the needs of its customers. Discover more at www.medidata.comand follow us on LinkedIn, Instagram, and X.

The Program:

At Medidata, interns will have the opportunity to accelerate their careers by working closely with experienced professionals and gain valuable, hands-on, full-time work experience. By being a part of our global organization, interns have the opportunity to work alongside our talented and committed professionals helping them to build a strong foundation for achieving their career goals. For 12 weeks, beginning May 20, 2024, interns will have an opportunity to gain a deep understanding of what it means to be a Medidatian. United around a single goal of empowering smarter treatments and healthier people. Medidatians work in a culture of curiosity, innovation and fun. You will be contributing to the line of business with sustainable and meaningful work.

Our Summer Internship program also includes instructor led training, guided mentorship, exposure to senior leadership and community service. In addition to individual and specific related responsibilities, each intern will participate in our Intern Innovation Lab. Assigned to cross-functional teams, interns will work closely to develop an innovative solution to a business problem currently facing Medidata. As they work diligently to present their final solutions to a panel of top Medidata leaders, we are confident that our interns will make a significant impact on our business.

About the Team:

Medidata is looking for individuals who will help us tackle some of the most complex questions facing the industry today using our proprietary platform and advanced analytics. At Medidata, we never work alone. This role will partner heavily with all of the key stakeholder functions including product, delivery, data science, engineering, partnerships, and biostatistics. Successful Medidata AI candidates will be skilled in analytical/quantitative thinking, structured communication, and excited about building the next horizon of Medidata's mission to power smarter treatments and healthier.

Responsibilities:

  • Collaborate with the Data Science engineering team to build ETL data pipelines.
  • Develop and optimize complex Snowflake SQL queries for efficient data extraction and transformation
  • Utilize Python data libraries for data extraction , analysis, and visualization.
  • Gain familiarity with cloud data engineering services like AWS S3, Redshift, Glue, and EMR
  • Work with project leadership to define data collection requirements from upstream and downstream systems
  • Gain exposure to data warehousing concepts and tools.
  • Perform troubleshooting of data issues and data pipelines.

Qualifications:

  • Pursuing a Master's degree in computer science/statistics field.
  • Proficient in writing complex SQL queries.
  • Proficient in Python data-related libraries (pandas, NumPy).
  • Experience with Git version control.
  • Familiarity with AWS data engineering services.
  • Experience using Docker for containerization and orchestration.
  • Familiarity with data warehousing concepts.
  • Ability to work independently, in a fast-paced and agile environment.

The salary range posted below refers only to positions that will be physically based in New York. As with all roles, Medidata sets ranges based on a number of factors including function, level, candidate expertise and experience, and geographic location. Pay ranges for candidates in locations other than New York, may differ based on the local market data in that region. The base hourly pay range for this position is $32.00 - $37.00 an hour and a $3,500 sign on bonus.

Applications will be accepted on an ongoing basis until the position is filled.

#LI-SB1

#LI-Hybrid

Note: Please be on the lookout for job scams. Medidata recruiters will never ask applicants for monetary compensation, credit card, or banking details.

Diversity

As a game-changer in sustainable technology and innovation, Medidata, Dassault Systèmes company, is striving to build more inclusive and diverse teams across the globe. We believe that our people are our number one asset and we want all employees to feel empowered to bring their whole selves to work every day. It is our goal that our people feel a sense of pride and a passion for belonging. As a company leading change, it’s our responsibility to foster opportunities for all people to participate in a harmonized Workforce of the Future.

Apply