Senior Data Engineer

Research & Development

United States - NY, New York

Requisition ID



Medidata: Powering Smarter Treatments and Healthier People

Medidata, a Dassault Systèmes company, is leading the digital transformation of life sciences, creating hope for millions of people. Medidata helps generate the evidence and insights to help pharmaceutical, biotech, medical device and diagnostics companies, and academic researchers accelerate value, minimize risk, and optimize outcomes. More than one million registered users across 2,000+ customers and partners access the world's most trusted platform for clinical development, commercial, and real-world data. Known for its ground-breaking technological innovations, Medidata has supported more than 30,000 clinical trials and 9 million study participants. And Medidata's ongoing commitment to infusing the patient voice into trial designs and solutions is creating a better and more inclusive experience for all participants in clinical studies. Medidata is involved in nearly 40% of company-initiated trial starts globally, with studies conducted in more than 140 countries. More than 70% of novel drugs approved by the Food and Drug Administration (FDA) in 2022 were developed with Medidata software. Medidata is headquartered in New York City and has offices around the world to meet the needs of its customers. Discover more at and follow us @medidata.

Our Team:

Medidata is looking for individuals who will help us solve some of the most complex questions facing the industry today using our proprietary platform and advanced analytics. Our users depend on our products to participate in any clinical trial using web, mobile and sensor-based solutions. We focus on the patient experience and aim to provide engaging solutions that fit into people's everyday lives. We make clinical trials faster by reducing burdens for both patients and study personnel. You will partner with all of the key stakeholder functions including product, delivery, data science, engineering, partnerships, and operational teams.

What we're looking for:

As a Senior Data Engineer, you are an important part of our team building strategic data pipelines into the Patient Cloud Data Platform. You will report to the Manager of Engineering and Testing and work with multiple teams daily. You are a motivated, technical data engineer with experience transforming data sets and will individually contribute in the following ways:

  • Promote the data vision and roadmap to meet team objectives
  • Identify the dependencies ahead and come up with proposed solutions.
  • Implement data engineering strategies that align with our goals.
  • Ensure data engineering processes are efficient, and compliant with regulatory requirements.
  • Identify opportunities for process improvement and new technology adoption.
  • Create new data pipelines and transform data to a structure that is relevant to the product need
  • Analyze complex data structures, elements and systems to create performant pipelines
  • Develop logical and physical data models that are efficient and best suited for the intended use
  • Manage data reliability and engineering planning from requirements to deployment, including assessing end-user and our needs, promoting engagement, and keeping processes running smoothly.
  • Enforces the implementation of best practices for data auditing, scalability, reliability, high availability and application performance. Apply data extraction, transformation and loading techniques to connect large datasets from a variety of sources.
  • Be a mentor for junior and senior team members.
  • Perform data analysis and testing to identify data gaps and ensure data quality.
  • Work with application, data platform, and data engineering teams to reconfigure data ingestion pipelines to be more reliable and monitored
  • Manage data incidents and lead blameless postmortems.
  • Help build the creation of monitoring, alerting, and reporting on the reliability of data pipelines and other big data processing systems.
  • Partner with product teams to develop a data management strategy following our goals
  • Stay current with new technologies and data engineering concepts
  • Follow Medidata's Standard Operating Procedures to ensure all software meets regulatory and company requirements

Requirements (Education & Experience):

  • Bachelor's Degree in Computer Science, Math, Software Engineering, Computer Engineering, or related field with at least 5+ years of data engineering experience in an enterprise environment
  • Experience managing high-volume data processing streams
  • Familiarity with building data visualizations and dashboards
  • Experience building complex, automated data processes in Python, Spark, AWS, Airflow, Snowflake and other technologies
  • Excellent knowledge of databases such as Postgres, Elasticsearch, Redshift, and Aurora, including distributed database design, SQL vs. NoSQL, and database optimizations.
  • Experience with data engineering, database management, business intelligence and data transformation tools
  • Experience developing web applications in frameworks like Shiny, Vue, React.
  • Along with programming proficiency, must have a capacity for.
  • Must have experience leading complex scientific data management and integration initiatives in a research environment.

The salary range posted below refers only to positions that will be physically based in New York City. As with all roles, Medidata sets ranges based on a number of factors including function, level, candidate expertise and experience, and geographic location. Pay ranges for candidates in locations other than New York City, may differ based on the local market data in that region. The base salary pay range for this position is $114,750 to $153,000.

Base pay is one part of the Total Rewards that Medidata provides to compensate and recognize employees for their work. Most sales positions are eligible for a commission on the terms of applicable plan documents, and many of Medidata's non-sales positions are eligible for annual bonuses. Medidata believes that benefits should connect you to the support you need when it matters most and provides best-in-class benefits, including medical, dental, life and disability insurance; 401(k) matching; unlimited paid time off; and 10 paid holidays per year.



As a game-changer in sustainable technology and innovation, Medidata, Dassault Systèmes company, is striving to build more inclusive and diverse teams across the globe. We believe that our people are our number one asset and we want all employees to feel empowered to bring their whole selves to work every day. It is our goal that our people feel a sense of pride and a passion for belonging. As a company leading change, it’s our responsibility to foster opportunities for all people to participate in a harmonized Workforce of the Future.