Senior Data Engineer automating cleaning and analysis of data for clinical trials at Statistics & Data Corporation. Engaging in AI initiatives to enhance data processing and insights.
Responsibilities
**Job Summary**
The Senior Data Engineer will design and automate the cleaning, processing, and analyzing of both clinical data and nonclinical data with the goal of transforming data into information that supports better understanding of the safety and efficacy of new clinical therapies. The role contributes to the organization’s strong drive to be at the forefront of using Artificial Intelligence (AI) in clinical trials to simplify data processing and discover imperceptible correlations.
**Primary Responsibilities**
Designs and develops data architecture for new and existing applications and data sources
Establishes Data Quality, Data Governance and Master Data Management Best Practices to enable the business to maintain clean and accurate data
Designs ETL frameworks and features to allow for robust and scalable data pipelines
Incorporates new pipelines into the existing data model, augmenting as needed
Manages and maintains several production systems
Engages with various internal cross-functional departments to strategically design, develop and implement data pipelines while understanding the underlying data
Increases team productivity by developing, identifying, and implementing better tools and processes
Exemplifies good documentation, coding, and testing best practices
Develops standard operating procedures for the use of artificial intelligence and data engineering principles within clinical trials
Prototypes new ideas/technologies to create proof of concept and demos
Provides mentorship for other data engineers
Assists in executing responsibilities of Data Engineers including the following:
o Maintains and develops ETL pipelines
o Maintains an Enterprise Data Warehouse by updating and translation logic and support warehouse servers.
o Follows Data Quality, Data Governance and Master Data Management Best Practices
o Delivers high quality software design documentation
o Prototypes new ideas/technologies to create proof of concept and demos
o Contributes to the development of standard operating procedures
o Performs other related duties incidental to the work described herein
Adherence to all essential systems and processes that are required at SDC to maintain compliance to business and regulatory requirements
Act as a resource for other team members for debugging, code reviews and other software development lifecycle activities.
Contract Research Organization experience and familiarity with its operations
The above statements describe the general nature and level of work being performed by individuals assigned to this classification. This document is not intended to be an exhaustive list of all responsibilities and duties required of personnel so classified.
Requirements
**Required Skills**
Fluency in Python with experience parsing, manipulating, and converting data to and from a wide range of formats (CSV, json, XML, html, SQL tables, etc.)
Deep understanding of modern RDBMS concepts (triggers, indexes, views, stored procedures) and SQL syntax, including experience with at least one modern RDBMS
Ability to design efficient data warehouse following dimensional modeling principles for scalable reporting.
Solid understanding of multiple database systems (No-SQL, SQL)
System design capabilities to improve operational efficiency and costs.
Experience in the software development lifecycle.
Ability to mentor other data engineers and increase team productivity
Ability to work with stakeholders to translate business requirements into clear technical specifications
Ability to communicate effectively in writing and verbally.
Ability to identify issues, present problems, and implement solutions
Capability of communicating technical concepts clearly, concisely, and understandably to non-technical colleagues
Good leadership, organizational and time management skills, with the ability to multi-task
Strong interpersonal communication and presentation skills
**Education or Equivalent Experience**
· Bachelor’s degree in a technical field with 7 years of technical experience with the last ~4 years being a Data Engineering centric role OR 10+ years in a technical role with the last ~5 years being a Data Engineering centric role
Benefits
**Why SDC**
SDC is a team of diversified professionals who deliver exceptional Biometric Services, Consulting, and Technology Solutions to pharmaceutical, biologic, and medical device/diagnostic companies. Since 2005 our purpose has been to partner with sponsors to provide high quality and experienced team members to develop great medicines that save lives and cure diseases in the most efficient manner possible. Our global team operates as a value partner to our clients by fulfilling their needs as our own and delivering exceptional results. We are a specialty CRO in that we provide scalable service offerings, focused services area specialists, efficient project timelines, optimal technology solutions, and proven success and experience. Our commitment to our clients is the same commitment to our employees. By offering strong benefits including competitive pay, generous time off, attainable career advances and positive work/life balance, we are able to attract some of the most talented people in the industry.
We are committed to developing our employees. We recognize achievements, provide growth opportunities and career advancement, offer a flexible work schedule, engaging work culture and employee benefits.
We are passionate about our company culture. Our recognition program is directly tied to our core values of Energy, Integrity, Engagement, Innovation, Ownership, and Commitment.
We strive to provide a place of belonging to our employees with fun and engaging activities from SDC’s culture club.
We are constantly growing and innovating to support our client and employee needs. Global in nature, we bring diverse perspectives enabling our growth in this ever-evolving industry.
With a proven track record, SDC has been successfully executing client clinical programs since 2005.
Principal Data Engineer at Genworth leading build teams and mentoring engineers in data engineering practices. Overseeing data architecture and engineering processes for the Life, Annuity, and Long - Term Care Data Solutions team.
Associate Data Engineer supporting enterprise data initiatives for big data projects at CVS Health. Ensuring the quality, reliability, and security of data solutions on Snowflake and Google Cloud Platform.
Data Engineer responsible for building data pipelines and improving data quality at Snap Inc. Collaborating with stakeholders across various functions to ensure timely dataset availability.
Vice President of Strategic Data Architecture overseeing data architecture for a group of companies. Defining, governing, and advancing the data ecosystem to align with business priorities.
Data Engineer role in São Paulo developing data pipelines and maintaining data architecture. Collaborative role with a focus on data governance and analytics.
Data Engineer designing automated data pipelines for a leading recycling and sustainability company. Collaborating with IT and operations to streamline workflows and increase efficiency.
Data Warehouse Engineer developing end - to - end solutions for projects at TD Bank. Leading technical design and delivery of effective tech solutions.
Building, optimizing and supporting Beghou’s AI data platform in cloud environments using Databricks and Python. Requires hands - on development and adherence to software engineering best practices.
People Manager leading data engineering teams for Kramp, fostering growth in a tech environment. Overseeing performance and collaboration within a dynamic digital landscape.
Senior Data Engineer for Semrush, developing scalable data pipelines and optimizing data systems. Collaborating with teams for analytics and mentoring junior engineers in best practices.