Assess and inventory existing legacy data in shared folders and shared drives.
Assess the quality, completeness, and relevance of data to be migrated.
Identify and document data dependencies, redundancies, and inconsistencies.
Collaborate with stakeholders to understand data structures, relationships, and required formats for successful migration.
Develop a comprehensive data migration plan that outlines steps, timelines, and resource requirements for the migration process.
Identify potential challenges and propose solutions to mitigate risks associated with data migration.
Migrate legacy data to AWS S3 buckets, ensuring data integrity and security throughout the process.
Implement indexing strategies for the migrated data, enabling seamless access and retrieval.
Conduct thorough testing of the migrated data to ensure accuracy, completeness, and successful integration.
Create validation reports and document any discrepancies, working closely with stakeholders to resolve issues.
Collaborate with software engineers, database administrators, and development teams to ensure compatibility and alignment with the overall architecture.
Maintain effective communication with project stakeholders to provide updates on migration progress and gather feedback.
Create and maintain technical documentation related to data migration processes, procedures, and system configurations.
Generate reports on migration progress, data quality metrics, and issue resolution.
Provide user guides and training materials to assist end-users with accessing and utilizing the newly migrated data.
Requirements
Bachelor’s degree in Computer Science, Information Technology, Data Science, or a related field.
Previous experience as a Database Engineer and proven experience in data migration, particularly with legacy systems and cloud environments.
Proficiency in data migration tools and ETL processes (e.g., Talend, Informatica, SSIS, or similar).
Strong knowledge of database management systems (e.g., SQL Server, Oracle, MySQL, PostgreSQL).
Experience with scripting languages (e.g., Python, SQL, PowerShell) for data transformation and automation.
Strong knowledge of AWS, specifically S3, and experience with data storage and retrieval solutions.
Familiarity with indexing and data accessibility in software applications.
U.S. Citizen with active TS/SCI clearance (with CI Polygraph).
Senior Data Engineer at Clorox developing cloud - based data solutions. Leading data engineering projects and collaborating with business stakeholders to optimize data flows.
Data Engineer building solutions on AWS for high - performance data processing. Leading initiatives in data architecture and analytics for operational support.
Senior Data Engineer overseeing Databricks platform integrity, optimizing data practices for efficient usage. Leading teams on compliance while mentoring a junior Data Engineer.
Associate Data Engineer contributing to software applications development and maintenance using Python. Collaborating with teams for clean coding and debugging practices in Pune, India.
Lead Data Engineer responsible for delivering scalable cloud - based data solutions and managing cross - functional teams. Collaborating with global stakeholders and ensuring high - quality project execution in a fast - paced environment.
Data Engineer focusing on development and optimization of data pipelines in an insurance context. Ensuring data integrity and supporting data - driven decision - making processes.
Full Stack Data Engineer on a Central Engineering Portfolio Team in Chennai delivering curated data products and collaborating with data engineers and product owners.
Data Engineer designing and implementing data pipelines and services for Ford Pro analytics. Working with diverse teams and technologies to drive data - driven solutions.
Data Engineer developing best - in - class data platforms for ClearBank with a focus on data insights and automation. Collaborating closely with stakeholders and supporting data science initiatives.
Data Engineer operating cloud - based data platform for Business Intelligence and Data Science. Collaborating on data architectures and ETL processes for Sparkassen - Finanzgruppe.