Data Engineer designing and maintaining data pipelines for RebelDot, specializing in cloud-based solutions and data quality. Collaborating with cross-functional teams to deliver scalable data solutions.
Responsibilities
Designing, developing, and maintaining data pipelines, including both batch ETL processes and real-time streaming solutions, to support our product teams.
Implementing and managing Snowplow-based event tracking pipelines to collect, validate, and process user behavioral data in real time for analytics and product insights
Collaborating with cross-functional teams (product managers, analysts, data scientists, etc.) to understand data needs and deliver insightful, scalable data solutions
Replicating and generalizing successful data pipeline patterns to accelerate new pipeline development and ensure consistency and reliability across projects
Developing reusable data processing utilities and tooling (leveraging common data-centric libraries and frameworks in Python) to streamline ETL/ELT workflows
Optimizing database performance and ensuring high reliability of our data stores by performing query optimization, indexing, and tuning of SQL queries
Monitoring and enhancing Snowplow pipeline performance and data quality: troubleshooting pipeline issues, optimizing event collection, and implementing improvements to maximize uptime and data accuracy
Supporting data governance and quality initiatives to maintain data integrity, privacy compliance, and consistency across all data pipelines
Providing technical guidance on data integration, transformation, and analytics best practices to team members and stakeholders
Building reports and dashboards (in collaboration with BI/Analytics teams) to empower product and business teams with actionable insights from collected data
Acting as a data solutions expert for the organization: advising and assisting product teams in selecting the right data architectures, tools, and approaches (e.g., choosing the appropriate data storage, streaming service, or analytics tool for a given need)
Translating business and user requirements into technical specifications by working closely with product managers and engineers to ensure data solutions meet real-world needs
Requirements
5+ years of experience in data engineering, database development, and cloud-based data solutions (especially on AWS)
Strong proficiency in SQL (T-SQL, PL/SQL) and experience with database technologies (e.g., Oracle, SQL Server, Snowflake/Redshift)
Hands-on experience with ETL/ELT tools and frameworks, including modern cloud integration services (e.g., AWS Glue, Apache Airflow or Azure Data Factory) and dbt
Experience with Snowplow or similar event data tracking pipelines, including their implementation, maintenance, and optimization for behavioral data collection and analytics
Experience with data modeling, data integration, and data warehousing concepts
Strong programming skills in Python (e.g., Pandas, automation scripting)
Knowledge of data governance and data quality frameworks, as well as security best practices for data (e.g., GDPR)
Experience working in Agile development environments
Data Engineer at Booz Allen creating advanced technology solutions and managing data engineering activities for mission - driven projects. Collaborating with a multi - disciplinary team in a fast - paced environment.
Lead Platform Data Engineer focusing on data architecture and integration at Allegion, enhancing security solutions. Collaborate on data strategy and mentor engineering teams for data standardization and quality.
Data Engineer building and optimising the Azure - based data platform at Castle Trust Bank. Collaborating to deliver scalable, reliable solutions empowering decision - making across the Bank.
Lead Data Engineer at Castle Trust Bank, owning the Azure - Databricks platform and SQL infrastructure. Delivering scalable and reliable solutions that drive strategic goals across the organization.
Data Engineer responsible for building and maintaining cloud environments and data pipelines at Boeing. Collaborating with teams to ensure system performance and deliver value to customers.
Data Engineer at Fellowmind creating data pipelines and platforms leveraging Microsoft Fabric. Collaborating with teams for end - to - end data solutions and ensuring quality and reliability.
Data Engineer developing data solutions for Noda’s smart building technology initiatives. Focused on scalable, high - performance analytics and business intelligence in a hybrid work environment.
Senior Consultant Data Engineering in Data Analytics team at BearingPoint Netherlands. Guiding clients in building reliable, scalable data pipelines and production - ready environments.
Senior Data Architect in Data Analytics team at BearingPoint Netherlands. Leading scalable data architectures and advising clients on data ecosystem development.
Senior Data Engineer leading the development of a high - load Data Platform at TENTENS Tech. Collaborating with analytics and infrastructure teams for scalable data access.