Data Analyst developing and maintaining data pipelines using SQL and PySpark at Minsait. Collaborating with teams to enhance data quality and reporting capabilities.
Responsibilities
We are looking for a qualified Mid-level Data Analyst with a strong foundation in SQL-based development.
This role will focus on building and maintaining data pipelines using PySpark, with SQL as the primary coding language.
The candidate should also have a solid understanding of data modeling frameworks (such as Kimball dimensional modeling) and experience supporting data warehouses and data marts.
Develop and maintain batch data pipelines using PySpark (SQL-focused)
Write and optimize complex SQL queries to support business logic and reporting needs
Independently understand requirements and translate them into code
Transform and integrate data from various sources into Iceberg tables and Snowflake
Contribute to the development of data marts and curated datasets for business consumption
Collaborate with business analysts to understand data needs
Monitor and manage data jobs running on AWS EMR orchestrated by Airflow, leveraging S3, Glue, and other AWS services
Ensure data quality, consistency, and performance across the pipeline
Requirements
Fluent in English
Proven experience in SQL, including joins, aggregations, window functions, and performance tuning
Hands-on experience with PySpark, particularly Spark SQL
Familiarity with AWS data services (e.g., EMR, S3, Glue)
Understanding of data modeling frameworks, including the Kimball methodology
Experience working with Snowflake or similar cloud data warehouses
Knowledge of Apache Iceberg or similar table formats (e.g., Delta Lake, Hudi)
Experience building and managing data marts (preferred)
Exposure to Airflow or other orchestration tools (preferred)
Benefits
Company-subsidized health insurance for the employee.
Option to include dependents in the health plan with payroll-deducted premium.
Optional dental care plan.
Option to include dependents in the dental plan with payroll-deducted premium.
Meal allowance or grocery voucher.
Optional transportation voucher.
Impact & Care - Personal Support Program offering confidential emotional support and counseling in psychological, legal, financial, social and pet-related matters at no cost for the employee and legal dependents.
Gympass - Wellhub (Access to over 700 gyms across Brazil with plans starting at R$ 29,90 deducted from payroll).
Option to include dependents in Gympass - Wellhub (up to 3 dependents - paid via credit card).
Access to Udemy through our intranet.
Partnerships with major consumer brands for discounts.
Agreement with SESC for the employee and dependents.
Discount agreements with educational institutions (undergraduate and postgraduate) and language/certification schools.
Data Analyst joining Oscar's Clinical Data Analytics team in Tempe, Arizona. Analyzing performance metrics and insights to support business goals with a hybrid work model.
Data Analyst supporting business units with data - driven insights and performance metrics at Oscar Health. Collaborating across departments and delivering analysis to improve healthcare efficiencies.
Data Analyst intern at Ledger, supporting AI initiatives and analytics engineering. Collaborating with teams to transform data into actionable insights for business growth.
Data Analyst supporting supply chain risk management for federal agencies. Focusing on data analysis and mitigation strategies in a collaborative team environment.
Data Analyst supporting threat analysis operations within supply chain risk management program. Focus on identifying vulnerabilities and assessing geopolitical exposure.
Senior Quantitative Analyst validating the Standard Initial Margin Model in investment banking. Involves quantitative validation and risk analysis in a regulated environment.
HR Coordinator & People Analytics Specialist supporting HR operations and managing people analytics reporting. Ensuring data accuracy and handling sensitive employee information with confidentiality.
Data Analyst developing dashboards and reports using Power BI and SQL for strategic insights. Collaborating with business areas to map analytical needs and ensure data quality.
Business Data Analyst II supporting SAS/SQL - based CCAR Y14 reporting solutions at Truist. Ensuring accuracy and regulatory alignment of CCAR Y14 data through reporting enhancements and analysis.
ERP Data Analytics Governance Manager leading data governance and compliance initiatives within SAP S/4HANA transformation project. Collaborating across teams to ensure data integrity and regulatory adherence.