Language Data Analyst improving MT/LLM performance through data-driven frameworks for Romanian and CEE markets. Collaborating with cross-functional teams to optimize language data and automation.
Responsibilities
Monitor MT/LLM quality, corpus health, and workflow KPIs for Romanian and other priority locales; identify trends, anomalies, and root causes.
Design, build, and maintain reporting and dashboards for language data (quality signals, error rates, throughput, coverage, cost, time saved) using SQL and BI tooling.
Own data pipeline health across key sources (e.g., Crowdin/CTMS/CX feedback, translation evaluations, operational request data); drive data harmonization and documentation.
Run linguistic and statistical analysis on large datasets to improve data quality, coverage, and consistency, including terminology and entity consistency in high-impact surfaces (titles, attributes, navigation, support).
Develop and maintain automation to eliminate manual and repetitive work, including workflow orchestration, validation checks, and quality-alert mechanisms.
Partner with engineering, NLP, and Quality teams to validate AI/MT outputs, propose corrective interventions, and support model training and evaluation cycles.
Design and execute experiments and POCs (e.g., provider comparisons, prompt versions, context-aware translation, glossary enforcement); translate findings into recommendations and measurable impact.
Support operational decision-making by sizing opportunities and trade-offs (quality, cost, latency), and by maintaining a clear record of provider selection decisions and configurations.
Requirements
Romanian native or near-native proficiency; strong command of English required. Additional languages (e.g., Bulgarian, Greek, Turkish, Croatian) are a plus.
Extensive experience in data analysis / data science, computational linguistics, NLP, or language-technology roles with demonstrable ownership of end-to-end analytics and automation.
Strong SQL and coding (e.g. Python/Javascript) skills; experience building data pipelines and automations (e.g., Airflow/n8n/Make or similar).
Experience working with MT and LLM outputs, evaluation methodologies, corpus validation, and multilingual datasets; comfortable with prompt-based workflows and quality measurement.
Proficient in data visualization and reporting (Tableau, Power BI, Looker, or similar) with the ability to translate insights into clear recommendations for stakeholders.
Comfortable operating in cross-functional, international environments; strong communication and stakeholder management skills.
E-commerce, marketplace, search/discovery, or customer experience domain familiarity preferred.
Benefits
Hybrid working model with flexibility: a schedule that helps you find the right balance between flexibility and team bonding, including work-from-abroad opportunities and a summer working model.
Personalised training allowance and learning opportunities: Use your annual budget for any training or conference of your choice, explore our Learning Management System (LMS) anytime, and join in-person learning sessions offered throughout the year.
Responsibility from day one: Take full ownership from the start in a culture where every voice is heard and valued.
A diverse, international team: Collaborate with global peers across our offices in Berlin, Amsterdam, Dubai, and beyond, in a startup-spirited and collaborative environment.
Opportunities to grow with the best: Tackle meaningful challenges, develop through hands-on experience, and grow with the support of expert guidance and global mentoring.
Meaningful connections beyond tasks: Be part of team rituals, events, and social activities that help us stay connected and inspired.
Data Analyst developing dashboards and applications using Power BI and ETL processes at Instituto Aquila. Focused on data integration, quality, and supporting business insights.
Intern working on data analysis for V8/V12 engine development at BMW Group. Involved in data interpretation and tool development for efficient data processing.
Operations Data Analyst improving efficiency and operations using data analytics in a tech upskilling platform, collaborating with various stakeholders for insights and metrics.
Data analyst specialist driving data quality and availability for CEVA Logistics. Collaborating with internal teams and managing IT solution processes for operational excellence.
Responsible for collecting and analyzing multilingual conversational data focusing on diplomatic terminology for AI models development. Ensuring linguistic accuracy and terminology consistency.
Senior Data Analyst leading and executing complex data analysis projects for Xylem, a global water solutions company, with a focus on data - driven decision - making and collaboration.
Sr. Data Analyst in Financial Analytics at Sunrun connecting finance and data engineering. Responsibilities include investor reporting, report refactoring, and metrics creation.
Mobile Product Data Analyst responsible for creating dashboards and reports for mobile product teams. Analyzing data to improve outcomes and automating data distribution for stakeholders.
Analyzing productivity and sales metrics for Transamerica's marketing group. Providing insights and recommendations to optimize business performance based on data analysis.