Content Engineer maintaining web-data extraction solutions at Meltwater. Working with cross-functional teams to enhance data quality in a hybrid remote setup from Hyderabad.
Responsibilities
Master our internal web-data extraction platform to configure, optimize, and maintain crawler setups.
Analyze website structures, HTML source code, and site behaviors to create accurate XPath, and Regular Expressions for data extraction.
Continuously improve extraction quality by identifying content gaps, reducing crawler failures, and ensuring high-quality structured output.
Monitor and troubleshoot crawling issues using logs, HTTP responses, and tooling insights to ensure consistent data accuracy and coverage.
Work cross-functionally with product, QA, and content teams to improve customer satisfaction through enhanced data completeness and reliability.
Document extraction logic, website behaviors, and configuration changes for internal knowledge sharing.
Requirements
Bachelor's Degree in Computer Science, Information Technology, or related field.
Strong written and verbal communication skills in English.
Solid understanding of HTML, DOM structure, and CSS.
Good understanding of HTTP concepts (status codes, redirects, authentication, headers, etc.)
Ability to quickly learn internal tools, proprietary systems, and new web technologies.
Strong analytical and problem-solving skills, especially when dealing with ambiguous or changing website structures.
High attention to detail, accuracy, and consistency in extraction logic.
Ability to adapt quickly in a fast-changing environment.
1-2 years experience in a technical support or web-data related role (preferred).
Experience with web crawling, web scraping, or data extraction workflows (preferred).
Working knowledge of XPath and Regular Expressions (preferred).
Familiarity with analyzing website source code, APIs, and network traffic (preferred).
Ability to debug technical issues using logs, HTTP responses, and browser developer tools (preferred).
Strong teamwork ethic with the ability to manage multiple tasks in parallel (preferred).
Experience working with spreadsheets (Google Sheet or similar) to manipulate and transform data (preferred).
Basic familiarity with JavaScript, Python, or other scripting languages (preferred).
Customer-first mindset with the ability to translate content requirements actionable steps (preferred).
Benefits
Enjoy flexible paid time off options for enhanced work-life balance.
Comprehensive health insurance tailored for you.
Employee assistance programs cover mental health, legal, financial, wellness, and behavior areas to ensure your overall well-being.
Complimentary CalmApp subscription for you and your loved ones, because mental wellness matters.
Energetic work environment with a hybrid work style, providing the balance you need.
Benefit from our family leave program, which grows with your tenure at Meltwater.
Thrive within our inclusive community and seize ongoing professional development opportunities to elevate your career.
Content Marketing Manager at Endeavor4 creating demand - generating content for ERP and CRM solutions. Focusing on industries like Accounting, Finance, and Supply Chain with collaborative strategies.
Content Marketing Manager at Endeavor4 generating leads with compelling content for ERP and CRM solutions. Collaborating within the marketing team to support revenue growth.
Content Specialist creating enterprise - level content for BCM One's diverse global communications products. Collaborating with sales and marketing teams to amplify digital brand presence.
Product Manager overseeing AI - driven solutions for Netflix's content distribution and operations. Collaborate across teams to enhance machine learning in content platform and publishing tools.
Knowledge Content Author managing knowledge requirements and documentation for Ayvens' Knowledge Management System. Collaborating with stakeholders to maintain and implement knowledge standards and best practices.
Contents Adjuster managing insurance claims for property and casualty insurers. Analyzing coverage, negotiating settlements, and attending litigation hearings in a supportive environment.
Support EMEA Marketing team by maintaining and updating Columbia Threadneedle’s public websites. Ensure content is accurate, compliant and aligned with brand standards while improving usability and processes.
Content Publishing Manager ensuring regulatory - ready outputs in clinical documentation at GSK. Leading publishing expertise and driving excellence across technical compliance and publishing domains.
Content Research & Creation at Elisana Digital GmbH requiring research and formulation of structured marketing content for targeted audiences. Managing content organization with a hybrid work setup in Dorsten.