Content Engineer managing web data extraction solutions for Meltwater's Content Support team. Analyzing website data structures and optimizing crawler setups in a hybrid work environment.
Responsibilities
Master our internal web-data extraction platform to configure, optimize, and maintain crawler setups.
Analyze website structures, HTML source code, and site behaviors to create accurate XPath, and Regular Expressions for data extraction.
Continuously improve extraction quality by identifying content gaps, reducing crawler failures, and ensuring high-quality structured output.
Monitor and troubleshoot crawling issues using logs, HTTP responses, and tooling insights to ensure consistent data accuracy and coverage.
Work cross-functionally with product, QA, and content teams to improve customer satisfaction through enhanced data completeness and reliability.
Document extraction logic, website behaviors, and configuration changes for internal knowledge sharing.
Requirements
Bachelor's Degree in Computer Science, Information Technology, or related field.
Strong written and verbal communication skills in English.
Solid understanding of HTML, DOM structure, and CSS.
Good understanding of HTTP concepts (status codes, redirects, authentication, headers, etc.).
Ability to quickly learn internal tools, proprietary systems, and new web technologies.
Strong analytical and problem-solving skills, especially when dealing with ambiguous or changing website structures.
High attention to detail, accuracy, and consistency in extraction logic.
Ability to adapt quickly in a fast-changing environment.
1-2 years experience in a technical support or web-data related role (preferred).
Experience with web crawling, web scraping, or data extraction workflows (preferred).
Working knowledge of XPath and Regular Expressions (preferred).
Familiarity with analyzing website source code, APIs, and network traffic (preferred).
Ability to debug technical issues using logs, HTTP responses, and browser developer tools (preferred).
Strong teamwork ethic with the ability to manage multiple tasks in parallel (preferred).
Experience working with spreadsheets (Google Sheet or similar) to manipulate and transform data (preferred).
Basic familiarity with JavaScript, Python, or other scripting languages (preferred).
Benefits
Enjoy flexible paid time off options for enhanced work-life balance.
Comprehensive health insurance tailored for you.
Employee assistance programs cover mental health, legal, financial, wellness, and behavior areas to ensure your overall well-being.
Complimentary CalmApp subscription for you and your loved ones, because mental wellness matters.
Energetic work environment with a hybrid work style, providing the balance you need.
Benefit from our family leave program, which grows with your tenure at Meltwater.
Thrive within our inclusive community and seize ongoing professional development opportunities to elevate your career.
Marketing Intern responsible for content creation and social media management at Thrive GmbH. Engaging community and supporting back - office tasks within a creative team.
Senior Content Scheduler handling client content scheduling, QA, and delivery at Akcelo. Collaborating with teams for process improvements and supporting junior schedulers.
Junior Graphic Designer at Marketplace Maniacs GmbH, creating impactful Amazon visuals and managing product imagery. Engaging with a dynamic team in a hybrid work environment in Munich.
Content Expert supporting fundraising and client engagement for private wealth Evergreen programs. Producing quality client reporting and handling proposals in a global financial institution.
Marketing Content Manager responsible for crafting impactful content for fashion brand communities. Overseeing content strategy, social media, and community engagement efforts in a hybrid role.
First dedicated Content Designer at LemFi, shaping app experience and clarity with a user - centered approach. Collaborating closely with designers and engineers in a fast - paced startup environment.
Content Strategy Consultant at Brave Bison leading client content strategy and optimisation projects. Blending content strategy, SEO, and UX thinking for diverse audiences.
Director responsible for managing global content design and delivery for Oncology. Leading integrated medical communication plans and collaborating with cross - functional teams.
Content Operations Manager responsible for designing and managing scalable content operations systems at a B2B software scale - up. Collaborating across teams to drive content strategy and performance.