Lead Python developer for Clarivate building FastAPI services, PySpark ETL, Databricks and knowledge graph integrations. Drive technical leadership, CI/CD, and mentor engineering team.
Responsibilities
Provide Technical Leadership: Guide the development team in implementing Python-based solutions, ensuring alignment with business requirements, performance standards, and scalability best practices.
Design, build, and maintain scalable Python applications for data processing, transformation, and analytics.
Utilize SQL and relational database concepts to design efficient data models, optimize queries, and manage data quality.
Implement and uphold best practices in Python software development, testing, and CI/CD pipelines.
Leverage Apache Spark, PySpark, and related big data technologies to develop distributed data processing pipelines.
Design and optimize Spark/Databricks ETL pipelines written in PySpark for high performance and scalability.
Build core Python FastAPI services to expose data from Databricks and graph databases to downstream applications.
Contribute to building a knowledge graph interconnecting healthcare and life science data domains.
Work on backend of AI assistant leveraging LLMs and machine learning to expose platform data and extract insights.
Mentor junior developers, facilitate knowledge sharing, and foster a culture of continuous learning.
Collaborate within a cross-functional C3 Data Platform Team in a startup-like environment inside Clarivate.
Requirements
5+ years of hands-on Python software development experience.
Advanced proficiency in SQL, database design, and relational database systems.
Experience developing and deploying solutions on cloud platforms, particularly AWS (S3, ECS, EMR, Glue, RDS).
Expertise in Apache Spark and PySpark for building scalable data solutions.
Familiarity with orchestration tools like Apache Airflow.
Knowledge of graph databases like Neo4j or AWS Neptune.
Solid grasp of CI/CD principles and experience implementing automated pipelines.
Practical experience with Databricks and Snowflake (desirable).
Exposure to Cassandra, Hadoop, Apache Hive (desirable).
Experience mentoring or providing technical leadership.
C++ Developer at Sperasoft creating high - quality game titles. Responsibilities include efficient code design, quality maintenance, and team collaboration.
Senior Software Engineer developing scalable software solutions for Coupa's spend management platform. Collaborating with cross - functional teams in a dynamic, agile environment in Pune, India.
Fullstack Engineer focusing on developing high - quality promotional capabilities using React and Node.js for a fast - growing technology organization. Join a skilled team to design and deliver innovative front - end experiences.
Backend Engineer developing reliable systems for Spotify's commerce platform. Collaborating with cross - functional teams to enhance the Checkout experience for users globally.
Developer creating CAP applications on SAP BTP for various projects in Germany. Collaborate on development cycle and optimize existing solutions in a dynamic environment.
Senior Full Stack Developer at a team creating proxy and web scraping solutions for Fortune 500 companies. Leading design, development, and mentoring in a hybrid work environment.
Experienced Technical Architect needed for a contract role, ensuring compliance with GDS standards in the UK. Overseeing architectural strategy, platform design, and integration within delivery teams.
Back End Software Engineer building solutions for maritime data processing. Working with a multidisciplinary team in a hybrid environment for the Signal Ocean Platform.
Software Engineer building services in Java with Spring Boot at Charlotte Tilbury. Joining a fast - paced team to deliver high - quality solutions with an emphasis on collaboration and innovation.
C++ Developer with CUDA designing and optimizing algorithms for 5G and 6G telecommunications. Collaborating on performance improvements and engaging in continuous learning about physical layer technologies.