Data Scientist specializing in structured and unstructured data analysis and ML solutions for predictive analytics and NLP. Join a fast-paced team in Hyderabad, India.
Responsibilities
Build, deploy, and optimize ML models for predictive analytics, forecasting, classification, and regression
Perform large-scale feature engineering using PySpark and Big Data tools
Work on batch pipelines, model versioning, and experiment tracking
Develop cost estimation and risk/likelihood models using statistical and ML techniques
Build NLP pipelines using deep learning frameworks such as PyTorch, TensorFlow, or similar
Develop real‑time, low‑latency inference systems for text classification, embeddings, semantic search, summarization, and retrieval
Create prompts, context graphs, and agentic workflows for LLM-based systems
Apply knowledge of prompt engineering, context engineering, and autonomous agent frameworks to production systems
Work in Databricks for ETL, feature engineering, ML training, and orchestration
Use Azure services for model deployment, data pipelines, and infrastructure
Collaborate using Git-based workflows; leverage tools like GitHub Copilot, Claude Code, etc.
Implement model monitoring, observability, drift detection, and performance tracking
Requirements
3–7 years of experience as a Data Scientist
Strong hands-on experience with Databricks (Delta Lake, MLflow, Job Orchestration)
Excellent PySpark skills for large-scale distributed data processing
Proficiency in Azure cloud services (ADF, Azure ML, AKS, Databricks on Azure)
Strong understanding of ML algorithms, statistical methods, and data analysis
Experience with deep learning frameworks: PyTorch, TensorFlow, Transformers (HuggingFace)
Experience with model monitoring and ML observability
Senior Data Scientist designing and leading projects to enhance machine learning classifiers for cancer detection. Collaborating with cross - functional teams in a healthcare company focused on early cancer detection.
Placement Data Science Engineer at Medialab building data software and infrastructure for media advertising. Working on Python tools, automated pipelines, and AI solutions in a hybrid role based in London.
Data Scientist at Capital One leading machine learning initiatives to unlock customer behavior insights. Collaborating with product teams to enhance digital experiences across a vast dataset.
As a Data Scientist at Capital One, collaborate with cross - functional teams on AI/ML technologies. Drive innovation using big data, impacting customer financial experiences.
Lead Data Scientist focused on Reinforcement Learning algorithms at Fractal Analytics. Leading teams delivering scalable machine learning models in a fast - paced environment.
Data Scientist leading the implementation of AI use cases within a Data Factory at Capgemini Invent. Driving project success and managing teams for innovative solutions.
Data Scientist leading BI and ML modeling projects for PayPal’s Risk and Core Platforms. Collaborating with cross - functional teams to deliver scalable, data - driven solutions and insights.
Senior Marketing Data Scientist driving data - led insights for campaign performance and growth. Joining Offshore Marketing Effectiveness team at Superloop, supporting marketing strategy and analytics.
Data Scientist supporting analytics team driving business intelligence platform transition and performing data analysis. Collaborating with team to deliver insights guiding business decisions in a fast - paced environment.
Senior Data Scientist leading a team for a language - related product. Identifying data - driven methodologies and steering effective team results from a technology aspect.