Data Scientist designing and implementing analytical solutions using Python and AI technologies. Collaborating cross-functionally to deliver Generative AI solutions for business needs in a hybrid setup.
Responsibilities
Design and develop data models and analytical solutions.
Optimize data analysis processes and methodologies.
Ensure data solutions meet business and technical requirements.
Provide technical support for data science projects.
Collaborate with stakeholders to address data science needs.
Develop and maintain advanced Python-based applications in the Generative AI domain, ensuring high performance, reliability, and scalability.
Implement and optimize Generative AI models, including GPT, LLAMA, Mistral, FLAN T5 and other cutting-edge AI technologies, to create innovative solutions and knowledge graph.
Development of advanced RAG pipelines with proper embeddings, indexing, chunking, reranking, prompts and evaluation
Collaborate with cross-functional teams to integrate AI functionalities into broader systems and applications.
Utilize AWS/Azure/Databricks GPU machines to manage GPU memory effectively, maximizing performance and efficiency.
Stay updated on the latest advancements in Generative AI, Python development practices, and cloud services to continually enhance our AI capabilities.
Assist delivery leads in delivering Generative AI solutions to clients in a timely manner, ensuring client satisfaction and project success.
Requirements
4+ years of experience as a NLP and Python developer.
Experience with Pandas, NumPy, Scikit, NLP a must have
Key fundamentals in object-oriented design, data structures and systems.
Ability to integrate multiple data sources into a single system.
Familiarity with testing tools.
Ability to collaborate on projects and work independently when required.
Working knowledge of GitHub and Jira
Ability to document requirements and specifications.
Bachelor's or Master's degree in a quantitative field (CS, machine learning, mathematics, statistics) or equivalent experience.
4+ years of experience in data science, building hands-on ML models.
Candidate must be aware of entire evolution history of NLP (Traditional Language Models to Modern Large Language Models), training data creation, training set-up and finetuning
Knowledge of advanced RAG pipelines with proper embeddings, indexing, chunking, reranking, prompts and evaluation
Excellent programming skills in Python. Strong working knowledge of Pythons numerical, data analysis, or AI frameworks such as NumPy, Pandas, Scikit-learn, Jupyter, etc
SQL skills with SQL Server and Spark experience is preferred but not necessary.
Knowledge of predictive/prescriptive analytics including Machine Learning algorithms (Supervised and Unsupervised) and deep learning algorithms and Artificial Neural Networks
Experience with Natural Language Processing (NLTK) and text analytics for information extraction, parsing and topic modeling.
Excellent verbal and written communication. Strong troubleshooting and problem-solving skills. Thrive in a fast-paced, innovative environment
Experience with cloud platforms such as Azure, AWS, Databricks is preferred.
Data Scientist developing predictive models and automation workflows for the mortgage lifecycle. Collaborating with cross - functional teams to enhance operational efficiency and customer outcomes.
Data Scientist developing machine learning models and analytics solutions to improve decisions in the mortgage lifecycle from acquisition to servicing. Collaborating with teams on predictive modeling and automation workflows.
Senior Manager in Data Science and Analytics leading statistical modeling for mortgage and consumer lending. Collaborating with teams to deliver data - driven insights and improve operational efficiency.
Internship for AI in document processing at ArianeGroup, focusing on natural language processing and data analysis tasks in a collaborative environment.
Senior Data Scientist focused on Generative AI and LLM at Manulife. Develop and implement machine learning models to solve business problems and mentor peers.
Sr. Advanced Data Scientist leveraging advanced analytics and data science at Honeywell. Developing solutions for business growth and operational efficiency in the Atlanta office.
Data Scientist at Capital One leveraging technology to improve fraud prevention and customer safety. Collaborating with cross - functional teams to deliver industry - leading fraud defenses.
Data Scientist delivering insights for product and operations teams in Customer Support at Etsy. Using behavioral analysis to drive product development and strategy within a collaborative environment.
Data Scientist developing predictive models and enhancing investment strategies with AI at MDOTM. Collaborating in a dynamic research team to drive data - driven insights and innovative solutions.