Senior Software Engineer, AI/ML at Calix developing advanced AI models for content generation and data synthesis. Collaborate across teams to innovate in machine learning and AI technologies.
Responsibilities
Design and Build ML Models: Develop and implement advanced machine learning models (including deep learning architectures) for generative tasks, such as text generation, image synthesis, and other creative AI applications.
Optimize Generative AI Models: Enhance the performance of models like GPT, V AEs, GANs, and Transformer architectures for content generation, making them faster, more efficient, and scalable.
Data Preparation and Management: Preprocess large datasets, handle data augmentation, and create synthetic data to train generative models, ensuring high-quality inputs for model training.
Model Training and Fine-tuning: Train large-scale generative models and fine-tune pre-trained models (e.g., GPT, BERT, DALL-E) for specific use cases, using techniques like transfer learning, prompt engineering, and reinforcement learning.
Performance Evaluation: Evaluate models’ performance using various metrics (accuracy, perplexity, FID, BLEU, etc.), and iterate on the model design to achieve better outcomes.
Collaboration with Research and Engineering Teams: Collaborate with cross-functional teams, including AI researchers, data scientists, and software developers, to integrate ML models into production systems.
Experimentation and Prototyping: Conduct research experiments and build prototypes to test new algorithms, architectures, and generative techniques, translating research breakthroughs into real-world applications.
Deployment and Scaling: Deploy generative models into production environments, ensuring scalability, reliability, and robustness of AI solutions in real-world applications.
Stay Up-to-Date with Trends: Continuously explore the latest trends and advancements in generative AI, machine learning, and deep learning to keep our systems at the cutting edge of innovation.
Requirements
Bachelor’s, Master’s, or Ph.D. in Computer Science, Machine Learning, Artificial Intelligence, Data Science, or a related field.
8+ years of overall software engineering in production.
3-5+ years of focus on Machine Learning.
Proven experience with generative AI models such as GPT, V AEs, GANs, or Transformer architectures.
Strong hands-on experience with deep learning frameworks such as TensorFlow, PyTorch, or JAX.
Strong coding experience in Python, Java, Go, C/C++, R.
Expertise in Python and libraries such as NumPy, Pandas, and Scikit-learn.
Experience with Natural Language Processing (NLP), image generation, or multimodal models.
Familiarity with training and fine-tuning large-scale models (e.g., GPT, BERT, DALL-E).
Knowledge of cloud platforms (AWS, GCP, Azure) and ML ops pipelines (e.g., Docker, Kubernetes) for deploying machine learning models.
Strong background in data manipulation, data engineering, and working with large datasets.
Good data skills - SQL, Pandas, exposure to various SQL and non-SQL databases.
Solid development experience with dev cycle on Testing and CICD.
Strong problem-solving abilities and attention to detail.
Excellent collaboration and communication skills to work effectively within a multidisciplinary team.
Proactive approach to learning and exploring new AI technologies.
Benefits
Flexible hybrid work model - work from Bangalore office for 20 days in a quarter
Full Stack Engineer at Schwarz IT Barcelona developing high - quality software using SOLID principles and agile methodologies. Collaborating in cross - functional teams to ensure product quality and performance.
Lead Engineer managing advanced semiconductor packaging programs at Micron. Collaborating cross - functionally to ensure compliance and operational excellence throughout project lifecycle.
Lead Engineer providing technical leadership for advanced packaging initiatives at Micron. Driving technology enablement and optimizing development workflows with a focus on innovation.
Senior Test Automation Engineer developing automated test scripts and strategies for quality assurance in medical devices. Collaborating with teams to enhance product reliability and testing methodologies.
Ada Software Engineer developing and sustaining mission - critical software for Defence sector. Contributing to software requirements, design documentation, and collaboration within Agile teams.
Full Stack Developer joining veritree, a climate tech startup, to build applications for reforestation efforts. Responsible for full - stack development and server maintenance with AWS services.
Software engineer developing generative AI technologies for enterprise solutions at WRITER. Collaborating with teams to transform business operations using AI applications.
Software engineer focusing on generative AI solutions at WRITER. Collaborating with cross - functional teams to deliver scalable applications and transform enterprise productivity.
Full Stack Developer automating core business processes in energy sector at Ørsted. Collaborating remotely with global teams on scalable and secure digital solutions.
Software Engineer at Eurobank developing banking products for the Core Banking Application Unit. Collaborating with product owners and business analysts in a team - oriented environment.