Foundational AI Research Scientist developing next-generation language models. Pioneering large-language-model architectures and attention mechanisms for efficient scaling.
Responsibilities
Research and prototype sub-quadratic attention architectures to unlock efficient scaling of large language models.
Design and evaluate efficient attention mechanisms including state-space models (e.g., Mamba), linear attention variants, and sparse attention patterns.
Lead pre-training initiatives across a range of model scales from 1B to 100B+ parameters.
Conduct rigorous experiments measuring the efficiency, performance, and scaling characteristics of novel architectures.
Collaborate closely with product and engineering teams to integrate models into production systems.
Stay at the forefront of foundational research and help shape Aldea's long-term model roadmap.
Requirements
Requires a Ph.D. in Computer Science, Engineering, or related field.
3+ years of relevant industry experience.
Deep understanding of modern sequence modeling architectures including State Space Models (SSMs), Sparse Attention mechanisms, Mixture of Experts (MoE), and Linear Attention variants.
Hands-on experience pre-training large language models across a range of scales (1B+ parameters).
Expertise in PyTorch, Transformers, and large-scale deep-learning frameworks.
Proven ability to design and evaluate complex research experiments.
Demonstrated research impact through patents, deployed systems, or core-model contributions.
Nice to Have Experience with distributed training frameworks and multi-node optimization.
Knowledge of GPU acceleration, CUDA kernels, or Triton optimization.
Publication record in top-tier ML venues (NeurIPS, ICML, ICLR) focused on architecture research.
Experience with model scaling laws and efficiency-performance tradeoffs.
Background in hybrid architectures combining attention with alternative sequence modeling approaches.
Familiarity with training stability techniques for large-scale pre-training runs.
Benefits
Competitive base salary
Performance-based bonus aligned with research and model milestones
Associate Principal Scientist leading CMC initiatives in a global biopharmaceutical company. Driving strategic planning and delivery of key transformation projects across cross - regional functions.
Credit Estimate Research Assistant involved in evaluating companies’ credit quality for CLO rating. Collaborating within a global team at S&P Global to support structured credit analysis.
Research Assistant in Structured Finance Research team at S&P Global. Analyzing asset - backed securities while collaborating with cross - functional teams for investment decisions and market understanding.
Research Assistant conducting studies to improve Youth Mental Health through Virtual Reality applications. Collaborating with research teams to implement and analyze projects in a supportive environment.
Postdoctoral Research Fellow conducting research on experimental hypersonics at the University of Queensland. Collaborating closely with Defence Science and Technology Group on high - speed air - breathing propulsion technologies.
Principal Scientist/Director managing value evidence strategies for healthcare products. Collaborating with global teams for outcomes research and health economic modeling activities.
Associate Principal Scientist managing real world and economic evidence activities for in - line and pipeline products, collaborating with cross - functional teams for health systems globally.
Senior Research Scientist in CMC Injectable Drug Product Development at Novo Nordisk. Driving formulation and process development to secure pharmaceutical characteristics and product stability.
Principal Scientist leading in vivo pharmacology research in obesity and diabetes at Novo Nordisk. Focusing on innovative medicines and scientific leadership within the pharmaceutical sector.
Research Assistant supporting a funded project at the University of Edinburgh. Engaging with undergraduate students to address low attendance issues within Edinburgh Law School.