Foundational AI Research Scientist developing next-generation language models. Pioneering large-language-model architectures and attention mechanisms for efficient scaling.
Responsibilities
Research and prototype sub-quadratic attention architectures to unlock efficient scaling of large language models.
Design and evaluate efficient attention mechanisms including state-space models (e.g., Mamba), linear attention variants, and sparse attention patterns.
Lead pre-training initiatives across a range of model scales from 1B to 100B+ parameters.
Conduct rigorous experiments measuring the efficiency, performance, and scaling characteristics of novel architectures.
Collaborate closely with product and engineering teams to integrate models into production systems.
Stay at the forefront of foundational research and help shape Aldea's long-term model roadmap.
Requirements
Requires a Ph.D. in Computer Science, Engineering, or related field.
3+ years of relevant industry experience.
Deep understanding of modern sequence modeling architectures including State Space Models (SSMs), Sparse Attention mechanisms, Mixture of Experts (MoE), and Linear Attention variants.
Hands-on experience pre-training large language models across a range of scales (1B+ parameters).
Expertise in PyTorch, Transformers, and large-scale deep-learning frameworks.
Proven ability to design and evaluate complex research experiments.
Demonstrated research impact through patents, deployed systems, or core-model contributions.
Nice to Have Experience with distributed training frameworks and multi-node optimization.
Knowledge of GPU acceleration, CUDA kernels, or Triton optimization.
Publication record in top-tier ML venues (NeurIPS, ICML, ICLR) focused on architecture research.
Experience with model scaling laws and efficiency-performance tradeoffs.
Background in hybrid architectures combining attention with alternative sequence modeling approaches.
Familiarity with training stability techniques for large-scale pre-training runs.
Benefits
Competitive base salary
Performance-based bonus aligned with research and model milestones
Join the renowned Patent Litigation team contributing to complex patent disputes. Involve in the life sciences and high - tech sectors with flexible working arrangements.
Research Scientist developing generative AI technology at Snap Inc. Leading research agenda and partnering with engineering teams to innovate for millions of users.
Principal Scientist overseeing liquid innovation at Suntory Global Spirits. Leading development projects in Shanghai with cross - functional teams for product enhancement.
Senior AI/ML Research Scientist developing ML models for adjoe's innovative advertising solutions. Engaging in research and collaboration to enhance business - driven projects.
Referendar und wissenschaftlicher Mitarbeiter bei Bird & Bird in Hamburg, Fokus auf neue Technologien und Wirtschaftsrecht. Unterstützung in der Mandatsarbeit und Examensvorbereitung für Absolventen.
Referendar und wissenschaftlicher Mitarbeiter for Bird & Bird, advising technology - focused industries. Engaging in economic and corporate law with flexible work arrangements.
Research Assistant supporting clinical research team focused on advancing stroke care. Responsibilities include data collection, analysis, and regulatory compliance in clinical settings.
Research Assistant performing lab duties in a Maternal Fetal Medicine lab. Responsible for experiments, maintenance, and training within the research team.
Principal Scientist at Johnson & Johnson enhancing ORBIT platform by leading requirements and cross - functional team collaboration. Fostering innovative medicine through data analytics and computational sciences.