Job Description
The Challenge
We are building the infrastructure for Artificial General Intelligence (AGI) that benefits all of humanity. As the landscape of AI evolves rapidly, our focus on the 2026 roadmap is on ensuring these powerful systems remain safe, aligned, and controllable. We are seeking a visionary Long-term AI Safety Researcher to lead our efforts in solving the alignment problem.
The Role
In this position, you will define the safety standards for next-generation models. You will work on the frontier of research, bridging the gap between theoretical safety guarantees and practical implementation in large-scale AI systems.
Responsibilities
- Develop novel technical frameworks for long-term AI safety and alignment.
- Design and execute adversarial testing and red-teaming protocols for LLMs and autonomous agents.
- Contribute to the open-source community by releasing safety benchmarks and tools.
- Collaborate with engineering teams to integrate safety constraints into model training pipelines.
- Write high-impact research papers for top-tier AI conferences (NeurIPS, ICML, ICLR).
- Advocate for ethical AI practices and safety guidelines within the organization.
Qualifications
- PhD, Masterβs degree, or equivalent experience in Computer Science, Physics, Mathematics, or a relevant field.
- Strong research background in Machine Learning, Natural Language Processing, or Reinforcement Learning.
- Expert proficiency in Python and deep learning frameworks (PyTorch, JAX).
- Experience working with Large Language Models (e.g., GPT, Claude, LLaMA).
- Excellent written and verbal communication skills for technical documentation and public speaking.
- Deep curiosity about the long-term impacts of AI and a commitment to safety.