Job Description
Nebula Nexus is pioneering the next generation of artificial intelligence infrastructure. We are seeking a visionary Senior AI Systems Architect to join our elite engineering team. In this role, you will be responsible for designing and deploying scalable, high-performance systems that power our next-generation generative models. You will work at the intersection of theoretical research and practical engineering, ensuring our platforms are robust, secure, and ready to handle the demands of the 2026 landscape and beyond. Join us in shaping the future of technology.
Responsibilities
- Architect and implement high-scale distributed AI systems and microservices.
- Lead the end-to-end lifecycle of Large Language Models (LLMs) and generative AI pipelines from research to production.
- Drive optimization strategies for inference latency, throughput, and cost-efficiency.
- Collaborate with data scientists and researchers to translate cutting-edge academic concepts into production-grade software.
- Establish and enforce best practices for MLOps, CI/CD, and model monitoring.
- Design resilient cloud-native infrastructure capable of handling petabyte-scale data.
- Mentor junior engineers and conduct rigorous architecture reviews.
Qualifications
- 8+ years of experience in software engineering, with at least 4 years focused on machine learning infrastructure.
- Deep expertise in Python, PyTorch, TensorFlow, or JAX.
- Proven track record of designing systems that scale to millions of users and handle massive data loads.
- Strong understanding of cloud architecture (AWS, GCP, or Azure) and containerization (Docker, Kubernetes).
- Familiarity with edge computing paradigms and quantum-ready frameworks.
- Excellent problem-solving skills and the ability to thrive in a fast-paced, innovative environment.