Job Description
Are you ready to architect the digital backbone of tomorrow? Nexus Future Systems is seeking a visionary Senior AI Infrastructure Architect to join our elite team in San Francisco. As we prepare for our 2026 roadmap, we need an expert to design scalable, resilient, and future-proof AI ecosystems.
In this pivotal role, you will bridge the gap between theoretical AI models and high-performance, production-grade infrastructure. You will lead initiatives that optimize data flow, enhance machine learning model deployment, and ensure our systems remain secure against evolving cyber threats.
If you thrive in a fast-paced, innovative environment and want to leave a lasting impact on the technology of 2026 and beyond, we want to hear from you.
Responsibilities
- Architect and implement highly scalable distributed systems for AI workloads using Kubernetes and microservices.
- Design data pipelines that optimize the training and inference of large language models (LLMs).
- Collaborate with data scientists to translate model requirements into robust engineering solutions.
- Ensure 99.99% uptime and implement disaster recovery protocols for critical infrastructure.
- Drive cloud-native transformation strategies, focusing on cost-efficiency and performance.
- Conduct code reviews, technical architecture reviews, and mentor junior engineers.
Qualifications
- 10+ years of experience in software engineering, with at least 5 years in a high-scale infrastructure role.
- Deep expertise in Python, Go, or Rust, and proficiency in cloud platforms (AWS, GCP, or Azure).
- Strong understanding of containerization (Docker, Kubernetes) and CI/CD pipelines.
- Experience with MLOps tools and frameworks (MLflow, Kubeflow, TensorFlow Serving).
- Proven track record of designing secure, high-availability systems.
- Master's degree in Computer Science, Engineering, or a related field is preferred.