Principal DevOps Engineer - AI
New York, New York
Full Time
$200k - $250k
They are looking for a Principal DevOps/Platform Engineer to architect, scale, and optimize our AWS-based infrastructure and Kubernetes systems that serve as the backbone of our AI healthcare platform. This is a high-impact role where you’ll shape our cloud platform strategy and ensure the reliability, security, and scalability of our systems.
Responsibilities-
Lead the design, development, and management of scalable and secure AWS cloud infrastructure.
-
Own and evolve our Kubernetes-based orchestration systems for deploying and managing microservices at scale.
-
Build and optimize CI/CD pipelines to streamline development, testing, and deployment workflows.
-
Collaborate with software engineers, data scientists, and security teams to ensure platform reliability and compliance with healthcare standards.
-
Implement infrastructure as code (IaC) practices with tools such as Terraform or CloudFormation.
-
Drive observability across systems (monitoring, logging, tracing) to ensure system health and proactive incident response.
-
Establish best practices for cost optimization, high availability, disaster recovery, and performance tuning.
-
Mentor and guide platform engineers while serving as a technical leader across the organization.
-
Stay current with emerging cloud and containerization technologies to ensure our infrastructure evolves ahead of industry standards.
-
8+ years of experience in platform engineering, DevOps, or cloud infrastructure roles, with at least 3+ years in a leadership or principal-level position.
-
Expert-level experience with AWS services (EC2, ECS/EKS, S3, RDS, IAM, Lambda, etc.).
-
Deep expertise with Kubernetes and containerization technologies (Docker, Helm).
-
Strong proficiency with Infrastructure as Code (Terraform, CloudFormation, or similar).
-
Experience building CI/CD pipelines (GitHub Actions, Jenkins, ArgoCD, or equivalent).
-
Strong background in observability, monitoring, and logging (Prometheus, Grafana, ELK, Datadog, etc.).
-
Solid understanding of cloud networking, security, and identity management in regulated environments.
-
Prior experience in health-tech, life sciences, or other regulated industries is highly desirable.
-
Excellent problem-solving skills, ability to thrive in fast-paced environments, and a collaborative mindset.
-
Work on cutting-edge AI technology with the potential to meaningfully impact healthcare outcomes.
-
Be part of a mission-driven start-up where your contributions directly shape the platform’s success.
-
Competitive compensation package, including equity.
-
Flexible work environment with opportunities for growth and leadership.