Senior Infrastructure Software Engineer - AI
New York, New York
Full Time
$150k - $250k
Our client, a rapidly growing venture-backed Legal AI technology startup, is seeking a Senior Platform Cloud Infrastructure Engineer to join their core engineering team. This company is building cutting-edge AI solutions that are transforming how legal professionals research, analyze, and draft complex legal content.
This is a high-impact opportunity to shape the foundational cloud infrastructure powering enterprise-grade AI products in a highly secure, compliance-driven environment.
About the OpportunityAs the Senior Platform Cloud Infrastructure Engineer, you will lead the design, implementation, and scaling of secure, high-performance cloud infrastructure supporting advanced AI and machine learning workloads. You will work cross-functionally with engineering, data science, and security teams to ensure reliability, scalability, and regulatory compliance.
This role is ideal for someone who thrives in high-growth startup environments and enjoys building scalable systems from the ground up.
Key Responsibilities Cloud Architecture & Platform Engineering-
Architect and manage scalable, secure, and cost-efficient cloud environments (AWS, GCP, or Azure).
-
Design and maintain Kubernetes-based container orchestration for production AI systems.
-
Develop and manage Infrastructure-as-Code (Terraform preferred) to ensure consistent, repeatable deployments.
-
Build and optimize CI/CD pipelines to support rapid and reliable software releases.
-
Implement modern deployment strategies (blue/green, canary, rolling deployments).
-
Support GPU-enabled and high-performance compute environments for model training and inference.
-
Optimize infrastructure for large-scale AI workloads, including LLM-based applications.
-
Design scalable storage and data architectures for handling sensitive legal datasets.
-
Improve observability across AI systems and platform services.
-
Implement cloud security best practices (IAM, VPC design, encryption, secrets management).
-
Support SOC 2 and other enterprise compliance initiatives.
-
Establish monitoring, logging, and alerting systems (Datadog, Prometheus, Grafana, or similar).
-
Define and manage SLAs/SLOs and lead incident response efforts.
-
Drive infrastructure cost optimization and performance improvements.
-
Provide guidance on DevOps and cloud-native best practices.
-
Contribute to long-term infrastructure strategy and architecture decisions.
-
Partner in scaling engineering operations as the company grows.
-
7+ years of experience in cloud infrastructure, DevOps, or platform engineering roles.
-
Strong production experience with AWS, GCP, or Azure.
-
Deep expertise in Kubernetes and containerized environments.
-
Hands-on experience with Infrastructure-as-Code tools (Terraform strongly preferred).
-
Experience supporting AI/ML workloads or high-performance compute environments.
-
Strong understanding of cloud security and best practices.
-
Experience operating highly available, production-scale systems.
-
Proficiency in scripting or programming (Python, Go, or similar).
-
Experience in regulated industries (legal, fintech, healthcare, etc.).
-
Familiarity with large language model (LLM) infrastructure.
-
Experience with vector databases or distributed data systems.
-
Prior involvement in SOC 2 or similar compliance audits.
-
Startup experience (Seed through Series B stage).
-
Opportunity to build foundational infrastructure for a category-defining AI company.
-
Direct impact on enterprise-scale legal technology transformation.
-
Competitive compensation package including equity.
-
Remote-first culture with strong growth trajectory.