Staff Platform Engineer
New York, New York
Full Time
$170k - $260k
Our client’s services and engineering teams are in hyperscale mode. We are looking for experienced Platform Engineers to join the team and help scale cloud infrastructure and developer experience. You’ll work on a centralized Platform team whose work spans platform building, adoption, and ongoing support of existing tooling and software. This role is approximately 80% infrastructure focused and 20% application software focused.
You will help evolve the infrastructure stack to be multi-tenant and multi-cloud, drive implementation and adoption of cloud security best practices, build and manage modular Terraform, and help build and integrate new cloud infrastructure into production systems at scale. You’ll also implement service templates and developer tooling, including canary releases, feature flagging, load testing, CI/CD pipelines, and more.
The platform being built must maximize engineering velocity and security, operate at tremendous scale, and offers many opportunities to leverage creativity, autonomy, and leadership to take systems from 0 to 1. This is a unique opportunity to rapidly grow your career at a fast-growing company leveraging emerging technologies.
What You’ll Do-
Design, build, and scale cloud infrastructure including networking, IAM, Kubernetes, databases, streaming and pub/sub platforms, storage, and distribution systems.
-
Design and implement build pipelines, branching strategies, and release management tooling to support a rapidly growing engineering organization and increasing code velocity.
-
Design, implement, and scale cloud security practices including CI and deployment scanning, least-privileged access controls, auditing, and maintaining SOC 2 and HIPAA compliance.
-
Advocate for, design, implement, and adopt fast and scalable application testing pipelines, including end-to-end UI tests and hyperscale load testing.
-
Improve incident response capabilities through enhanced observability, runbooks, and incident response processes across the organization.
-
Bridge the gap between local development and production environments to maximize engineering velocity and security while minimizing issues caused by environment drift.
-
Evangelize, document, and train the engineering team on platform solutions and cloud-native design strategies.
-
Act as a public evangelist for our client within the global platform engineering community through conferences, open source contributions, and research, helping pioneer AI-first, cloud-native, security-first implementations at scale.
-
8+ years of software engineering experience, including 3+ years of infrastructure-as-code experience in a cloud-first organization.
-
Strong experience building and scaling services on Kubernetes, including cloud-native tooling such as ArgoCD, Argo Rollouts, Istio, and related technologies.
-
Hands-on experience operating Kubernetes clusters, including version upgrades, service mesh management, and maintaining Helm charts for application deployments.
-
Experience creating and maintaining CI/CD pipelines for both infrastructure-as-code and application deployments (e.g., Terragrunt, Atlas, ArgoCD, Octopus Deploy, Travis CI).
-
Experience with monitoring and observability practices at scale, including metrics, logs, and traces using platforms such as Grafana, Datadog, or Honeycomb.
-
Comfortable implementing and securing services in Google Cloud Platform using infrastructure as code, including GCP projects, VPC networks, GKE, and IAM roles, groups, and policies.
-
Experience with backend programming languages such as Python, Go, Node.js, or Rust.
-
Up to date on industry best practices and tools, with a strong desire to continuously learn.
-
Excited to be hands-on in a fast-moving, collaborative, and supportive environment.
-
Willing to pitch in wherever needed in a fast-paced startup environment.