Site Reliability Engineer- Azure

San Francisco, CA

100% Remote

Full Time

$140k - $160k

Job Description


Hiring for a Senior Site Reliability Engineer to help improve our customer experience by defining, maintaining, and improving service-level objectives for the our athletics platform. You will also help improve the development experience for the engineering team by automating infrastructure, building CI/CD pipelines, and improving the observability of the platform. This role will report to the Site Reliability Engineering Manager. If you have a strong foundation with Azure as a Site Reliability Engineer, believe in the power of continuous improvement, and have the willingness to help where needed, this role may be the place for you!

Required Skills & Experience
  • 3+ years’ experience working in Microsoft Azure
  • Understanding of DevOps Pipelines, CI/CD practices, and Infrastructure as Code (IaC)
  • Knowledge in implementing and managing DevOps tools such as Git, Azure DevOps, Swagger, Selenium, JMeter- Scripting skills particularly with PowerShell
  • Comfortable with troubleshooting app code in C#/ Java
  • Experience with SQL Server or other relational databases
  • Infrastructure Provisioning experience – ARM Templates
  • Knowledge in cloud monitoring tools, application performance monitoring tools, and operational dash-boarding
  • Experience with Azure solutions such as App Services, App Insights, Storage Accounts, Resource Groups and monitoring tools
  • Knowledgeable about sound engineering practices like continuous delivery, automated testing, (micro)services-based architecture, etc.



Desired Skills & Experience
  • Experience with Elasticsearch a plus
  • Experience with Scrum, Kanban and other agile methodologies


What You Will Be Doing

Projects:

  • Identify and implement reliability and efficiency improvements to the platform
  • Create and maintain terraform to manage test and production environments.
  • Create and maintain CI/CD pipelines and other automation tools.
  • Work with engineers to support product development from inception to release.
  • Participate in on-call rotations.
  • Respond to, diagnose, and solve production incidents.
  • Participate in root cause analysis and blameless post-mortems


    Applicants must be currently authorized to work in the US on a full-time basis now and in the future.


    #LI-GA1

Posted by: Grace Allen


Related Jobs

    Not Ready To Apply?

    Send us your resume and we’ll get started matching you with the right job.