Back to the roster

[Remote] Senior Site Reliability Engineer (SRE) / Platform Reliability Engineer

Remote Full-time Hiring now

Note: The job is a remote job and is open to candidates in USA. Mastech Digital is seeking an experienced Site Reliability Engineer (SRE) to lead reliability engineering initiatives for large-scale, mission-critical healthcare platforms. The role involves defining reliability KPIs, driving observability strategies, and leading incident response for enterprise platforms.

Responsibilities

  • Define and monitor reliability KPIs, SLIs, and SLOs
  • Drive observability and monitoring strategies across distributed systems
  • Lead incident response, RCA, and reliability improvements
  • Build automation for infrastructure and CI/CD pipelines
  • Partner with stakeholders on SLA and service-level management
  • Support modernization of enterprise platforms

Skills

  • Proven experience implementing SRE frameworks in large enterprise environments
  • Strong background supporting complex distributed systems
  • Java, Spring Boot
  • Azure, GCP, GKE
  • Kubernetes, CI/CD, JFrog
  • MongoDB, SQL
  • AppDynamics, Splunk, Grafana
  • Experience with Prometheus is a plus
  • Healthcare or PBM platform experience
  • Platform Engineering or Reliability Engineering background

Company Overview

  • Mastech Digital provides IT associates in digital and mainstream technologies, Digital Transformation Services around Salesforce.com and SAP It was founded in 1986, and is headquartered in Pittsburgh, Pennsylvania, USA, with a workforce of 1001-5000 employees. Its website is http://www.mastechdigital.com/.
  • Company H1B Sponsorship

  • Mastech Digital has a track record of offering H1B sponsorships, with 50 in 2026, 399 in 2025, 496 in 2024, 540 in 2023, 947 in 2022, 681 in 2021, 751 in 2020. Please note that this does not guarantee sponsorship for this specific role.
  • Apply To This Job

    Related roles