Back to the roster

Lead Site Reliability Engineer (GCP & Hybrid Cloud) Hybrid

Remote Full-time Hiring now

About the position Join Cisco’s Enterprise AI team, the core group enabling Generative AI powered experiences across Cisco. Our mission is to build secure, scalable AI platforms that empower teams to safely develop, deploy, and operationalize AI-powered solutions. We operate at the intersection of applied AI, cloud infrastructure and security - partnering across engineering, security, compliance, and product teams to bring trusted AI to life at an enterprise scale. We are a fast-growing, highly collaborative team of platform engineers, AI engineers, and data scientists who value technical depth, ownership, and pragmatic execution. What makes this team exciting is the opportunity to define how secure Generative AI is built and governed inside a global technology leader. As a Lead SRE, you will own the architectural integrity of our hybrid cloud infrastructure, ensuring our GCP and on-premise Kubernetes environments are resilient and secure. You will set the standard for automation and reliability that enables our AI models to scale globally.

Responsibilities

  • Lead the architectural design of scalable hybrid-cloud environments, managing GCP and On-premise Kubernetes clusters with Anthos Service Mesh (ASM) and Istio.
  • Direct the implementation of Identity and Access Management (IAM) policies and GCP Quota management to ensure secure and cost-effective resource utilization.
  • Architect multi-region, load-balanced microservices with DDoS hardening, end-to-end encryption, and automated secrets management.
  • Design a comprehensive observability strategy using Elasticsearch and Kibana to provide proactive alerts on service performance and cost envelope management.
  • Partner with development leads to integrate "Security by Design" into the automation and AI agent lifecycle using Apigee for secure API management.

Requirements

  • Bachelor’s Degree in Computer Science, Engineering, or a related field.
  • 7+ years of experience in Cloud/On-prem Operations, SRE, or DevOps.
  • Expert-level proficiency with Terraform, Kubernetes (GKE & On-prem), and Docker.
  • Hands-on expertise with Anthos Service Mesh (ASM), Istio, and Apigee.
  • Deep understanding of IAM implementation and GCP Quota management.

Nice-to-haves

  • GCP Professional Cloud Security Engineer or Network Engineer certification.
  • Experience with the ELK stack (Elasticsearch/Kibana) for large-scale observability.
  • Strong financial acumen for cloud cost optimization and proactive budget alerting.
  • Experience managing complex traffic between cloud platforms and on-premise data centers.

Apply tot his job Apply To this Job

Related roles

Senior Infrastructure Engineer/SRE

Remote Full-time

Staff Site Reliability Engineer

Remote Full-time

Software Engineer (Python + Kubernetes)

Remote Full-time

Senior Systems Software Engineer, Containers and Kubernetes

Remote Full-time

Kubernetes Networking Platform Engineer :: Bethesda, MD (Remote)

Remote Full-time

Senior DevOps Engineer - Kubernetes Focused (Hub-Remote: DC or Philly Metro)

Remote Full-time

Senior Software Engineer, Managed Orchestration (Managed Kubernetes)

Remote Full-time

Forward Deployed Engineer, AI Inference (vLLM and Kubernetes)

Remote Full-time

Java Engineer Level III - AWS , Kafka, Kubernetes (MEXICO ONLY)

Remote Full-time

Ranchester Kubernetes Engineer; USC or GC W2

Remote Full-time

Power Systems Engineer / Electrical Engineer - Remote

Remote Full-time

Research Manager, Advisory Board

Remote Full-time

Amazon Compliance Specialist

Remote Full-time

Experienced Part-Time Remote Data Entry Amazon Specialist – Customer Service Representative

Remote Full-time

Experienced Customer Service & Sales Representative – Building Connections and Driving Growth at arenaflex

Remote Full-time

Experienced Web Chat Specialist – Remote Customer Service Representative

Remote Full-time

Entry-Level Remote Part-Time Data Entry Clerk – Flexible Hours, High-Earning Opportunity with arenaflex

Remote Full-time

Experienced Service Desk Specialist/Live Chat Agent – Mobile Application Support and Customer Assistance

Remote Full-time

Accounting Assistant / Bookkeeper

Remote Full-time

Experienced Data Entry Specialist – Healthcare Industry – Join arenaflex

Remote Full-time