Back to the roster

Senior Customer Experience Engineer – Cloud Reliability, SLO Monitoring & Observability Specialist

Remote Full-time Hiring now

About arenaflex

arenaflex is a global leader in cloud‑enabled solutions, empowering organizations of every size to innovate, scale, and thrive in a digital‑first world. Our mission is to enable every person and every organization on the planet to achieve more through secure, reliable, and intelligent cloud services. At arenaflex, we blend the agility of a start‑up with the resources of an industry‑defining enterprise, creating an environment where bold ideas are turned into real‑world impact every day.

Our culture is built on four pillars: customer obsession, relentless innovation, inclusive collaboration, and continuous learning. We believe that when our customers succeed, we all succeed. This philosophy drives every team, from product engineering to support, and it’s the heartbeat of the Customer Experience Engineering group.

Role Overview

We are seeking a seasoned Senior Customer Experience Engineer to join our high‑impact Observability team. In this role, you will design, implement, and operate cutting‑edge Service Level Objective (SLO) monitoring solutions for our most strategic arenaflex Cloud customers. You will partner directly with customers to define measurable reliability targets, translate those targets into actionable telemetry, and build automated remediation pathways that keep applications running smoothly.

This position sits at the intersection of technical excellence and customer advocacy. You will be a trusted advisor to our customers, a catalyst for cross‑functional collaboration, and a champion for reliability across the arenaflex platform.

Key Responsibilities

Customer Collaboration & SLO Definition

  • Work hand‑in‑hand with customers to co‑create Service Level Objectives (SLOs) and Service Level Indicators (SLIs) that align with their business outcomes.
  • Translate business‑level reliability goals into concrete technical metrics that can be instrumented, measured, and reported.
  • Facilitate workshops, design reviews, and joint planning sessions to ensure mutual understanding of reliability expectations.

Instrumentation & Monitoring Development

  • Instrument application code, infrastructure, and platform services to capture SLO‑related telemetry at scale.
  • Develop and maintain automated pipelines that ingest, aggregate, and visualize SLO data in real time.
  • Build alerting and incident‑response playbooks that trigger when SLO thresholds are breached.

Automation & Remediation

  • Design self‑healing automation that can remediate common SLO violations without human intervention.
  • Create detailed troubleshooting guides and runbooks that empower both customers and internal support engineers to resolve issues quickly.
  • Integrate remediation workflows with arenaflex’s broader incident‑management platform to ensure seamless hand‑offs.

Cross‑Team Collaboration

  • Partner with service engineering, platform reliability, and product teams to align customer‑defined SLOs with arenaflex’s internal reliability targets.
  • Drive the creation of shared telemetry signals that provide a unified view of performance across the stack.
  • Advocate for reliability improvements in product roadmaps based on real‑world SLO data and trends.

Data Analysis & Continuous Improvement

  • Analyze historical SLO performance to identify patterns, emerging risks, and opportunities for optimization.
  • Produce regular reliability reports for customers, highlighting successes, areas for improvement, and actionable recommendations.
  • Lead initiatives to refine SLO definitions, adjust thresholds, and evolve monitoring strategies as workloads evolve.

Documentation & Knowledge Sharing

  • Maintain comprehensive documentation of customer‑specific SLOs, SLIs, monitoring architectures, and remediation processes.
  • Contribute to internal knowledge bases, best‑practice guides, and training materials to spread reliability expertise across arenaflex.
  • Mentor junior engineers and share insights through brown‑bag sessions, webinars, and community forums.

Required Qualifications

  • Bachelor’s degree in Engineering, Computer Science, or a related discipline, or equivalent practical experience.
  • Minimum 4 years of experience designing, implementing, debugging, and launching commercial software products or web services.
  • At least 3 years of hands‑on experience in Site Reliability Engineering (SRE) or Customer Reliability Engineering (CRE) within a cloud environment (arenaflex Cloud, AWS, or GCP).
  • Demonstrated expertise in building and managing SLOs and SLIs for enterprise‑grade cloud customers.
  • Minimum 2 years of experience in an external client‑facing role, with a track record of translating technical concepts into business value.
  • Strong programming or scripting skills (e.g., Python, Go, PowerShell, Bash) for instrumentation and automation.
  • Proficiency with observability stacks (e.g., Prometheus, Grafana, Azure Monitor, OpenTelemetry) and experience building dashboards and alerting rules.
  • Excellent communication skills, both written and verbal, with the ability to influence senior technical and business stakeholders.
  • Ability to obtain and maintain arenaflex Cloud security clearances and pass periodic background checks.

Preferred Qualifications

  • Master’s degree in Engineering, Computer Science, or a related field, combined with 6+ years of software industry experience.
  • Or a Bachelor’s degree with 8+ years of relevant experience.
  • Additional 2+ years of direct customer‑facing experience in a consulting or professional services capacity.
  • Experience with large‑scale distributed systems, microservices architectures, and container orchestration platforms (Kubernetes, Docker Swarm).
  • Familiarity with SLO‑driven reliability frameworks such as Google’s SRE book, or the “Four Golden Signals” methodology.
  • Prior involvement in building or operating a public‑cloud observability platform at scale.
  • Certifications related to cloud reliability (e.g., arenaflex Certified SRE, Certified Kubernetes Administrator).

Skills & Competencies

  • Analytical mindset: Ability to dissect complex telemetry data and surface actionable insights.
  • Customer empathy: Deep understanding of how reliability impacts business outcomes and a passion for delivering measurable value.
  • Collaboration: Proven ability to work across product, engineering, and support teams to drive shared goals.
  • Automation‑first approach: Commitment to reducing manual toil through code, scripts, and self‑service tools.
  • Problem‑solving agility: Comfortable navigating ambiguous situations, troubleshooting under pressure, and delivering rapid resolutions.
  • Communication excellence: Clear, concise, and persuasive storytelling for both technical and non‑technical audiences.
  • Continuous learning: Eagerness to stay current with emerging observability technologies, reliability best practices, and industry trends.

Career Growth & Learning Opportunities

At arenaflex, your career trajectory is limited only by your ambition. As a Senior Customer Experience Engineer, you will have access to:

  • Mentorship from senior leaders in the arenaflex Cloud Reliability organization.
  • Opportunities to lead cross‑functional reliability initiatives that influence product direction for millions of users.
  • Funding for certifications, conferences, and advanced technical training.
  • Rotational programs that allow you to explore adjacent domains such as product management, security, or data analytics.
  • A clear promotion path from senior individual contributor to principal engineer or engineering manager, based on impact and leadership.

Compensation, Perks & Benefits

arenaflex offers a competitive total rewards package that reflects the value you bring to the organization. While exact figures vary by location and experience, you can expect:

  • A base salary range that aligns with market benchmarks for senior reliability roles.
  • Annual performance bonuses tied to individual and team outcomes.
  • Equity participation through arenaflex stock options or restricted stock units.
  • Comprehensive health, dental, and vision coverage for you and your dependents.
  • Generous paid time off, parental leave, and flexible work‑from‑home policies.
  • Wellness programs, employee assistance resources, and a robust employee resource group (ERG) network.
  • Access to cutting‑edge cloud resources, sandbox environments, and a learning budget for personal development.

Why Join arenaflex?

Joining arenaflex means becoming part of a purpose‑driven organization where your work directly influences the reliability of mission‑critical applications worldwide. You will:

  • Help turn arenaflex Cloud customers into enthusiastic fans by delivering rock‑solid reliability experiences.
  • Work in a fast‑moving, start‑up‑like environment that values autonomy, creativity, and rapid iteration.
  • Collaborate with a diverse, inclusive team that celebrates different perspectives and backgrounds.
  • Contribute to a culture that encourages “no dead‑ends,” “get it done,” and “whatever it takes” mindsets.
  • Enjoy a supportive environment where personal growth, mentorship, and work‑life balance are top priorities.

Application Process

If you are passionate about cloud reliability, love turning data into action, and thrive in a customer‑obsessed culture, we want to hear from you. To apply, click the link below and submit your resume, a brief cover letter highlighting your SLO experience, and any relevant project portfolios.

Apply Now – Join arenaflex!

arenaflex is an equal‑opportunity employer. We celebrate diversity and are committed to creating an inclusive environment for all employees. If you require accommodations during the application process, please let us know.

Apply for this job

Related roles

Remote Data Entry & Statistics Analyst – Full‑Time & Part‑Time Opportunities at arenaflex

Remote Full-time

Remote Data Entry Associate – Entry Level – Aviation & Travel Industry – $30‑40/hr – Full‑Time Remote Position at arenaflex

Remote Full-time

Remote Data Entry Specialist – College Student Friendly, Part‑Time & Full‑Time Opportunities at arenaflex

Remote Full-time

Senior Customer Logistics & Supply Chain Advisor – Remote Order‑to‑Cash Operations for Retail, Bottler & Distributor Partners

Remote Full-time

Senior GRC Data Entry Specialist – Remote (Part‑/Full‑Time) – $80,000 Yearly – arenaflex

Remote Full-time

Remote Member Services Representative – Payments & Digital Strategy – arenaflex Credit Union (Remote – Texas, USA)

Remote Full-time

Remote Customer Service Associate – Technical Support & Financial Services Specialist for Healthcare Solutions (Work‑From‑Home)

Remote Full-time

Remote Customer Service & Payments Operations Specialist – Remote (US) – arenaflex

Remote Full-time

Remote Data Entry Specialist – Call Support & Customer Assistance for Streaming Entertainment (arenaflex) – $32/hr – Work‑From‑Home

Remote Full-time

Data Platform Principal Product Manager – Remote Full‑Time/Part‑Time Role for College Students at arenaflex

Remote Full-time

Experienced Data Entry Technician – Customs Brokerage Team (Remote)

Remote Full-time

Environmental Health & Safety (EHS) Consultant

Remote Full-time

Sr CRA - FSP Oncology - California

Remote Full-time

Principal Product Design Architect

Remote Full-time

Access Engineer

Remote Full-time

Back End Developer - Mid-level job at Maximus in US National

Remote Full-time

(Part-time Work From Home Jobs) Apple Remote Jobs

Remote Full-time

Experienced Product Testers Wanted – Entry-Level Opportunity with arenaflex

Remote Full-time

Immediate Hiring: arenaflex Remote Careers Remote Jobs – Customer Service Representative

Remote Full-time

Experienced Field Customer Success Manager II - Small to Medium Business (SMB) - Cleveland, OH

Remote Full-time