Back to the roster

[Remote] Sr Platform Engineer

Remote Full-time Hiring now

Note: The job is a remote job and is open to candidates in USA. Flexential is a company focused on building and operating IT platforms, and they are seeking a Senior Platform Engineer to join their platform development team. This role involves hands-on engineering responsibilities for developing and managing critical platform subsystems, ensuring high availability and operational resiliency while utilizing native-AI capabilities.

Responsibilities

  • Design, develop and operationally manage automated, resilient, high availability, self-healing, secure platforms with native-AI capabilities for IT needs, serving both internal as well as customer business capabilities
  • Develop, and manage the Observability OpenTelemetry Central Backend Stack: Grafana Enterprise, Mimir, Loki, Tempo, and Alertmanager on Kubernetes/RKE2 via Helm and GitLab CI-CD
  • Build and manage iaC and CI-CD for automated provisiong and deployment, including Terraform modules for Infra/VM/storage provisioning, Ansible AWX playbooks for OS/App bootstrap, ArgoCD and Helm for Kubernetes configuration
  • Develop and manage OpenTelemetry Prometheus scrape profile library including SNMP exporters, REST API exporters, and cloud provider exporters (CloudWatch, Azure Monitor, GCP) for multiple device classes
  • Develop AIOps capabilities on platforms for e.g Observability use-cases: anomaly detection integrations, event correlation rules in Alertmanager, and synthetic monitoring patterns to reduce alert noise
  • Configure and maintain Zabbix auto-discovery: network range scanning, device classification, and Prometheus service discovery integration
  • Build and harden Edge Stack deployments (Prometheus + OTel collector) per data center site using GitOps templates
  • Integrate Alertmanager with ServiceNow: webhook routing, ticket enrichment, auto-close logic, and escalation policy configuration
  • Maintain platform security: Conjur/CyberArk secret injection at runtime, mTLS between stack components, RBAC in Grafana Enterprise
  • Author and maintain Grafana dashboards in JSON/GitLab — facility overview, network health, RED metrics, application telemetry
  • Mentor mid-level engineers, lead code reviews, and establish engineering standards for the team. Represent platform engineering in cross-functional architecture reviews and executive-level program updates
  • Perform other duties as required and assigned

Skills

  • DevOps / Automation - 5+ years in a production environment, Kubernetes (RKE2/k3s), Helm chart deployment, system services, Docker/container
  • LGTM Stack Development and Configuration - 4+ years: Grafana, Mimir, Loki, Tempo configuration, tuning, dash-boarding and production operations; Prometheus required
  • Senior-level Python / Scripting frameworks - 5+ years, Automation scripts, exporter development, GitLab pipeline scripting, REST API integrations
  • GitOps / CI/CD - 5+ years, GitLab CI/CD pipeline authoring; Terraform and Ansible as primary IaC tools; ArgoCD or Flux preferred
  • AIOps / Observability Engineering - 2+ years, Alertmanager rule authoring, anomaly detection integration, event correlation, noise reduction techniques
  • Working infrastructure (Linux/VM) management knowledge - 5+ years, Linux administration, VMware vCenter/VCF experience, Netapp storage management, network fundamentals (SNMP, TCP/IP)
  • Secrets Management - 2+ years, CyberArk/Conjur, HashiCorp Vault, or equivalent — runtime secret injection patterns
  • Minimal travel may be required
  • Experience and/or knowledge of ITSM processes and workflow automation e.g. Incident & Response Mgmt (IRM), Release mgmt., ServiceNow ITSM integration, alert routing, escalation policy design, SLA-driven on-call workflows
  • Hands-on experience or working knowledge of Boomi integrations PaaS(iPaaS) technologies
  • Experience working with BAS / BMS systems in a Datacenter / OT environment
  • Hands-on experience working with AWS products in a Well-architected Framework and multi-account model to develop various compute, storage, network iaaS and PaaS services for IT applications

Benefits

  • Medical, Telehealth, Dental and Vision
  • 401(k)
  • Health Savings Accounts (HSA) and Flexible Spending Accounts (FSA)
  • Life and AD&D
  • Short Term and Long-Term disability
  • Flex Paid Time Off (PTO)
  • Leave of Absence
  • Employee Assistance Program
  • Wellness Program
  • Rewards and Recognition Program

Company Overview

  • Flexential provides IT solutions including integrated colocation, interconnection, cloud, data protection, and professional services. It was founded in 2000, and is headquartered in Charlotte, North Carolina, USA, with a workforce of 501-1000 employees. Its website is https://www.flexential.com/.
  • Apply To This Job

    Related roles

    [Remote] Senior Software Engineer (PHP)

    Remote Full-time

    [Remote] Logistics & Sales Support (Internship or Ongoing Part-Time)

    Remote Full-time

    [Remote] Customer Service Representative

    Remote Full-time

    [Remote] Business Operations Intern

    Remote Full-time

    [Remote] Sr. Software Engineer - Backend

    Remote Full-time

    [Remote] Sales Specialist

    Remote Full-time

    [Remote] Remote 1099 Medicare Sales Agent - Pod Leader

    Remote Full-time

    [Remote] Principal Product Manager

    Remote Full-time

    [Remote] Medicare Sales Agent (Remote) - AL

    Remote Full-time

    [Remote] Business Development Manager - CPG

    Remote Full-time

    Experienced Data Entry and Review Writer for Mobile Apps and Games – Remote Opportunity with Flexible Hours and Comprehensive Training

    Remote Full-time

    SIU Medical Auditor (CPC/CPMA) Remote

    Remote Full-time

    Customer Service Agent – Remote Social Media & E‑Commerce Support Specialist for Premium Sugar‑Free Chocolate Brand

    Remote Full-time

    Experienced Online Live Chat Customer Support Representative – Remote Part-Time Opportunity for Exceptional Customer Service Professionals at arenaflex

    Remote Full-time

    Medical Transcriptionist (NOT REMOTE)

    Remote Full-time

    Experienced Remote Data Entry Specialist for High School Students – Flexible Part-Time Opportunities for Career Development and Skill Enhancement at blithequark

    Remote Full-time

    Experienced Customer Support Representative – Financial Institutions Software Solutions

    Remote Full-time

    Experienced Customer Support Representative – Remote Work Opportunity in the Aviation Industry with blithequark

    Remote Full-time

    Experienced Full-Time Worklife Customer Support Associate - Employee Assistance Program (EAP) and Management Consultant Resources at Blithequark (Fri-Tue 8:30AM-5:00PM EST)

    Remote Full-time

    Licensed Psychiatrist- New Hampshire

    Remote Full-time