magnifier icon

Principal Cloud Engineer - AI Agentic

Comcast

Comcast

Cable Operator

45071 West Chester, OH - USA

Senior Specialist / Project Manager

Experteer Overview

As Principal Cloud Engineer in the AI Agentic Team, you design and operate cloud-based platforms that power AI-enabled, agentic systems in production. You own infrastructure, automation, and observability to ensure reliability, security, and scalability. You collaborate with engineering, security, and AI teams to deliver production-ready platforms and guide on-call actions during incidents. This role blends hands-on engineering with operational ownership to advance safe, ethical AI-enabled operations. You will play a pivotal part in shaping resilient cloud foundations for cutting-edge AI initiatives.

Responsibilities

  • Design and operate scalable, highly available cloud infrastructure for AI-enabled platforms
  • Apply Infrastructure as Code (IaC) practices to provision resources securely
  • Monitor platform health using metrics, logs, dashboards, and alerts and diagnose failures
  • Lead troubleshooting of complex cloud and integration issues across distributed systems
  • Develop automation tooling in Python and shell to improve reliability and efficiency
  • Enhance observability with tools like Prometheus, Splunk, Grafana, CloudWatch
  • Collaborate with application, data, and AI teams to ensure operability and scalability
  • Support CI/CD pipelines and safe release workflows across environments
  • Ensure secure cloud operations including secrets management, encryption, and network security
  • Apply AI literacy to understand agent lifecycles and non-deterministic execution behaviors
  • Use AI-assisted tools for investigations and optimization with engineering judgment
  • Maintain runbooks, platform docs, and operational standards
  • Participate in on-call rotations and act as escalation point during incidents
  • Perform other duties as assigned

Key requirements

  • Cloud Infrastructure Engineering (AWS, Azure, or GCP)
  • Kubernetes & Container Platforms
  • Infrastructure as Code (Terraform, Packer, Ansible, or equivalent)
  • Python programming for automation
  • Monitoring, Logging, and Alerting Systems
  • CI/CD Tools and Release Automation
  • Security Fundamentals (IAM, secrets management, encryption, network security)

Description

As Principal Cloud Engineer in the AI Agentic Team, you design and operate cloud-based platforms that power AI-enabled, agentic systems in p…
For members onlyMobile Experteer Ad

Take your next career step

  • 1M+ top positions worldwide with salary benchmarks

  • Be discreetly found and contacted by headhunters

  • Exclusively for senior-level professionals and executives

Already a member?

Experteer uses cookies.

Information on data protection