Principal Cloud Engineer - AI Agentic
Comcast
Cable Operator
45071 West Chester, OH - USA
Senior Specialist / Project Manager
Experteer Overview
As Principal Cloud Engineer in the AI Agentic Team, you design and operate cloud-based platforms that power AI-enabled, agentic systems in production. You own infrastructure, automation, and observability to ensure reliability, security, and scalability. You collaborate with engineering, security, and AI teams to deliver production-ready platforms and guide on-call actions during incidents. This role blends hands-on engineering with operational ownership to advance safe, ethical AI-enabled operations. You will play a pivotal part in shaping resilient cloud foundations for cutting-edge AI initiatives.
Responsibilities
- Design and operate scalable, highly available cloud infrastructure for AI-enabled platforms
- Apply Infrastructure as Code (IaC) practices to provision resources securely
- Monitor platform health using metrics, logs, dashboards, and alerts and diagnose failures
- Lead troubleshooting of complex cloud and integration issues across distributed systems
- Develop automation tooling in Python and shell to improve reliability and efficiency
- Enhance observability with tools like Prometheus, Splunk, Grafana, CloudWatch
- Collaborate with application, data, and AI teams to ensure operability and scalability
- Support CI/CD pipelines and safe release workflows across environments
- Ensure secure cloud operations including secrets management, encryption, and network security
- Apply AI literacy to understand agent lifecycles and non-deterministic execution behaviors
- Use AI-assisted tools for investigations and optimization with engineering judgment
- Maintain runbooks, platform docs, and operational standards
- Participate in on-call rotations and act as escalation point during incidents
- Perform other duties as assigned
Key requirements
- Cloud Infrastructure Engineering (AWS, Azure, or GCP)
- Kubernetes & Container Platforms
- Infrastructure as Code (Terraform, Packer, Ansible, or equivalent)
- Python programming for automation
- Monitoring, Logging, and Alerting Systems
- CI/CD Tools and Release Automation
- Security Fundamentals (IAM, secrets management, encryption, network security)
Description
As Principal Cloud Engineer in the AI Agentic Team, you design and operate cloud-based platforms that power AI-enabled, agentic systems in p…
Take your next career step
1M+ top positions worldwide with salary benchmarks
Be discreetly found and contacted by headhunters
Exclusively for senior-level professionals and executives
Already a member?

