Overview

Principal Site Reliability Engineer AWS and Kubernetes Operations Jobs in Dundalk, Canada at Parallel Domain

Title: Principal Site Reliability Engineer AWS and Kubernetes Operations

Company: Parallel Domain

Location: Dundalk, Canada

Category:

Position: Principal Site Reliability Engineer for AWS and Kubernetes Operations
Location: Dundalk

Shape cloud reliability as a Principal Site Reliability Engineer leading AWS and Kubernetes operations. Enhance infrastructure for high-stake simulation workloads in autonomous vehicle technology.

In this role, you’ll be responsible for the overall health and performance of a scalable cloud architecture, managing EKS clusters, and ensuring security compliance. You will also drive initiatives for incident response and proactive issue prevention while collaborating across engineering and customer-facing teams to deliver a seamless experience.

Key Responsibilities:

• Evolve AWS infrastructure and enhance platform performance

• Lead incident investigations and deploy automated solutions

• Oversee security governance for cloud services

• Collaborate with customer teams for optimal availability

• Improve CI/CD pipelines for seamless development

Requirements:

• 5+ years in infrastructure engineering or SRE

• Proficient in infrastructure-as-code with Terraform

• Deep knowledge of AWS services and Kubernetes

• Strong networking fundamentals and security awareness

• Experience with monitoring tools like Prometheus

Become a key player in building reliable and high-performance cloud systems that foster innovation in autonomous vehicle simulation and beyond.
#J-18808-Ljbffr

 

Upload your CV/resume or any other relevant file. Max. file size: 800 MB.