Colombia

Site Reliability Engineer (Caicedonia)

Site Reliability Engineer (Caicedonia)
Descripción
As a Senior Platform Engineer, you will design, automate, and evolve cloud platforms supporting a large microservices environment across Azure Government, Azure Commercial and AWS. This role requires deep technical expertise in Kubernetes, Azure networking, Terraform, observability, and platform reliability. You will build secure, scalable systems that enable engineering teams to deliver efficiently, while ensuring compliance, governance, and high availability. Responsibilities
- Design, build, and maintain Azure-first cloud and Kubernetes platforms including networking, autoscaling, and disaster recovery.
- Develop Infrastructure as Code using Terraform; contribute to automation.
- Build reusable platform components and services that support application delivery across the organization.
- Implement cloud governance, identity, compliance, and secrets management solutions.
- Introduce and maintain policy-as-code enforcement (OPA, Conftest, tfsec).
- Work closely with other teams on patching, vulnerability remediation, observability, and readiness.
- Collaborate with development teams to streamline deployments, improve platform reliability, and automate processes.
- Provide technical input during vendor/managed-service coordination and renewals.
- Continuously evaluate platform architecture to improve performance, availability, and security.
- Create and maintain documentation.
- Troubleshoot issues across pre-production and production environments. Required Skills, Knowledge, and Experience
- Minimum 5 years of experience as a Cloud, Infrastructure, or DevOps Engineer.
- Strong expertise with Azure and Kubernetes, including managing distributed, large-scale systems.
- Proficiency with Helm charts and Kubernetes.
- Deep understanding of Azure networking such as VNETs, Private Link, Load Balancers, DNS, Firewalls, WAF, and transit architectures.
- Strong identity and security understanding including Entra ID, RBAC, Managed Identities, and PIM.
- Experience implementing governance frameworks using policy-as-code tools.
- Familiarity with observability stacks such as Grafana, Prometheus, Azure Monitor, Elasticsearch, and OpenTelemetry.
- CI/CD pipeline experience with Jenkins, GitHub Actions, and GitOps workflows.
- Scripting experience with Bash, Python, or PowerShell.
- Fluent verbal and written English communication skills (B2+) required for daily collaboration with US and Canada teams.
- Ability to work independently, challenge assumptions, and drive initiatives to completion. Technology Environment Networking: VNETs, Private Link, Application Gateway, WAF, Load Balancers, DNS, Firewalls CI/CD: Jenkins, GitHub Actions, GitOps IaC: Terraform, Conftest Postúlate en Kit Empleo: kitempleo.com.co/empleo/1a4az7
Información clave
Consejos de seguridad
Ten cuidado con trabajos prometedores que no exigen demasiado.
1 / 10
Más info sobre el anuncio

El anuncio Site Reliability Engineer (Caicedonia) fue publicado en la categoría Caicedonia Otros trabajos de Locanto.

No hay más anuncios en Caicedonia para esta categoría, ¡por ahora!

Además, en esta sección, disponemos de más anuncios clasificados en un radio de 15 km. Haz clic aquí para verlos.