Job Description:
We are looking for a skilled and proactive Platform Engineer to design, build, and maintain the foundational infrastructure that enables software development, deployment, and operations. The ideal candidate will ensure the reliability, scalability, and performance of cloud and on-premise platforms while collaborating closely with DevOps, SRE, and development teams. This role is pivotal in enabling seamless CI/CD pipelines, infrastructure as code, and platform reliability.
Key Responsibilities:
-
Design and implement highly available, scalable, and secure platforms to support development and production workloads.
-
Build reusable infrastructure components using Infrastructure as Code (IaC) tools like Terraform, Pulumi, or CloudFormation.
-
Managing and administering AKS clusters (node pools, autoscaling, RBAC, networking, etc.).
-
Deploying containerized applications using kubectl, Helm, and YAML manifests.
-
Monitoring and troubleshooting workloads running in AKS.
-
Collaborate with DevOps to implement and improve CI/CD pipelines using GitLab CI, Jenkins, ArgoCD, etc.
-
Support containerization and orchestration platforms like Docker and Kubernetes (K8s).
-
Work closely with development, QA, and SRE teams to support platform operations and troubleshoot infrastructure issues.
-
Participate in root cause analysis (RCA) and continuous improvement efforts.
-
Implement monitoring, logging, and alerting using tools like Prometheus, Grafana, ELK/EFK stack, Datadog, or Splunk.
-
Participate in root cause analysis (RCA) and continuous improvement efforts.
Professional Skills:
-
Deep hands-on experience with Azure Kubernetes Service (AKS).
-
Experience with pod security policies, config maps, secrets, ingress controllers, and network policies.
-
Scripting – PowerShell / Python.
-
Linux/Unix System Administration.