The DevOps Engineer plays a key role in designing, automating, and maintaining the infrastructure and delivery pipelines that power our applications. This position is responsible for implementing cloud infrastructure using Infrastructure as Code (IaC), managing Kubernetes-based environments, and ensuring the reliability, performance, and security of our systems. The engineer will work closely with development, operations, and security teams to streamline deployments, improve monitoring, and support continuous integration and delivery (CI/CD) processes. Daily tasks include maintaining configuration management systems, writing automation scripts, monitoring system health, troubleshooting network and deployment issues, and contributing to infrastructure design with a focus on scalability and resilience.
Responsibilities:
Infrastructure Automation & Management
· Design and manage scalable, secure cloud infrastructure using Infrastructure as Code (IaC).
· Implement and maintain automation for resource provisioning and system configuration.
· Ensure high availability, fault tolerance, and performance across cloud environments.
CI/CD Pipeline Development
· Build, maintain, and improve CI/CD pipelines to support automated testing and deployments.
· Integrate tools and processes to ensure fast, secure, and reliable code delivery.
· Monitor pipeline performance and resolve build or deployment failures.
Configuration Management
· Use configuration management tools (e.g., Ansible) to manage infrastructure state.
· Maintain environment consistency across development, staging, and production.
· Apply patches, updates, and configuration changes programmatically.
Kubernetes and Container Orchestration
· Deploy and manage Kubernetes clusters and containerized workloads.
· Optimize application deployments for performance, reliability, and scalability.
· Troubleshoot Kubernetes and container-related issues in production environments.
Security and Compliance
· Implement security best practices in cloud and DevOps workflows.
· Collaborate with security teams to enforce policies, vulnerability management, and audits.
· Monitor access controls, secrets management, and compliance-related configurations.
Monitoring and Reliability
· Monitor infrastructure health using tools like Prometheus, Grafana, or ELK Stack.
· Set up alerting and logging systems to proactively identify and resolve issues.
· Participate in on-call rotations and incident response efforts.
Networking and System Troubleshooting
· Configure and troubleshoot network components (DNS, VPN, routing, firewalls).
· Resolve performance bottlenecks and system-level issues in various environments.
· Ensure secure, fast, and reliable communication across services.
Collaboration and Documentation
· Work closely with development and security teams to streamline delivery processes.
· Provide infrastructure support during software development and releases.
· Contribute to internal documentation, architecture reviews, and knowledge sharing.
Qualifications:
· Bachelor’s degree in Computer Science, Information Technology, or a related field.
· 2+ years of relevant experience in DevOps, system administration, or cloud engineering.
· Experience managing production environments and deploying cloud-native applications.
· Kubernetes administration experience in production environments is highly preferred.
· Certification in a major cloud platform (AWS, OCI, Azure) is a plus.
· Strong analytical, problem-solving, and communication skills.
· Excellent English communication skills (reading, writing and speaking)