ANURAG YADAVDEVOPS ENGINEER
DevOps Engineer with 3+ years of hands-on experience designing, automating, and operating production-grade cloud infrastructure across AWS, Azure, and GCP. Strong expertise in CI/CD pipelines, Kubernetes orchestration, Terraform-based infrastructure automation, observability, and security-first deployments for scalable web and microservices platforms.
CORE EXPERTISE
Cloud Platforms
AWS, Azure, GCP, DigitalOcean · Multi-Account, Multi-Environment Architectures · Cost-Optimization, Secure Cloud Design
CI/CD Automation
GitHub Actions, GitLab CI/CD · Pipeline Architecture, Deployment Strategies, Pipeline Optimization
Containers & Kubernetes
Docker, Podman, EKS, MicroK8s · HPA, Helm, Containerized Microservices, GPU Workloads
Scripting & Automation
Python, Bash · Infrastructure Automation, Operational Workflows, Security Hardening
Infrastructure As Code
Terraform, Ansible · Module Design, State Management, Automated Provisioning
Observability
Prometheus, Grafana, Uptime Kuma · Proactive Alerting, System Monitoring, Incident Response
Security
IAM, OIDC, Secrets Management, Trivy, OWASP ZAP · Security-First Deployments, Vulnerability Scanning, Hardening
PROFESSIONAL EXPERIENCE
DevOps Engineer – VegaStack
Oct 2022 – Present
Key Responsibilities
- Designed, automated, and operated production-grade CI/CD platforms across AWS, Azure, and GCP using GitLab CI/CD, GitHub Actions, and Terraform.
- Managed self-hosted GitLab environments for multiple clients, improving repository governance, pipeline reliability, and developer productivity.
- Built and maintained Kubernetes platforms (Amazon EKS, MicroK8s) with autoscaling (HPA) to support containerized microservices and GPU-based workloads.
- Architected and deployed scalable web and serverless applications (Django, Node.js, React) behind Nginx with secure IAM, secrets, and zero-downtime strategies.
- Automated infrastructure provisioning and migrations across multi-account AWS and multi-subscription Azure environments using Terraform.
- Implemented monitoring, alerting, and observability stacks using Prometheus, Grafana, Zabbix, and Uptime Kuma.
- Developed Bash and Python automation for security hardening, backups, Cloudflare Zero Trust tunnels, and Slack-based operational workflows.
- Owned the end-to-end infrastructure lifecycle, including architecture design, automation, deployment, monitoring, and production support.
Key Achievements
- Reduced AWS EC2 migration effort and downtime by ~30% through automated, repeatable Terraform-based migration workflows.
- Enabled secure, scalable, multi-environment deployments on GCP across Cloud Run, Cloud Functions, IAM, and Artifact Registry.
- Delivered large-scale, batch-based Azure VM provisioning across regions and subscriptions to accelerate QA and testing workflows.
- Improved system observability and incident response time through proactive monitoring and alerting implementations.
- Successfully led weekly production deployments with rollback readiness, ensuring stable releases and minimal service disruption.
- Led multi-account AWS service migrations in collaboration with cross-functional teams, ensuring seamless transitions with zero critical outages.
- Produced architecture diagrams, operational documentation, and onboarding guides using Notion and Loom to improve client communication and knowledge sharing.
SKILLS
SCM Tools
Git, GitHub, GitLab
Cloud Platforms
AWS, Azure, GCP
Orchestration Tools
Docker, Podman, Kubernetes
CI/CD Tools
GitHub Actions, GitLab CI/CD, AWS DevOps
Infrastructure Automation
Terraform, Ansible
Code Analysis & Security
SonarQube, Trivy, OWASP ZAP
Linux & Web Servers
RHEL8, Ubuntu, Nginx, Apache
Monitoring Tools
Prometheus, Grafana, Uptime Kuma
Programming & Scripting
Python, Bash Shell
Soft Skills
Communication, Collaboration, Problem Solving, Teamwork, Leadership, Time Management, Adaptability