AWS DevOps Engineer with 3+ years of hands-on experience designing production-grade cloud systems, CI/CD pipelines, and multi-tenant architectures serving 1000+ customers.
~80%
AWS Cost Reduced
I'm Akshay Ghalme — an AWS DevOps Engineer currently at BytePhase Technologies, where I manage production infrastructure for a multi-tenant SaaS platform serving 1000+ customer subdomains with 99.9% uptime.
I've reduced AWS costs by ~80% through architectural optimization, built zero-downtime CI/CD pipelines, and designed secure VPC architectures. I bring hands-on expertise in Terraform, Docker, Kubernetes, Jenkins, GitHub Actions, and the full AWS ecosystem.
1000+ subdomains managed
Solutions Architect – Associate
Terraform, CloudFormation
CI/CD with automated rollback
Production work at BytePhase Technologies — a multi-tenant SaaS platform serving repair businesses in 32+ countries.
~80%
AWS Cost Reduction
1000+
Customer Subdomains
99.9%
Uptime Maintained
Zero
Public DB Exposure
The Problem → The Solution → The Result
Problem
AWS bill was growing unchecked — oversized instances, unused resources, no reserved capacity planning, and inefficient traffic routing patterns.
What I Did
Audited the full AWS infrastructure. Right-sized EC2 instances based on actual usage patterns. Migrated all EBS volumes from gp2 to gp3. Implemented Reserved Instance planning. Redesigned traffic routing to eliminate redundant data transfer costs.
Result
~80% reduction in monthly AWS spend — without any performance degradation. Savings compounded every month.
Multi-tenant SaaS at scale
Problem
SaaS platform needed to serve 1000+ repair businesses, each with their own subdomain — while keeping data isolated, performance consistent, and security audit-ready.
What I Did
Designed secure VPC architecture with public/private subnet segmentation. Deployed RDS in private subnets with VPC peering. Enforced least-privilege IAM across all services. Zero public database exposure.
Result
99.9% uptime across 1000+ subdomains. Passed all security audits. Zero data breaches. Architecture scaled without re-engineering.
CI/CD pipeline automation
Problem
Deployments were manual, error-prone, and required maintenance windows. Each release risked downtime for 1000+ active customers.
What I Did
Built Jenkins-based CI/CD pipelines with automated testing, build triggers, health checks, and rollback mechanisms. Containerized services with Docker for environment parity.
Result
Zero-downtime deployments with automated rollback. No manual intervention during releases. Consistent deployments across dev, staging, and production.
Monitoring & incident detection
Problem
No centralized monitoring. Incidents were discovered by customers before the team. Mean time to detect (MTTD) was unacceptably high.
What I Did
Implemented CloudWatch for AWS-native metrics, Prometheus for custom application metrics, and Grafana for unified dashboards. Set up alerting for proactive incident detection.
Result
Dramatically reduced MTTD. Team now detects and responds to issues before customers notice. Full visibility across the entire stack.
The skills and areas I work with every day to build, manage, and optimize cloud infrastructure.
Production-grade AWS architecture — VPCs, EC2, RDS, S3, CloudFront, Route 53, ALB, ElastiCache. Multi-AZ deployments with security-first design.
Automated build-test-deploy pipelines with Jenkins, GitHub Actions, and ArgoCD. Zero-downtime deployments with rollback mechanisms and health checks.
AWS bill auditing, instance right-sizing, gp2→gp3 migrations, Reserved Instance planning, and architecture redesign. Achieved ~80% cost reduction.
Terraform modules, CloudFormation templates, Terragrunt — version-controlled, reproducible infrastructure across dev, staging, and production.
Trivy scanning, Cosign image signing, Syft SBOM, Checkov IaC scanning, Kyverno policies, least-privilege IAM, private subnet DB deployments.
Prometheus, Grafana, Loki, OpenTelemetry, CloudWatch. Full-stack observability with SLO burn-rate alerts and distributed tracing.
Open-source projects demonstrating production-grade DevOps practices.
End-to-end microservices e-commerce platform on AWS EKS — GitOps delivery, supply-chain security, chaos engineering, and zero-downtime deployments.
Multi-AZ AWS infrastructure with Terraform — VPC segmentation, containerized deployments, failure simulations, and documented recovery runbooks.
Open to remote opportunities and interesting engineering challenges. Best way to reach me is LinkedIn or email.
linkedin.com/in/akshay-ghalme
github.com/akshayghalme
akshayghalme07@gmail.com
Pune, Maharashtra, India
Available remotely worldwide
I'll get back to you within 24 hours.
Message sent!
I'll get back to you within 24 hours.
Something went wrong. Please email me directly.
Whether you're scaling infrastructure, optimizing costs, or looking for a DevOps engineer who cares about doing things right — I'd love to connect.