I am an Applications and Systems Support Engineer with ~4 years of experience specializing in DevOps. My expertise encompasses the administration, troubleshooting, design, deployment, and automation of scalable, highly available systems across various cloud platforms. I am dedicated to ensuring efficient system operations and thrive in collaborative environments where I can deliver effective solutions. With a strong eagerness to embrace new challenges, I seek opportunities to leverage my skills and enhance organizational success, driving substantial value.
Configuring Services | Pager Oncall | Troubleshooting & Fix | Cloud and Application Support | Team Leadership | Fast Learning and Extensive Research on New Technologies | System designing, Documenting and Knowledge Sharing
• Worked on over 200+ cloud-based Service offerings across public and private cloud environments, ensuring optimal configuration, performance, and security, while supporting app teams across 10 environments (Dev, AT, PROD, RETAIL/PAYMENTS) running 3000-4500 microservice applications each.
• Spearheaded the deployment and management of cloud environments, implementing Infrastructure as Code (IaC) using Terraform and GitHub Actions to automate provisioning and lifecycle management of services.
• Supported cross-functional teams by troubleshooting System and Application issues and their resolution, sharing knowledge on scaling and optimizing cloud-native applications, and advising on cloud migration strategies.
• Developed and maintained robust IAM policies, roles, and user groups to manage access controls for AWS and other cloud resources.
• Designed and executed Disaster Recovery (DR) and Business Continuity (BC) plans, including creating technical documentation on the SDLC process, Naming Conventions, Data Governance Workflows, and recovery procedures for data warehouse and cloud services across cloud environments for $140 Billion Client.
• Automated CI/CD pipelines enabling seamless deployments for multi-cloud environments across Dev, AT, Prod, and Retail environments.
• Performed SSL certificate management for multiple environments, ensuring timely updates and proper certificate rotation to maintain system security.
• Collaborated with Business Units, Governance, Legal and Privacy, Security, and Application Teams to ensure solutions met privacy, compliance, and quality standards.
• Contributed to the automation of infrastructure management and application support using Python and Shell scripting, enabling efficiency in daily admin tasks.
• Played a key role in improving infrastructure reliability, working on scaling memory container space, and managing load balancing across multiple environments.
• Supported monitoring and optimization efforts using Wavefront, Looker, and other system and logs monitoring tools to ensure high availability and minimal latency.
• Leading the team, performance report generation monthly/quarterly, writing and assigning KPIs, mentoring juniors.
Earned a distinguished CGPA of 8.19/10