
Dibya Darshan Khanal
Cloud & Infrastructure Engineer | DevOps | SRE
Cincinnati, Ohio • 5+ years experience • AWS Certified Solutions Architect • AWS Community Builder


About
Site Reliability Engineer specializing in cloud infrastructure, automation, and system reliability. Currently pursuing Masters in Computer and Information Sciences with expertise in AWS multi-region architectures, observability stack optimization, and zero-downtime deployments.
Technical Skills
Cloud Infrastructure
AWS, Azure, Git, Atlassian
Automation & CI/CD
Jenkins, CloudFormation, Terraform, Ansible, CodePipeline, GitHub, GitLab
Containerization
ECS, EKS, ECR, Docker, Kubernetes, Nexus
Logging & Monitoring
Datadog, CloudWatch, CloudTrail, Splunk, Site24x7, ELK, Nagios, Apache Superset
Programming & Scripting
Python, Shell, JavaScript, Groovy, C, C++, C#
AI & ML Tools
OpenAI, LangChain, TensorFlow, scikit-learn, AWS SageMaker, Apache Doris
Research & Publications
Peer-reviewed Journal
Comparative Security Analysis of Serverless Computing Platforms
i-manager's Journal on Cloud Computing, 11(1), 36-42
View PublicationIn Progress
AI-Driven Autoscaling in Microservices
Enhancing Reliability While Maintaining Scalability
Experience
Site Reliability Engineer
Dec 2022 – Jan 2025UBA Solutions Pvt Ltd.
- Built and optimized observability stack (Datadog, ELK, Grafana, Splunk), improving alert accuracy by 40% and reducing MTTR by 25%
- Architected AWS multi-region failover infrastructure, enabling zero-downtime deployments and boosting rollout reliability
- Maintained 99.95% uptime for a SaaS platform serving 100K+ users/hour via proactive scaling and automated recovery
- Received the '2023 Above and Beyond Award' for exceptional contribution and performance
Software Engineer & Associate Cloud and DevOps
Apr 2020 – Dec 2022Cloudlaya
- Managed 100+ websites using WHM and cPanel, ensuring uptime and rapid issue resolution
- Automated deployments for 50+ client sites with Jenkins & CodePipeline, cutting manual effort by 60% and reducing deployment time by 88%
- Migrated 20+ services to ECS Fargate using Terraform & CloudFormation, lowering hosting costs by 30% and boosting reliability
- Refactored and modernized legacy applications (SpringBoot 1.5.x, PHP, Node.js) and contributed to new feature development using Django and Vue.js, improving maintainability and reducing security risks by 40%
Education
Masters in Computer and Information Sciences
University of Cincinnati
Bachelors of Computer Science and Information Technology
Tribhuvan University
Certifications
Personal Projects
ARPF-TI
A modern AI-powered reverse proxy firewall combining rule-based filtering with Gemini and TinyLlama threat detection, real-time monitoring, and multi-source threat intelligence for comprehensive web security.
View on GitHubDatadog Synthetic Monitor Validator
Automated health check validation and alerting with Datadog API to reduce false positives and enhance reliability.
View on GitHubJava Spring Boot CI/CD Pipeline with AWS EKS
A complete CI/CD solution for deploying Java Spring Boot applications to AWS EKS using Jenkins, SonarQube, Docker, Terraform, and Kubernetes.
View on GitHubSoccer Data ETL Pipeline
Developed ETL pipeline to load soccer data into Apache Doris along with Apache Superset for analytics and visualization.
View on GitHubSecurity Scanning with AWS Inspector
Created Python automation for EC2/ECR vulnerability scanning integrated with CI/CD pipelines using AWS Inspector V2.
View on GitHubFixit.AI
FixIt.AI is a Django-based application that provides a comprehensive platform for API testing, chaos engineering, and automated root cause analysis. It helps developers test, break, and fix their APIs in a controlled environment.
View on GitHubCommunity
Open Source Contributor
Collaborating with global communities on innovative solutions
DevOps Kathmandu
Active member engaging in seminars and knowledge sharing