Hello, I'm
Mohammed Saad
Senior Site Reliability Engineer
Senior DevOps | Platform Engineering | MLOps
Senior SRE & DevOps Engineer with 9+ years of experience designing and scaling cloud-native distributed systems on AWS, GCP, and Azure. Expert in Kubernetes, Platform Engineering, Infrastructure-as-Code (Terraform, Pulumi), and observability. Passionate about community service - currently volunteering as Full Stack Developer at Challengers CC.

9+ Years
Experience
Cloud Expert
AWS | GCP | Azure
50+ Projects
About Me
Passionate technologist driving innovation through automation and reliability

Mohammed Saad
Senior Site Reliability Engineer
Senior Site Reliability Engineer | Senior DevOps Engineer | Platform Engineering
Senior Site Reliability Engineer & DevOps Engineer with over 9 years of experience designing, building, and scaling infrastructure for large-scale, cloud-native systems. I specialize in AWS, GCP, and Azure, with expertise in distributed systems, security, and scalability. Proven track record of owning systems end-to-end, solving ambiguous problems, and mentoring peers while delivering highly available, performant, and secure services.
Expert in building and scaling Kubernetes clusters and containerized workloads from the ground up. Proficient in Infrastructure-as-Code using Terraform and Pulumi, with hands-on experience maintaining and optimizing databases (PostgreSQL, DynamoDB, Redis) and backend services. Led initiatives on stateless architectures, CI/CD pipelines, and observability (Prometheus, Grafana, OpenTSDB, Envoy) to enhance scalability, maintainability, and reliability.
Strong background in platform engineering, designing internal developer platforms, and implementing Golden Metrics dashboards (latency, traffic, errors, saturation) with SLO-driven alerting. Passionate about fostering a culture of collaboration and continuous improvement through technical leadership and mentorship. Proficient in Python, Go, and JavaScript with a technology generalist mindset.
Beyond my professional work, I am passionate about giving back to the community. I volunteer for non-profit organizations focused on community betterment. Currently, I serve as the Full Stack Developer & Playing Director at Challengers Cricket Club, a cricket club in the London community. I built their website from scratch and manage their social media presence, helping grow the club's digital footprint and community engagement.
Available for Remote Work • Open to Contract and Full-Time Opportunities in Senior SRE, Senior DevOps, Platform Engineering, and MLOps
9+ Years Experience
Senior SRE, Senior DevOps, MLOps, and Platform Engineering across industries
Cloud Certified
AWS DevOps Professional, GCP Architect, CKAD, LFCS, KCNA, KCSA
Platform Engineering
Distributed systems, cloud-native architecture, and scalability design
Community Leader
Non-profit volunteer, Full Stack Developer & Playing Director at Challengers CC
Experience
9+ years of building and scaling infrastructure
DevOps Engineer
- ▸Led multi-cloud deployments using GCP (Cloud Build, Container Registry, GKE) and AWS, architecting scalable containerized solutions
- ▸Directed CI/CD strategy with Cloud Build, GitHub Actions, and Artifact Registry, enabling blue-green deployments and automated updates
- ▸Achieved 40% reduction in MySQL downtime through effective blue-green deployment strategies
- ▸Implemented New Relic, Grafana, and Prometheus for enhanced monitoring, resulting in 30% increase in real-time monitoring capabilities
- ▸Configured VPCs, load balancers for regional/global deployments and enforced IAM security policies on GCP
- ▸Orchestrated GKE deployments with complete CI/CD pipelines for automated updates and scalability
Data Engineer
- ▸Increased data pipeline availability by 15% resulting in improved system uptime using GCP
- ▸Built and optimized GCP pipelines: Cloud Storage, Dataflow, BigQuery for predictive analytics
- ▸Executed predictive customer churn analysis with Python and machine learning models
- ▸Created dashboards in PowerBI for business insights and data visualization
- ▸Managed Apache Airflow workflows and GKE migrations for scalable data processing
Data Engineer
- ▸Deployed ARIMA forecasting models in Azure, optimizing parameters and scaling via Kubernetes
- ▸Conducted predictive analytics (customer churn, CO₂ emissions) using Python, Logistic Regression, and ARIMA
- ▸Built automated pipelines to load data into BigQuery via Informatica and operationalized workflows
- ▸Integrated PySpark with Cassandra and Hive for scalable ETL/ELT operations
- ▸Connected ML models with SAP datasets to support enterprise reporting and predictive analytics
- ▸Positioned AI/ML services for production through MLOps best practices and containerized deployments
Data Analyst
- ▸Analyzed data for business intelligence and reporting
- ▸Created data visualizations and dashboards for stakeholder insights
- ▸Performed data quality assessment and validation
Software Engineering Analyst
- ▸Developed and maintained software applications
- ▸Collaborated with cross-functional teams on technical solutions
- ▸Implemented software testing and quality assurance processes
Skills & Expertise
Comprehensive technical stack for modern infrastructure and ML operations
Cloud Platforms
Container & Orchestration
Platform Engineering
Infrastructure-as-Code
CI/CD & Automation
Monitoring & Observability
FinOps & Cost Optimization
Distributed Systems
Data & ML Tools
Programming & Scripting
Databases & Storage
Security & Networking
Certifications & Badges
Industry-recognized certifications demonstrating expertise in cloud, DevOps, Kubernetes, and infrastructure
LFCS: Linux Foundation Certified Systems Administrator
The Linux Foundation
KCNA: Kubernetes and Cloud Native Associate
The Linux Foundation
KCSA: Kubernetes and Cloud Native Security Associate
The Linux Foundation
CKAD: Certified Kubernetes Application Developer
The Linux Foundation
Professional Cloud Architect Certification
Google Cloud
AWS Certified DevOps Engineer – Professional
Amazon Web Services
6
Certifications
2
Cloud Platforms
4
Linux Foundation
2027
Valid Until
Featured Projects
A selection of projects showcasing my expertise in cloud infrastructure, DevOps, and MLOps
Multi-Cloud CI/CD Infrastructure
Led GCP and AWS multi-cloud deployments with Cloud Build, GKE, and blue-green deployment strategies, achieving 40% reduction in downtime
Predictive Analytics Pipeline (BPCL)
Built and deployed predictive models for customer churn and CO₂ emissions using Python, containerized and scaled on Azure Kubernetes Service
Churn Prediction & Customer Insights
Designed ETL/ELT workflows with Apache Airflow and Dataflow, deployed churn prediction models with CI/CD integration
Enterprise Data Pipeline (GCP)
Built optimized GCP pipelines with Cloud Storage, Dataflow, and BigQuery, increasing pipeline availability by 15%
Monitoring & Observability Stack
Implemented Prometheus, Grafana, and New Relic across multiple projects, achieving 30% increase in monitoring capabilities
ML Integration with SAP Data
Connected enterprise SAP ERP datasets to ML pipelines with PySpark, Cassandra, and Hive. Leveraged Kubernetes for scaling inference services
Technical Blog
Sharing insights on DevOps, MLOps, and Cloud Engineering

Effortless Database Migrations in Production with Cloud Run Jobs
Master DevOps Project 4: Learn how to automate database migrations in production environments using Cloud Run Jobs and GCP
Let's Connect
Open to contract and full-time opportunities in MLOps, Site Reliability Engineering, DevOps, and Gen AI Development.
Get in Touch
Feel free to reach out for collaborations, opportunities, or just a friendly chat. I typically respond within 24 hours.
mbadru3434@gmail.com
Phone
+1 (431) 726-3434
Location
Toronto, Ontario, Canada
Connect with me
Currently Available
Open for new opportunities





