Available for opportunities

Hello, I'm

Mohammed Saad

Senior Site Reliability Engineer

Senior DevOps | Platform Engineering | MLOps

Senior SRE & DevOps Engineer with 9+ years of experience designing and scaling cloud-native distributed systems on AWS, GCP, and Azure. Expert in Kubernetes, Platform Engineering, Infrastructure-as-Code (Terraform, Pulumi), and observability. Passionate about community service - currently volunteering as Full Stack Developer at Challengers CC.

Mohammed Saad - Senior Site Reliability Engineer

9+ Years

Experience

Cloud Expert

AWS | GCP | Azure

50+ Projects

About Me

Passionate technologist driving innovation through automation and reliability

Mohammed Saad - Senior Site Reliability Engineer

Mohammed Saad

Senior Site Reliability Engineer

Toronto, Ontario, Canada
AWSGCPKubernetesTerraform

Senior Site Reliability Engineer | Senior DevOps Engineer | Platform Engineering

Senior Site Reliability Engineer & DevOps Engineer with over 9 years of experience designing, building, and scaling infrastructure for large-scale, cloud-native systems. I specialize in AWS, GCP, and Azure, with expertise in distributed systems, security, and scalability. Proven track record of owning systems end-to-end, solving ambiguous problems, and mentoring peers while delivering highly available, performant, and secure services.

Expert in building and scaling Kubernetes clusters and containerized workloads from the ground up. Proficient in Infrastructure-as-Code using Terraform and Pulumi, with hands-on experience maintaining and optimizing databases (PostgreSQL, DynamoDB, Redis) and backend services. Led initiatives on stateless architectures, CI/CD pipelines, and observability (Prometheus, Grafana, OpenTSDB, Envoy) to enhance scalability, maintainability, and reliability.

Strong background in platform engineering, designing internal developer platforms, and implementing Golden Metrics dashboards (latency, traffic, errors, saturation) with SLO-driven alerting. Passionate about fostering a culture of collaboration and continuous improvement through technical leadership and mentorship. Proficient in Python, Go, and JavaScript with a technology generalist mindset.

Beyond my professional work, I am passionate about giving back to the community. I volunteer for non-profit organizations focused on community betterment. Currently, I serve as the Full Stack Developer & Playing Director at Challengers Cricket Club, a cricket club in the London community. I built their website from scratch and manage their social media presence, helping grow the club's digital footprint and community engagement.

Available for Remote Work • Open to Contract and Full-Time Opportunities in Senior SRE, Senior DevOps, Platform Engineering, and MLOps

9+ Years Experience

Senior SRE, Senior DevOps, MLOps, and Platform Engineering across industries

Cloud Certified

AWS DevOps Professional, GCP Architect, CKAD, LFCS, KCNA, KCSA

Platform Engineering

Distributed systems, cloud-native architecture, and scalability design

Community Leader

Non-profit volunteer, Full Stack Developer & Playing Director at Challengers CC

Experience

9+ years of building and scaling infrastructure

DevOps Engineer

Hotspex Media
May 2023 - Present
  • Led multi-cloud deployments using GCP (Cloud Build, Container Registry, GKE) and AWS, architecting scalable containerized solutions
  • Directed CI/CD strategy with Cloud Build, GitHub Actions, and Artifact Registry, enabling blue-green deployments and automated updates
  • Achieved 40% reduction in MySQL downtime through effective blue-green deployment strategies
  • Implemented New Relic, Grafana, and Prometheus for enhanced monitoring, resulting in 30% increase in real-time monitoring capabilities
  • Configured VPCs, load balancers for regional/global deployments and enforced IAM security policies on GCP
  • Orchestrated GKE deployments with complete CI/CD pipelines for automated updates and scalability

Data Engineer

Insight2Actions
March 2022 - May 2023
  • Increased data pipeline availability by 15% resulting in improved system uptime using GCP
  • Built and optimized GCP pipelines: Cloud Storage, Dataflow, BigQuery for predictive analytics
  • Executed predictive customer churn analysis with Python and machine learning models
  • Created dashboards in PowerBI for business insights and data visualization
  • Managed Apache Airflow workflows and GKE migrations for scalable data processing

Data Engineer

BPCL (Bharat Petroleum)
March 2020 - March 2022
  • Deployed ARIMA forecasting models in Azure, optimizing parameters and scaling via Kubernetes
  • Conducted predictive analytics (customer churn, CO₂ emissions) using Python, Logistic Regression, and ARIMA
  • Built automated pipelines to load data into BigQuery via Informatica and operationalized workflows
  • Integrated PySpark with Cassandra and Hive for scalable ETL/ELT operations
  • Connected ML models with SAP datasets to support enterprise reporting and predictive analytics
  • Positioned AI/ML services for production through MLOps best practices and containerized deployments

Data Analyst

Legacy Designs Inc
2019 - 2020
  • Analyzed data for business intelligence and reporting
  • Created data visualizations and dashboards for stakeholder insights
  • Performed data quality assessment and validation

Software Engineering Analyst

Legacy Designs Inc
2016 - 2019
  • Developed and maintained software applications
  • Collaborated with cross-functional teams on technical solutions
  • Implemented software testing and quality assurance processes

Skills & Expertise

Comprehensive technical stack for modern infrastructure and ML operations

Cloud Platforms

Google Cloud Platform (GCP)95%
AWS93%
Microsoft Azure88%

Container & Orchestration

Kubernetes (GKE/EKS/AKS)95%
Docker95%
Container Registry90%
Helm88%
Envoy85%

Platform Engineering

Internal Developer Platforms90%
Self-Service Infrastructure88%
Golden Paths & Templates87%
Service Catalogs85%
Platform APIs86%

Infrastructure-as-Code

Terraform92%
Pulumi85%
Ansible88%
CloudFormation85%

CI/CD & Automation

Cloud Build93%
GitHub Actions92%
Jenkins88%
GitLab CI86%
ArgoCD85%

Monitoring & Observability

Prometheus92%
Grafana92%
OpenTSDB85%
New Relic88%
Datadog85%
ELK Stack85%

FinOps & Cost Optimization

New Relic CCI90%
Looker Dashboards88%
Cost Anomaly Detection88%
Resource Rightsizing90%
Cloud Tagging Strategy92%

Distributed Systems

High Availability Design92%
Stateless Architectures90%
Scalability Patterns90%
Cloud-Native Architecture92%
Microservices88%

Data & ML Tools

Apache Airflow90%
BigQuery92%
Cloud Dataflow88%
Apache Spark/PySpark87%
MLOps88%

Programming & Scripting

Python93%
Go85%
Bash/Shell92%
TypeScript/JavaScript85%
Java82%

Databases & Storage

PostgreSQL90%
DynamoDB88%
Redis88%
Cloud SQL (MySQL)90%
MongoDB85%

Security & Networking

IAM & Security Policies90%
VPC Configuration88%
Load Balancers90%
Linux Administration92%
Debugging & Troubleshooting90%
Credentials

Certifications & Badges

Industry-recognized certifications demonstrating expertise in cloud, DevOps, Kubernetes, and infrastructure

NEW
🐧

LFCS: Linux Foundation Certified Systems Administrator

The Linux Foundation

Issued: Nov 2025Valid until: Nov 2027
LinuxCentOSDisk Partitioning+1
NEW

KCNA: Kubernetes and Cloud Native Associate

The Linux Foundation

Issued: Nov 2025Valid until: Nov 2027
KubernetesContainersCloud Native+2
NEW
🔐

KCSA: Kubernetes and Cloud Native Security Associate

The Linux Foundation

Issued: Nov 2025Valid until: Nov 2027
Kubernetes SecurityCloud Native SecurityContainer Security
NEW

CKAD: Certified Kubernetes Application Developer

The Linux Foundation

Issued: May 2025Valid until: May 2027
KubernetesDockerPython+3
NEW
🌐

Professional Cloud Architect Certification

Google Cloud

Issued: Feb 2025Valid until: Feb 2027
GCPCloud ArchitectureCloud Security+2
☁️

AWS Certified DevOps Engineer – Professional

Amazon Web Services

Issued: Mar 2023Valid until: Mar 2026
AWSDevOpsCI/CD+2

6

Certifications

2

Cloud Platforms

4

Linux Foundation

2027

Valid Until

My Work

Featured Projects

A selection of projects showcasing my expertise in cloud infrastructure, DevOps, and MLOps

Multi-Cloud CI/CD Infrastructure
DevOps
40% less downtime

Multi-Cloud CI/CD Infrastructure

Led GCP and AWS multi-cloud deployments with Cloud Build, GKE, and blue-green deployment strategies, achieving 40% reduction in downtime

GCPCloud BuildGKE+1
View Project
Predictive Analytics Pipeline (BPCL)
MLOps
92% accuracy

Predictive Analytics Pipeline (BPCL)

Built and deployed predictive models for customer churn and CO₂ emissions using Python, containerized and scaled on Azure Kubernetes Service

AzureKubernetesPython+1
View Project
Churn Prediction & Customer Insights
MLOps
25% reduced churn

Churn Prediction & Customer Insights

Designed ETL/ELT workflows with Apache Airflow and Dataflow, deployed churn prediction models with CI/CD integration

AirflowBigQueryML+1
View Project
Enterprise Data Pipeline (GCP)
Data Engineering
15% more uptime

Enterprise Data Pipeline (GCP)

Built optimized GCP pipelines with Cloud Storage, Dataflow, and BigQuery, increasing pipeline availability by 15%

Cloud DataflowBigQueryApache Beam
View Project
Monitoring & Observability Stack
DevOps
30% better monitoring

Monitoring & Observability Stack

Implemented Prometheus, Grafana, and New Relic across multiple projects, achieving 30% increase in monitoring capabilities

PrometheusGrafanaNew Relic
View Project
ML Integration with SAP Data
MLOps
3x faster processing

ML Integration with SAP Data

Connected enterprise SAP ERP datasets to ML pipelines with PySpark, Cassandra, and Hive. Leveraged Kubernetes for scaling inference services

PySparkKubernetesSAP+1
View Project

Technical Blog

Sharing insights on DevOps, MLOps, and Cloud Engineering

Effortless Database Migrations in Production with Cloud Run Jobs
DevOpsCloud RunDatabase

Effortless Database Migrations in Production with Cloud Run Jobs

Master DevOps Project 4: Learn how to automate database migrations in production environments using Cloud Run Jobs and GCP

Contact

Let's Connect

Open to contract and full-time opportunities in MLOps, Site Reliability Engineering, DevOps, and Gen AI Development.

Get in Touch

Feel free to reach out for collaborations, opportunities, or just a friendly chat. I typically respond within 24 hours.

Currently Available

Open for new opportunities

By submitting this form, you agree to be contacted regarding your inquiry.