CLOUD · GPU · AI · AGENTIC INFRASTRUCTURE

Hi, I'm Dhia Gharsallaoui

AI Infrastructure Architect & Senior SRE

Building production-grade AI infrastructure at scale. I architect Kubernetes and GPU platforms, multi-cloud and edge deployments, and the agentic-AI layer on top — from LLM gateways to AI-driven operations and infrastructure automation.

5+ yrs
AI infra & cloud-native
1000s
GPUs orchestrated
Multi-cloud
sovereign & edge
Dhia Gharsallaoui
About

Deep systems engineering, at platform scale.

A Senior SRE and AI Infrastructure Architect with 5+ years of experience across cloud-native systems and GPU platforms. I architect and operate production-grade GPU infrastructure, Kubernetes platforms, and multi-cloud/edge orchestration for AI workloads at scale — and increasingly, the agentic-AI layer that runs and automates them.

I’m passionate about 🤖 AI infrastructure, ☸️ Kubernetes orchestration, 🎮 GPU computing, 🧠 agentic systems, and ⚡ high-performance distributed systems. My approach pairs deep technical expertise with architecture leadership on mission-critical AI platforms.

Currently architecting a multi-tenant sovereign GPU cloud platform at Dapple — extending managed Kubernetes into sovereign and edge datacenters at a scale of thousands of GPUs — and contributing to cloud-native open-source projects.

Experience

Where I've built.

From blockchain backends to sovereign GPU clouds — a decade of shipping production infrastructure.

AI Infrastructure Architect · Dapple
April 2026 - Present

Technical lead and architect for a multi-tenant GPU cloud platform that extends managed Kubernetes into sovereign and edge datacenters at a scale of thousands of GPUs — owning the stack end-to-end, from cross-site GPU networking and node provisioning to the agentic infrastructure and AI-driven operations layers on top.

Infrastructure Team Lead · FlexAI
September 2025 - June 2026

Building production-grade infrastructure for AI-as-a-Service platform, orchestrating multi-cloud GPU compute for training, inference, and AI workloads at scale.

Lead Backend/DevOps Engineer · Perplex
April 2025 - September 2025

Led backend architecture and infrastructure operations for an on-chain trading platform serving DeFi protocols and traders.

Solution Architect · Skilld
August 2021 - April 2025

Designed and implemented scalable backend systems and cloud-native solutions with focus on distributed architectures.

Blockchain Engineer · ChainsAtlas
March 2024 - April 2025

Specialized in blockchain backend development and cross-chain protocol implementation.

Software Engineer · idatase GmbH
January 2024 - March 2024

Developed scalable data ingestion and processing systems.

  • Developed scalable data ingestion pipeline using Python/Flask and PostgreSQL
  • Implemented containerized microservices using Docker Compose and Kubernetes
  • Created automated testing and deployment pipeline using GitHub Actions
  • Designed real-time metrics collection system
Projects

Open source & production work.

Libraries, tools and platforms I've built or contributed to across cloud, GPU and developer infrastructure.

Mage-ai preview
PythonApache DruidLDAP

Mage-ai

Implemented LDAP authentication, Apache Druid integration, and centralized logging system for this open-source data pipeline tool with 8K+ GitHub stars.

View Project
Go-mail preview
GoOpenPGPMiddleware

Go-mail

Developed middleware architecture pattern and implemented OpenPGP encryption middleware with comprehensive test coverage.

View Project
Go-RTE preview
GoAPIClient Library

Go-RTE

Created a Go client library for RTE APIs with clean and uniform way to interact with different API endpoints.

View Project
AuthGuard preview
GoNginxAuthenticationCachingMonitoring

AuthGuard

Lightweight, high-performance authentication service designed for nginx's auth_request module. Provides composable authentication with pluggable providers, built-in caching, and comprehensive monitoring.

View Project
Go ElevenLabs preview
GoText-to-SpeechAPI ClientStreaming

Go ElevenLabs

Production-grade Go client library for the ElevenLabs Text-to-Speech API. Built with idiomatic Go practices, comprehensive error handling, and full support for streaming audio generation.

View Project
Kratos Admin UI preview
ReactTypeScriptOry KratosIdentity Management

Kratos Admin UI

A modern, responsive admin interface for Ory Kratos identity management system. Features identity management, session monitoring, analytics dashboard, and schema inspection.

View Project
Stack

Tools I run in production.

GPU Infrastructure (Nvidia H100, A10G, AMD MI300)Kubernetes OrchestrationKubernetes Operators DevelopmentAI Workload OrchestrationTraining & Inference PipelinesMulti-Cloud GPU ComputingBare Metal GPU Clustersnvidia-device-pluginROCm gpu-operatorAgentic AI InfrastructureAI Agents (LangGraph)Model Context Protocol (MCP)LLM / AI GatewaysAgentic Infrastructure-as-CodeMulti-Tenant Platform ArchitectureAzure Kubernetes Service (AKS)Entra ID / Workload IdentityCilium / eBPF NetworkingTemporal WorkflowsWireGuard Cross-Site NetworkingBare-Metal Provisioning (PXE / Redfish)Sovereign & Edge KubernetesGo (Golang) - ExpertPython - ProductionRustScalaTypeScriptBash/Shell ScriptingTerraform IaCPulumi IaCAnsible ConfigurationArgoCD GitOpsFluxCD GitOpsHelm ChartsDocker ContainerizationAWSAzureScalewaygRPC ServicesREST APIsMicroservices ArchitectureEvent-Driven SystemsDomain-Driven DesignHigh Availability DesignSkupper Edge NetworkingLow-Latency NetworkingCross-Cluster CommunicationService DiscoveryLoad BalancingPrometheus & GrafanaOpenTelemetryDistributed TracingLoki Log AggregationAlerting & Incident ResponsePerformance MonitoringSLI/SLO ImplementationApache KafkaNATSPostgreSQLTimescaleDBRedis CachingApache DruidJuiceFS Distributed StorageData ModelingGitHub ActionsGitLab CI/CDJenkins Pipelines
Education

Foundations.

2020 - 2021

Master's Degree in Computer Science

Minor in Mathematics Focus on Distributed Systems and Cloud Computing

2018 - 2021

Engineer's Degree in Industrial Engineering

Minor in Data Science Specialization in Big Data Analytics

Contact

Let's build something solid.

Have a cloud, GPU or AI-infrastructure challenge? Send a note and I'll get back to you.

Prefer LinkedIn? Reach me there →