Michał Wojdylak

AI Infrastructure Engineer

Building production AI systems, LLM infrastructure, inference platforms and cloud-native ML solutions.

Latest writing

All posts →

Featured projects

All projects →

LLM Inference Gateway

A high-throughput inference gateway for serving open-weight LLMs with token streaming, request batching, and per-tenant rate limiting. Built to run on Kubernetes with autoscaling backed by GPU node pools.

  • Python
  • vLLM
  • FastAPI
  • Kubernetes
  • Triton

MLOps Platform Blueprint

Reference architecture and Terraform modules for an end-to-end MLOps platform on AWS — feature store, model registry, CI/CD for models, and automated rollout with shadow deployments.

  • Terraform
  • AWS
  • SageMaker
  • MLflow
  • GitHub Actions

GPU Cost Observability

A monitoring stack that attributes GPU utilization and cloud spend to individual models and teams, with Grafana dashboards and Prometheus exporters for inference workloads.

  • Go
  • Prometheus
  • Grafana
  • DCGM
  • Kubernetes