Mert Cobanov
senior ai engineer · los angeles / istanbul
building ai agents with complex memory. backend systems, vector databases, training diffusion models, and generative ai pipelines. mostly python, fastapi, and pydantic ai.
writing about ai engineering, homelabs, and whatever i'm tinkering with.
// Writing
RECENT- KV Cache & Flash Attention LLM inference optimizations, one at a time — a smarter cache, a faster kernel, a better scheduler. Eleven sections pairing Python with animated diagrams.
- How AI agent memory works Language models forget the moment they finish replying. Memory is everything the system around them does to make that not matter.
- From latent spaces to JWTs: how agents taught me backend I wrote code every day for years and somehow felt like I wasn't really writing code. Let me explain.
- How I connected my Tesla to Claude I built an MCP server so I could chat with my Tesla. Here's how.
- The generative AI revolution I witnessed An arc from Disco Diffusion to today, written by someone who watched it happen and never bet against it.
// Libraries
06 ITEMS-
Semantic text clustering using sentence embeddings and agglomerative clustering.
-
Aspect ratio bucketing for diffusion model training. PyTorch native, DDP correct.
-
Unsupervised image clustering with modern deep embeddings, PCA and K-Means.
-
Convert depth map images to RGB normal maps for shading and 3D workflows.
-
Summarize web pages and YouTube videos with pluggable LLM backends.
-
Realtime webcam streaming and video processing with a FastAPI UI.
// Projects
09 ITEMSteslamate-mcp
MCP server that lets AI assistants query your TeslaMate database in natural language.
Instagram Unfollowers
Find and unfollow people who don't follow you back on Instagram.
autocut
Drop a video, remove silent gaps, export MP4 or send the timeline to DaVinci Resolve / Premiere.
SemVault
JWT-scoped semantic key-value store.
Pathbook
Create, share, fork, and track learning paths.
Bartainer
macOS menu bar app for managing local and remote Docker containers.
Screenfit
Quick screen-fit utility.
Memory
Interactive explainer for AI agent memory concepts.
better-tailscale-ls
A nicer tailscale status: colored, sorted, filterable, with online/offline counts and badges.
// Contact
Based in Los Angeles. Open to interesting projects, collaborations, and long emails about AI, art, and tooling.