Taimuraz Kaitmazov

Work

Selected projects and roles. The home page shows the headline claims with their receipts; this is the longer record.

Selected projects

On-device NPU inference engine — AMD Ryzen AI (XDNA2), Linux

From-scratch engine running whole model graphs natively on the laptop NPU via the open MLIR-AIE / IRON stack — where the prior open-source high-water mark was a single matmul at ~0.6% utilization and AMD’s own Linux stack offloads zero ops. 16 models across 11 architectures (BERT · Whisper/Parakeet/GigaAM · ViT/DINOv2/ResNet-18/CLIP · ESM-2), parity ~4e-3. Core: fuses a 12-layer decode into one NPU dispatch (72→1 per token), weights + KV resident. −29% energy / ~half the package power vs CPU at equal accuracy (WER 0.117). Making it compile at scale meant fixing AMD’s compiler & operators upstream.

MLIR-AIE · IRON · Peano · XRT · Rust · sole architect · mlir-aie #3178 · amd/IRON #123

AI payroll-compliance agent over 1С:ZUP

For a top-3 Russian B2B accounting-software publisher. Daily-reconciliation AI agent on the dominant Russian payroll/ERP platform + a Go/Rust MCP server; 30+ validation-rule registry, tax/labor-regulation cross-references, on-premise connector. Closed-beta with real client accountants surfaced specific compliance discrepancies in 10+-year ERP legacy.

Go · Rust · Anthropic SDK · sole architect · closed beta

Document-intelligence pipeline for accounting primaries

Self-hosted OCR → LLM extraction of structured fields from Russian accounting primaries, PII kept on-premise (dots.ocr on H100 + an LLM extraction stage). Built a gold-set, a human-in-the-loop labeler and a scorer, then benchmarked OCR→LLM architectures on it — quantifying the privacy-vs-accuracy tradeoff to choose the production design on ground truth, not impression.

vLLM · H100 · FastAPI · sole engineer · production

kde-mcp — computer-use MCP for the Linux desktop

The kind of tool Anthropic’s computer-use covers for macOS/Windows, still rare on Linux/Wayland — built from scratch, solo. Drives apps through the accessibility tree (like a screen reader) rather than pixel-clicks: more reliable and safer. Safety policy gate, ADR-documented.

Rust · KDE Plasma 6 / Wayland · open-source · github.com/atassis/kde-mcp

Multi-tenant medical SaaS — OCR + LLM PII depersonalization

Local PaddleOCR-VL + Presidio + LLM-based PII detection for Russian PII (SNILS / INN / passport) with bounding boxes; per-tenant isolation (nginx routing) + a ClickHouse dedup buffer in front of Postgres.

PaddleOCR-VL · Presidio · Werf/Helm/K8s · designer & lead

Three-language MCP server stack for 1С

Coordinated Go + Rust + TypeScript/Bun stack exposing 1С enterprise data to agents over MCP, behind a unified backend-neutral query abstraction with pluggable backends. 15+ MCP tools.

Go · Rust · TS/Bun · sole architect · private

Experience

2025 — now

Independent · Tech Lead, AI Platform

Long-running contract with a top-3 Russian B2B accounting publisher (1С AI agent, OCR + LLM pipelines, vLLM-on-H100) + own R&D (kde-mcp, the NPU engine).

2024 — 2025

Buildmate (US startup) · Lead Engineer

Construction project-management marketplace to production-ready MVP; ~80% of the codebase (backend, React, Helm/K8s + AWS); led a developer, mentored the designer.

2024 — 2025

Shopmate (US startup) · Scraping team lead

Led a team of 4. Kubernetes scraping for 20+ US retail sources; Kafka + Protobuf streaming, >1M messages/day.

2020 — 2024

Deeplay · TypeScript team lead

Led a team of up to 4 on a high-throughput data-aggregation service; 99% uptime; ClickHouse-optimized storage up to 10M records/day.

2018 — 2020

Overgear · Senior fullstack / Tech Lead

Led a team of up to 5. TypeScript adoption, payment-system integration, a frontend design system, monitoring/analytics.

2017 — 2018

Biletix · Backend developer

Service rewrite to Docker + stateless; database optimization.

More

Full CV on request · github.com/atassis · home