Skip to main content

Principal backend engineer · AI infrastructure architect

Building memory, tools, and bridges for AI coding agents.

Principal backend engineer & AI infrastructure architect. 15+ years shipping production systems. Now building the infrastructure that lets AI agents actually do the work.

Novi Sad, Serbia · independent · US / AU / EU clients
Vitalii Cherepanov

Vitalii Cherepanov

Principal Backend Engineer · AI Infrastructure Architect · Open Source Builder

  • 73 GitHub stars
  • 1,038 LinkedIn followers
  • 86,730 LinkedIn impressions / yr
  • 1,500 reads on top article
  • 15+ years in software
  • 6 pinned OSS projects

The story

Principal backend engineer who saw AI agents coming and started building infrastructure for them.

Started in 1998 writing Visual Basic for forestry enterprises. Spent the 2000s and 2010s shipping production PHP, Symfony, and Bitrix at every scale — fiscal payments, identity, e-commerce, ERP integrations. 100+ commercial Bitrix modules in production across the CIS.

Picked up Go in 2019, never put it down. Built OAuth2 / OIDC platforms, gRPC microservices, message-driven cores. Got tired of rebuilding the same observability primitives, so I codified them as ClearVibeArchitecture (CVA).

In late 2024, AI coding agents went from toy to tool. The bottleneck moved from the model to the surrounding infrastructure: memory, observability, multi-agent communication, statusbars that don't lie. So I started building it. total-agent-memory, claude-statusbar, braincore (getbraincore.com), a2abridge, phpfilebroker — open source, MCP-native, self-hosted.

Currently living and working in Serbia (Novi Sad), running independent contracts for clients in the US, Australia and Europe. Open to relocation for senior backend / AI-tooling roles, and to async remote work in the same direction.

  1. Feb 2022 — Present 4 yrs 4 mos

    Senior Backend Developer / Architect — independent contractor for US, Australian and European teams.

    Cherepanov Engineering

    • Full-time

    AI tooling and agent infrastructure

    • Authored 7 open source projects for the AI coding agent era (Claude Code, Codex CLI ecosystem).
    • Built MCP servers for persistent memory, status observability, and agent-to-agent communication.
    • Designed RAG pipelines in PHP and Go with pgvector + HNSW, BGE/FastEmbed embeddings.
    • Codified backend patterns as ClearVibeArchitecture (CVA): hexagonal + outbox/inbox + observability.

    Backend engineering (production)

    • Designed discount and pricing engine logic improving performance and scalability.
    • Built analytics microservices delivering real-time business insights.
    • Implemented custom OAuth2 / OIDC server securing multi-tenant access.
    • Developed AI-powered interaction platform reducing manual workflows.
    • Maintained 100+ commercial Bitrix modules across CIS projects.
    • Integrated fiscal payment APIs (ATOL, T-Bank/Tinkoff).

    Infrastructure

    • Built unified Docker environments for PHP/Go/Vue services with GitLab CI/CD.
    • Production deployments on Hetzner bare metal with Docker Swarm and Traefik.
    • Resolved PostgreSQL race conditions in financial transaction systems.
    Stack
    • PHP 8.4 / Symfony 7.x
    • Go 1.25
    • Python
    • PostgreSQL + pgvector
    • Redis
    • RabbitMQ
    • Docker
    • Hetzner
    • Claude Code
    • MCP servers
  2. Feb 2022 — Nov 2022 10 mos

    Back End Engineer

    GEGI LLC

    • Contract
    • Remote
    • Development and maintenance of high-load web applications.
    • Implementation of new features with a strong focus on performance, security, and scalability.
    • Maintenance and optimization of existing systems, resolving technical debt.
    • Gradual migration of a monolithic product to a microservices architecture, applying modern integration and communication patterns.
    • Development of REST APIs and integrations with both external services and internal modules.
    • Working with databases, caching systems, and message queues.
    • Applying best practices in CI/CD, automating deployment and testing processes.
    • Close collaboration with frontend developers and business analysts to deliver comprehensive business solutions.
    Stack
    • PHP 8
    • Symfony
    • AWS
    • Docker
    • RabbitMQ
    • PostgreSQL
    • Redis
  3. Nov 2020 — Jan 2022 1 yr 3 mos

    Lead Software Engineer

    DeMedia LLC

    • Full-time
    • On-site
    • Led development and maintenance of large-scale backend projects using PHP and Symfony.
    • Built an internal game-store monitoring system and a review analysis tool, enabling data-driven business decisions.
    • Designed and developed an HRM platform from scratch (PHP + Vue), implementing Docker containerization and CI/CD deployment pipelines.
    • Configured and stabilised infrastructure for testing and production environments, improving deployment reliability.
    Stack
    • PHP
    • Symfony
    • Vue.js
    • Docker
    • PostgreSQL
    • Redis
    • Nginx
  4. Jul 2020 — Nov 2020 5 mos

    Senior PHP Developer / Team Lead

    PE Ikonnikova (D-Partners)

    • Contract
    • Remote
    • Led backend development for corporate projects using PHP and Symfony.
    • Designed REST APIs and SPA frontends with Vue.js, improving user experience and system responsiveness.
    • Trained junior developers, introduced code review practices and testing standards, raising overall team quality.
    • Implemented CI/CD pipelines and standardized development environments, reducing onboarding time and deployment friction.
    Stack
    • PHP 7.4
    • Vue.js
    • Docker
    • GitLab CI
    • Redis
    • PostgreSQL
  5. Sep 2014 — Nov 2020 6 yrs 3 mos

    Backend Systems Consultant

    Cherepanov Engineering

    • On-site
    • Designed and delivered 100+ commercial backend modules and SaaS services used across multiple client projects.
    • Led migration from monolithic architecture to microservices, improving system reliability and scalability.
    • Since 2019, adopted Go for high-performance backend and API development, expanding the technology stack and reducing response times.
    Stack
    • PHP (Symfony)
    • Go
    • MySQL
    • Redis
    • Docker
    • Nginx
  6. Aug 2012 — Nov 2013 1 yr 4 mos

    Head of Software Development Department

    Akrit-Postavka LLC

    • Contract
    • Led a team of 4 developers building corporate websites and Bitrix modules.
    • Delivered 15+ projects, including CRM and ERP modules for internal use.
    • Introduced version control and release management processes.
    Stack
    • PHP
    • Bitrix
    • MySQL
    • JavaScript
    • HTML
    • CSS
    • Git
  7. Apr 2013 — Jul 2013 4 mos

    Software Engineer / Lead Engineer

    Post Office

    • Full-time
    • Maintained and updated software used in postal operations.
    • Fixed 50+ critical issues in accounting and delivery systems.
    • Built automation scripts for reporting and monitoring.
    Stack
    • C++
    • PHP
    • SQL
  8. Jul 2012 — Jan 2013 7 mos

    Full-Stack Engineer

    PE Cherepanov

    • Full-time
    • Developed websites, internal tools, and custom modules for client projects.
    • Worked with a wide stack — from PHP, Bitrix and JavaScript.
    • Integrated CRM systems, optimized performance, and maintained Linux servers.
    Stack
    • PHP
    • MySQL
    • Bitrix
    • JavaScript
    • Nginx
    • Linux
  9. Apr 2011 — Sep 2012 1 yr 6 mos

    System Software Engineer

    PE Romas

    • Developed a corporate website and integrated it with 1C accounting software.
    • Modernized the company's ERP system, increasing turnover by 12%.
    • Set up automated synchronization between retail locations.
    Stack
    • 1C
    • PHP
    • MySQL
    • HTML
    • JavaScript
  10. Oct 2010 — Apr 2011 7 mos

    Head of IT Automation Department

    Central Library

    • Managed IT operations and technical infrastructure across multiple library branches.
    • Implemented private networks between departments, improving system performance by 40%.
    • Developed internal web tools for catalog and circulation management.
    Stack
    • PHP
    • C++ Builder
    • MySQL
    • Windows Server
  11. Aug 2010 — Oct 2010 3 mos

    Automation Consultant

    A-Soft LLC

    • Contract
    • On-site
    • Implemented and configured 1C:Enterprise systems for small business clients.
    • Automated accounting and warehouse processes, designed custom reports and print forms.
    Stack
    • 1C:Enterprise
    • SQL
    • Windows Server
  12. Dec 2004 — Jun 2010 5 yrs 7 mos

    Web Application Engineer

    Freelance Project

    • Full-time
    • Worked as an independent developer creating web solutions and CRM systems for small and medium-sized businesses.
    • Delivered 25+ commercial websites, including corporate portals, online stores, and order-tracking systems.
    • Integrated web platforms with internal accounting and warehouse systems, reducing manual data entry errors by 35%.
    • Implemented automated financial and sales reports to improve business transparency.
    Stack
    • PHP 5
    • MySQL 5
    • Apache
    • JavaScript
    • HTML
    • CSS
    • jQuery
    • AJAX
    • XML
    • JSON
    • cron
    • Linux (CentOS/Debian)
  13. Aug 2003 — Dec 2004 1 yr 5 mos

    Head of Automation Department

    Sibinstrom JSC

    • On-site
    • Led a small automation department implementing IT solutions for construction management.
    • Developed an automated database backup system with CD partitioning, completely eliminating data-loss incidents.
    • Created the company's first corporate website and an electronic document-archive system, cutting search time by 5×.
    Stack
    • C++ Builder
    • PHP
    • MySQL
    • Apache
    • JavaScript
  14. Oct 2002 — Jul 2003 10 mos

    Software Engineer

    Baikal Global Service LLC

    • Full-time
    • IT company providing paging and internet communication services.
    • Developed the company's corporate website and an online message-sending system.
    • Refined the internal billing platform, reducing invoicing errors and improving server stability by 18%.
    • Configured Apache servers and mini-PBX systems to ensure reliable internal communications.
    Stack
    • C++ Builder 6
    • PHP 4
    • MySQL
    • Apache
    • JavaScript
    • HTML
  15. Jul 1998 — Oct 2002 4 yrs 4 mos

    Junior Application Engineer

    Freelance Project

    • Full-time
    • Remote
    • Developed business applications for forestry enterprises.
    • Built a wood volume accounting system that automated calculations and increased the sales department's productivity by 30%.
    • Handled data integration, user interface design, and employee training.
    Stack
    • Visual Basic
    • C

Featured work

Six pinned projects. All open source.

Production-grade infrastructure for the AI coding agent era. Self-hosted, MCP-native, MIT-licensed.

  • AISWARM

    in progress (testing)

    Distributed cognitive workforce runtime — the AISWARM + a2abridge + BrainCore triad.

    Pure-Go orchestrator that runs Claude Code, Codex, DeepSeek and local Llama (LM Studio / llama.cpp) side-by-side on one codebase. Phase 6 closed: 33 internal packages, 8 binaries, distributed cognitive workforce runtime. KindPlanner expands tasks into 12 typed agent roles; auto_choose routes 30/30/20/20 across providers; a 3-tier verifier (command → regression-on-integration → LLM judge) gates merges; peer-helper consult (ASK_PEER) spawns cross-model helpers on demand; soft pause/resume protects dependency cascades; subtasks decomposed over MCP; tokens + cost tracked per provider. Workers run inside sandbox-exec / bwrap, each in its own tmux + git-worktree, with SQLite (modernc.org, no CGO) in ~/.aiswarm. Bubble Tea TUI and a Web UI with REST API show A2A and memory tabs live. Memory is a side-channel through MCP (total-agent-memory or BrainCore) — the runtime does not embed its own. Two open peers stand under it: a2abridge (A2A 1.0 mesh) and BrainCore (cognitive memory plane).

    • Go 1.25
    • tmux
    • git worktree
    • SQLite (modernc.org)
    • MCP
    • A2A 1.0
    • Triad positioning: AISWARM (runtime) + a2abridge (messaging) + BrainCore (memory) — each shippable on its own, stronger together.
    • KindPlanner expands plans into 12 typed agent roles; auto_choose routes 30/30/20/20 across Claude, Codex, DeepSeek and local Llama.
    • 3-tier verifier (cmd → regression-on-integration → LLM judge) gates merges; peer-helper consult escalates stuck workers cross-model with a hard cap.
  • total-agent-memory totalmemory.dev

    42

    Persistent memory for Claude Code & Codex CLI.

    Auto-extracted knowledge graph, multi-representation embeddings, and a 3D WebGL visualisation of every decision the agent has ever made. Self-hosted, Ollama-optional.

    • Python
    • SQLite
    • FAISS
    • BGE
    • Ollama
    • MCP
    Key metric: LongMemEval R@5 97.45 %
    • LongMemEval R@5 = 97.45 % on default config.
    • 6-stage retrieval pipeline (FTS5 + BM25 + semantic + fuzzy + graph + cross-encoder + MMR).
    • Knowledge graph with cognitive activation spreading.
  • braincore getbraincore.com

    0

    Local-first cognitive memory for AI coding agents.

    Attention, decision graph, "the right to say I don't know" — MCP-native, plugs into Claude Code, Codex, Cursor, Cline. Pure-Go binary, zero CGO. getbraincore.com

    • Go
    • SQLite
    • MCP
    • tree-sitter
    • pluggable LLM
    • ~50 internal packages, 19 SQLite migrations, hundreds of tests.
    • Internal git versioning + time-travel for the entire memory store.
    • Cognitive bridge: associations, analogies, contradictions, temporal queries.
  • claude-statusbar

    25

    Rich two-line status bar for Claude Code CLI.

    Real-time model, context usage, tokens, cost, duration, git branch, cache stats — without leaving the terminal.

    • Bash
    • jq
    • Claude Code
    • Single-file install via curl.
    • Per-block colour theming follows Claude Code config.
    • Caches API metadata between hooks for sub-50 ms render.
  • a2abridge

    0

    Open A2A 1.0 mesh for Claude Code, Codex, Cursor, Cline, Continue, Gemini.

    Local directory daemon on 127.0.0.1:7777 speaking JSON-RPC + SSE + Push Notifications. Lets multiple coding agents broadcast contract changes and FYI without spam.

    • Go
    • JSON-RPC
    • SSE
    • A2A 1.0
    • Inbox hook auto-injects unread peer messages before the agent's next reply.
    • Self-labels ([cwd-hex]) so peers know which agent spoke.
    • No central server — peers discover via local mDNS-style directory.
  • phpfilebroker

    0

    Filesystem as a message broker — exchanges, streams, dead letters.

    Persistent streams, consumer groups, priority queues, DLQs, retries and metrics. Everything you expect from RabbitMQ, Kafka and NATS — but it's a single PHP package and a directory tree.

    • PHP 8.4
    • POSIX file locks
    • inotify
    • Zero external services — works on a fresh shared host.
    • Per-stream priority lanes with configurable visibility timeout.
    • Built-in metrics endpoint compatible with Prometheus.
  • b-docker

    4

    Bitrix Docker tooling that finally feels modern.

    Repeatable, multi-stage Bitrix dev environments — PHP 8.x, MariaDB, push, sphinx, scheduled agents — all behind a single Makefile.

    • Docker Compose
    • PHP 8.x
    • Bitrix D7
    • Make
    • Cold-start to logged-in admin in < 2 minutes.
    • Xdebug 3 + step-debug profiles, IDE-ready.
    • Stateless: nuke and rebuild without losing the database.
  • php-llamacpp-benchmarks

    2

    Six llama.cpp-inspired optimisation patterns, measured in PHP 8.4.

    A reproducible benchmark suite that translates llama.cpp ideas — mmap'd weights, flat dense buffers, value pools, table dispatch, generator streaming, columnar layout — into idiomatic PHP 8.4 with JIT, plus a naive-vs-optimised case study importing 100K CSV rows into Postgres. Docker + Make, PHPStan level 8, no cooking the numbers.

    • PHP 8.4
    • llama.cpp
    • FFI
    • JIT
    • Docker
    • PostgreSQL 16
    Key metric: Patterns / benchmarks 6 / 7
    • 6 benchmarks (B01–B06) + 1 end-to-end case study, all reproducible via `make all`.
    • FFI + mmap binary table beats JSON-loaded array on 10M-entry lookups (B01).
    • Optimised importer combines all six patterns: ~6× faster than naive on the same workload.

Writing

Notes from the workshop.

Long-form pieces on memory, AI tooling, distributed systems and the boring backend work behind them.

Reviews

What people say after they ship something with one of my tools.

  • “This is the first article on AI memory I've read that doesn't reduce it to vector search. Thank you.”

    Andrey — Senior ML Engineer Medium
  • “Plugged total-agent-memory into our Codex setup. Recall went from 60 % to 94 % overnight. The MMR diversity flag is genius.”

    Pavel — Tech lead, fintech Telegram
  • “Vitalii ships open source the way most people ship internal tickets — with documentation, benchmarks, and zero excuses.”

    Daria — Head of Engineering LinkedIn
  • “I've forwarded "RAG isn't memory" to every PM at our company. Should be required reading.”

    Marko — Founder, dev-tools startup Medium
  • “After two days of dogfooding braincore I cannot go back to a stateless agent. The "right to say I don't know" idea alone justified the install.”

    Mikhail — Independent consultant Telegram

Toolkit

What I reach for.

Grouped by job, not by skill bar.

Languages

  • PHP / Symfony 23 yrs · Symfony 7.x, Doctrine, async with Messenger
  • Go 5 yrs · primary language for new services and OSS
  • Python AI/ML tooling, RAG pipelines, MCP servers
  • TypeScript / Vue / Nuxt 5 yrs · internal tooling, dashboards, SPA/SSR

Infrastructure

  • Docker · Swarm 11 yrs · running production myself
  • PostgreSQL 5 yrs · pgvector + HNSW for embeddings
  • Hetzner bare metal self-hosted multi-tenant deploys
  • Traefik cf cert resolver, Let's Encrypt automation
  • RabbitMQ · Kafka · NATS event-driven by default

AI / ML

  • Claude Code daily driver · subagents, hooks, MCP
  • MCP servers authored 7+ in production
  • Ollama · local LLMs local-first when latency matters
  • RAG · pgvector BGE / FastEmbed embeddings, HNSW
  • Multi-agent orchestration A2A 1.0 mesh, parallel waves

Specialty

  • Bitrix24 / 1C-Bitrix D7 API, 100+ commercial modules shipped
  • Fiscal payments ATOL, T-Bank integrations
  • OAuth2 / SSO AegisID identity platform
  • VR observability Quest 3 microservice flow visualiser

Observability

  • OpenTelemetry tracing across PHP/Go/Python
  • Prometheus · Grafana SLOs, alerts, custom dashboards
  • Loki · ELK structured logs as a contract
  • Sentry release-tagged error budgets

Tools

  • git workflow surgeon · rebases, bisects, hooks
  • neovim · VS Code switch by task, not by religion
  • claude-statusbar authored my own
  • Custom CLI tooling every recurring chore becomes a binary

Now

Currently building.

  • BrainCore (getbraincore.com) — local-first cognitive memory for AI coding agents. Pure-Go, MCP-native, daemon.

  • total-agent-memory, claude-statusbar, a2abridge, phpfilebroker — six pinned OSS projects, all in active dogfooding.

  • Writing on Medium about AI tooling, memory architecture, and the infrastructure agents need to do real work.

  • Independent contracts with US / Australian / European teams. Open to relocation and senior AI-tooling roles.

Connect

Got something ambitious? Let's talk.

Looking for Principal-level backend & AI infrastructure work — async remote contracts or on-site staff/principal roles. Open to relocation. Based in Novi Sad, working with US / AU / EU teams.