Aaron
Cohen

AI Engineer

Charlotte, NC

Available

I build production AI systems — safety harnesses for coding agents, RAG pipelines over knowledge graphs, and fintech tools pulling live SEC data. Most of my work sits between LLMs and real systems.

01 About

AI engineer, prototypes to production

BA Computer Science from UNC Charlotte, Azure AI certified. I build production RAG systems, LLM-powered tools, and agent infrastructure — from safety harnesses that block dangerous AI actions in under 5ms to fintech products pulling live SEC data.

Currently focused on AI agent safety, MCP tooling, knowledge graph pipelines, and mechanistic interpretability research. Strong at explaining technical concepts to non-technical stakeholders.

Languages
Python · Rust · TypeScript · SQL
AI & LLMs
Claude API · OpenAI · RAG · LangChain
Infrastructure
Neo4j · PostgreSQL · Docker · Qdrant
Automation
MCP · Claude Code Hooks · BullMQ
Frontend
React Native · D3 · FastAPI
Credentials
BA CS (UNCC) · Azure AI Certified
02 Experience
Jun – Aug 2024
Tel Aviv, Israel

AI Prototype Engineer Intern

Orange-Pay · Fintech Startup

Built AI prototypes from concept to deployment. Partnered with Product, Finance, and Operations to identify automation opportunities. Delivered demos and training to non-technical stakeholders.

Sep 2024 – Present
Matthews, NC

Client Services

Turn-Key Solutions

Translate technical requirements for non-technical clients. Maintain cross-functional relationships across teams.

03 Education & Certifications
Degree
B.A. Computer Science, IT Concentration
UNC Charlotte · 2020 – 2025
Certification
Azure AI Engineer Associate
Microsoft · Feb 2025
Certification
AI Engineer for Data Scientists Associate
DataCamp · 2026
04 Selected Work
001

dev-loop

Safety harness for AI coding agents

Sits between the AI agent and your codebase. Intercepts every file write, edit, and shell command — blocks dangerous ones in under 5ms. Runs Semgrep, secret detection, and tests at commit time. Seven-layer closed loop with LLMOps prompt optimization. Open source, zero paid dependencies.

997 tests <5ms Tier 1 7 layers 7/7 tracer bullets
Python Rust Claude Code OpenTelemetry
$ dl status daemon: running (pid 4821) uptime: 3h 12m sessions: 2 active events: 1,247 processed tier1: ██████████ 100% tier2: ████████░░ 82% blocked: 14 dangerous ops secrets: 3 caught
002

topo-confidence

LLM hidden-state geometry as a window into reasoning

Research project probing whether a model's residual-stream activations can predict chain-of-thought correctness. Found that a single direction in the prefill residual stream predicts answer correctness better than final-token activations after reasoning. Selective prediction at 50% coverage yields 71.6% accuracy vs 48.6% unconditional.

0.77 AUROC 11 pathways 4 models tested 91/91 claims verified
Python PyTorch Mechanistic Interp
Finding F-1 (STRONG) Dimensional breathing: residual-stream PR rises during CoT generation, collapses at answer token. Universal across: · Qwen-1.5B · Qwen-7B · Phi-3-mini · Llama-3.2-1B Random-token control: flat (PR ≈ 10)
003

OmniSwipe

AI agent platform with swipe-based code review

AI agent system with 20 MCP tools, YAML-defined pipelines, and human-in-the-loop review — swipe right to approve AI-generated code improvements, left to reject. React Native with 15 Zustand stores, Socket.io real-time updates, and 68 Maestro E2E flows.

React Native TypeScript MCP Private
004

Link Forge

Knowledge graph forager + MCP server

Discord bot scrapes links from an entire guild, categorizes them with Claude, embeds them into a Neo4j knowledge graph (9 node types, 384-dim vectors), and exposes 8 MCP tools for RAG-powered Q&A from any Claude Code session.

TypeScript Neo4j MCP D3 Private
005

BurryDCF

Dilution-aware stock valuation calculator

Live fintech tool integrating 3 external APIs (SEC EDGAR, Polygon.io, FRED) into a unified valuation calculator. Implements Michael Burry's Owner's Earnings methodology — compares traditional Gordon Growth Model against a dilution-aware model that accounts for the real cash cost of stock-based compensation.

Python FastAPI GCP SEC EDGAR Live Product
burrydcf.com
SEC EDGAR · Polygon.io · FRED
Owner's Earnings methodology
SBC dilution correction
Real-time multi-API integration
05 Contact

Let's build
something real

Open to collaborations, interesting problems, and conversations about AI tooling, developer infrastructure, or fintech.