AI-Powered Insights & Core Backend Services - PeerSuites
PeerSuite is a financial performance analytics SaaS platform serving 700+ credit unions and vendors with AI-powered peer benchmarking, executive dashboards, and regulatory data insights.
- Designed and shipped an end-to-end AI-powered peer benchmarking system — across distributed backend (ASP.NET Core, FastAPI) and Angular frontend, surfacing LLM-generated insights and recommendations across the entire PeerSuites platform at sub-100ms latency.
- Built a conversational AI analytics assistant that translates natural-language queries into SQL-backed metrics and citation-grounded insights, powered by a RAG pipeline for hybrid retrieval over financial publications and structured data, reducing analysis time from days to secs.
- Engineered prompt engineering and evaluation infrastructure — including few-shot dynamic templating, fact-grounding, LLM-as-Judge and human feedback loops — driving iterative prompt refinements and optimizations, lifting response accuracy from 71% to 94%.
- Designed a multi-tenant LLM gateway routing requests across models by subscription tier and prompt complexity — giving premium users access to higher-capability models while cutting inference costs by 35% and API failures by 95% via priority queuing and backoff retries.
- Decoupled LLM and embedding workloads into async microservices with Redis result caching, cutting P95 latency from 22s to 5s across 500K+ monthly queries.
- Developed and scaled a unified analytics data platform, ingesting and normalizing regulatory datasets into a canonical financial schema enabling cross-dataset analysis across 1B+ rows, powering frontend dashboards and downstream BI pipelines.
ASP.NET Core (C#)FastAPI (Python)
LangChainpgvector
OpenAI EmbeddingsAngular
SQL Server RLSRedis
Snowflake
Data Processing & Report Generation Platform
An internal full-stack platform enabling file ingestion, asynchronous loan data processing, human-in-the-loop validation, and automated report generation with real-time job observability.
- Built and shipped a full-stack web application (React, FastAPI) supporting partner teams to configure file ingestion worflows, monitor async jobs, human-in-the-loop validation, and trigger automated report generation with real-time job observability.
- Architected fault-tolerant data orchestration workflows in Temporal, processing 10M+ records with human-in-the-loop review, reducing pipeline failure recovery from 3 hours to 15 seconds.
- Engineered event-driven background processing using Python, Celery, and RabbitMQ to offload analytical computations, achieving 3x throughput while keeping user-facing APIs under sub-100ms latency.
- Scaled production-grade ETL pipelines with automated schema validation, data quality gates, and observability alerting, achieving 99.9% pipeline uptime.
React (Typescript)FastAPI (Python)
Temporal
CeleryRabbitMQ
ETL PipelinesObservability