Architecture

This page describes the system topology for the Oversight beta release, including all major components, data flows, and external dependencies.

Architecture Diagram

System Topology

System Overview

Oversight is a full-stack Next.js application deployed on Vercel. The application uses the App Router for both the frontend pages and API routes, communicates with external LLM providers for analysis, and stores data in a PostgreSQL database hosted on Neon.

Core Components

Component	Technology	Purpose
Frontend	Next.js 16 App Router, React 19, Tailwind CSS 4	All user-facing pages (upload, dashboard, chat, monitor, trends, settings)
API Layer	Next.js API Routes (serverless)	18 API endpoints handling auth, uploads, chat, analysis, monitoring, settings
Core Logic	TypeScript modules in `lib/`	Analysis pipeline, live monitoring, chat reply generation, email alerts, rate limiting
Database	PostgreSQL via Neon	All persistent data (users, uploads, analyses, chat sessions, settings)
ORM	Prisma 7 with Neon adapter	Database access with connection pooling
Authentication	NextAuth.js v5 (beta)	Email/password auth, JWT sessions, bcrypt password hashing
File Storage	Vercel Blob	Uploaded JSON conversation files
Email	Resend	Analyst alert notifications

External LLM Providers

Provider	Models	Used For
Google Gemini	`gemini-2.5-flash`, `gemini-3.1-flash-lite` (fallback)	Hallucination, bias, and toxicity analysis in Gemini and Both modes
Groq	`openai/gpt-oss-120b`, Kimi K2 variants (fallback)	Analysis in Groq and Both modes, live monitoring, chatbot reply generation

Data Flow

File Upload Flow: User → Upload Page → POST /api/upload → Vercel Blob (file storage) → lib/run-analysis.ts → Gemini/Groq API → Analysis results → PostgreSQL
Chat Flow: Customer → Chat Page → POST /api/chat → Groq (bot reply) → lib/live-monitor.ts → Groq (monitoring) → PostgreSQL (message + monitoring data)
Alert Flow: Chat session completes → lib/run-analysis.ts (full analysis) → lib/send-alert-email.ts → Resend API → Analyst inbox

Database Schema

The database consists of 9 models:

Model	Purpose	Key Relations
User	Analyst accounts (email, password hash, name)	Has many: Uploads, Feedbacks, GroundTruths; Has one: UserPreferences
UserPreferences	Per-user settings (default mode, alert email, bias threshold, T&C acceptance)	Belongs to: User
Upload	A conversation file upload or chat-sourced analysis	Belongs to: User (optional), GroundTruth (optional); Has many: Analyses; Has one: ChatSession
Analysis	A single analysis result (e.g., hallucination-gemini)	Belongs to: Upload
ChatSession	A live chatbot conversation session	Has many: ChatMessages; Has one: Upload
ChatMessage	A single message in a chat session (with optional monitoring data)	Belongs to: ChatSession
GroundTruth	A reference document for factual verification	Belongs to: User (optional); Has many: Uploads
Feedback	In-app user feedback (bug/feature/general)	Belongs to: User
RateLimit	Per-user/IP rate limiting counters	Unique on: (identifier, type)

Architecture Rationale

The system topology is unchanged since the alpha release. The same components — Next.js, Neon PostgreSQL, Vercel Blob, Google Gemini, Groq Llama, Resend, and NextAuth — remain in the same architectural positions. No components were added, removed, or re-positioned, and no new external services were introduced.

The alpha release served as a validation milestone for the full architecture. Through alpha testing, we confirmed that:

The dual-LLM analysis pipeline (Gemini + Groq with cross-checking) produces reliable detection results across all three categories
Vercel serverless functions handle the analysis workload within the 120-second timeout, including "Both" mode which runs providers sequentially
Neon connection pooling via the Prisma adapter handles concurrent database access without connection exhaustion
Live chat monitoring delivers real-time per-message checks with acceptable latency for the customer-facing chatbot

Alpha testing also helped us identify and document known issues. A formal bug severity audit classified all discovered bugs by severity (Critical, High, Medium, Low). The key finding was that zero Critical or High severity bugs remained open — all were either resolved or classified as Medium/Low with documented mitigations:

Bug ID	Severity	Description	Status
BUG-001	Medium	Simulated progress bar does not reflect actual backend progress	Documented; deferred to GA
BUG-002	High	No rate limiting on public chat API	Resolved — enforces 5/min, 40/day per IP
BUG-003	Medium	Session completion endpoint lacks ownership check	Mitigated by CUID non-enumerability
BUG-004	Low	Chat-originated uploads have null userId	Accepted trade-off for internal analyst scope
BUG-005	Low	Vercel Blob URLs are publicly accessible if known	Accepted; non-enumerable filenames

This validation gave us confidence that the architecture is sound for beta release without structural changes.

Architecture Diagram​

System Overview​

Core Components​

External LLM Providers​

Data Flow​

Database Schema​

Architecture Rationale​