A static Next.js application for visualizing LongMemEval and LoComo benchmark results, optimized for deployment on Vercel.
- π Static site generation - fast, CDN-distributed
- π¨ Beautiful UI with Tailwind CSS + shadcn/ui theme
- ποΈ Supports both
.jsonand.json.gzresult files - π Optimized for Vercel deployment
- π Category breakdown - view accuracy by question category
- π Advanced filters - search by ID, filter by correctness and category
- π― Item detail pages - dive deep into individual questions with reasoning and retrieved memories
- Node.js 18+
- npm or yarn
- Install dependencies:
npm install- Run development server:
npm run devBuild the static site:
npm run buildThis generates a static site in the out/ directory.
# Install Vercel CLI
npm i -g vercel
# Deploy
cd visualizer-web
vercel- Push to GitHub
- Import project in Vercel dashboard
- Vercel will auto-detect Next.js and deploy
visualizer-web/
βββ app/
β βββ page.tsx # Home/benchmark selector
β βββ longmemeval/
β β βββ page.tsx # LongMemEval results
β βββ locomo/
β βββ page.tsx # LoComo results
βββ lib/
β βββ data.ts # Data loading utilities
βββ results/ # Copy your result files here
βββ longmemeval.json.gz
βββ locomo.json.gz
- Result files are loaded at build time (not runtime)
- The app reads from
../results/relative to the build directory - Supports both
.jsonand.json.gzfiles automatically - For Vercel deployment, compressed files (
*.json.gz) are recommended to stay under size limits
- Free on Vercel for hobby projects
- Unlimited bandwidth on free tier
- Static site = no server costs