Commit 97ef437
Add video transcripts, embeddings, catalog + update documentation
- 5,407 Whisper transcripts (44MB) from Khan Academy videos
- 5,044 sliding-window embedding .npy files (247MB)
- Video catalog.json (5,044 videos, 77K windows)
- AGENTS.md: complete rewrite for current Vite/ES6 modular architecture
- README.md: updated features (50 domains, 2,450 questions, 5,000+ videos)
- Session notes: pipeline refresh status and key insights
- .gitignore: un-ignore video data, ignore duplicate transcripts_raw/
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>1 parent a0c03dd commit 97ef437
File tree
10,610 files changed
+59502
-185
lines changed- data/videos/.working/embeddings
Some content is hidden
Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.
10,610 files changed
+59502
-185
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
297 | 297 | | |
298 | 298 | | |
299 | 299 | | |
300 | | - | |
301 | 300 | | |
302 | 301 | | |
303 | 302 | | |
304 | 303 | | |
305 | | - | |
306 | | - | |
| 304 | + | |
307 | 305 | | |
308 | | - | |
309 | | - | |
310 | | - | |
| 306 | + | |
| 307 | + | |
| 308 | + | |
311 | 309 | | |
312 | 310 | | |
313 | 311 | | |
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
0 commit comments