Skip to content

Commit 584add1

Browse files
committed
Move still-relevant things from TADA.md (crudely) to TODO.md
1 parent c6a0211 commit 584add1

File tree

2 files changed

+38
-124
lines changed

2 files changed

+38
-124
lines changed

TADA.md

Lines changed: 0 additions & 124 deletions
This file was deleted.

TODO.md

Lines changed: 38 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,43 @@
11
# TODO for the Python knowpro port
22

3+
# Leftover TODOs from TADA.md
4+
5+
6+
## Software
7+
8+
Minor:
9+
10+
- Distinguish between release deps and build/dev deps?
11+
- Improve load_dotenv() (don't look for `<repo>/ts/.env`, use one loop)
12+
13+
### Specifically for VTT import (minor)
14+
15+
- Reduce duplication between ingest_vtt.py and typeagent/transcripts/
16+
- `get_transcript_speakers` and `get_transcript_duration` should not
17+
re-parse the transcript -- they should just take the parsed vtt object.
18+
19+
### Later
20+
21+
- Fix MCP service (should use host's LLM, not its own)
22+
- Handle embeddings in MCP, even though MCP doesn't support it yet
23+
- GPT5 suggests to run a separate MCP service for this
24+
- Batch 128-256 items at a time
25+
- Explicitly handle truncation by counting tokens
26+
- Handle caching using sha256() of text?
27+
28+
## Documentation
29+
30+
- Document what should go in `.env` and where it should live
31+
- And alternatively (first?) what to put in shell env directly
32+
- Document how o reproduce the demos from the talk (and Kevin/Adrian)
33+
- Document test/build/release process
34+
- Document how to use gmail_dump.py (set up a project etc.)
35+
36+
Maybe later:
37+
38+
- Document how to run evaluations (but don't reveal all the data)
39+
40+
341
# TODOs for fully implementing persistence through SQLite
442

543
## Now

0 commit comments

Comments
 (0)