Skip to content
Closed
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
10 changes: 9 additions & 1 deletion README.md
Original file line number Diff line number Diff line change
Expand Up @@ -14,6 +14,14 @@ poetry install --no-root
poetry self add poetry-plugin-shell
poetry shell

# Set up pre-commit hooks
poetry run pre-commit install

# Verify pre-commit hooks are working
poetry run pre-commit run --all-files

# See notebook_precommit.md for more details on how notebook outputs are automatically stripped

# Install Jupyter kernel for this environment (needed for Jupyter notebooks)
python -m ipykernel install --user --name=tgov-scraper --display-name="TGOV Scraper"
```
Expand Down Expand Up @@ -44,7 +52,7 @@ pytest -v
- 'scripts`: one off scripts for downloading, conversions, etc
- `tests/`: Test files
- `notebooks/`: Jupyter notebooks for analysis and exploration
- `data/`: output from notebooks
- `data/`: output from notebooks


## Running the transcription scripts
Expand Down
File renamed without changes.
6 changes: 3 additions & 3 deletions notebooks/vtt_subtitles.ipynb
Original file line number Diff line number Diff line change
Expand Up @@ -70,17 +70,17 @@
"\n",
"# Import from the new subtitles module\n",
"from src.subtitles import create_track, load_transcript\n",
"from src.models.subtitles import TrackFormat\n",
"from src.models.subtitles import SubtitleTrack\n",
"\n",
"# Path to the transcript file\n",
"transcript_file = Path(\n",
" \"../data/transcripts/regular_council_meeting___2025_02_26.diarized.json\"\n",
")\n",
"\n",
"# Create VTT track\n",
"vtt_track = create_track(\n",
"vtt_track: SubtitleTrack = create_track(\n",
" transcript_data=transcript_file,\n",
" track_format='vtt',\n",
" format='vtt',\n",
" max_duration=5.0,\n",
" include_speaker_prefix=False,\n",
")\n",
Expand Down
176 changes: 0 additions & 176 deletions scripts/download_m3u8.py

This file was deleted.

Loading