Skip to content

Commit ad4faa3

Browse files
committed
docs(readme): update readme with reading experience enhancements
1 parent 7271afa commit ad4faa3

File tree

1 file changed

+9
-3
lines changed

1 file changed

+9
-3
lines changed

README.md

Lines changed: 9 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -22,7 +22,7 @@ OpenReader WebUI is an open source text to speech document reader web app built
2222
- 🚀 **(New) Optimized TTS Pipeline**: Next.js TTS backend with in-memory LRU audio cache, ETag-aware responses, and in-flight request de-duplication for faster repeat playback
2323
- 💾 **Local-First Architecture**: IndexedDB browser storage for documents and settings (now using Dexie.js)
2424
- 🛜 **Optional Server-side documents**: Manually upload documents to the Next.js backend (and Docker `docstore`) for all users to download
25-
- 📖 **Read Along Experience**: Follow along with highlighted text as the TTS narrates PDF files, with per-sentence navigation and skip controls
25+
- 📖 **Read Along Experience**: Follow along with real-time highlighted text as the TTS narrates PDF files, using an overlay-based highlighter, per-sentence navigation, and skip controls
2626
- 📄 **Document formats**: EPUB, PDF, TXT, MD, DOCX (with libreoffice installed, plus hardened DOCX→PDF conversion for better reliability)
2727
- 🎨 **Customizable Experience**:
2828
- 🔑 Select TTS provider (OpenAI, Deepinfra, or Custom OpenAI-compatible)
@@ -38,8 +38,14 @@ OpenReader WebUI is an open source text to speech document reader web app built
3838
</summary>
3939

4040
- 🧠 **Smart sentence continuation**
41+
- Improved NLP handling of complex structures and quoted dialogue provides more natural sentence boundaries and a smoother audio-text flow.
4142
- EPUB and PDF playback now use smarter sentence splitting and continuation metadata so sentences that cross page/chapter boundaries are merged before hitting the TTS API.
42-
- This yields more natural narration and fewer awkward pauses when a sentence spans multiple pages or EPUB spine items
43+
- This yields more natural narration and fewer awkward pauses when a sentence spans multiple pages or EPUB spine items.
44+
- 📄 **Modernized PDF text highlighting pipeline**
45+
- Real-time PDF text highlighting is now offloaded to a dedicated Web Worker so scrolling and playback controls remain responsive during narration.
46+
- A new overlay-based highlighting system draws independent highlight layers on top of the PDF, avoiding interference with the underlying text layer.
47+
- Upgraded fuzzy matching with Dice-based similarity improves the accuracy of mapping spoken words to on-screen text.
48+
- A new per-device setting lets you enable or disable real-time PDF highlighting during playback for a more tailored reading experience.
4349
- 🎧 **Chapter/page-based audiobook export with resume & regeneration**
4450
- Per-chapter/per-page generation to disk with persistent `bookId`
4551
- Resumable generation (can cancel and continue later)
@@ -61,7 +67,7 @@ OpenReader WebUI is an open source text to speech document reader web app built
6167
- PDF/EPUB/HTML readers use a full-height app shell with a sticky bottom TTS bar, improved scrollbars, and refined focus styles.
6268
-**End-to-end Playwright test suite with TTS mocks**
6369
- Deterministic TTS responses in tests via a reusable Playwright route mock.
64-
- Coverage for accessibility, upload, navigation, folder management, deletion flows, and playback across all document types.
70+
- Coverage for accessibility, upload, navigation, folder management, deletion flows, audiobook generation/export and playback across all document types.
6571

6672
</details>
6773

0 commit comments

Comments
 (0)