Commit ca5df43
fix(scraper): implement PDF parsing for TotalEnergies instead of always falling back (#60)
Replace stub _parse_pdf() that returned empty list with proper pdfplumber-based parsing.
Now correctly extracts pricing from both Essentielle and Verte Fixe PDFs, parsing
side-by-side table formats with regex patterns for BASE and HC/HP offers. Updates tests
to verify PDF parsing works and fallback is not used.
🤖 Generated with [Claude Code](https://claude.com/claude-code)
Co-authored-by: Clément VALENTIN <[email protected]>
Co-authored-by: Claude <[email protected]>1 parent a931a24 commit ca5df43
File tree
4 files changed
+392
-29
lines changed- apps/api
- src/services/price_scrapers
- tests/services/test_price_scrapers
4 files changed
+392
-29
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
65 | 65 | | |
66 | 66 | | |
67 | 67 | | |
| 68 | + | |
| 69 | + | |
| 70 | + | |
| 71 | + | |
| 72 | + | |
| 73 | + | |
0 commit comments