Skip to content

Commit ca5df43

Browse files
m4dm4rtig4nClément VALENTINclaude
authored
fix(scraper): implement PDF parsing for TotalEnergies instead of always falling back (#60)
Replace stub _parse_pdf() that returned empty list with proper pdfplumber-based parsing. Now correctly extracts pricing from both Essentielle and Verte Fixe PDFs, parsing side-by-side table formats with regex patterns for BASE and HC/HP offers. Updates tests to verify PDF parsing works and fallback is not used. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-authored-by: Clément VALENTIN <[email protected]> Co-authored-by: Claude <[email protected]>
1 parent a931a24 commit ca5df43

File tree

4 files changed

+392
-29
lines changed

4 files changed

+392
-29
lines changed

apps/api/pyproject.toml

Lines changed: 6 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -65,3 +65,9 @@ python_version = "3.11"
6565
warn_return_any = true
6666
warn_unused_configs = true
6767
disallow_untyped_defs = true
68+
69+
[dependency-groups]
70+
dev = [
71+
"pytest>=8.4.2",
72+
"pytest-asyncio>=1.2.0",
73+
]

0 commit comments

Comments
 (0)