Skip to content

Commit 86e7dd2

Browse files
committed
Ensure correct images are extracted
1 parent 886d718 commit 86e7dd2

File tree

1 file changed

+3
-0
lines changed

1 file changed

+3
-0
lines changed

src/fundus/publishers/de/der_freitag.py

Lines changed: 3 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -49,4 +49,7 @@ def images(self) -> List[Image]:
4949
return image_extraction(
5050
doc=self.precomputed.doc,
5151
paragraph_selector=self._paragraph_selector,
52+
upper_boundary_selector=CSSSelector("header.bc-article-intro"),
53+
lower_boundary_selector=CSSSelector("span.freitag-article-end"),
54+
image_selector=CSSSelector("figure img,div[role='figure'] img"),
5255
)

0 commit comments

Comments
 (0)