Skip to content

[Proposal]: use case for SG AI #524

@DiTo97

Description

@DiTo97

Problem statement

A lot of manual work and tuning goes into every single publisher that's currently maintained, and still requires constant monitoring if anything changes in the supported news outlets or web sources.

Solution

replace manual and labour-intensive scraping code with SG AI, whose you-only-scrape-once (YOSO) concept serves that purpose specifically: you write the scraping pipeline once, and leverage powerful LLMs (open-source or closed-source) to extract the articles in the desired format regardless of the web source or its HTML code changing over time.

write a single smart scraper graph tailored for news and articles crawling in the desired relational format, common to all available publishers and outlets.

Draft

Open Questions

No response

Metadata

Metadata

Assignees

No one assigned

    Labels

    proposalYou want to address a specific problem? Let us know about your idea.

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions