📰 news-fetch

news-fetch is an open-source, easy-to-use news crawler that extracts structured information from almost any news website 🌐. It can recursively follow internal hyperlinks and read RSS feeds to fetch both recent and archived articles 📚. You only need to provide the root URL of the news website to crawl it completely 🔍. News-fetch combines the power of multiple state-of-the-art libraries and tools.

I built this tool to minimize NaN or empty values when scraping data from various news websites 🚀. It's platform-independent and written in Python 3, making it easy for programmers and developers to access news data for their applications 💻.

📦 Dependencies

📝 Extracted Information

news-fetch extracts the following attributes from news articles. You can also check out an example JSON file generated by news-please.

📰 Headline
✍️ Author(s)
📅 Publication date
🗞️ Publication
📂 Category
🌍 Source domain
📑 Article content
📝 Summary
🔑 Keywords
🌐 URL
🌐 Language

🔧 Dependency Installation

Use the package manager pip to install the required dependencies:

pip install -r requirements.txt

🚀 Usage

You can download it by clicking the green download button.

To scrape all the news details, use the newspaper function:

🤝 Contributing

Pull requests are welcome! For major changes, please open an issue first to discuss what you would like to change.

Make sure to update tests as appropriate.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
newsfetch		newsfetch
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
LICENSE		LICENSE
README.md		README.md
_config.yml		_config.yml
requirements.txt		requirements.txt
setup.cfg		setup.cfg
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

📰 news-fetch

📦 Dependencies

📝 Extracted Information

🔧 Dependency Installation

🚀 Usage

🤝 Contributing

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

📰 news-fetch

📦 Dependencies

📝 Extracted Information

🔧 Dependency Installation

🚀 Usage

🤝 Contributing

About

Resources

License

Code of conduct

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages