Sitemap Crawler

Overview

Sitemap Crawler is a simple web-based tool built with Node.js, Express, and SimpleCrawler. It allows users to input a URL, crawl the website, and generate lists of successful and failed URLs. The results can be downloaded for further analysis.

Features

Web-based UI for entering a URL and starting a crawl.
Uses SimpleCrawler to fetch and analyze pages.
Lists successful and failed URLs separately.
Allows downloading results for further analysis.
Built with Node.js, Express, and Bootstrap for easy use and customization.

Installation & Setup

Prerequisites

Ensure you have Node.js installed on your system.

Clone the Repository

git clone https://github.com/Nuraj250/sitemap-crawler.git
cd sitemap-crawler

Install Dependencies

npm install

Run the Server

node app.js

By default, the server runs on http://localhost:3000.

Usage

Open index.html in your browser.
Enter a URL in the input field.
Click "Start Crawl" to begin.
The progress will be displayed in a modal.
Once completed, view the results and download the successful or failed URLs.

Technologies Used

Node.js - Backend framework
Express.js - Server setup
SimpleCrawler - Web crawling library
Bootstrap - UI framework
JavaScript - Frontend scripting

Contribution

Feel free to fork this repository and submit pull requests with improvements or additional features.

License

This project is licensed under the MIT License.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
README.md		README.md
app.js		app.js
index.html		index.html
package-lock.json		package-lock.json
package.json		package.json
script.js		script.js

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Sitemap Crawler

Overview

Features

Installation & Setup

Prerequisites

Clone the Repository

Install Dependencies

Run the Server

Usage

Technologies Used

Contribution

License

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Sitemap Crawler

Overview

Features

Installation & Setup

Prerequisites

Clone the Repository

Install Dependencies

Run the Server

Usage

Technologies Used

Contribution

License

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages