This project demonstrates how to scrape tabular data from Wikipedia using pandas and save it into a CSV file for further analysis.
- Scrapes HTML tables directly with
pandas.read_html() - Cleans and structures the data
- Exports the results into a CSV file
- Scraping from wikipedia.ipynb – Jupyter Notebook with the scraping code
- Companies.csv – Example output data
- Python 3.x
- pandas
- Jupyter Notebook (optional)
Clone the repository:
git clone https://github.com/aminasaeed223/Webscraping-with-pandas.git
cd Webscraping-with-pandas