Skip to content

Commit 7b4ad4f

Browse files
authored
Merge pull request #169 from Harshitmishra001/main
Updated README.MD
2 parents 87d510d + 836bf60 commit 7b4ad4f

File tree

1 file changed

+47
-6
lines changed

1 file changed

+47
-6
lines changed

README.md

Lines changed: 47 additions & 6 deletions
Original file line numberDiff line numberDiff line change
@@ -1,18 +1,30 @@
11
<img src="https://raw.githubusercontent.com/me-shweta/Design-Den/main/Reviews%20Scraping%20Image.png" align="center"/>
22

3-
<h2 align="center">🎬IMDB Movie review Scrapping📊</h2>
4-
<blockquote align="center">Scrapping the movie review ✏️ using python programming language💻. </blockquote>
3+
<h2 align="center"><picture>
4+
<source srcset="https://fonts.gstatic.com/s/e/notoemoji/latest/1faa9/512.webp" type="image/webp">
5+
<img src="https://fonts.gstatic.com/s/e/notoemoji/latest/1faa9/512.gif" alt="🪩" width="32" height="32">
6+
</picture>IMDB Movie review Scrapping<picture>
7+
<source srcset="https://fonts.gstatic.com/s/e/notoemoji/latest/2699_fe0f/512.webp" type="image/webp">
8+
<img src="https://fonts.gstatic.com/s/e/notoemoji/latest/2699_fe0f/512.gif" alt="" width="32" height="32">
9+
</picture></h2>
10+
<blockquote align="center"><b>Scrapping the movie review ✏️ using python programming language💻. </b> </blockquote>
511

612
🔍Welcome to the IMDb Movie Review Scraper project! 🌟 This Python script is designed to scrape movie reviews from IMDb, providing valuable data for analysis and research purposes. The IMDb Movie Review Scraping project aims to gather a new dataset by automatically extracting movie reviews from IMDb. This dataset will support various natural language processing tasks, including sentiment analysis and recommendation systems. Using web scraping techniques, such as Beautiful Soup, movie reviews are collected, preprocessed, and structured into a CSV format suitable for analysis, including Support Vector Machine classification. 📈
7-
## Features
13+
## <picture>
14+
<source srcset="https://fonts.gstatic.com/s/e/notoemoji/latest/2699_fe0f/512.webp" type="image/webp">
15+
<img src="https://fonts.gstatic.com/s/e/notoemoji/latest/2699_fe0f/512.gif" alt="" width="32" height="32">
16+
</picture><b style="font-size:3vw">Features</b>
817

918
**`Semi-supervised-sequence-learning-Project`** : replication process is done over here and for further analysis creation of new data is required.
1019

1120
1. Scraping Movie Reviews 🕵️‍♂️
1221
- `Movie_review_imdb_scrapping.ipynb` - The script fetches user reviews from IMDb, providing access to a diverse range of opinions and feedback for different movies. It utilizes BeautifulSoup, a powerful Python library for web scraping, to extract data from IMDb's web pages efficiently and accurately. 🎥🔎
1322

1423
2. Customizable Scraper 🛠️
15-
- `rename_files.ipynb` - Users can customize the scraper to target specific time periods, ratings, and other parameters, enabling focused data collection based on their requirements. This flexibility allows researchers, analysts, and enthusiasts to tailor the scraping process to their specific needs. 🎯🔧
24+
- `rename_files.ipynb` - Users can customize the scraper to target specific time periods, ratings, and other parameters, enabling focused data collection based on their requirements. This flexibility allows researchers, analysts, and enthusiasts to tailor the scraping process to their specific needs. <picture>
25+
<source srcset="https://fonts.gstatic.com/s/e/notoemoji/latest/1f3af/512.webp" type="image/webp">
26+
<img src="https://fonts.gstatic.com/s/e/notoemoji/latest/1f3af/512.gif" alt="🎯" width="32" height="32">
27+
</picture>
1628

1729
3. CSV Output 📁
1830
- `convert_texts_to_csv.ipynb` - The scraped data is saved into a CSV file, allowing for easy import into data analysis software or further processing. The CSV format ensures compatibility with a wide range of tools and platforms, making it convenient to incorporate the scraped data into various workflows and projects. 💾💼
@@ -80,10 +92,39 @@ When prompted by the app, upload a CSV (comma separated value) file containing t
8092
Streamlit app link: https://scrape-review-analysis.streamlit.app
8193
8294
## Contribution
83-
🎉Contributions are welcome! If you have any suggestions for improvements or new features, please feel free to submit a pull request. Your contributions help make this project better for everyone. 🚀
95+
<picture>
96+
<source srcset="https://fonts.gstatic.com/s/e/notoemoji/latest/1f389/512.webp" type="image/webp">
97+
<img src="https://fonts.gstatic.com/s/e/notoemoji/latest/1f389/512.gif" alt="🎉" width="32" height="32">
98+
</picture>Contributions are welcome! If you have any suggestions for improvements or new features, please feel free to submit a pull request. Your contributions help make this project better for everyone. <picture>
99+
<source srcset="https://fonts.gstatic.com/s/e/notoemoji/latest/1f680/512.webp" type="image/webp">
100+
<img src="https://fonts.gstatic.com/s/e/notoemoji/latest/1f680/512.gif" alt="🚀" width="32" height="32">
101+
</picture>
102+
<div align="Left">
103+
<h2><font size="6">
104+
<picture>
105+
<source srcset="https://fonts.gstatic.com/s/e/notoemoji/latest/1f525/512.webp" type="image/webp">
106+
<img src="https://fonts.gstatic.com/s/e/notoemoji/latest/1f525/512.gif" alt="🔥" width="32" height="32">
107+
</picture>Contribution</font></h2>
108+
</div>
109+
<h3>This project thanks all the contributors for having your valuable contribution to our project</h3>
110+
<br>
111+
112+
<center>
113+
<a href="https://github.com/Recode-Hive/Scrape-ML/graphs/contributors">
114+
<img src="https://contrib.rocks/image?repo=Recode-Hive/Scrape-ML" />
115+
</a>
116+
</center>
117+
<br>
118+
84119
## Final Dataset
85120
86121
🔬Here is the Link to **Final Dataset:** [Drive Link](https://drive.google.com/file/d/1sTNAeuy-99Hao0V5AOVznLXyDJC2zuFn/view?usp=sharing) containing the scraped IMDb movie reviews. This dataset can be used for analysis, research, or any other purposes you require. 📦
87122
## Support
88123
89-
🤝For any issues regarding the scraper, feel free to open an issue on GitHub. We'll be happy to assist you with any problems or inquiries you may have. 🛠️
124+
<picture>
125+
<source srcset="https://fonts.gstatic.com/s/e/notoemoji/latest/2728/512.webp" type="image/webp">
126+
<img src="https://fonts.gstatic.com/s/e/notoemoji/latest/2728/512.gif" alt="✨" width="20" height="20">
127+
</picture>For any issues regarding the scraper, feel free to open an issue on GitHub. We'll be happy to assist you with any problems or inquiries you may have. 🛠️
128+
129+
<p align="right"><a href="#top">Back to top</a></p>
130+
*

0 commit comments

Comments
 (0)