You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Now let's use the framework to create a new version of our scraper. In the same project directory where our `main.py` file lives, create a file `newmain.py`. This way, we can keep peeking at the original implementation while working on the new one. The initial content will look like this:
48
48
49
-
<!-- vale off -->
50
49
```py title="newmain.py"
51
50
import asyncio
52
51
from crawlee.beautifulsoup_crawler import BeautifulSoupCrawler
@@ -63,7 +62,6 @@ async def main():
63
62
if__name__=='__main__':
64
63
asyncio.run(main())
65
64
```
66
-
<!-- vale on -->
67
65
68
66
In the code, we do the following:
69
67
@@ -427,15 +425,15 @@ If you export the dataset as JSON, it should look something like this:
The [Global Top 10](https://www.netflix.com/tudum/top10) page has a table listing the most popular Netflix films worldwide. Scrape the movie names from this page, then search for each movie on [IMDb](https://www.imdb.com/). Assume the first search result is correct and retrieve the film's rating. Each item you push to Crawlee's default dataset should include the following data:
0 commit comments