This is a Scrapy project to scrape quotes and author information from http://quotes.toscrape.com.
This project is only meant for educational purposes.
Main Site
Quotes Selection
Comment Selection
Author Selection
Tags Selection
Author Details Selection
This project extracts quotes, combined with the respective author names and tags. The extracted data looks like this sample:
{
'Author': 'Douglas Adams',
'Comment': '“I may not have gone where I intended to go, but I think I ...”',
'Tags': ['life', 'navigation'],
'Author Born Location': 'in Atlanta, Georgia, The United States',
'Author Description': "Martin Luther King, Jr. was one of the pivotal leaders of the American civil rights movement. King was a'
}
This project contains one spider and you can list them using the list
command:
$ scrapy list
quotesSpider
Spider extract the data from quotes page and visit author hyperlink and extract auther infomation also.
You can run a spider using the scrapy crawl command, such as:
$ scrapy crawl quotesSpider
If you want to save the scraped data to a file, you can pass the -o option:
$ scrapy crawl quotesSpider -o output.json





