Skip to content

TPF2017/Scrapy_crawl_qichacha

Repository files navigation

Scrapy_crawl_qichacha

Requests:

python-version: python 3.6

Make sure that scrapy has been installed in your computer.

Usage:

At first you need visit https://www.qichacha.com/user_login and login to get your cookie.

Then clear the cookies that exist in the cookies.txt before, and paste your own cookie into it.

You can paste more than one cookies into it and which will make the crawler better performance beacause of the limitation of the website visiting.

Open your command window and enter the master path:

cd Scrapy_crawl_qichacha-master

Then start to crawl:

scrapy crawl qcc

Then follow the instructions that showing in the window.

Note:

When crawling too fast the verification code will appear and make the program report an error, so this program set a relatively slow speed, and you can change it in the settings.py.

When the program report an error this means that there is a verification code appear in your account, and you need to visit the URL in the browser and enter the confirmation code.

About

爬取企查查公司基本信息

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages