-
Notifications
You must be signed in to change notification settings - Fork 9.2k
Open
Labels
bugSomething isn't workingSomething isn't working
Description
🔍 问题检查清单
- 我已经仔细阅读了项目使用过程中的常见问题汇总
- 我已经搜索并查看了已关闭的issues
- 我确认这不是由于滑块验证码、Cookie过期、Cookie提取错误、平台风控等常见原因导致的问题
🐛 问题描述
- 抖音无法获取到数据,扫码登陆成功,登陆状态也能保持住。
网页正常打开,页面也正常,视频也能点击,但是无法获取到数据。 - 同时还有个问题,程序运行完成后,窗口不会自动关闭。
📝 复现步骤
直接修改搜索词,然后运行。
💻 运行环境
- 操作系统: win11
- Python版本: 3.12
- 是否使用IP代理: 无
- 是否使用VPN翻墙软件:无
- 目标平台(抖音/小红书/微博等): 抖音
📋 错误日志
Peter ❯ python main.py
C:\Users\Peter\AppData\Local\miniconda3\envs\mediac\Lib\site-packages\jieba\_compat.py:18: UserWarning: pkg_resources is deprecated as an API. See https://setuptools.pypa.io/en/latest/pkg_resources.html. The pkg_resources package is slated for removal as early as 2025-11-30. Refrain from using this package or pin to Setuptools<81.
import pkg_resources
2025-10-31 15:05:40 MediaCrawler INFO (core.py:60) - [DouYinCrawler] 使用CDP模式启动浏览器
2025-10-31 15:05:40 MediaCrawler INFO (cdp_browser.py:94) - [CDPBrowserManager] 检测到浏览器: Google Chrome (正在现有的浏览器会话中打开。)
2025-10-31 15:05:40 MediaCrawler INFO (cdp_browser.py:97) - [CDPBrowserManager] 浏览器路径: C:\Program Files\Google\Chrome\Application\chrome.exe
2025-10-31 15:05:40 MediaCrawler INFO (cdp_browser.py:137) - [CDPBrowserManager] 用户数据目录: D:\new_media_tools\MediaCrawler\browser_data\cdp_dy_user_data_dir
2025-10-31 15:05:40 MediaCrawler INFO (browser_launcher.py:154) - [BrowserLauncher] 启动浏览器: C:\Program Files\Google\Chrome\Application\chrome.exe
2025-10-31 15:05:40 MediaCrawler INFO (browser_launcher.py:155) - [BrowserLauncher] 调试端口: 9223
2025-10-31 15:05:40 MediaCrawler INFO (browser_launcher.py:156) - [BrowserLauncher] 无头模式: False
2025-10-31 15:05:40 MediaCrawler INFO (browser_launcher.py:186) - [BrowserLauncher] 等待浏览器在端口 9223 上准备就绪...
2025-10-31 15:05:40 MediaCrawler INFO (browser_launcher.py:195) - [BrowserLauncher] 浏览器已在端口 9223 上准备就绪
2025-10-31 15:05:41 MediaCrawler INFO (cdp_browser.py:111) - [CDPBrowserManager] CDP端口 9223 可访问
2025-10-31 15:05:42 httpx INFO (_client.py:1740) - HTTP Request: GET http://localhost:9223/json/version "HTTP/1.1 200 OK"
2025-10-31 15:05:42 MediaCrawler INFO (cdp_browser.py:175) - [CDPBrowserManager] 获取到浏览器WebSocket URL: ws://localhost:9223/devtools/browser/4b09025a-d07f-40b0-a74c-99bf9a22beab
2025-10-31 15:05:42 MediaCrawler INFO (cdp_browser.py:194) - [CDPBrowserManager] 正在通过CDP连接到浏览器: ws://localhost:9223/devtools/browser/4b09025a-d07f-40b0-a74c-99bf9a22beab
2025-10-31 15:05:42 MediaCrawler INFO (cdp_browser.py:200) - [CDPBrowserManager] 成功连接到浏览器
2025-10-31 15:05:42 MediaCrawler INFO (cdp_browser.py:201) - [CDPBrowserManager] 浏览器上下文数量: 1
2025-10-31 15:05:42 MediaCrawler INFO (cdp_browser.py:226) - [CDPBrowserManager] 使用现有的浏览器上下文
2025-10-31 15:05:42 MediaCrawler INFO (cdp_browser.py:258) - [CDPBrowserManager] 已添加反检测脚本: libs/stealth.min.js
2025-10-31 15:05:42 MediaCrawler INFO (core.py:353) - [DouYinCrawler] CDP浏览器信息: {'version': '141.0.7390.123', 'contexts_count': 1, 'debug_port': 9223, 'is_connected': True}
2025-10-31 15:05:45 MediaCrawler INFO (core.py:108) - [DouYinCrawler.search] Begin search douyin keywords
2025-10-31 15:05:45 MediaCrawler INFO (core.py:115) - [DouYinCrawler.search] Current keyword: 兼职
2025-10-31 15:05:45 MediaCrawler INFO (core.py:121) - [DouYinCrawler.search] Skip 0
2025-10-31 15:05:45 MediaCrawler INFO (core.py:125) - [DouYinCrawler.search] search douyin keyword: 兼职, page: 1
2025-10-31 15:05:46 httpx INFO (_client.py:1740) - HTTP Request: GET https://www.douyin.com/aweme/v1/web/general/search/single/?search_channel=aweme_general&enable_history=1&keyword=%E5%85%BC%E8%81%8C&search_source=tab_search&query_correct_type=1&is_filter_search=0&from_group_id=7378810571505847586&offset=0&count=15&need_filter_settings=1&list_type=multi&search_id=&device_platform=webapp&aid=6383&channel=channel_pc_web&version_code=190600&version_name=19.6.0&update_version_code=170400&pc_client_type=1&cookie_enabled=true&browser_language=zh-CN&browser_platform=MacIntel&browser_name=Chrome&browser_version=125.0.0.0&browser_online=true&engine_name=Blink&os_name=Mac+OS&os_version=10.15.7&cpu_core_num=8&device_memory=8&engine_version=109.0&platform=PC&screen_width=2560&screen_height=1440&effective_type=4g&round_trip_time=50&webid=7883741341431410401&msToken=50hztnFaU6MM5IWdDzVEVixerCiai-Vzwcm1CB8gZ0fvA4RfOcZ7sZ0CfcB0RrGgoN5E55pOitLwi3Y0QQWR_wp81bwCY-8dZwlLJb4jcY7bbKz-nPfGUvimZM2-YTyIkUy_3PtLuHx-cjLEtl9GwyqegyNWZPjary6XeXqmiKxz&a_bogus=xXRZ%2F5gkdkgsXDyk5-9LfY3q6UZ3YZ7w0trEMD2fqx3v9L39HMPM9exoPL7vgwDjiT%2FQIeYjy4hbT3ohrQ2y8qwf9W0L%2F25gsDSkKl12so0j53inCLf%2FE0iE5hsAtFH8svr4iKi8owICSYyhldAJ5kIlO62-zo0%2F94f%3D "HTTP/1.1 200 OK"
2025-10-31 15:05:46 MediaCrawler INFO (core.py:133) - [DouYinCrawler.search] search douyin keyword: 兼职, page: 1 is empty,[]`
2025-10-31 15:05:46 MediaCrawler INFO (core.py:155) - [DouYinCrawler.search] keyword:兼职, aweme_list:[]
2025-10-31 15:05:46 MediaCrawler INFO (core.py:105) - [DouYinCrawler.start] Douyin Crawler finished ...📷 错误截图

antecanis8
Metadata
Metadata
Assignees
Labels
bugSomething isn't workingSomething isn't working