Docs: Update features

coder-hxl · coder-hxl · commit 777a9afe4ec5 · 2023-05-08T11:37:26.000+08:00
diff --git a/README.md b/README.md
@@ -10,14 +10,14 @@ x-crawl is a flexible Node.js multifunctional crawler library. Flexible usage an
 
 - **🔥 Asynchronous Synchronous** - Just change the mode property to toggle asynchronous or synchronous crawling mode.
 - **⚙️ Multiple purposes** - It can crawl pages, crawl interfaces, crawl files and poll crawls to meet the needs of various scenarios.
+- **☁️ Crawl SPA** - Crawl SPA (Single Page Application) to generate pre-rendered content (aka "SSR" (Server Side Rendering)).
+- **⚒️ Control Page** - Automate form submission, UI testing, keyboard input, event manipulation, open browser, etc.
 - **🖋️ Flexible writing style** - The same crawling API can be adapted to multiple configurations, and each configuration method is very unique.
 - **⏱️ Interval Crawling** - No interval, fixed interval and random interval to generate or avoid high concurrent crawling.
 - **🔄 Failed Retry** - Avoid crawling failure due to short-term problems, and customize the number of retries.
 - **➡️ Proxy Rotation** - Auto-rotate proxies with failure retry, custom error times and HTTP status codes.
 - **👀 Device Fingerprinting** - Zero configuration or custom configuration, avoid fingerprinting to identify and track us from different locations.
 - **🚀 Priority Queue** - According to the priority of a single crawling target, it can be crawled ahead of other targets.
-- **☁️ Crawl SPA** - Crawl SPA (Single Page Application) to generate pre-rendered content (aka "SSR" (Server Side Rendering)).
-- **⚒️ Control Page** - You can submit form, keyboard input, event operation, generate screenshots of the page, etc.
 - **🧾 Capture Record** - Capture and record crawling, and use colored strings to remind in the terminal.
 - **🦾 TypeScript** - Own types, implement complete types through generics.
 
@@ -136,7 +136,7 @@ Take the automatic acquisition of some photos of experiences and homes around th
 import xCrawl from 'x-crawl'
 
 // 2.Create a crawler instance
-const myXCrawl = xCrawl({maxRetry: 3,intervalTime: { max: 3000, min: 2000 }})
+const myXCrawl = xCrawl({ maxRetry: 3, intervalTime: { max: 3000, min: 2000 } })
 
 // 3.Set the crawling task
 /*
@@ -164,12 +164,9 @@ myXCrawl.startPolling({ d: 1 }, async (count, stopPolling) => {
     await new Promise((r) => setTimeout(r, 300))
 
     // Gets the URL of the page image
-    const urls = await page.$$eval(
-      `${elSelectorMap[id - 1]} img`,
-      (imgEls) => {
-        return imgEls.map((item) => item.src)
-      }
-    )
+    const urls = await page.$$eval(`${elSelectorMap[id - 1]} img`, (imgEls) => {
+      return imgEls.map((item) => item.src)
+    })
     targets.push(...urls)
 
     // Close page
@@ -283,7 +280,7 @@ myXCrawl.crawlPage('https://www.example.com').then((res) => {
 
 #### Browser Instance
 
-When you call crawlPage API to crawl pages in the same crawler instance, the browser instance used is the same, because the crawlPage API of the browser instance in the same crawler instance is shared.  For specific usage, please refer to [Browser](https://pptr.dev/api/puppeteer.browser).
+When you call crawlPage API to crawl pages in the same crawler instance, the browser instance used is the same, because the crawlPage API of the browser instance in the same crawler instance is shared. For specific usage, please refer to [Browser](https://pptr.dev/api/puppeteer.browser).
 
 **Note:** The browser will keep running and the file will not be terminated. If you want to stop, you can execute browser.close() to close it. Do not call [crawlPage](#crawlPage) or [page](#page) if you need to use it later. Because the crawlPage API of the browser instance in the same crawler instance is shared.
 
@@ -332,9 +329,9 @@ Disable running the browser in headless mode.
 import xCrawl from 'x-crawl'
 
 const myXCrawl = xCrawl({
-   maxRetry: 3,
-   // Cancel running the browser in headless mode
-   crawlPage: { launchBrowser: { headless: false } }
+  maxRetry: 3,
+  // Cancel running the browser in headless mode
+  crawlPage: { launchBrowser: { headless: false } }
 })
 
 myXCrawl.crawlPage('https://www.example.com').then((res) => {})
diff --git a/docs/cn.md b/docs/cn.md
@@ -10,14 +10,14 @@ x-crawl 是一个灵活的 Node.js 多功能爬虫库。灵活的使用方式和
 
 - **🔥 异步同步** - 只需更改一下 mode 属性即可切换异步或同步爬取模式。
 - **⚙️ 多种用途** - 可爬页面、爬接口、爬文件以及轮询爬，满足各种场景需求。
+- **☁️ 爬取 SPA** - 爬取 SPA（单页应用程序）生成预渲染内容（即“SSR”（服务器端渲染））。
+- **⚒️ 控制页面** - 自动化表单提交、UI 测试、键盘输入、事件操作、打开浏览器等。
 - **🖋️ 写法灵活** - 同种爬取 API 适配多种配置，每种配置方式都非常独特。
 - **⏱️ 间隔爬取** - 无间隔、固定间隔以及随机间隔，产生或避免高并发爬取。
 - **🔄 失败重试** - 避免因短暂的问题而造成爬取失败，自定义重试次数。
 - **➡️ 轮换代理** - 配合失败重试，自定义错误次数以及 HTTP 状态码自动轮换代理。
 - **👀 设备指纹** - 零配置或自定义配置，避免指纹识别从不同位置识别并跟踪我们。
 - **🚀 优先队列** - 根据单个爬取目标的优先级可以优先于其他目标提前爬取。
-- **☁️ 爬取 SPA** - 爬取 SPA（单页应用程序）生成预渲染内容（即“SSR”（服务器端渲染））。
-- **⚒️ 控制页面** - 可以表单提交、键盘输入、事件操作、生成页面的屏幕截图等。
 - **🧾 捕获记录** - 对爬取进行捕获记录，并在终端使用彩色字符串提醒。
 - **🦾 TypeScript** - 拥有类型，通过泛型实现完整的类型。
 
@@ -162,12 +162,9 @@ myXCrawl.startPolling({ d: 1 }, async (count, stopPolling) => {
     await new Promise((r) => setTimeout(r, 300))
 
     // 获取页面图片的 URL
-    const urls = await page.$$eval(
-      `${elSelectorMap[id - 1]} img`,
-      (imgEls) => {
-        return imgEls.map((item) => item.src)
-      }
-    )
+    const urls = await page.$$eval(`${elSelectorMap[id - 1]} img`, (imgEls) => {
+      return imgEls.map((item) => item.src)
+    })
     targets.push(...urls)
 
     // 关闭页面