Skip to content

Commit 797677f

Browse files
authored
Remove duplicated text from actions
1 parent d814102 commit 797677f

File tree

1 file changed

+3
-11
lines changed

1 file changed

+3
-11
lines changed

specs/crawler/common/schemas/action.yml

Lines changed: 3 additions & 11 deletions
Original file line numberDiff line numberDiff line change
@@ -25,17 +25,12 @@ Action:
2525
Use `discoveryPatterns` to define pages that should be visited _just_ for their links to other pages,
2626
_not_ their content.
2727
It functions similarly to the `pathsToMatch` action but without record extraction.
28-
29-
`discoveryPatterns` uses [micromatch](https://github.com/micromatch/micromatch) to support matching with wildcards,
30-
negation, and other features.
3128
items:
3229
$ref: '#/urlPattern'
3330
fileTypesToMatch:
3431
type: array
3532
description: |
3633
File types for crawling non-HTML documents.
37-
38-
For more information, see [Extract data from non-HTML documents](https://www.algolia.com/doc/tools/crawler/extracting-data/non-html-documents/).
3934
maxItems: 100
4035
items:
4136
$ref: '#/fileTypes'
@@ -91,7 +86,8 @@ Action:
9186
maxItems: 100
9287
items:
9388
type: string
94-
description: DOM selector. Negation is supported. This lets you ignore pages that match the selector.
89+
description: |
90+
Prefix a selector with `!` to ignore matching pages.
9591
example:
9692
- .products
9793
- '!.featured'
@@ -114,8 +110,6 @@ ActionSchedule:
114110
fileTypes:
115111
type: string
116112
description: |
117-
Supported file types for indexing non-HTML documents.
118-
119113
For more information, see [Extract data from non-HTML documents](https://www.algolia.com/doc/tools/crawler/extracting-data/non-html-documents/).
120114
enum:
121115
- doc
@@ -131,9 +125,7 @@ fileTypes:
131125
urlPattern:
132126
type: string
133127
description: |
134-
Pattern for matching URLs.
135-
136-
Uses [micromatch](https://github.com/micromatch/micromatch) for negation, wildcards, and more.
128+
Use [micromatch](https://github.com/micromatch/micromatch) for negation, wildcards, and more.
137129
example: https://www.algolia.com/**
138130

139131
hostnameAliases:

0 commit comments

Comments
 (0)