Skip to content

Commit d71e89b

Browse files
committed
Check documentation for double-word typos
1 parent 6885221 commit d71e89b

18 files changed

+653
-657
lines changed

ChangeLog

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -1174,7 +1174,7 @@ gets rid of the warnings. There were also two missing casts in pcre2test.
11741174
Version 10.32 10-September-2018
11751175
-------------------------------
11761176

1177-
1. When matching using the the REG_STARTEND feature of the POSIX API with a
1177+
1. When matching using the REG_STARTEND feature of the POSIX API with a
11781178
non-zero starting offset, unset capturing groups with lower numbers than a
11791179
group that did capture something were not being correctly returned as "unset"
11801180
(that is, with offset values of -1).
@@ -1349,7 +1349,7 @@ assumed empty second branch cannot be anchored. Demonstrated by test patterns
13491349
such as /(?(1)^())b/ or /(?(?=^))b/.
13501350

13511351
40. A repeated conditional subpattern that could match an empty string was
1352-
always assumed to be unanchored. Now it it checked just like any other
1352+
always assumed to be unanchored. Now it is checked just like any other
13531353
repeated conditional subpattern, and can be found to be anchored if the minimum
13541354
quantifier is one or more. I can't see much use for a repeated anchored
13551355
pattern, but the behaviour is now consistent.

doc/html/pcre2_compile.html

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -98,7 +98,7 @@ <h1>pcre2_compile man page</h1>
9898
NULL immediately. Otherwise, the yield of this function is a pointer to a
9999
private data structure that contains the compiled pattern, or NULL if an error
100100
was detected. In the error case, a text error message can be obtained by
101-
passing the value returned via the <i>errorcode</i> argument to the the
101+
passing the value returned via the <i>errorcode</i> argument to the
102102
<b>pcre2_get_error_message()</b> function. The offset (in code units) where the
103103
error was encountered is returned via the <i>erroroffset</i> argument.
104104
</P>

doc/html/pcre2api.html

Lines changed: 10 additions & 11 deletions
Original file line numberDiff line numberDiff line change
@@ -1096,10 +1096,9 @@ <h1>pcre2api man page</h1>
10961096
matching can continue.
10971097
</P>
10981098
<P>
1099-
The default value for the limit can be set when PCRE2 is built; the default
1100-
default is 10 million, which handles all but the most extreme cases. A value
1101-
for the match limit may also be supplied by an item at the start of a pattern
1102-
of the form
1099+
The default value for the limit can be set when PCRE2 is built; the default is
1100+
10 million, which handles all but the most extreme cases. A value for the match
1101+
limit may also be supplied by an item at the start of a pattern of the form
11031102
<pre>
11041103
(*LIMIT_MATCH=ddd)
11051104
</pre>
@@ -2626,7 +2625,7 @@ <h1>pcre2api man page</h1>
26262625
A minimum of at least 1 pair is imposed by <b>pcre2_match_data_create()</b>, so
26272626
it is always possible to return the overall matched string in the case of
26282627
<b>pcre2_match()</b> or the longest match in the case of
2629-
<b>pcre2_dfa_match()</b>. The maximum number of pairs is 65535; if the the first
2628+
<b>pcre2_dfa_match()</b>. The maximum number of pairs is 65535; if the first
26302629
argument of <b>pcre2_match_data_create()</b> is greater than this, 65535 is
26312630
used.
26322631
</P>
@@ -3109,8 +3108,8 @@ <h1>pcre2api man page</h1>
31093108
also set to PCRE2_UNSET. For example, if the string "abc" is matched against
31103109
the pattern (abc)(x(yz)?)? groups 2 and 3 are not matched. The return from the
31113110
function is 2, because the highest used capture group number is 1. The offsets
3112-
for for the second and third capture groups (assuming the vector is large
3113-
enough, of course) are set to PCRE2_UNSET.
3111+
for the second and third capture groups (assuming the vector is large enough,
3112+
of course) are set to PCRE2_UNSET.
31143113
</P>
31153114
<P>
31163115
Elements in the ovector that do not correspond to capturing parentheses in the
@@ -3268,7 +3267,7 @@ <h1>pcre2api man page</h1>
32683267
<pre>
32693268
PCRE2_ERROR_NOMEMORY
32703269
</pre>
3271-
Heap memory is used to remember backgracking points. This error is given when
3270+
Heap memory is used to remember backtracking points. This error is given when
32723271
the memory allocation function (default or custom) fails. Note that a different
32733272
error, PCRE2_ERROR_HEAPLIMIT, is given if the amount of memory needed exceeds
32743273
the heap limit. PCRE2_ERROR_NOMEMORY is also returned if
@@ -3863,7 +3862,7 @@ <h1>pcre2api man page</h1>
38633862
match. If the value is not zero, the current replacement is not accepted. If
38643863
the value is greater than zero, processing continues when
38653864
PCRE2_SUBSTITUTE_GLOBAL is set. Otherwise (the value is less than zero or
3866-
PCRE2_SUBSTITUTE_GLOBAL is not set), the the rest of the input is copied to the
3865+
PCRE2_SUBSTITUTE_GLOBAL is not set), the rest of the input is copied to the
38673866
output and the call to <b>pcre2_substitute()</b> exits, returning the number of
38683867
matches so far.
38693868
</P>
@@ -4141,9 +4140,9 @@ <h1>pcre2api man page</h1>
41414140
</P>
41424141
<br><a name="SEC43" href="#TOC1">REVISION</a><br>
41434142
<P>
4144-
Last updated: 08 December 2023
4143+
Last updated: 19 January 2024
41454144
<br>
4146-
Copyright &copy; 1997-2023 University of Cambridge.
4145+
Copyright &copy; 1997-2024 University of Cambridge.
41474146
<br>
41484147
<p>
41494148
Return to the <a href="index.html">PCRE2 index page</a>.

doc/html/pcre2callout.html

Lines changed: 8 additions & 8 deletions
Original file line numberDiff line numberDiff line change
@@ -350,12 +350,12 @@ <h1>pcre2callout man page</h1>
350350
processed in the pattern string. When the callout is at the end of the pattern,
351351
the length is zero. When the callout precedes an opening parenthesis, the
352352
length includes meta characters that follow the parenthesis. For example, in a
353-
callout before an assertion such as (?=ab) the length is 3. For an an
354-
alternation bar or a closing parenthesis, the length is one, unless a closing
355-
parenthesis is followed by a quantifier, in which case its length is included.
356-
(This changed in release 10.23. In earlier releases, before an opening
357-
parenthesis the length was that of the entire group, and before an alternation
358-
bar or a closing parenthesis the length was zero.)
353+
callout before an assertion such as (?=ab) the length is 3. For an alternation
354+
bar or a closing parenthesis, the length is one, unless a closing parenthesis
355+
is followed by a quantifier, in which case its length is included. (This
356+
changed in release 10.23. In earlier releases, before an opening parenthesis
357+
the length was that of the entire group, and before an alternation bar or a
358+
closing parenthesis the length was zero.)
359359
</P>
360360
<P>
361361
The <i>pattern_position</i> and <i>next_item_length</i> fields are intended to
@@ -471,9 +471,9 @@ <h1>pcre2callout man page</h1>
471471
</P>
472472
<br><a name="SEC8" href="#TOC1">REVISION</a><br>
473473
<P>
474-
Last updated: 03 February 2019
474+
Last updated: 19 January 2024
475475
<br>
476-
Copyright &copy; 1997-2019 University of Cambridge.
476+
Copyright &copy; 1997-2024 University of Cambridge.
477477
<br>
478478
<p>
479479
Return to the <a href="index.html">PCRE2 index page</a>.

doc/html/pcre2matching.html

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -27,7 +27,7 @@ <h1>pcre2matching man page</h1>
2727
This document describes the two different algorithms that are available in
2828
PCRE2 for matching a compiled regular expression against a given subject
2929
string. The "standard" algorithm is the one provided by the <b>pcre2_match()</b>
30-
function. This works in the same as as Perl's matching function, and provide a
30+
function. This works in the same as Perl's matching function, and provide a
3131
Perl-compatible matching operation. The just-in-time (JIT) optimization that is
3232
described in the
3333
<a href="pcre2jit.html"><b>pcre2jit</b></a>
@@ -244,9 +244,9 @@ <h1>pcre2matching man page</h1>
244244
</P>
245245
<br><a name="SEC8" href="#TOC1">REVISION</a><br>
246246
<P>
247-
Last updated: 28 August 2021
247+
Last updated: 19 January 2024
248248
<br>
249-
Copyright &copy; 1997-2021 University of Cambridge.
249+
Copyright &copy; 1997-2024 University of Cambridge.
250250
<br>
251251
<p>
252252
Return to the <a href="index.html">PCRE2 index page</a>.

doc/html/pcre2pattern.html

Lines changed: 4 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -1436,7 +1436,7 @@ <h1>pcre2pattern man page</h1>
14361436
</P>
14371437
<P>
14381438
Perl treats a hyphen as a literal if it appears before or after a POSIX class
1439-
(see below) or before or after a character type escape such as as \d or \H.
1439+
(see below) or before or after a character type escape such as \d or \H.
14401440
However, unless the hyphen is the last character in the class, Perl outputs a
14411441
warning in its warning mode, as this is most likely a user error. As PCRE2 has
14421442
no facility for warning, an error is given in these cases.
@@ -3728,7 +3728,7 @@ <h1>pcre2pattern man page</h1>
37283728
fail. However, if A and B match, but C fails, the backtrack to (*THEN) causes
37293729
the next alternative (ABD) to be tried. This behaviour is consistent, but is
37303730
not always the same as Perl's. It means that if two or more backtracking verbs
3731-
appear in succession, all the the last of them has no effect. Consider this
3731+
appear in succession, all but the last of them has no effect. Consider this
37323732
example:
37333733
<pre>
37343734
...(*COMMIT)(*PRUNE)...
@@ -3844,9 +3844,9 @@ <h1>pcre2pattern man page</h1>
38443844
</P>
38453845
<br><a name="SEC32" href="#TOC1">REVISION</a><br>
38463846
<P>
3847-
Last updated: 12 October 2023
3847+
Last updated: 19 January 2024
38483848
<br>
3849-
Copyright &copy; 1997-2023 University of Cambridge.
3849+
Copyright &copy; 1997-2024 University of Cambridge.
38503850
<br>
38513851
<p>
38523852
Return to the <a href="index.html">PCRE2 index page</a>.

doc/html/pcre2posix.html

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -207,7 +207,7 @@ <h1>pcre2posix man page</h1>
207207
</P>
208208
<P>
209209
In the absence of these flags, no options are passed to the native function.
210-
This means the the regex is compiled with PCRE2 default semantics. In
210+
This means that the regex is compiled with PCRE2 default semantics. In
211211
particular, the way it handles newline characters in the subject string is the
212212
Perl way, not the POSIX way. Note that setting PCRE2_MULTILINE has only
213213
<i>some</i> of the effects specified for REG_NEWLINE. It does not affect the way
@@ -370,9 +370,9 @@ <h1>pcre2posix man page</h1>
370370
</P>
371371
<br><a name="SEC10" href="#TOC1">REVISION</a><br>
372372
<P>
373-
Last updated: 14 November 2023
373+
Last updated: 19 January 2024
374374
<br>
375-
Copyright &copy; 1997-2023 University of Cambridge.
375+
Copyright &copy; 1997-2024 University of Cambridge.
376376
<br>
377377
<p>
378378
Return to the <a href="index.html">PCRE2 index page</a>.

doc/html/pcre2test.html

Lines changed: 13 additions & 14 deletions
Original file line numberDiff line numberDiff line change
@@ -90,14 +90,14 @@ <h1>pcre2test man page</h1>
9090
unless you really want that action.
9191
</P>
9292
<P>
93-
The input is processed using using C's string functions, so must not
94-
contain binary zeros, even though in Unix-like environments, <b>fgets()</b>
95-
treats any bytes other than newline as data characters. An error is generated
96-
if a binary zero is encountered. By default subject lines are processed for
97-
backslash escapes, which makes it possible to include any data value in strings
98-
that are passed to the library for matching. For patterns, there is a facility
99-
for specifying some or all of the 8-bit input characters as hexadecimal pairs,
100-
which makes it possible to include binary zeros.
93+
The input is processed using C's string functions, so must not contain binary
94+
zeros, even though in Unix-like environments, <b>fgets()</b> treats any bytes
95+
other than newline as data characters. An error is generated if a binary zero
96+
is encountered. By default subject lines are processed for backslash escapes,
97+
which makes it possible to include any data value in strings that are passed to
98+
the library for matching. For patterns, there is a facility for specifying some
99+
or all of the 8-bit input characters as hexadecimal pairs, which makes it
100+
possible to include binary zeros.
101101
</P>
102102
<br><b>
103103
Input for the 16-bit and 32-bit libraries
@@ -1543,7 +1543,7 @@ <h1>pcre2test man page</h1>
15431543
If the <b>substitute_callout</b> modifier is set, a substitution callout
15441544
function is set up. The <b>null_context</b> modifier must not be set, because
15451545
the address of the callout function is passed in a match context. When the
1546-
callout function is called (after each substitution), details of the the input
1546+
callout function is called (after each substitution), details of the input
15471547
and output strings are output. For example:
15481548
<pre>
15491549
/abc/g,replace=&#60;$0&#62;,substitute_callout
@@ -1814,9 +1814,8 @@ <h1>pcre2test man page</h1>
18141814
If the strings contain any non-printing characters, they are output as \xhh
18151815
escapes if the value is less than 256 and UTF mode is not set. Otherwise they
18161816
are output as \x{hh...} escapes. See below for the definition of non-printing
1817-
characters. If the <b>aftertext</b> modifier is set, the output for substring
1818-
0 is followed by the the rest of the subject string, identified by "0+" like
1819-
this:
1817+
characters. If the <b>aftertext</b> modifier is set, the output for substring 0
1818+
is followed by the rest of the subject string, identified by "0+" like this:
18201819
<pre>
18211820
re&#62; /cat/aftertext
18221821
data&#62; cataract
@@ -2193,9 +2192,9 @@ <h1>pcre2test man page</h1>
21932192
</P>
21942193
<br><a name="SEC21" href="#TOC1">REVISION</a><br>
21952194
<P>
2196-
Last updated: 11 August 2023
2195+
Last updated: 19 January 2024
21972196
<br>
2198-
Copyright &copy; 1997-2023 University of Cambridge.
2197+
Copyright &copy; 1997-2024 University of Cambridge.
21992198
<br>
22002199
<p>
22012200
Return to the <a href="index.html">PCRE2 index page</a>.

0 commit comments

Comments
 (0)