Skip to content

Commit fbb93c9

Browse files
committed
Updates document translation matches and rectifies some line breaks in api.rst
1 parent ad17ecd commit fbb93c9

File tree

3 files changed

+16
-12
lines changed

3 files changed

+16
-12
lines changed
3.74 KB
Binary file not shown.

docs/locales/ja/LC_MESSAGES/document.po

Lines changed: 14 additions & 8 deletions
Original file line numberDiff line numberDiff line change
@@ -1085,20 +1085,18 @@ msgid "**Class API**"
10851085
msgstr "**クラスAPI**"
10861086

10871087
#: ../../document.rst:174 b482340231f44738bb9cf3c571f4f7d5
1088-
#, fuzzy
10891088
msgid "Create a ``Document`` object."
1090-
msgstr "*Document* オブジェクトを作成します。"
1089+
msgstr "``Document`` オブジェクトを作成します。"
10911090

10921091
#: ../../document.rst:176 1c55311b5fa64850a06598e2a8ce9a7c
10931092
msgid "With default parameters, a **new empty PDF** document will be created."
10941093
msgstr "デフォルトのパラメータを使用すると、**新しい空の PDF** ドキュメントが作成されます。"
10951094

10961095
#: ../../document.rst:177 22c55c2d255f491f89be960f4babb5e5
10971096
msgid "If ``stream`` is given, then the document is created from memory."
1098-
msgstr ""
1097+
msgstr "``stream`` が指定された場合、ドキュメントはメモリから作成されます。"
10991098

11001099
#: ../../document.rst:178 3a8f75233dd54040a52ad9ae44d26e74
1101-
#, fuzzy
11021100
msgid ""
11031101
"If ``stream`` is `None`, then a document is created from the file given "
11041102
"by ``filename``."
@@ -1154,15 +1152,21 @@ msgid ""
11541152
"``filetype`` parameter is ignored, except when content inspection was "
11551153
"unsuccessful. This is regularly the case for plain text types like "
11561154
"\"txt\", \"html\", \"xml\" etc. with a wrong or missing file extension."
1157-
msgstr ""
1155+
msgstr ""ファイルパスを含むUTF-8文字列または ``pathlib.Path`` オブジェクトです。"
1156+
"ドキュメントタイプは常にファイルの内容から判定されます。"
1157+
"``filetype`` パラメータは、内容の検査が失敗した場合を除いて無視されます。"
1158+
"これは、誤った、または欠落したファイル拡張子を持つ "
1159+
"\"txt\"\"html\"\"xml\" などのプレーンテキストタイプでよくあるケースです。""
11581160

11591161
#: ../../document.rst:182 42a021002ca24463805b8c682c3ef247
11601162
msgid ""
11611163
"A memory area containing file data. The document type is always detected "
11621164
"from the data content. The ``filetype`` parameter is ignored, except when"
11631165
" content inspection was unsuccessful. This is regularly the case for "
11641166
"plain text types like \"txt\", \"html\", \"xml\" etc."
1165-
msgstr ""
1167+
msgstr ""ファイルデータを含むメモリ領域です。ドキュメントタイプは常にデータの内容から検出されます。"
1168+
"``filetype`` パラメータは、内容の検査が失敗した場合を除いて無視されます。"
1169+
"これは、\"txt\"\"html\"\"xml\" などのプレーンテキストタイプでよくあるケースです。""
11661170

11671171
#: ../../document.rst:184 df0bacd926f548f7993192643669e3da
11681172
msgid ""
@@ -1171,10 +1175,12 @@ msgid ""
11711175
" etc. cannot be disambiguated by their content. When such files are "
11721176
"provided in memory or being provided with the wrong file extension, this "
11731177
"parameter **must** be used."
1174-
msgstr ""
1178+
msgstr ""ドキュメントのタイプを指定する文字列です。これは、ファイル内容の検査が失敗した場合にのみ必要です。"
1179+
"\"txt\"\"html\"\"xml\" などのテキストタイプは、その内容によって識別することができません。"
1180+
"このようなファイルがメモリ内で提供される場合、または誤ったファイル拡張子で提供される場合、"
1181+
"このパラメータを **必ず** 使用してください。""
11751182

11761183
#: ../../document.rst:186 78759e58f1034e9b8808547679e5f0c7
1177-
#, fuzzy
11781184
msgid ""
11791185
"a rectangle specifying the desired page size. This parameter is only "
11801186
"meaningful for documents with a variable page layout (\"reflowable\" "

docs/pymupdf4llm/api.rst

Lines changed: 2 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -224,8 +224,7 @@ The |PyMuPDF4LLM| API
224224

225225
Return appropriate markdown header prefix. This is either "" or a string of "#" characters followed by a space.
226226

227-
Given a text span from a "dict"" extraction, determine the
228-
markdown header prefix string of 0 to n concatenated '#' characters.
227+
Given a text span from a "dict" extraction, determine the markdown header prefix string of 0 to n concatenated '#' characters.
229228

230229
:arg dict span: a dictionary containing the text span information. This is the same dictionary as returned by `page.get_text("dict")`.
231230

@@ -332,8 +331,7 @@ This user function uses the document's Table of Contents -- under the assumption
332331

333332
Create an object which uses the document's Table of Contents (TOC) to determine header levels. Upon object creation, the table of contents is read via the `Document.get_toc()` method. The TOC data is then used to determine header levels in the `to_markdown()` method.
334333

335-
This is an alternative to :class:`IdentifyHeaders`. Instead of running through the full document to identify font sizes, it uses the document's Table Of
336-
Contents (TOC) to identify headers on pages. Like :class:`IdentifyHeaders`, this also is no guarantee to find headers, but for well-built Table of Contents, there is a good chance for more correctly identifying header lines on document pages than the font-size-based approach.
334+
This is an alternative to :class:`IdentifyHeaders`. Instead of running through the full document to identify font sizes, it uses the document's Table Of Contents (TOC) to identify headers on pages. Like :class:`IdentifyHeaders`, this also is no guarantee to find headers, but for well-built Table of Contents, there is a good chance for more correctly identifying header lines on document pages than the font-size-based approach.
337335

338336
It also has the advantage of being much faster than the font-size-based approach, as it does not execute a full document scan or even access any of the document pages.
339337

0 commit comments

Comments
 (0)