You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: README.md
+10-9Lines changed: 10 additions & 9 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -23,15 +23,17 @@
23
23
-[Troubleshooting](docs/troubleshooting.md)
24
24
-[Licence](docs/LICENCE.md)
25
25
26
-
pgEdge Document Loader is a command-line tool for loading documents from various formats into PostgreSQL databases. Full documentation is available at:
pgEdge Document Loader is a command-line tool for loading documents from various formats into PostgreSQL databases. Full documentation is available [here](https://docs.pgedge.com/pgedge-docloader/).
29
27
30
28
The pgEdge Document Loader automatically converts documents (HTML, Markdown, reStructuredText, and SGML/DocBook) to Markdown format and loads them into a PostgreSQL database with extracted metadata.
31
29
32
30
**Features**
33
31
34
-
-**Multiple Format Support**: HTML, Markdown, reStructuredText, and SGML/DocBook
32
+
The pgEdge Document Loader automatically converts documents (HTML, Markdown, reStructuredText, and DocBook SGML/XML) to Markdown format and loads them into a PostgreSQL database with extracted metadata.
33
+
34
+
**Features**
35
+
36
+
-**Multiple Format Support**: HTML, Markdown, reStructuredText, and DocBook SGML/XML
35
37
-**Automatic Conversion**: All formats converted to Markdown
Copy file name to clipboardExpand all lines: docs/index.md
+4-3Lines changed: 4 additions & 3 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -7,9 +7,10 @@ before storing them in the database.
7
7
8
8
pgEdge Document Loader supports the following document formats:
9
9
10
-
-**HTML** (`.html`, `.htm`) - Extracts the document title from `<title>` tag
11
-
-**Markdown** (`.md`) - Extracts the title from first `#` heading
12
-
-**reStructuredText** (`.rst`) - Extracts the title from underlined headings
10
+
-**HTML** (`.html`, `.htm`) - Extracts the document title from `<title>` tags.
11
+
-**Markdown** (`.md`) - Extracts the title from first `#` headings.
12
+
-**reStructuredText** (`.rst`) - Extracts the title from underlined headings.
13
+
-**DocBook SGML/XML** (`.sqml`, `.xml` ) - Extracts the title from `<title>` or `<refentrytitle>` tags (PostgreSQL-style reference pages use `<refentrytitle>`).
0 commit comments