You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
"descriptions": "Structured information from images.",
30
+
"descriptions": "Description of image.",
31
31
"fields": {
32
-
"Title": {
32
+
"Description": {
33
33
"type": "string",
34
-
"description": "Title for the image (either taken from the image directly or a good short title based off content)",
35
-
},
36
-
"ImageType": {
37
-
"type": "string",
38
-
"description": "The type of image.",
39
-
"kind": "classify",
40
-
"enum": [
41
-
"chart",
42
-
"diagram",
43
-
"table",
44
-
"figure",
45
-
"photo",
46
-
"screenshot",
47
-
"logo",
48
-
"icon",
49
-
"map",
50
-
"infographic",
51
-
"other",
52
-
],
53
-
},
54
-
"MarkdownDescription": {
55
-
"type": "string",
56
-
"description": "Description of the image in markdown format. Start with a 2-sentence summary. If the image is a chart, diagram, or table, include the underlying data in tabular markdown format, with valid syntax and accurate numbers. If the image is a chart, describe any axis or legends.",
34
+
"description": "Description of the image. If the image has a title, start with the title. Include a 2-sentence summary. If the image is a chart, diagram, or table, include the underlying data in an HTML table tag, with accurate numbers. If the image is a chart, describe any axis or legends. The only allowed HTML tags are the table/thead/tr/td/tbody tags.",
Copy file name to clipboardExpand all lines: app/backend/prepdocslib/page.py
+5-1Lines changed: 5 additions & 1 deletion
Original file line number
Diff line number
Diff line change
@@ -3,7 +3,7 @@ class Page:
3
3
A single page from a document
4
4
5
5
Attributes:
6
-
page_num (int): Page number
6
+
page_num (int): Page number (0-indexed)
7
7
offset (int): If the text of the entire Document was concatenated into a single string, the index of the first character on the page. For example, if page 1 had the text "hello" and page 2 had the text "world", the offset of page 2 is 5 ("hellow")
Copy file name to clipboardExpand all lines: docs/data_ingestion.md
+1-1Lines changed: 1 addition & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -69,7 +69,7 @@ A [recent change](https://github.com/Azure-Samples/azure-search-openai-demo/pull
69
69
70
70
You may want to remove documents from the index. For example, if you're using the sample data, you may want to remove the documents that are already in the index before adding your own.
71
71
72
-
To remove all documents, use `scripts/prepdocs.sh --removeall` or `scripts/prepdocs.ps1 --removeall`.
72
+
To remove all documents, use `./scripts/prepdocs.sh --removeall` or `./scripts/prepdocs.ps1 --removeall`.
73
73
74
74
You can also remove individual documents by using the `--remove` flag. Open either `scripts/prepdocs.sh` or `scripts/prepdocs.ps1` and replace `/data/*` with `/data/YOUR-DOCUMENT-FILENAME-GOES-HERE.pdf`. Then run `scripts/prepdocs.sh --remove` or `scripts/prepdocs.ps1 --remove`.
0 commit comments