You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/cognitive-services/QnAMaker/Concepts/content-types.md
+3-129Lines changed: 3 additions & 129 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -3,7 +3,7 @@ title: Content types - QnA Maker
3
3
description: Content types include many standard structured documents such as PDF, DOC, and TXT.
4
4
services: cognitive-services
5
5
ms.topic: conceptual
6
-
ms.date: 01/27/2020
6
+
ms.date: 02/24/2020
7
7
---
8
8
9
9
# Content types of documents you can add to a knowledge base
@@ -40,134 +40,8 @@ You can add files to a knowledge base from a public source, or your local file s
40
40
41
41
## Content format guidelines
42
42
43
-
QnA Maker adds file and URL content, converting the content into QnA sets, stored as markdown (`.md`). Content is fit for a knowledge base where the content is organized in some structured form, and is represented in well-defined sections. The sections can further be broken into subsections, or subtopics. Extraction works best on content that has a clear structure with hierarchical headings.
44
-
45
-
QnA Maker identifies sections and subsections and relationships in the content based on visual clues like font size, font style, numbering, colors, etc. Semi-structured content includes manuals, FAQs, guidelines, policies, brochures, flyers, and many other types of files.
46
-
47
-
## FAQ URLs
48
-
49
-
QnA Maker can support FAQ web pages in 3 different forms: Plain FAQ pages, FAQ pages with links, FAQ pages with a Topics Homepage.
50
-
51
-
52
-
### Plain FAQ pages
53
-
54
-
This is the most common type of FAQ page, in which the answers immediately follow the questions in the same page.
55
-
56
-
Below is an example of a plain FAQ page:
57
-
58
-

59
-
60
-
61
-
### FAQ pages with links
62
-
63
-
In this type of FAQ page, questions are aggregated together and are linked to answers that are either in different sections of the same page, or in different pages.
64
-
65
-
Below is an example of an FAQ page with links in sections that are on the same page:
66
-
67
-

68
-
69
-
70
-
### FAQ pages with a Topics homepage
71
-
72
-
This type of FAQ has a home page with the Topics where each Topic is a link to its relevant QnAs in a different page. Here, QnA Maker crawls all the linked pages to extract the corresponding questions & answers.
73
-
74
-
Below is an example of an FAQ page where a topics homepage has links to FAQ sections in different pages.
75
-
76
-

77
-
78
-
79
-
### Support Urls
80
-
81
-
QnA Maker can process semi-structured support web pages, such as web articles that would describe how to perform a given task, how to diagnose and resolve a given problem, and what are the best practices for a given process. Extraction works best on content that has a clear structure with hierarchical headings.
82
-
83
-
> [!NOTE]
84
-
> Extraction for support articles is a new feature and is in early stages. It works best for simple pages, that are well structured, and do not contain complex headers/footers.
85
-
86
-

87
-
88
-
89
-
## PDF/ DOC files
90
-
91
-
QnA Maker can process semi-structured content in a PDF or DOC file, and convert it into QnAs. A good file that can be extracted well is one where content is organized in some structured form, and is represented in well-defined sections. The sections can further be broken inTO subsections, or subtopics. Extraction works best on documents that have a clear structure with hierarchical headings.
92
-
93
-
QnA Maker identifies sections and subsections and relationships in the file based on visual clues like font size, font style, numbering, colors, etc. Semi-structured PDF or DOC files could be Manuals, FAQs, Guidelines, Policies, Brochures, Flyers, and many other types of files. Below are some example types of these files.
94
-
95
-
### Product manuals
96
-
97
-
A manual is typically guidance material that accompanies a product. It helps the user to set up, use, maintain, and troubleshoot the product. When QnA Maker processes a manual, it extracts the headings and subheadings as questions and the subsequent content as answers. See an example [here](https://download.microsoft.com/download/2/9/B/29B20383-302C-4517-A006-B0186F04BE28/surface-pro-4-user-guide-EN.pdf).
98
-
99
-
Below is an example of a manual with an index page, and hierarchical content
100
-
101
-

102
-
103
-
> [!NOTE]
104
-
> Extraction works best on manuals that have a table of contents and/or an index page, and a clear structure with hierarchical headings.
105
-
106
-
### Brochures, guidelines, papers, and other files
107
-
108
-
Many other types of documents can also be processed to generate QA pairs, provided they have a clear structure and layout. These include: Brochures, guidelines, reports, white papers, scientific papers, policies, books, etc. See an example [here](https://qnamakerstore.blob.core.windows.net/qnamakerdata/docs/Manage%20Azure%20Blob%20Storage.docx).
109
-
110
-
Below is an example of a semi-structured doc, without an index:
The format for structured Question-Answers in DOC files, is in the form of alternating Questions and Answers per line, one question per line followed by its answer in the following line, as shown below:
117
-
118
-
```text
119
-
Question1
120
-
121
-
Answer1
122
-
123
-
Question2
124
-
125
-
Answer2
126
-
```
127
-
128
-
Below is an example of a structured QnA word document:
129
-
130
-

131
-
132
-
## Structured *TXT*, *TSV* and *XLS* Files
133
-
134
-
QnAs in the form of structured *.txt*, *.tsv* or *.xls* files can also be uploaded to QnA Maker to create or augment a knowledge base. These can either be plain text, or can have content in RTF or HTML.
Any additional columns in the source file are ignored.
142
-
143
-
### Example of structured Excel file
144
-
145
-
Below is an example of a structured QnA *.xls* file, with HTML content:
146
-
147
-

148
-
149
-
### Example of alternate questions for single answer in Excel file
150
-
151
-
Below is an example of a structured QnA *.xls* file, with several alternate questions for a single answer:
152
-
153
-

154
-
155
-
After the file is imported, the question-and-answer pair is in the knowledge base as shown below:
156
-
157
-

158
-
159
-
## Formatting considerations
160
-
161
-
After importing a file or URL, QnA Maker converts and stores your content in the [markdown format](https://en.wikipedia.org/wiki/Markdown). The conversion process adds new lines in the text, such as `\n\n`. A knowledge of the markdown format helps you to understand the converted content and manage your knowledge base content.
162
-
163
-
If you add or edit your content directly in your knowledge base, use **markdown formatting** to create rich text content or change the markdown format content that is already in the answer. QnA Maker supports much of the markdown format to bring rich text capabilities to your content. However, the client application, such as a chat bot may not support the same set of markdown formats. It is important to test the client application's display of answers.
164
-
165
-
Learn more from the [QnA Maker markdown reference documentation](../reference-markdown-format.md).
166
-
167
-
## Testing your Markdown
168
-
169
-
Use the **[CommonMark](https://commonmark.org/help/tutorial/index.html)** tutorial to validate your Markdown. The tutorial has a **Try it** feature for quick copy/paste validation.
43
+
Learn more about the [format guidelines](../reference-document-format-guidelines.md) for the different files.
170
44
171
45
## Next steps
172
46
173
-
*Understand how to design and manage [question and answer (QnA) sets](question-answer-set.md)
47
+
Understand what information is stored in a [question and answer (QnA) set](question-answer-set.md).
Copy file name to clipboardExpand all lines: articles/cognitive-services/QnAMaker/How-To/add-sharepoint-datasources.md
+2-1Lines changed: 2 additions & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -15,7 +15,8 @@ If the QnA Maker knowledge base manager is not the Active Directory manager, you
15
15
16
16
## Prerequisites
17
17
18
-
QnA Maker uses Microsoft Graph for permissions. If your SharePoint is on-premises, you won't be able to extract from SharePoint because Microsoft Graph won't be able to determine permissions.
18
+
* Cloud-based SharePoint - QnA Maker uses Microsoft Graph for permissions. If your SharePoint is on-premises, you won't be able to extract from SharePoint because Microsoft Graph won't be able to determine permissions.
19
+
* URL format - QnA Maker only supports SharePoint urls which are generated for sharing and are of format https://*.sharepoint.com
0 commit comments