Replies: 1 comment
-
@maxin9966 ContextGem provides Aspects API that is designed to extract text segments (topics, sections) from a document. But using Aspects API right away requires knowing what aspects you need to extract, since each Aspect instance requires Quick example:
Since you are working with long documents (books, academic papers, etc.), you will need to configure the extraction params to account for the content length. Check out the Dealing with Long Documents guide. For instance, you will probably need to adjust Hope this helps. |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
-
Objective:
I want to use the contextgem project to semantically segment a piece of text, preferably with structured output.
The core requirement is segmentation—for example, if the input is a book, the system should extract elements like the book title, table of contents, chapters, subsections, etc.
Text Definition:
The input text could be:
Problem:
Since the uploaded document content is unknown in advance, I need a generic semantic segmentation logic to preprocess the text. This will facilitate downstream tasks like summarization and aggregation for individual segments.
Beta Was this translation helpful? Give feedback.
All reactions