Skip to content

Commit 1b3e450

Browse files
committed
Update changelog
1 parent e48b71c commit 1b3e450

File tree

1 file changed

+349
-45
lines changed

1 file changed

+349
-45
lines changed

CHANGELOG.md

Lines changed: 349 additions & 45 deletions
Original file line numberDiff line numberDiff line change
@@ -1,47 +1,351 @@
1-
Poe the Poet - A task runner that works well with poetry.
2-
version 0.31.1
3-
4-
Error: Unrecognised task 'semversioner'
5-
6-
Usage:
7-
poe [global options] task [task arguments]
8-
9-
Global options:
10-
-h, --help Show this help page and exit
11-
--version Print the version and exit
12-
-v, --verbose Increase command output (repeatable)
13-
-q, --quiet Decrease command output (repeatable)
14-
-d, --dry-run Print the task contents but don't actually run it
15-
-C PATH, --directory PATH
16-
Specify where to find the pyproject.toml
17-
-e EXECUTOR, --executor EXECUTOR
18-
Override the default task executor
19-
--ansi Force enable ANSI output
20-
--no-ansi Force disable ANSI output
21-
22-
Configured tasks:
23-
semversioner_add
24-
coverage_report
25-
check_format
26-
fix
27-
fix_unsafe
28-
test_unit
29-
test_integration
30-
test_smoke
31-
test_notebook
32-
test_verbs
33-
index
34-
update
35-
init
36-
query
37-
prompt_tune
38-
test_only
39-
serve_docs
40-
build_docs
41-
release
42-
convert_docsite_notebooks
43-
format
44-
check
45-
test
1+
# Changelog
2+
Note: version releases in the 0.x.y range may introduce breaking changes.
463

4+
## 2.4.0
475

6+
- minor: Allow injection of custom pipelines.
7+
- minor: Refactored StorageFactory to use a registration-based approach
8+
- patch: Fix default values for tpm and rpm limiters on embeddings
9+
- patch: Update typer.
10+
- patch: cleaned up logging to follow python standards.
11+
12+
## 2.3.0
13+
14+
- minor: Remove Dynamic Max Retries support. Refactor typer typing in cli interface
15+
- minor: Update fnllm to latest. Update default graphrag configuration
16+
- patch: A few fixes and enhancements for better reuse and flow.
17+
- patch: Add full llm response to LLM PRovider output
18+
- patch: Fix Drift Reduce Response for non streaming calls
19+
- patch: Fix global search prompt to include missing formatting key
20+
- patch: Upgrade pyarrow dependency to >=17.0.0 to fix CVE-2024-52338
21+
22+
## 2.2.1
23+
24+
- patch: Fix Community Report prompt tuning response
25+
- patch: Fix graph creation missing edge weights.
26+
- patch: Update as workflows
27+
28+
## 2.2.0
29+
30+
- minor: Support OpenAI reasoning models.
31+
- patch: Add option to snapshot raw extracted graph tables.
32+
- patch: Added batching logic to the prompt tuning autoselection embeddings workflow
33+
- patch: Align config classes and docs better.
34+
- patch: Align embeddings table loading with configured fields.
35+
- patch: Brings parity with our latest NLP extraction approaches.
36+
- patch: Fix fnllm to 0.2.3
37+
- patch: Fixes to basic search.
38+
- patch: Update llm args for consistency.
39+
- patch: add vector store integration tests
40+
41+
## 2.1.0
42+
43+
- minor: Add support for JSON input files.
44+
- minor: Updated the prompt tunning client to support csv-metadata injection and updated output file types to match the new naming convention.
45+
- patch: Add check for custom model types while config loading
46+
- patch: Adds general-purpose pipeline run state object.
47+
48+
## 2.0.0
49+
50+
- major: Add children to communities to avoid re-compute.
51+
- major: Reorganize and rename workflows and their outputs.
52+
- major: Rework API to accept callbacks.
53+
- minor: Add LMM Manager and Factory, to support provider registration
54+
- minor: Add NLP graph extraction.
55+
- minor: Add pipeline_start and pipeline_end callbacks.
56+
- minor: Move embeddings snapshots to the workflow runner.
57+
- minor: Remove config inheritance, hydration, and automatic env var overlays.
58+
- minor: Rework the update output storage structure.
59+
- patch: Add caching to NLP extractor.
60+
- patch: Add vector store id reference to embeddings config.
61+
- patch: Export NLP community reports prompt.
62+
- patch: Fix DRIFT search on Azure AI Search.
63+
- patch: Fix StopAsyncIteration catch.
64+
- patch: Fix missing embeddings workflow in FastGraphRAG.
65+
- patch: Fix proper use of n_depth for drift search
66+
- patch: Fix report generation recursion.
67+
- patch: Fix summarization over large datasets for inc indexing. Fix relationship summarization
68+
- patch: Optimize data iteration by removing some iterrows from code
69+
- patch: Patch json mode for community reports
70+
- patch: Properly increment text unit IDs during updates.
71+
- patch: Refactor config defaults from constants to type-safe, hierarchical dataclass.
72+
- patch: Require explicit azure auth settings when using AOI.
73+
- patch: Separates graph pruning for differential usage.
74+
- patch: Tuck flow functions under their workflow modules.
75+
- patch: Update fnllm. Remove unused libs.
76+
- patch: Use ModelProvider for query module
77+
- patch: Use shared schema for final outputs.
78+
- patch: add dynamic retry logic.
79+
- patch: add option to prepend metadata into chunks
80+
- patch: cleanup query code duplication.
81+
- patch: implemented multi-index querying for api layer
82+
- patch: multi index query cli support
83+
- patch: remove unused columns and change property document_attribute_columns to metadata
84+
- patch: update multi-index query to support new workflows
85+
86+
## 1.2.0
87+
88+
- minor: Add Drift Reduce response and streaming endpoint
89+
- minor: add cosmosdb vector store
90+
- patch: Fix example notebooks
91+
- patch: Set default rate limits.
92+
- patch: unit tests for text_splitting
93+
94+
## 1.2.0
95+
96+
- patch: Basic Rag minor fix
97+
98+
## 1.1.1
99+
100+
- patch: Fix a bug on creating community hierarchy for dynamic search
101+
- patch: Increase LOCAL_SEARCH_COMMUNITY_PROP to 15%
102+
103+
## 1.1.0
104+
105+
- minor: Make gleanings independent of encoding
106+
- minor: Remove DataShaper (first steps).
107+
- minor: Remove old pipeline runner.
108+
- minor: new search implemented as a new option for the api
109+
- patch: Fix gleanings loop check
110+
- patch: Implement cosmosdb storage option for cache and output
111+
- patch: Move extractor code to co-locate with operations.
112+
- patch: Remove config input models.
113+
- patch: Ruff update
114+
- patch: Simplify and streamline internal config.
115+
- patch: Simplify callbacks model.
116+
- patch: Streamline flows.
117+
- patch: fix instantiation of storage classes.
118+
119+
## 1.0.1
120+
121+
- patch: Fix encoding model config parsing
122+
- patch: Fix exception on error callbacks
123+
- patch: Manage llm instances inside a cached singleton. Check for empty dfs after entity/relationship extraction
124+
- patch: Respect encoding_model option
125+
126+
## 1.0.0
127+
128+
- patch: Add Parent id to communities data model
129+
- patch: Add migration notebook.
130+
- patch: Create separate community workflow, collapse subflows.
131+
- patch: Dependency Updates
132+
- patch: cleanup and refactor factory classes.
133+
134+
## 0.9.0
135+
136+
- minor: Refactor graph creation.
137+
- patch: Dependency updates
138+
- patch: Fix Global Search with dynamic Community selection bug
139+
- patch: Fix question gen.
140+
- patch: Optimize Final Community Reports calculation and stabilize cache
141+
- patch: miscellaneous code cleanup and minor changes for better alignment of style across the codebase.
142+
- patch: replace llm package with fnllm
143+
- patch: replaced md5 hash with sha256
144+
- patch: replaced md5 hash with sha512
145+
- patch: update API and add a demonstration notebook
146+
147+
## 0.5.0
148+
149+
- minor: Data model changes.
150+
- patch: Add Parquet as part of the default emitters when not pressent
151+
- patch: Centralized prompts and export all for easier injection.
152+
- patch: Cleanup of artifact outputs/schemas.
153+
- patch: Config and docs updates.
154+
- patch: Implement dynamic community selection to global search
155+
- patch: fix autocompletion of existing files/directory paths.
156+
- patch: move import statements out of init files
157+
158+
## 0.4.1
159+
160+
- patch: Add update cli entrypoint for incremental indexing
161+
- patch: Allow some CI/CD jobs to skip PRs dedicated to doc updates only.
162+
- patch: Fix a file paths issue in the viz guide.
163+
- patch: Fix optional covariates update in incremental indexing
164+
- patch: Raise error on empty deltas for inc indexing
165+
- patch: add visualization guide to doc site
166+
- patch: fix streaming output error
167+
168+
## 0.4.0
169+
170+
- minor: Add Incremental Indexing
171+
- minor: Added DRIFT graph reasoning query module
172+
- minor: embeddings moved to a different workflow
173+
- patch: Add DRIFT search cli and example notebook
174+
- patch: Add config for incremental updates
175+
- patch: Add embeddings to subflow.
176+
- patch: Add naive community merge using time period
177+
- patch: Add relationship merge
178+
- patch: Add runtime-only storage option.
179+
- patch: Add text units update
180+
- patch: Allow empty workflow returns to avoid disk writing.
181+
- patch: Apply pandas optimizations to create final entities
182+
- patch: Calculate new inputs and deleted inputs on update
183+
- patch: Collapse covariates flow.
184+
- patch: Collapse create-base-entity-graph.
185+
- patch: Collapse create-final-community-reports.
186+
- patch: Collapse create-final-documents.
187+
- patch: Collapse create-final-entities.
188+
- patch: Collapse create-final-nodes.
189+
- patch: Collapse create_base_documents.
190+
- patch: Collapse create_base_text_units.
191+
- patch: Collapse create_final_relationships.
192+
- patch: Collapse entity extraction.
193+
- patch: Collapse entity summarize.
194+
- patch: Collapse intermediate workflow outputs.
195+
- patch: Dependency updates
196+
- patch: Extract DataShaper-less flows.
197+
- patch: Fix Community ID loading for DRIFT search over existing indexes
198+
- patch: Fix embeddings faulty assignments
199+
- patch: Fix init defaults for vector store and drift img in docs
200+
- patch: Fix nested json parsing
201+
- patch: Fix some edge cases on Drift Search over small input sets
202+
- patch: Fix var name for embedding
203+
- patch: Merge existing and new entities, updating values accordingly
204+
- patch: Merge text_embed into create-final-relationships subflow.
205+
- patch: Move embedding verbs to operations.
206+
- patch: Moving verbs around.
207+
- patch: Optimize Create Base Documents subflow
208+
- patch: Optimize text unit relationship count
209+
- patch: Perf optimizations in map_query_to_entities()
210+
- patch: Remove aggregate_df from final coomunities and final text units
211+
- patch: Remove duplicated relationships and nodes
212+
- patch: Remove unused column from final entities
213+
- patch: Reorganized api,reporter,callback code into separate components. Defined debug profiles.
214+
- patch: Small cleanup in community context history building
215+
- patch: Transient entity graph and snapshotting.
216+
- patch: Update Incremental Indexing to new embeddings workflow
217+
- patch: Use mkdocs for documentation
218+
- patch: add backwards compatibility patch to vector store.
219+
- patch: add-autogenerated-cli-docs
220+
- patch: fix docs image path
221+
- patch: refactor use of vector stores and update support for managed identity
222+
- patch: remove redundant error-handling code from global-search
223+
- patch: reorganize cli layer
224+
225+
## 0.3.6
226+
227+
- patch: Collapse create_final_relationships.
228+
- patch: Dependency update and cleanup
229+
230+
## 0.3.5
231+
232+
- patch: Add compound verbs with tests infra.
233+
- patch: Collapse create_final_communities.
234+
- patch: Collapse create_final_text_units.
235+
- patch: Covariate verb collapse.
236+
- patch: Fix duplicates in community context builder
237+
- patch: Fix prompt tune output path
238+
- patch: Fix seed hardcoded init
239+
- patch: Fix seeded random gen on clustering
240+
- patch: Improve logging.
241+
- patch: Set default values for cli parameters.
242+
- patch: Use static output directories.
243+
244+
## 0.3.4
245+
246+
- patch: Deep copy txt units on local search to avoid race conditions
247+
- patch: Fix summarization including empty descriptions
248+
249+
## 0.3.3
250+
251+
- patch: Add entrypoints for incremental indexing
252+
- patch: Clean up and organize run index code
253+
- patch: Consistent config loading. Resolves #99 and Resolves #1049
254+
- patch: Fix circular dependency when running prompt tune api directly
255+
- patch: Fix default settings for embedding
256+
- patch: Fix img for auto tune
257+
- patch: Fix img width
258+
- patch: Fixed a bug in prompt tuning process
259+
- patch: Refactor text unit build at local search
260+
- patch: Update Prompt Tuning docs
261+
- patch: Update create_pipeline_config.py
262+
- patch: Update prompt tune command in docs
263+
- patch: add querying from azure blob storage
264+
- patch: fix setting base_dir to full paths when not using file system.
265+
- patch: fix strategy config in entity_extraction
266+
267+
## 0.3.2
268+
269+
- patch: Add context data to query API responses.
270+
- patch: Add missing config parameter documentation for prompt tuning
271+
- patch: Add neo4j community notebook
272+
- patch: Ensure entity types to be str when running prompt tuning
273+
- patch: Fix weight casting during graph extraction
274+
- patch: Patch "past" dependency issues
275+
- patch: Update developer guide.
276+
- patch: Update query type hints.
277+
- patch: change-lancedb-placement
278+
279+
## 0.3.1
280+
281+
- patch: Add preflight check to check LLM connectivity.
282+
- patch: Add streaming support for local/global search to query cli
283+
- patch: Add support for both float and int on schema validation for community report generation
284+
- patch: Avoid running index on gh-pages publishing
285+
- patch: Implement Index API
286+
- patch: Improves filtering for data dir inferring
287+
- patch: Update to nltk 3.9.1
288+
289+
## 0.3.0
290+
291+
- minor: Implement auto templating API.
292+
- minor: Implement query engine API.
293+
- patch: Fix file dumps using json for non ASCII chars
294+
- patch: Stabilize smoke tests for query context building
295+
- patch: fix query embedding
296+
- patch: fix sort_context & max_tokens params in verb
297+
298+
## 0.2.2
299+
300+
- patch: Add a check if there is no community record added in local search context
301+
- patch: Add sepparate workflow for Python Tests
302+
- patch: Docs updates
303+
- patch: Run smoke tests on 4o
304+
305+
## 0.2.1
306+
307+
- patch: Added default columns for vector store at create_pipeline_config. No change for other cases.
308+
- patch: Change json parsing error in the map step of global search to warning
309+
- patch: Fix Local Search breaking when loading Embeddings input. Defaulting overwrite to True as in the rest of the vector store config
310+
- patch: Fix json parsing when LLM returns faulty responses
311+
- patch: Fix missing community reports and refactor community context builder
312+
- patch: Fixed a bug that erased the vector database, added a new parameter to specify the config file path, and updated the documentation accordingly.
313+
- patch: Try parsing json before even repairing
314+
- patch: Update Prompt Tuning meta prompts with finer examples
315+
- patch: Update default entity extraction and gleaning prompts to reduce hallucinations
316+
- patch: add encoding-model to entity/claim extraction config
317+
- patch: add encoding-model to text chunking config
318+
- patch: add user prompt to history-tracking llm
319+
- patch: update config reader to allow for zero gleans
320+
- patch: update config-reader to allow for empty chunk-by arrays
321+
- patch: update history-tracking LLm to use 'assistant' instead of 'system' in output history.
322+
- patch: use history argument in hash key computation; add history input to cache data
323+
324+
## 0.2.0
325+
326+
- minor: Add content-based KNN for selecting prompt tune few shot examples
327+
- minor: Add dynamic community report rating to the prompt tuning engine
328+
- patch: Add Minute-based Rate Limiting and fix rpm, tpm settings
329+
- patch: Add N parameter support
330+
- patch: Add cli flag to overlay default values onto a provided config.
331+
- patch: Add exception handling on file load
332+
- patch: Add language support to prompt tuning
333+
- patch: Add llm params to local and global search
334+
- patch: Fix broken prompt tuning link on docs
335+
- patch: Fix delta none on query calls
336+
- patch: Fix docsite base url
337+
- patch: Fix encoding model parameter on prompt tune
338+
- patch: Fix for --limit exceeding the dataframe length
339+
- patch: Fix for Ruff 0.5.2
340+
- patch: Fixed an issue where base OpenAI embeddings can't work with Azure OpenAI LLM
341+
- patch: Modify defaults for CHUNK_SIZE, CHUNK_OVERLAP and GLEANINGS to reduce time and LLM calls
342+
- patch: fix community_report doesn't work in settings.yaml
343+
- patch: fix llm response content is None in query
344+
- patch: fix the organization parameter is ineffective during queries
345+
- patch: remove duplicate file read
346+
- patch: support non-open ai model config to prompt tune
347+
- patch: use binary io processing for all file io operations
348+
349+
## 0.1.0
350+
351+
- minor: Initial Release

0 commit comments

Comments
 (0)