Skip to content

Commit 95567d3

Browse files
committed
add versioning entries for 0.1.2 through 0.1.5 (backfilled from git history), with 0.1.5 covering the quote normalization fixes
1 parent e505f8e commit 95567d3

File tree

2 files changed

+32
-1
lines changed

2 files changed

+32
-1
lines changed

CHANGELOG.md

Lines changed: 31 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -5,6 +5,37 @@ All notable changes to this project will be documented in this file.
55
The format is based on [Keep a Changelog](https://keepachangelog.com/en/1.1.0/),
66
and this project adheres to [Semantic Versioning](https://semver.org/spec/v2.0.0.html).
77

8+
## [0.1.5] - 2026-02-25
9+
10+
### Fixed
11+
12+
- Pre-model Unicode normalization of raw OCR text (NFKD decomposition, smart/curly quote replacement) to prevent unescaped double-quotes inside JSON string values
13+
- General unescaped inline scare-quote repair in `_repair_unescaped_quotes()` as a JSON parse safety net
14+
15+
## [0.1.4] - 2026-02-24
16+
17+
### Added
18+
19+
- Regex-based identifier extraction from raw poster text (DOI, ORCID, arXiv, ROR, Crossref Funder ID)
20+
- Automatic scheme/schemeURI inference for identifiers
21+
- Caption ID auto-generation (`fig1`, `fig2`, `table1`, etc.)
22+
- New `identifiers.py` module with `enrich_json_with_identifiers()`
23+
24+
## [0.1.3] - 2026-02-20
25+
26+
### Fixed
27+
28+
- CLI test fixtures updated for required conference fields
29+
- Schema sync: publisher mandatory, conferenceYear required
30+
31+
## [0.1.2] - 2026-02-14
32+
33+
### Changed
34+
35+
- Updated model ID from jimnoneill to fairdataihub HuggingFace org
36+
- Synced prompts and schema with poster_schema v0.1
37+
- Updated field proportion threshold to 0.5-2.0
38+
839
## [0.1.1] - 2026-02-04
940

1041
### Added

pyproject.toml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,7 +1,7 @@
11
[tool.poetry]
22

33
name = "poster2json"
4-
version = "0.1.4"
4+
version = "0.1.5"
55
description = "Convert scientific posters (PDF/images) to structured JSON metadata using Large Language Models"
66

77
packages = [{ include = "poster2json" }]

0 commit comments

Comments
 (0)