- Replaced iscc-sum with iscc-lib for Data/Instance/Sum code generation
- Simplified
code_data,code_instance, andcode_sumusing iscc-lib's optimized generators - Removed iscc-sum dependency
- Migrated from iscc-core to iscc-lib for core ISCC algorithms
- Removed iscc-core from dependencies (blake3, loguru, bitarray, xxhash now explicit)
- Added optional
outpathparameter toembed_metadatafor custom output file paths (closes #71) - Added support for passing
dictas metadata toembed_metadata - Fixed
code_iscc_mtto acceptname,description, andmetaparameters likecode_iscc - Migrated from Poetry to uv for dependency management and build backend
- Replaced IPFS binary tool with pure Python CIDv1 computation (no external dependency)
- Fixed installer script referencing removed IPFS tool functions
- Fixed memory usage in IPFS CID computation for large files (stream-hash per chunk)
- Updated CI GitHub Actions to latest versions
- Updated dependencies
- Fixed TypeError with Semantic-Code image in granular processing mode.
- Fixed issue with passing incompatible options to iscc-sct in experimental mode
- Improved test coverage
- Added
code_sumfunction for efficient combined Data-Code and Instance-Code generation - Updated
code_iscc&code_metato accept custom metadata and optionally skip file metadata extraction - Updated
code_isccto compute Data & Instance Code in one go and process data sequentially - Added separate
code_iscc_mtfor multihreaded processing - Updated dependencies
- Added experimental semantic codes for text and image with optional iscc-sct and iscc-sci packages
- Added
byte_offsetsoption to support UTF-8 byte positions in granular simprints - Improved image transparency handling with more robust conversion logic
- Improved text sanitization
- Updated lock file dependencies
- Added
text_keepoption to store extracted text onIsccMeta.textproperty - Changed PDF text extraction to use pypdfium2 with reading order reconstruction
- Re-added pillow-avif-plugin (Pillow dropped bundling libavif)
- Added support for optional ISCC-UNITS and configurable bit size (256-bit).
- Added
extractcommand to CLI for text extraction with error handling. - Added AVIF thumbnail format support and metadata stripping for thumbnails.
- Added robust EPUB cover image extraction with multiple fallback strategies.
- Added fixed layout EPUB detection with namespace support.
- Added container processing support for embedded elements in epub files.
- Added min_image_size option to filter images during EPUB processing.
- Added text sanitization and HTML cleaning for metadata extraction.
- Added image_strip_metadata function for thumbnail generation.
- Added AVIF thumbnail format support.
- Modified code_iscc to conditionally generate meta-code based on create_meta option.
- Updated text features generation to use 256-bit MinHash algorithm.
- Updated video feature extraction with improved hash bits and metadata structure.
- Simplified media processing functions by removing redundant optional parameters.
- Updated dependencies.
- Added Python 3.13 support
- Added support for scene based granular video code processing
- Added ffprobe video metadata extraction (duration, fps, width, height, language)
- Added support str and Path objects for all file inpunts
- Added
generator(name and version of software) to ISCC metadata - Added optional fallback to ISCC-SUM for unsupported media types
- Handle unsupported SVG files gracefully
- Replaced python-magic with puremagic wrapper for cross-platform compatibility
- Replaced exiv2 CLI tool with native python bindings
- Replaced Tika Java dependency with native extractous package
- Fix bug with thumbnail generation for image modes
- Update and relax dependencies
- Update ISO project status
- Improved robustness of granular text features
- Added
extract_meta&create_thumbparameters - Fixed redundant metadata extraction
- Updated dependencies
- Add avif and heic image format support
- Update to Exiv2 0.27.7
- Update to Tika 2.9.1
- Update to FFMPEG 6.1
- Add Python 3.12 Support
- Add Pydantic v2 Support
- Dropped Python 3.8 Support
- Fix stdout decoding
- Add
installcli command - Ignore signature files in batch cli command
- Fix encoding of result files
- Updated dependencies
- Added parallel processing of ISCC-UNITs
- Handle video thumbnail extraction errors gracefully
- Add basic command line interface for batch processing
- Add option to keep MP7 Video Signature
- Support concurrent audio metadata extraction
- Updated dependencies
- Fixed mkdocstrings
- Don´t install tika more than once per session
- Don´t pin poetry build requirement version
- Publish wheels
- Use latest poetry with default settings in CI
- Update dependencies
- Switch to official pytaglib distribution
- Update dependencies
- Added global
extract_metadataoption - Removed taglib installation from CI
- Update to iscc-core 1.0.3 using binary wheels
- Added Python 3.11 support
- Added WavPack support
- Added docx metadata embedding support
- Improved error messages for unsupported mediatypes
- Improved robustness of audio metadata extraction
- Normalize mediatype application/xml to text/xml
- Nomralize mediatype application/vnd.ms-asf video/x-ms-asf
- Normalize mediatype application/vnd.adobe.flash.movie to application/x-shockwave-flash
- Add docx and xlsx extensions to mediatypes
- Updated to iscc-core 1.0.1
- Updated various dependencies
- Fix audio title embedding
- Add audio cover art thumbnailing
- Integrate EPUB metadata embedding
- Add EPUB cover image thumbnail extraction
- Add EPUB metadata embedding
- Retain existing PDF DocInfo metadata when embedding
- Separeta ISCC_CORE and ISCC_SDK options
- Document ISCC_SDK option
- Integrate PDF thumbnail extraction
- Add metadata embedding support for PDFs
- Add thumbnail support for PDFs
- Improve text metadata extraction mapping
- Improve documentation
- Update dependencies
- Add support for earlier Pillow versions
- Add option to configure
image_max_pixels - Don´t embed empty titles in images
- Handle extracted metadata gracefully
- Fixed CVE-2007-4559
- Update to iscc-schema v0.4.0
- Update to iscc-core v0.2.12
- Update to tika 2.6.0
- Update dependencies
- Update to iscc-schema v0.3.9
- Update to iscc-core v0.2.11
- Fix issue with embedded identifiers
- Add support for granular text features
- Update to iscc-schema v0.3.8
- Updated dependencies
- Moved changelog to seperate file
- Fixed pillow resampling deprecation warning
- Fixed exiv2 error on older linux versions
- Added support for lazy installation of cli-tools
- Added cleanup of cli-tool archives after installation
- Removed obsolete ffprobe tool
- Updated dependencies
- Fix metadata extraction failure with long texts
- Embed Dublin Core metadata in images
- Support path object inputs for text_name_from_uri
- Added video thumbnail support
- Added support for IPFS wrap-with-directory
- Added Text-Code generation
- Added Video-Code generation
- Added text document metadata extraction
- Added video metadata embedding and extraction
- Added custom exeptions
- Changed embedding to create a new mediafile
- Set filename on IsccMeta
- Set @type on IsccMeta
- Added support for Audio-Code with metadata embedding/extraction
- Added IPFS support
- Initial release with support for ISCC Content-Code Image