-
Notifications
You must be signed in to change notification settings - Fork 17
Open
Labels
Description
Summary
Parsing an E-ARK SIP via EARKSIP.parse(...) has two related issues:
contentInformationTypeis never populated from METS, so it stays at defaultMIXED.contentTypeis derived fromCONTENTINFORMATIONTYPE/OTHERCONTENTINFORMATIONTYPEinstead ofTYPE/OTHERTYPE.
Steps to Reproduce
- Create a SIP with root METS attributes:
TYPE="Other"csip:OTHERTYPE="Moving images - on tangible media"csip:CONTENTINFORMATIONTYPE="OTHER"csip:OTHERCONTENTINFORMATIONTYPE="MOVINGIMAGES-PROFILE-1.0"
- Parse it with
new EARKSIP().parse(path). - Inspect results:
sip.getContentType().asString()→ incorrect (taken from content information fields)sip.getContentInformationType().asString()→ remainsMIXED
Expected
contentTypeshould come fromTYPE/OTHERTYPEcontentInformationTypeshould come fromcsip:CONTENTINFORMATIONTYPE/csip:OTHERCONTENTINFORMATIONTYPE
Actual
contentTypeincorrectly uses content information attributescontentInformationTyperemains defaultMIXED
Likely Cause
In EARKUtils.setIPContentType(...), the value is taken from CONTENTINFORMATIONTYPE / OTHERCONTENTINFORMATIONTYPE rather than TYPE / OTHERTYPE.
Also, processMainMets(...) (and representation parsing) never call a setter for contentInformationType.