Skip to content

naming scheme for generating ResourceSync documents #8

@markmatney

Description

@markmatney

I've observed the following about the way this implementation names generated sitemaps (resourcelist, changelist, etc.) when ResourceSync.execute() is invoked (for a set of 50000 resources or less):

  • With strategy 0 (resourcelist):
    • On first invocation, a file resourcelist_0000.xml is created.
    • Subsequent invocations modify that file and keep the same name.
  • With strategy 1 (new_changelist):
    • On first invocation, a file changelist_0000.xml is created.
    • On subsequent invocations, a file changelist-index.xml is created, and points to a series of changelist_xxxx.xml files.
  • With strategy 2 (inc_changelist):
    • On first invocation, a file changelist_0000.xml is created.
    • Subsequent invocations modify that file and keep the same name.

I've not tested on collections with more than 50000 resources, but per the spec, I expect there'd be a resourcelist-index.xml generated that points to a number of resourcelists.

I am thinking that whenever there is only one resourcelist or changelist for a set of resources, then it should be named simply resourcelist.xml or changelist.xml, without a _0000 suffix, on the basis that a suffix like that is misleading/unnecessary when it is the only such sitemap for a set of resources.

Thoughts?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions