Skip to content

More powerful deduplication #7

@philippeitis

Description

@philippeitis

At the current moment, deduplication is limited to books with matching titles and authors.

In the future, it would be nice to be able to specify how duplicates are handled and how duplicates are detected,

In particular, we should be able to use information about hashes, titles, authors, and even ISBNs to determine duplicates. We should also allow specifying which of these are important, and functionality for handling duplicates which can not seamlessly be merged with the provided strategies (eg. when metadata has been altered).

There are no limitations in regards to the database or implementation of this project which prevent such a feature.

The implementation would require the following:

  • Specifying de-duplication strategy via the interface (eg. "keepfirst checkhash checkisbn checktitle")
  • Creating a screen which displays the conflicting books side by side, with user input via typing an index, or arrow key + enter, or discarding a book outright, or applying new metadata

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions