Skip to content

Indexing Atlassian Confluence  #154

@pudo

Description

@pudo

We have this recurring request from some editors to index project Confluence wikis into Aleph. The idea is to index all the reporters notes from a given wiki space into an investigation casefile. What we'd need to figure out:

  • How do we authenticate with Confluence in a way that ships around 2FA on SSO. Do they have some sort of app passwords?
  • Do we want to index wiki pages as HTML, or is it better to index for example a PDF export?
    • What do we do with comments?
    • How do we represent the hierarchy of wiki pages? Do we create pseudo-folders?
  • Need to make sure we also pull in page attachments
  • Need to generate good foreign IDs so changed pages don't duplicate based on hash

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions