Skip to content

Separate “Other” into “None of the above” and “Algorithm can’t decide” #10

@jucor

Description

@jucor

Dear Jigsaw team

As discussed by email, it would be very useful if the topic analysis part of the Sensemaking tools could make the difference between “Other” (in the sense of “None of the above”) and “The Algorithm Can’t Decide”. See rationale at compdemocracy/polis#1878

I believe this can be achieved in src/tasks/categorization.ts in function assignDefaultCategory https://github.com/Jigsaw-Code/sensemaking-tools/blame/8eb482e35c44d2399ab68d684a12d51a74472ad4/src/tasks/categorization.ts#L381 called by function categorizeWithRetry .

I do note that SenseMaker does in some cases provide a form of distinction by using the Uncategorized sub-topic of the category Other . However this:

  • does not work for conversations which do not need subtopics, i.e. when the argument includeSubtopics of categorizeWithRetry is set to False, due to line
    includeSubtopics
    ? ({ name: "Other", subtopics: [{ name: "Uncategorized" }] } as NestedTopic)
    : ({ name: "Other" } as FlatTopic),
    -- in those case, there is no distinction.
  • Is not documented (but yay open-source, it can be found in the code :) ) ,
  • Thus even when includeSubtopics is True it is easy to overlook,
  • and because of that, can very easily confuse the users and the interpretation of the different topics, be it for debugging purposes (“Why is ‘Other’ so big?”) or for actual use.

Would be terrific to thus move the Uncategorized as a top-level topic rather than a subtopic.
Thanks !

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions