Skip to content

Use tsv file instead of excel for infobox data #7

@michaelsilver

Description

@michaelsilver

From @alvaromorales on August 12, 2015 5:37

I noticed you added the infoboxes.xlsx spreadsheet to the repo. I created that file manually a few months ago for quick analysis of infobox frequency. I wasn't really intending for it to be machine-readable.

Now that the data turned out to be useful for wikithingsdb and wikimap, I'll make elasticstart generate this file automatically on every reindex. Because of simplicity, I'll output a tab-separated file with the following format:

class               count
settlement          354090
person              102384

The tsv format should simplify the way you read in data.

Copied from original issue: infolab-csail/wikithingsdb#1

Metadata

Metadata

Assignees

No one assigned

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions