Skip to content

Benchmarking different python whois library against registrant value #3

@baderdean

Description

@baderdean

Parsing Whois data is hard, especially because the format differ depending on the TLD. I've a specific issue with registrant value. So I decided to test multiple python whois library (pythonwhoisalt, asyncwhois, whoisit, whoisdomain) against the registrant field using google's domain dataset and check for their speed too. Initially, I was using whoisdomain, so here the initial post on its github: mboot-github/WhoisDomain#21

Here the script I wrote: https://gist.github.com/baderdean/cc4643ecd95d3ccde31dee80ebdbea28

And here the results for pythonwhois alt (not good) but I don't know why, since it's the older I thought it would be the most complete:

{'pythonwhoisalt': {'count': 7,
                    'duration': 1055.0711162629996,
                    'percentage': '4%'}}

Other results:

{'asyncwhois': {'count': 49,
                'duration': 285.84409061399947,
                'end': 6710.12183199,
                'percentage': 26,
                'start': 6424.277741376},
 'whoisdomain': {'count': 44,
                 'duration': 195.54051797400007,
                 'end': 6396.365353012,
                 'percentage': 24,
                 'start': 6200.824835038},
 'whoisit': {'count': 6,
             'duration': 27.91238160300054,
             'end': 6424.27773957,
             'percentage': 3,
             'start': 6396.365357967}}

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions