Skip to content

Mobox email indexing complete with 0 recorded files.  #1

@unit92

Description

@unit92

Execute docker environment with :
docker run --rm -ti -p 5601:5601 --mount type=bind,source=F:/comms-analyzer-toolbox/Files,target=/files --mount type=bind,source=F:/comms-analyzer-toolbox/Elastic,target=/toolbox/elasticsearch/data comms-analyzer-toolbox:latest python /toolbox/elasticsearch-gmail/src/index_emails.py --infile=/files/incoming.mbox --init=True --index-bodies=True --index-bodies-ignore-content-types=application,image --index-bodies-html-parser=html5lib --index-name=comm_data

testing with :
import mailbox
i = 1
for message in mailbox.mbox('incoming.mbox'):
i=+1
print(i)

from inside docker container via exec indicates 15000 records

I refactored from --v as it was also failing.

Result is:
{"type":"log","@timestamp":"2018-07-26T14:49:20Z","tags":["warning","elasticsearch","admin"],"pid":61,"message":"Unable to revive connection: http://localhost:9200/"}
{"type":"log","@timestamp":"2018-07-26T14:49:20Z","tags":["warning","elasticsearch","admin"],"pid":61,"message":"No living connections"}

Launching MBOX email indexer....

INFO:root:Starting import from file /files/incoming.mbox
INFO:root:Import done - total count 0

MBOX email indexing is complete!

Elastic appears to be running as per process

Not sure if anyone is still working on tis but i thought i would ask before going in for a full refactor.....

Metadata

Metadata

Assignees

No one assigned

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions