Skip to content

automate/improve infrastructure #59

@andrewljohnson

Description

@andrewljohnson

This issue describes how deeposm works now. Then it describes changes needed to improve the infrastructure. Also see notes/scripts on these issues: #8, #23, #30, and #39 (these issues were closed to merge with this issue, not completed).

Test Data and Training

  • one app does both the data prep, and neural net training
  • it uses a GPU/Tensorflow on a Linux box in my office
  • findings are then uploaded to S3

Display on deeposm.org

  • when a deeposm.org page is loaded, it checks S3, grabs findings, and updates the database
  • deeposm.org shows where DeepOSM detects mis-registered roads

Issues with this Setup

  • The scripts to gather data, train, and upload findings should run on a cycle, not manually when I press a button
  • The data prep and training modules should be separate - DeepOSM has one monster Dockerfile.gpu-devel that includes GDAL, Tensorflow, and more. This makes the build fragile and hard to deploy.
  • Actual work includes:
    • move the analysis to AWS, run on a cycle
    • set up a cron job to have deeposm.org check for new findings
    • parallelize the analysis, so we can do deeper nets and more area
    • import NAIPs into Postgres, instead of hacking them up and caching files
    • use Overpass or other approach to getting OSM data, instead of hacking up PBF extracts with Osmium

Metadata

Metadata

Assignees

No one assigned

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions