automate/improve infrastructure

This issue describes how deeposm works now. Then it describes changes needed to improve the infrastructure. Also see notes/scripts on these issues: #8, #23, #30, and #39 (these issues were closed to merge with this issue, not completed).
## Test Data and Training
- one app does both the data prep, and neural net training
- it uses a GPU/Tensorflow on a Linux box in my office
- findings are then uploaded to S3
## Display on deeposm.org
- when a deeposm.org page is loaded, it checks S3, grabs findings, and updates the database
- deeposm.org shows where DeepOSM detects mis-registered roads
## Issues with this Setup
- The scripts to gather data, train, and upload findings should run on a cycle, not manually when I press a button
- The data prep and training modules should be separate - DeepOSM has one monster Dockerfile.gpu-devel that includes GDAL, Tensorflow, and more. This makes the build fragile and hard to deploy.
- Actual work includes:
  - move the analysis to AWS, run on a cycle
  - set up a cron job to have deeposm.org check for new findings
  - parallelize the analysis, so we can do deeper nets and more area
  - import NAIPs into Postgres, instead of hacking them up and caching files
  - use Overpass or other approach to getting OSM data, instead of hacking up PBF extracts with Osmium


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

automate/improve infrastructure #59

Test Data and Training

Display on deeposm.org

Issues with this Setup

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

automate/improve infrastructure #59

Description

Test Data and Training

Display on deeposm.org

Issues with this Setup

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions