-
Notifications
You must be signed in to change notification settings - Fork 184
Open
Description
This issue describes how deeposm works now. Then it describes changes needed to improve the infrastructure. Also see notes/scripts on these issues: #8, #23, #30, and #39 (these issues were closed to merge with this issue, not completed).
Test Data and Training
- one app does both the data prep, and neural net training
- it uses a GPU/Tensorflow on a Linux box in my office
- findings are then uploaded to S3
Display on deeposm.org
- when a deeposm.org page is loaded, it checks S3, grabs findings, and updates the database
- deeposm.org shows where DeepOSM detects mis-registered roads
Issues with this Setup
- The scripts to gather data, train, and upload findings should run on a cycle, not manually when I press a button
- The data prep and training modules should be separate - DeepOSM has one monster Dockerfile.gpu-devel that includes GDAL, Tensorflow, and more. This makes the build fragile and hard to deploy.
- Actual work includes:
- move the analysis to AWS, run on a cycle
- set up a cron job to have deeposm.org check for new findings
- parallelize the analysis, so we can do deeper nets and more area
- import NAIPs into Postgres, instead of hacking them up and caching files
- use Overpass or other approach to getting OSM data, instead of hacking up PBF extracts with Osmium