MSDI_TOOL

1.Introduction

This is the utility tool for Mochuan Zhan's MSC project: "Registration of UAV Imagery to Aerial and Satellite Imagery" at the University of Manchester (https://www.manchester.ac.uk/), supervised by Dr.Terence Patrick Morley.

This tool has four main parts:

Python program for automatic camera calibration and image undistortion (checkerboard image required).
Python program for obtaining corresponding Google/Bing satellite maps based on a batch of given images (GPS data required).
Python program to read/copy/delete/modify EXIF data for a batch of images.
Python program with GUI used for annotating ground control points (GCPs) from both sensed image and reference image and auto-generate config file.

The basic idea of the project is to investigate various methods for the registration of downward-facing (nadir) drone imagery to higher altitude aerial and satellite imagery (Google, Bing), and possibly develop a system based on visual localization and navigation of drones (UAV) as opposed to using a satellite navigation system such as GPS. This project is based on the research of local feature detectors and high-throughput computing techniques.

If you want to cite this code, please use:

2.Dataset

The dataset used in this project is MSDI (Manchester Surface Drone Imagery) which was collected and processed by Mochuan Zhan, supervised by Dr.Terence Patrick Morley. The image dataset can be found through the

The dataset contains 599 drone images of Manchester (447 downward facing, 89 45-degree forward facing, and 64 0-degree forward facing), 26 checkboard images taken by drone for camera calibration, camera internal parameter matrix and distortion matrix.

The corresponding Google/Bing satellite maps can be obtained by this program. Larger reference map can be obtained by stitching several maps together with this program. By the way, this method will be repalced by the approach using "tiles" in the incoming version (example: see simonw/datasette-tiles#17). Due to copyright issues, these image is not included in the dataset, but the user could apply their API KEY and obtain them, the following diagram demonstrates the principle of this process:

This method is based on the calculation of correlation coefficient, it searches for the right position for stitching line by line, once all the position is found, all images will be put together to obtain a bigger one. In addition, the water mark could be cut off in this step, this can be very helpful while performing registration tasks. Detailed steps and principles can be found in my thesis, the link will be posted later.

In our calibrated images, all EXIF data has been copied from raw images, including GPS data. As for the aerial images retrieved from Google/Bing, their format is PNG, so the EXIF doesn't work on it.

3.File Structure

The following file structure demonstrates the content of the whole project, including the images that can't be uploaded due to copyright issues. Hence, users could have a better understanding of the complete dataset and find out how this program can be reused.

# The complete file structure (now only have utils)
.
├── README.md
├── LISENSE
├── data                                # Image dataset 
│   ├── bing_advance_static_images      # Stitched Bing static map without watermark
│   ├── calibrated_images               # Calibrated drone images
│   ├── checkboard_images               # Checkboard images taken by drone
│   ├── google_advance_static_images    # Stitched Google static map without watermark
│   ├── google_static_images            # Google static map with watermark
│   ├── raw_images                      # Raw drone images
│   └── README.txt                      
├── model
│   ├── model_distortion.txt            # data for drone image distortion
│   └── model_matrix.txt                # Camera internal parameter matrix 
├── utils
│   ├── calibration.py                  # Class for camera calibration and image distortion
│   ├── config.py                       # configuration file for utils tools
│   ├── GCP_selector.py                 # Ground control point selector for calculating reprojection error in feature matching
│   ├── main.py                         # main function
│   ├── map.py                          # Class for requesting static map from Google/Bing and creating advance maps
│   └── myexif.py                       # Class for obtaining/copying/modifying images' exif data
└── requirements.py

4.GCP selector

GCP selector is a tool for annotating ground control points in two images that need to be matched. By calculating the homography and projecting one image onto another, the average error of these GCPs could be calculated to represent the quality of the feature matching algorithm.

The user can choose to resize the first image and then select GCPs for two images simultaneously. To resize image is because the the registration between two images which have huge differences in the amount of data can be very difficult. Using GaussianBlur and Resize could reduce and summerize image information and obtain better matching results, if you choose to resize images before registration, please remember to input W and H to resize the image when selecting GCP.

This is how I choose the parameter of GaussianBlur: KERNEL = (WIDTH_1 // WIDTH_2) ^ 2 # WIDTH_1 > WIDTH_2

The following code is a sample GCP.cfg file:

# SRC,im_name,name (im_num can be 1 or 2.)
SRC1,EV_001.JPG
SRC2,EV_002.JPG

# GCP,im_num,x,y (im_num can be 1 or 2.  Order of im2 GCPs must be the same as those for im1.)
GCP,1,370,218
GCP,1,359,968
GCP,1,683,615
GCP,2,329,191
GCP,2,299,1052
GCP,2,678,660

5.Run utility tools

install Python 3.7 and required packages pip install -r requirements
apply for Google/Bing Static Map API KEY (Google is not free, but a free trial could be used for a couple of month)
modify config.py

# ==================== File Paths =================================================
# ADD ABSOLUTE PATHS FOR THE FOLLOWING FILE
CHECKER_BOARD_PATH = r'PATH'
RAW_IMAGE_FILE = r'PATH'
CALIBRATED_FILE = r'PATH'
GOOGLE_STATIC_MAP_FILE = r'PATH'
GOOGLE_ADVANCE_STATIC_MAP_FILE = r'PATH'
BING_ADVANCE_STATIC_MAP_FILE = r'PATH'

# ==================== Calibration model path =====================================
# ADD ABSOLUTE PATHS FOR THE FOLLOWING FILE
MODEL_NAME = r'PATH'

# ==================== API KEY FOR GOOGLE MAP =====================================
GOOGLE_API_KEY = "INPUT YOUR API KEY HERE"
GOOGLE_URL = "http://maps.googleapis.com/maps/api/staticmap?maptype=satellite"

# ==================== API KEY FOR BING MAP =======================================
BING_API_KEY = "INPUT YOUR API KEY HERE"
BING_URL = "https://dev.virtualearth.net/REST/v1/Imagery/Map/Aerial/"

# ==================== exif data that you want to input ===========================
ARTIST = 'MOCHUAN ZHAN'
COPYRIGHT = 'THE UNIVERSITY OF MANCHESTER'
USER_COMMENT = 'CALIBRATED'

Run main.py

if __name__ == "__main__":

    print("add information to raw images!")
    modify_exif(CALIBRATED_FILE, {'artist': ARTIST, 'copyright': COPYRIGHT, 'user_comment': 'RAW IMAGE'})
    
    calib = Calibration()
    print("start create calibrate model!")
    calib.create_model(WIDTH, HEIGHT, CHECKER_BOARD_PATH) 

    print("start undistort!")
    calib.undistort(RAW_IMAGE_FILE, CALIBRATED_FILE)

    
    print("copy exif to calibrated images!")
    copy_exif(RAW_IMAGE_FILE, CALIBRATED_FILE)
    modify_exif(CALIBRATED_FILE, {'artist': ARTIST, 'copyright': COPYRIGHT, 'user_comment': USER_COMMENT})
    

    print("get static google map!")
    map = Map(CALIBRATED_FILE, GOOGLE_STATIC_MAP_FILE)
    map.get_maps(advance=0)

    print("get advance static google map")
    ad_map = Map(CALIBRATED_FILE, GOOGLE_ADVANCE_STATIC_MAP_FILE, 'google')
    ad_map.get_maps(advance=1)

    print("get advance static bing map")
    ad_map = Map(CALIBRATED_FILE, BING_ADVANCE_STATIC_MAP_FILE, 'bing')
    ad_map.get_maps(advance=1)

Name		Name	Last commit message	Last commit date
Latest commit History 59 Commits
.idea		.idea
read_img		read_img
utils		utils
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

MSDI_TOOL

1.Introduction

2.Dataset

3.File Structure

4.GCP selector

5.Run utility tools

About

Uh oh!

Releases 2

Packages

Languages

License

scymz2/MSDI_TOOL

Folders and files

Latest commit

History

Repository files navigation

MSDI_TOOL

1.Introduction

2.Dataset

3.File Structure

4.GCP selector

5.Run utility tools

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 2

Packages 0

Languages

Packages