Kaggle IEEE's Signal Processing Society - Camera Model Identification

Implementation of camera model identification system by team "[ods.ai] GPU_muscles" (2nd place overall in Kaggle competition IEEE's Signal Processing Society - Camera Model Identification and 1st place among student eligible teams).

Should any questions arise regarding the solution, please do not hesitate to contact me on Telegram or via e-mail ikibardin@gmail.com

Our team

Artur Kuzin [linkedin]
Valeriy Babushkin [linkedin]
Artur Fattakhov [kaggle]
Ilya Kibardin [linkedin]
Andrey Kiselev [linkedin]

Requirements

To train models and get predictions the following is required:

OS: Ubuntu 16.04
Python 3.6
Hardware:
- Any decent modern computer with x86-64 CPU,
- 32 GB RAM
- 4 x Nvidia GeForce GTX 1080 Ti

Installation

Install required OS and Python
Install packages with pip install -r requirements.txt
Create data folder at the root of the repository. Place train dataset from Kaggle competition to data/train. Place test dataset from Kaggle competition to data/test. Place additional validation images to data/val_images.
Place se_resnet50.pth and se_resnext50.pth to imagenet_pretrain folder.
Place the following final weights to final_weights folder:
- densenet161_28_0.08377413648371115.pth
- densenet161_55_0.08159203971706519.pth
- densenet161_45_0.0813179751742137.pth
- dpn92_tune_11_0.1398952918197271.pth
- dpn92_tune_23_0.12260739478774665.pth
- dpn92_tune_29_0.14363511492280367.pth

Producing the final submission

Run bash final_submit.sh -d <folder with test images> -o <output .csv filename>

Training ensemble from scratch

This section describes the steps required to train our ensemble.

1. Download external dataset

Images from both Yandex.Fotki and Flickr are essential for reproducing our solution.

Downloading images from Yandex.Fotki

Run bash download_from_yandex.sh

Downloading images from Flickr

Unfortunately, this step involves some manual actions.

cd into downloader/flickr
For every model go to the telephone model group page from flickr_groups.txt. Scroll every gallery page to the end and download as html file to the corresponding folder. As a result you will have a set of folders with .html files corresponding to a specific phone model at html_pages folder.
Run python pages_to_image_links.py. The result of the script will be folder links of .csv files with links to photos of each phone model.
Run python download_from_links.py to download images from the links received in the previous paragraph (previous two steps could be skipped, because the links folder already contains necessary files).

2. Filter external dataset

Run bash filter.sh

3. Train the ensemble

Download and filter external dataset as described above.
Run bash init_train.sh to train 9 models.
Run bash make_pseudo.sh to get predictions from these models for images at data/test and create pseudo labels.
Run bash final_train.sh to train the same 9 models but using pseudo labels this time.
Run bash predict.sh -d <folder with test images> -o <output .csv filename> to get predictions from the ensemble.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Kaggle IEEE's Signal Processing Society - Camera Model Identification

Our team

Requirements

Installation

Producing the final submission

Training ensemble from scratch

1. Download external dataset

Downloading images from Yandex.Fotki

Downloading images from Flickr

2. Filter external dataset

3. Train the ensemble

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 46 Commits
downloader		downloader
final_weights		final_weights
imagenet_pretrain		imagenet_pretrain
src		src
tables		tables
.gitignore		.gitignore
README.md		README.md
download_from_yandex.sh		download_from_yandex.sh
filter.sh		filter.sh
final_submit.sh		final_submit.sh
final_train.sh		final_train.sh
init_train.sh		init_train.sh
make_pseudo.sh		make_pseudo.sh
predict.sh		predict.sh
requirements.txt		requirements.txt

Folders and files

Latest commit

History

Repository files navigation

Kaggle IEEE's Signal Processing Society - Camera Model Identification

Our team

Requirements

Installation

Producing the final submission

Training ensemble from scratch

1. Download external dataset

Downloading images from Yandex.Fotki

Downloading images from Flickr

2. Filter external dataset

3. Train the ensemble

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages