This is a project that compares three reference databases for the number of unique and shared genera present. Bacteria is classified taxonomically ranging from Phylum (high level) to species, but most classifications using 16S data classify at the level of genus.
The three databases analyzed were Greengenes, Silva, and Human Oral Microbiome Database and the links embedding into the python notebook were accessed on 3/2020.
The main project python notebook is listed in "final-project.ipynb" Data is housed in the data_input folder Any images that were used in the markdown are saved in the img folder Extra code or code that errored was stored in a seperate python notebook called "Error_extra_commands.ipynb"
The code in this repository is released under the MIT license. Read more at the Open Source Initiative.