Skip to content

camilla-eldridge/SNP-Ranger

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

25 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

SNP-Ranger

SNP-Ranger is a Python tool for functional SNP analysis.

Overview

Given a GBK annotation file and the output from the SNP exporter tool in Mauve, SNP-ranger identifies whether an SNP falls within the range of annotated features.

This method streamlines the process of identifying functionally relevant SNPs while reducing the likelihood of reporting SNPs from contaminants.

Usage

SNP-Ranger requires two inputs:

  • The reference (or first) GBK file used for progressiveMauve alignment.
  • The SNP export file generated by Mauve.

Output

SNP-Ranger generates a plain text output listing SNPs along with their parent nodes, relative positions, and annotations.

It excludes SNPs outside the range of annotated tRNA, rRNA, or CDS regions. If an SNP is found in an unannotated region, the script will notify the user for further investigation.

Running the tool

The tool can be run through a gui interface, via the following:

  1. Step 1 - Clone this repo

      git clone https://github.com/camilla-eldridge/SNP-Ranger.git 
    
  2. Step 2 - Navigate into the repo directory and run:

       python gui.py
    
  3. Step 3 - Use the Browse feature to find your input files. Remember to select the first gbk file that was selected during ProgressiveMauve alignment.

  4. Step 4 - Click on Find my SNPs!

Example run

Once run you should see text output as below, indicating which SNPs were located within an annotated range.

alt text

Commandline use

The snp_ranger script can be run on the command line using the following command:

snp_ranger.py Exported_SNPS.txt first_gbk_file > snps.txt  

Dependencies

The following Python 3 modules are required:

  • sys
  • regex
  • typing
  • tkinter

Additional Notes

  • This script was tested using GBK files generated by Prokka v1.7.
  • Ensure the alignment is correctly interpreted — SNPs may arise from annotated regions of contaminant species aligning or misannotations between reference and target genomes.

License

This work is licensed under a Creative Commons Attribution 4.0 International License (CC BY 4.0). You are free to share and adapt this code, provided you credit the original author and indicate any modifications made.

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages