Skip to content

code for parsing and visualizing abricate results#125

Open
stephenturner wants to merge 1 commit intomasterfrom
stephenturner-patch-1
Open

code for parsing and visualizing abricate results#125
stephenturner wants to merge 1 commit intomasterfrom
stephenturner-patch-1

Conversation

@stephenturner
Copy link

  • parse filename to extract assembler, trimming, dataset
  • reduce coverage across a gene, with an example toy problem illustrating procedure
  • drop parsed and reduced results
  • visualize percent coverage of samples faceted by gene

files:

  • original dataset
  • parsed/reduced dataset
  • rmarkdown with code
  • compiled html

@kternus
Copy link
Collaborator

kternus commented Jul 27, 2018

I thought this code was helpful to visualize the ABRicate results. In particular, @stephenturner implemented a way to get the consensus coverage information for genes with overlapping intervals in the ABRicate output, and he parsed the files by dataset, assembler, and trim value. This might be a future idea for visualizing results in the antibiotic resistance Jupyter notebook, or it could be a standalone script for data visualization. The dotted line shows the 90% coverage threshold, which is the default for SRST2 gene detection.

@stephenturner
Copy link
Author

not sure what kind of capability python has for doing this. i had to treat the coverage intervals as genomic ranges and use plyranges (a dplyr-like interface for manipulating ranges) to do a grouped interval reduction.

@kternus
Copy link
Collaborator

kternus commented Jul 27, 2018

I don't know either, but I appreciate you sharing this! It looks much better than what I was trying to sketch out as a visualization idea.

@stephenturner
Copy link
Author

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants