Skip to content

Commit 53d2fd8

Browse files
committed
ENH update 06_Compare_with_other_datasets
1 parent 073dab7 commit 53d2fd8

File tree

2 files changed

+4
-14
lines changed

2 files changed

+4
-14
lines changed

General_Scripts/06_Compare_with_other_datasets/02_filter_sp_dedup.py

Lines changed: 3 additions & 7 deletions
Original file line numberDiff line numberDiff line change
@@ -11,11 +11,7 @@ def filter_dedup(infile,outfile):
1111
out.write(f'>{header}\n{seq}\n')
1212
seqset.add(seq)
1313

14-
infile1 = '~/refseq/arc/refseq_arc.faa.gz'
15-
outfile1 = '~/refseq/arc/refseq_sp_arc_dedup.faa'
14+
infile1 = 'database.faa'
15+
outfile1 = 'database_dedup.faa'
1616

17-
infile2 = '~/refseq/bac/refseq_bac.faa.gz'
18-
outfile2 = '~/refseq/bac/refseq_sp_bac_dedup.faa'
19-
20-
filter_dedup(infile1,outfile1)
21-
filter_dedup(infile2,outfile2)
17+
filter_dedup(infile1,outfile1)

General_Scripts/06_Compare_with_other_datasets/03_align.sh

Lines changed: 1 addition & 7 deletions
Original file line numberDiff line numberDiff line change
@@ -3,10 +3,4 @@
33
# Concept:
44
# Use Diamond to align sequences to GMSC
55

6-
diamond blastp -q refseq_sp_arc_dedup.faa -d 100AA_GMSC.dmnd -o refseq_arc.tsv --more-sensitive -e 0.00001 --outfmt 6 qseqid sseqid full_qseq full_sseq qlen slen pident length evalue qcovhsp scovhsp -p 64
7-
8-
diamond blastp -q refseq_sp_bac_dedup.faa -d 100AA_GMSC.dmnd -o refseq_bac.tsv --more-sensitive -e 0.00001 --outfmt 6 qseqid sseqid full_qseq full_sseq qlen slen pident length evalue qcovhsp scovhsp -p 64
9-
10-
diamond blastp -q cluster.faa -d 100AA_GMSC.dmnd -o cluster.tsv --more-sensitive -e 0.00001 --outfmt 6 qseqid sseqid full_qseq full_sseq qlen slen pident length evalue qcovhsp scovhsp -p 64
11-
12-
diamond blastp -q family.faa -d 100AA_GMSC.dmnd -o family.tsv --more-sensitive -e 0.00001 --outfmt 6 qseqid sseqid full_qseq full_sseq qlen slen pident length evalue qcovhsp scovhsp -p 64
6+
diamond blastp -q database_dedup.faa -d 100AA_GMSC.dmnd -o database.tsv --more-sensitive -e 0.00001 --outfmt 6 qseqid sseqid full_qseq full_sseq qlen slen pident length evalue qcovhsp scovhsp -p 64

0 commit comments

Comments
 (0)