You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: bin/bioaln
+8-2Lines changed: 8 additions & 2 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -59,6 +59,7 @@ GetOptions(
59
59
"select-third",
60
60
"shuffle-sites|S",
61
61
"slice|s=s",
62
+
"slice-orfs=s",
62
63
"split-cdhit=s",
63
64
"trim-ends",
64
65
"uniq|u",
@@ -416,7 +417,7 @@ Generate an alignment of every-third (mostly synonymous) bases (assuming a CDS a
416
417
417
418
Make a shuffled (not bootstrapped, which is sampling I<with> replacement) alignment. This operation I<permutes> alignment columns. It is used for testing the significance of long-runs of conserved sites in an alignment (e.g., conserved intergenic spacer sequences).
418
419
419
-
=item --slice, -s 'start|-,end|-'; or -s"file:orfs.tsv"
420
+
=item --slice, -s 'start|-,end|-'
420
421
421
422
Get a slice of the alignment.
422
423
@@ -425,10 +426,15 @@ Using a '-' character in the first or second position defaults to the beginning
425
426
--slice '20,80' or --slice '20,80' or -s='20,80' or --slice='20,80': Slice from position 20 to 80, inclusive.
426
427
--slice '-,80': Slice from beginning up to, and including position 80
427
428
--slice '20,-': Slice from position 20 up to, and including, the end of the alignment
428
-
--slice 'file:orfs.tsv': slice using coordinates from a file. The file should contain 4 tab/space-delimited columns: locus_tag (no space), start (numerical), end (numerical), strand(0/1)
429
429
430
430
NOTE: --slice'-,x' (where x is '-' or a position) without a space does NOT work. Use --slice='-,x' (or a space in place of =) instead.
431
431
432
+
=item --slice-orfs orfs.tsv
433
+
434
+
Get slices of the alignment based on an input interval file. The file should contain 4 tab/space-delimited columns: locus_tag (no space), start (numerical), end (numerical), strand(negative strand marked as 0, -1, or "-").
435
+
436
+
Each line of the interval file would generate a single alignment file. This method is designed with a gff file in mind, each line defines an ORF location. The input file is an in-frame whole-genome alignment originating from a VCF file made with a reference genome. This method would (ideally) split the whole-genome alignment into ORF-by-ORF in-frame alignments.
437
+
432
438
=item --split-cdhit <cdhit clrs file>
433
439
434
440
Generate alignment for each CDHIT family (based on .clrs file). Ignore if you don't use cdhit for family clustering.
0 commit comments