Skip to content

Commit 063378c

Browse files
committed
v3.3.1
1 parent 6fbd2fe commit 063378c

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

64 files changed

+2669
-1439
lines changed

doc/Release.html

Lines changed: 55 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -6,7 +6,61 @@
66
<a id=top>
77
<!--#include virtual="./ssi/start1.html" -->
88

9+
<h4>v3.3.1 1-Sept-2021</h4>
910

11+
The singleTCW database has a small schema update that will be applied the first time an existing
12+
database is viewed.
13+
14+
<p><ttp>runSingleTCW</ttp> -- Annotate
15+
<ul>
16+
<li>sTCWdb version sdb6.0: The percent similarity (identity) is stored as a real number instead of an integer.
17+
18+
<li><font color=green>NEW</font> Prune Hits - there are many hits with the exact same coordinates and/or descriptions.
19+
A new function removes all but the best based on same alignment values or same description.
20+
This function can be set to run in the AnnoDBs <ttl>Options</ttl> or from the command line.
21+
<li>Tiny changes;
22+
<ul>
23+
24+
25+
<li>DIAMOND - removed '--max-hsps 1' as TCW defaults because it is a DIAMOND default.
26+
Removed '--top 20' as this misses some good descriptions.
27+
<li>TCW was only loading 25 hits per annoDB. This restriction has been removed since the user
28+
can set the limit in the search programs parameters.
29+
<li>This really isn't a problem, but I fixed it any way as it could be confusing: If there were multiple hits with the
30+
same bit_score and E-value, it was using the one with the highest rank for the "Best Bit" assignment,
31+
which was changed to lowest.
32+
<li>Tiny bug fix in Multi-Frame: If a sequence had an NT hit, it was typically incorrectly marked as 'Multi-frame'.
33+
</ul>
34+
</ul>
35+
<ttp>viewSingleTCW</ttp> - tiny adjustments
36+
<ul>
37+
<li>Sequence Detail:
38+
<ul>
39+
<li>A new <ttl>Show</ttl> button shows all columns for a selected hit (the Hit Table
40+
only provides a subset of all columns).
41+
<li>The Hit Table now sorts on bit-score, e-value, %sim in that order (the addition of %sim is new).
42+
</ul>
43+
<li>Basic Hit:
44+
<ul>
45+
<li>The Bit-score has been added to the columns.
46+
<li>DE values of "-" will sort to the bottom of the table.
47+
</ul>
48+
49+
<li>A reading frame was being assigned to an NT hit -- it now has a value of "-".
50+
<li>Tiny bug fix in alignments: In rare instance, aligning an NT-NT following by a NT-AA could create a bogus NT-AA alignment.
51+
</ul>
52+
53+
<ttp>viewMultiTCW</ttp>
54+
<ul>
55+
<li>Sequence Table: DE values of "-" will sort to the bottom of the table and the sort will ignore
56+
the minus sign before an DE value (as <ttp>viewSingleTCW</ttp> does).
57+
</ul>
58+
59+
<ttl>Software details</ttl>
60+
<ul>
61+
<li>The Best Bits, etc assignment was moved from DoUniProt.java to DoUniAssign.java
62+
</ul>
63+
<!-------------------------------------------------------------------->
1064
<h4>v3.3.0 15-August-2021</h4>
1165
<i>Update existing multiTCW databases for this release</i>:
1266
There is a small mTCWdb schema change, which will be updated the first time you access the database. However,
@@ -154,7 +208,7 @@ <h4>v3.2.6 6-July-21</h4>
154208
on a selected DE p-value, it will overwrite an existing column.
155209
<li>The DE and GO p-value columns will be displayed in <ttp>viewSingleTCW</ttp> in the order that
156210
the DE p-value columns are created (only guaranteed if created with v3.2.6 or later).
157-
<li>The online <a href=stcw/DiffExp.html>DE User Manual</a> was significantly updated.
211+
<li>The online <a href="stcw/DiffExp.html">DE User Manual</a> was significantly updated.
158212
</ul>
159213

160214
<ttp>runSingleTCW</ttp> - <b>Overview</b>

doc/Summary.html

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -18,7 +18,7 @@
1818
</ul>
1919
<a id=task></a>
2020
<h2>Programs</h2>
21-
<a href="img/bullet2.gif"><img src="img/bullet2.gif" alt="bullet" style="border: 0px; width: 20px"></a>
21+
<img src="img/bullet2.gif" alt="bullet" style="border: 0px; width: 20px">
2222
<i><u>singleTCW contains four major programs:</u></i>
2323
<table class="tablex">
2424
<tr>
@@ -84,7 +84,7 @@ <h2>Programs</h2>
8484
<td class="top"><a href="./tour/viewSingle/img/mainTable.png"><img src="./tour/viewSingle/img/mainTable.png" alt="" style="border: 1px solid black; width: 100px"></a>
8585
</table>
8686

87-
<a href="img/bullet2.gif"><img src="img/bullet2.gif" alt="bullet" style="border: 0px; width: 20px"></a>
87+
<img src="img/bullet2.gif" alt="bullet" style="border: 0px; width: 20px">
8888
<i><u>multiTCW contains two major programs</u></i>:
8989
<table class="tablex">
9090
<tr><td class="top">

doc/index.html

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -36,8 +36,8 @@
3636

3737
<tr> <td colspan=3>&nbsp;&nbsp;
3838
<tr><td colspan=3> Details</td></tr>
39-
<tr> <td>&nbsp;&nbsp;<td><a href="stcw/AnnoGuide.html">Updating annotation</a></td>
40-
<td>Removing, adding, updating annotation</td></tr>
39+
<tr> <td>&nbsp;&nbsp;<td><a href="stcw/AnnoGuide.html">Annotate Details</a></td>
40+
<td>Removing, adding, updating annotation; pruning hits</td></tr>
4141
<tr> <td>&nbsp;&nbsp;<td><a href="stcw/ORF.html">ORF finder</a></td>
4242
<td>Description of options, algorithm, outputs, and results. </td></tr>
4343
<tr> <td>&nbsp;&nbsp;<td><a href="stcw/AssemGuide.html">Assembly Details</a></td>

doc/ov/sDemo.html

Lines changed: 48 additions & 51 deletions
Original file line numberDiff line numberDiff line change
@@ -5,10 +5,10 @@
55
<h2>TCW overview for tra </h2>
66
<table width=700 border=1><tr><td>
77
<pre>
8-
Project: tra #Seqs: 211 #Hits: 9,584 #GOs: 5,294 TPM Seq-DE GO-Enrich Pairs
8+
Project: tra #Seqs: 211 #Hits: 12,267 #GOs: 6,651 TPM Seq-DE GO-Enrich Pairs
99

10-
12-Aug-21 Build Database sequences loaded from external source
11-
12-Aug-21 Last Annotation with sTCW v3.3.0
10+
26-Aug-21 Build Database sequences loaded from external source
11+
26-Aug-21 Last Annotation with sTCW v3.3.1
1212

1313
INPUT
1414
Counts:
@@ -23,41 +23,40 @@ <h2>TCW overview for tra </h2>
2323

2424
ANNOTATIONS
2525
Hit Statistics:
26-
Sequences with hits 210 (99.5%) Bases covered by hit 180,729 (79.4%)
27-
Unique hits 9,584 Total bases 227,683
28-
Total sequence hits 13,224
26+
Sequences with hits 210 (99.5%) Bases covered by hit 180,582 (79.3%)
27+
Unique hits 12,267 Total bases 227,683
28+
Total sequence hits 18,013
2929

30-
annoDBs (Annotation databases): 8 (see Legend below)
30+
annoDBs (Annotation databases): 7 (see Legend below)
3131
ANNODB ONLY BITS ANNO UNIQUE TOTAL AVG Rank HAS (%Seqs) AVG COVER COVER
3232
%SIM =1 HIT %SIM >=50 >=90
33-
SP-plants 0 0 0 781 1,311 65.3 | 193 (91.5%) 70.1 62.7% 3.6%
34-
SP-invertebrates 0 0 1 329 590 48.3 | 126 (59.7%) 47.6 30.2% 0.8%
35-
SP-fungi 0 0 0 640 1,067 46.7 | 118 (55.9%) 47.0 27.1% 0.8%
36-
SP-bacteria 0 0 0 659 944 44.9 | 63 (29.9%) 45.8 25.4% 0%
37-
SP-fullSubset 0 0 0 757 1,515 48.7 | 137 (64.9%) 48.7 33.6% 0.7%
38-
TR-plants 7 207 207 4,122 4,858 77.4 | 210 (99.5%) 81.8 68.6% 8.6%
39-
TR-invertebrates 0 3 2 2,165 2,701 52.3 | 179 (84.8%) 54.5 39.7% 2.8%
40-
TF-PlantTFDB 0 0 0 131 238 47.0 | 123 (58.3%) 51.2 7.3% 0%
41-
42-
Top 15 species from total: 1,312
33+
SP-plants 0 0 0 1,320 2,956 53.2 | 193 (91.5%) 69.6 61.1% 3.1%
34+
SP-invertebrates 0 0 0 610 1,252 42.0 | 126 (59.7%) 47.0 29.4% 0.8%
35+
SP-fungi 0 0 0 850 1,459 42.9 | 118 (55.9%) 46.4 24.6% 0.8%
36+
SP-bacteria 0 0 0 786 1,121 43.7 | 63 (29.9%) 45.4 25.4% 0%
37+
SP-fullSubset 0 0 0 1,261 2,291 44.2 | 137 (64.9%) 48.1 32.1% 0.7%
38+
TR-plants 9 207 207 4,535 5,217 75.5 | 210 (99.5%) 81.4 68.6% 8.6%
39+
TR-invertebrates 0 3 3 2,905 3,717 49.5 | 179 (84.8%) 54.0 39.1% 2.2%
40+
41+
Top 15 species from total: 1,508
4342
SPECIES (25 char) BITS ANNO TOTAL SPECIES BITS ANNO TOTAL
44-
Musa acuminata 73 53 325 Anthurium amnicola 2 4 93
45-
Musa balbisiana 62 45 312 Vitis vinifera 2 3 91
46-
Ensete ventricosum 19 14 199 Dendrobium catenatum 2 2 63
47-
Elaeis guineensis 10 24 344 Macleaya cordata 2 2 43
48-
Phoenix dactylifera 9 19 319 Nelumbo nucifera 1 3 95
49-
Zingiber officinale 7 7 24 Apostasia shenzhenica 1 2 59
50-
Ananas comosus 4 11 432 Ricinus communis 1 2 31
51-
Meloidogyne enterolobii 3 0 9 Other 12 19 10,785
43+
Musa acuminata 75 54 325 Anthurium amnicola 2 4 101
44+
Musa balbisiana 59 42 313 Vitis vinifera 2 3 89
45+
Ensete ventricosum 19 14 214 Apostasia shenzhenica 2 3 67
46+
Elaeis guineensis 10 24 347 Dendrobium catenatum 2 2 73
47+
Phoenix dactylifera 9 19 318 Macleaya cordata 2 2 41
48+
Zingiber officinale 7 7 27 Nelumbo nucifera 1 3 95
49+
Ananas comosus 4 11 458 Ricinus communis 1 2 43
50+
Meloidogyne enterolobii 3 0 10 Other 12 20 15,492
5251

5352
Gene Ontology Statistics:
54-
Unique GOs 5,294 Unique hits with GOs 8,124 (84.8%)
55-
Sequences with GOs 208 (98.6%) Seq best hit has GOs 194 (91.9%)
56-
Has goslim_plant 95
57-
58-
biological_process 3,858 (72.9%) is_a 8,481
59-
molecular_function 821 (15.5%) part_of 898
60-
cellular_component 615 (11.6%)
53+
Unique GOs 6,651 Unique hits with GOs 10,670 (87.0%)
54+
Sequences with GOs 208 (98.6%) Seq best hit has GOs 194 (91.9%)
55+
Has goslim_plant 95
56+
57+
biological_process 4,879 (73.4%) is_a 10,776
58+
molecular_function 1,026 (15.4%) part_of 1,176
59+
cellular_component 746 (11.2%)
6160

6261
EXPRESSION
6362
TPM: (% of 211)
@@ -72,11 +71,11 @@ <h2>TCW overview for tra </h2>
7271
RoOl 1(<1%) 16 (8%) 37(18%) 74(35%) 110(52%)
7372
StOl 0 (0%) 2 (1%) 12 (6%) 65(31%) 101(48%)
7473

75-
Gene ontology enrichment: (% of 5,294)
74+
Gene ontology enrichment: (% of 6,651)
7675
<1E-5 <1E-4 <0.001 <0.01 <0.05
77-
RoSt 0 (0%) 1(<1%) 3(<1%) 54 (1%) 289 (5%)
78-
RoOl 0 (0%) 0 (0%) 3(<1%) 20(<1%) 187 (4%)
79-
StOl 0 (0%) 0 (0%) 0 (0%) 0 (0%) 36 (1%)
76+
RoSt 0 (0%) 0 (0%) 1(<1%) 51 (1%) 309 (5%)
77+
RoOl 0 (0%) 0 (0%) 2(<1%) 22(<1%) 166 (2%)
78+
StOl 0 (0%) 0 (0%) 0 (0%) 1(<1%) 48 (1%)
8079

8180
SEQUENCES
8281
Sequence lengths:
@@ -98,10 +97,8 @@ <h2>TCW overview for tra </h2>
9897
Pos2 13.7% %GC 43.71 48.80 36.27 Length 21k 182k 25k
9998
Pos3 16.6% CpG-O/E 0.87 0.71 0.63 AvgLen 101.2 860.8 117.1
10099

101-
Similar pairs: 434
102-
Nucleotide 20
103-
Translated nucleotide 415
104-
Translated ORFs 369
100+
Similar pairs: 20
101+
Nucleotide 20
105102

106103
LOCATIONS
107104
Sequences with location: 12 unique locations: 12
@@ -114,17 +111,17 @@ <h2>TCW overview for tra </h2>
114111
-------------------------------------------------------------------
115112
PROCESSING INFORMATION:
116113
AnnoDB Files:
117-
Type Taxo FILE DB DATE ADD DATE EXECUTE
118-
sp plants uniprot_sprot_plants.fasta 27-Jan-21 12-Aug-21 diamond --max-hsps 1 --masking 0 --top 20
119-
sp invertebrates uniprot_sprot_invertebrates.fasta 27-Jan-21 12-Aug-21 diamond --max-hsps 1 --masking 0 --top 20
120-
sp fungi uniprot_sprot_fungi.fasta 27-Jan-21 12-Aug-21 diamond --max-hsps 1 --masking 0 --top 20
121-
sp bacteria uniprot_sprot_bacteria.fasta 27-Jan-21 12-Aug-21 diamond --max-hsps 1 --masking 0 --top 20
122-
sp fullSubset uniprot_sprot_fullSubset.fasta 27-Jan-21 12-Aug-21 diamond --max-hsps 1 --masking 0 --top 20
123-
tr plants uniprot_trembl_plants.fasta 27-Jan-21 12-Aug-21 diamond --max-hsps 1 --masking 0 --top 20
124-
tr invertebrates uniprot_trembl_invertebrates.fasta 27-Jan-21 12-Aug-21 diamond --max-hsps 1 --masking 0 --top 20
125-
tf PlantTFDB PlantTFDB_TCW.fa 25-Oct-20 12-Aug-21 diamond --max-hsps 1 --masking 0 --top 20
126-
127-
Gene Ontology: go-basic.obo-Feb2021 GOdb: go_demo [GOs added with sTCW v3.3.0]
114+
Type Taxo FILE DB DATE ADD DATE EXECUTE
115+
sp plants uniprot_sprot_plants.fasta 27-Jan-21 26-Aug-21 diamond --masking 0
116+
sp invertebrates uniprot_sprot_invertebrates.fasta 27-Jan-21 26-Aug-21 diamond --masking 0
117+
sp fungi uniprot_sprot_fungi.fasta 27-Jan-21 26-Aug-21 diamond --masking 0
118+
sp bacteria uniprot_sprot_bacteria.fasta 27-Jan-21 26-Aug-21 diamond --masking 0
119+
sp fullSubset uniprot_sprot_fullSubset.fasta 27-Jan-21 26-Aug-21 diamond --masking 0
120+
tr plants uniprot_trembl_plants.fasta 27-Jan-21 26-Aug-21 diamond --masking 0
121+
tr invertebrates uniprot_trembl_invertebrates.fasta 27-Jan-21 26-Aug-21 diamond --masking 0
122+
123+
124+
Gene Ontology: go-basic.obo-Feb2021 GOdb: go_demo [GOs added with sTCW v3.3.1]
128125
GO Slim: goslim_plant
129126

130127
ORF finder:

doc/stcw/AnnoDBs.html

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,6 +1,6 @@
11
<!DOCTYPE html>
22
<html>
3-
<!--#set var=TITLE value="Annotation Setup" -->
3+
<!--#set var=TITLE value="Annotation Setup and runAS" -->
44
<!--#include virtual="../ssi/head.html" -->
55
<body onload="init('set')">
66
<a id="top"></a>

0 commit comments

Comments
 (0)