Skip to content

Commit 7ef4a02

Browse files
committed
v4.0.0
1 parent 917f6c7 commit 7ef4a02

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

83 files changed

+1748
-1203
lines changed

doc/Install.html

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -62,9 +62,9 @@ <h2>Installation</h2>
6262

6363
<i><b><u>Step 1:</u></b></i> Decide where you want the main TCW directory to go, and untar the package in that location.
6464
<pre>
65-
tar -xvf TCW_3.tar.gz
65+
tar -xvf TCW_4.tar.gz
6666
</pre>
67-
The untar will create a <ttp>TCW_3</ttp> directory, with the following content:
67+
The untar will create a <ttp>TCW_4</ttp> directory, with the following content:
6868
<pre>
6969
Executables:
7070
execLoadLib execAssm execAnno
@@ -299,7 +299,7 @@ <h4>Blast and Diamond</h4>
299299
If you are going to use the <ttp>runSingleTCW</ttp> assembler, you MUST specify the full path of where the Blast executables reside,
300300
even if your going to use the default executables, e.g
301301
<pre>
302-
blast_path = /Users/cari/Workspace/TCW_3/Ext/mac/blast
302+
blast_path = /Users/cari/Workspace/TCW_4/Ext/mac/blast
303303
</pre>
304304
To verify what diamond and blast paths are being used, run
305305
<pre>

doc/Release.html

Lines changed: 28 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -13,7 +13,7 @@
1313

1414
There have been many improvements, but the following provides some recent critical points.
1515
<table>
16-
<tr><td><a href=#v341>v3.4.2</a><td>08-Mar-22<td><ttp>Assorted<td>Make color of buttons consistent.
16+
<tr><td><a href=#v343>v4.0.0</a><td>22-Mar-22<td><ttp>Assorted<td>Tidy up somethings in order to start v4.
1717
<tr><td><a href=#v339>v3.3.9</a><td>27-Dec-21<td><ttp><ttp>runAS</ttp></ttp><td>Improved interface for full subset UniProt; updated Demo.
1818
<tr><td><a href=#v335>v3.3.5</a><td>04-Nov-21<td><ttp>viewSingle</ttp><td>log2FC TPM analysis
1919
<tr><td><a href=#v334>v3.3.4</a><td>18-Oct-21<td><ttp>runSingle</ttp><td><ttl>ORF finder</ttl> improvements
@@ -27,6 +27,33 @@
2727
<tr><td><a href=#early>Earlier</a><td>2020<td>&nbsp;<td>&nbsp;
2828
</table>
2929

30+
<a name=v343>
31+
<h4>v4.0.0 22-Mar-2022</h4>
32+
33+
<p>General
34+
<ul>
35+
<li>Documentation and Help updates - mostly for mTCW.
36+
</ul>
37+
38+
<p><ttp>runAS</ttp>
39+
<ul>
40+
<li>The <ttc>Check</ttc> function will write to the terminal the UniProts added to an existing goDB.
41+
For existing goDBs, they will be updated on first <ttc>Check</ttc> and stored in the goDB.
42+
</ul>
43+
44+
<p><ttp>viewMultiTCW</ttp>
45+
<ul>
46+
<li><ttc>Explain</ttc> on <ttc>Overview</ttc>: rewrote much of it for clarity.
47+
<li>A lot of little tidy up things.
48+
(1) <ttc>Overview</ttc>: removed "(Avg %Sim)" on AA and NT pairs. Slight change to KaKs counts.
49+
(2) <ttc>Pair Filter</ttc>: If KaKs set to be at least &gt;=0.0, got
50+
Kaks with no values or NA. If pairs&clusters removed, pairs added but no clusters, the
51+
pairs filter crashed.
52+
(3) <ttc>Sequence Table</ttc>: check to see if it is !NTonly database, if so, do
53+
not give options of Export CDS/NT sequence. If !GO, do not give option of exports GOs.
54+
<li><ttc>Details</ttc>: Add <ttc>Copy...</ttc> option to copy the displayed Detail sequence.
55+
</ul>
56+
3057
<a name=v342>
3158
<h4>v3.4.2 8-Mar-2022</h4>
3259
A few improvements and tiny little bug fixes.

doc/index.html

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -11,6 +11,7 @@
1111

1212
<tr><td colspan=3> Getting started</td></tr>
1313
<tr> <td>&nbsp;&nbsp;<td><a href="Summary.html">Summary</a> <td>A summary of the input, processing and suite of programs.
14+
<tr> <td>&nbsp;&nbsp;<td><a href="./tour">TCW Tour</a> </td> <td>Snapshots of different TCW screens</td></tr>
1415

1516
<tr> <td>&nbsp;&nbsp;<td><a href="Install.html">Installation</a><td>Installing TCW and other necessary and optional software.
1617
<tr> <td colspan=3>&nbsp;&nbsp;
@@ -46,7 +47,6 @@
4647
<tr> <td colspan=3>&nbsp;&nbsp;
4748
<tr><td colspan=3> Other docs</td></tr>
4849
<tr> <td>&nbsp;&nbsp;<td><a href="External.html">External Software</a></td><td>Packages used by TCW</td></tr>
49-
<tr> <td>&nbsp;&nbsp;<td><a href="./tour">TCW Tour</a> </td> <td>Snapshots of different TCW screens</td></tr>
5050

5151
<tr> <td>&nbsp;&nbsp;<td><a href="Mac-OSX.html">MacOS</a></td> <td>Installing MySQL, etc on MacOS</td></tr>
5252
<tr> <td>&nbsp;&nbsp;<td><a href="./Trouble.html">Trouble Shooting</a></td>

doc/mtcw/UserGuide.html

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -558,8 +558,8 @@ <h4>Details on running <ttp>KaKs_calculator</ttp></h4>
558558
called <ttx>runKaKs</ttx> which has the commands to run the <ttp>KaKs_calculator</ttp>
559559
on each file, e.g.
560560
<pre>
561-
/Users/cari/Workspace/TCW_3/Ext/mac/KaKs_Calculator -i oTCW1.awt -o iTCW1.tsv -m YN &amp;
562-
/Users/cari/Workspace/TCW_3/Ext/mac/KaKs_Calculator -i oTCW2.awt -o iTCW2.tsv -m YN &amp;
561+
/Users/cari/Workspace/TCW_4/Ext/mac/KaKs_Calculator -i oTCW1.awt -o iTCW1.tsv -m YN &amp;
562+
/Users/cari/Workspace/TCW_4/Ext/mac/KaKs_Calculator -i oTCW2.awt -o iTCW2.tsv -m YN &amp;
563563
</pre>
564564
The only reason the method "YN" is used here is because it is fastest; this should probably
565565
be changed. The default method is "MA", which will be used if you remove the "-m YN" on all lines.

doc/ov/mDemo.html

Lines changed: 13 additions & 16 deletions
Original file line numberDiff line numberDiff line change
@@ -1,24 +1,23 @@
11
<html><title>mTCW_ex overview</title><body><center>
22
<h2>mTCW overview for ex</h2>
3-
<table width=600 border=1><tr><td><pre>Project: ex Cluster: 1.3k Pairs: 846 Seqs: 707 Hits: 4.0k PCC Stats KaKs Multi
3+
<table width=600 border=1><tr><td><pre>Project: ex Cluster: 1.0k Pairs: 846 Seqs: 707 Hits: 4.0k GOs: 8.0k PCC Stats KaKs Multi
44

5-
Created: 28-Feb-22 v3.4.2
5+
Created: 15-Mar-22 v4.0.0 Last Update: 20-Mar-22 v4.0.0
66

77
DATASETS: 3
88
Type #Seq #annotated #annoDB Created Remark
9-
bar NT 250 250 3 25-Feb-22 exBar
10-
fly NT 207 207 3 13-Aug-21 exFly
11-
foo NT 250 250 3 13-Aug-21 exFoo
9+
bar NT 250 250 3 15-Mar-22 exBar
10+
fly NT 207 207 3 15-Mar-22 exFly
11+
foo NT 250 250 3 15-Mar-22 exFoo
1212

13-
CLUSTER SETS: 6
13+
CLUSTER SETS: 5
1414
Statistics
1515
Prefix Method conLen sdLen Score1 SD Score2 SD
1616
CL Closure 579.80 58.50 0.48 0.08 0.78 0.15
17-
OM orthoMCL.OM-4 606.37 85.26 0.45 0.10 0.75 0.16
17+
OM orthoMCL.OM-4 606.32 85.26 0.45 0.10 0.75 0.16
1818
B12 BBH bar,fly 540.22 62.63 0.50 0.07 0.72 0.18
1919
B13 BBH bar,foo 558.75 34.11 0.53 0.07 0.79 0.15
2020
B23 BBH fly,foo 559.57 70.82 0.47 0.08 0.80 0.21
21-
OM2 orthoMCL.OM-4 606.32 85.26 0.45 0.10 0.75 0.16
2221

2322
Sizes
2423
Prefix =2 =3 4-5 6-10 11-15 16-20 21-25 >25 Total #Seqs
@@ -27,12 +26,11 @@ <h2>mTCW overview for ex</h2>
2726
B12 157 0 0 0 0 0 0 0 157 44.4%
2827
B13 239 0 0 0 0 0 0 0 239 67.6%
2928
B23 166 0 0 0 0 0 0 0 166 47.0%
30-
OM2 50 186 3 3 0 0 0 0 242 98.0%
3129

3230
PAIRS: 846
3331
Hits
34-
AA Diff 753 Same 60 Similarity 76.4% (77.0%) Coverage 84.7% (86.6%)
35-
NT Diff 663 Same 4 Similarity 86.4% (87.0%) Coverage 59.4% (63.7%)
32+
AA Diff 753 Same 60 Similarity 76.4% Coverage 84.7%
33+
NT Diff 663 Same 4 Similarity 86.4% Coverage 59.4%
3634

3735
Aligned: 700 CDS: 1.0Mb 5UTR: 61.5kb 3UTR: 96.5kb
3836
Codons 311.0k Amino Acids Nucleotides
@@ -49,10 +47,10 @@ <h2>mTCW overview for ex</h2>
4947

5048
KaKs method: YN Pairs: 700
5149
Ka/Ks Quartiles Average P-value
52-
NA 16 Q1(Lower) 0.019 Ka 0.102 <1E-100 419
53-
KaKs~1 0 Q2(Median) 0.044 Ks 2.793 <1E-10 85
54-
KaKs<1 675 Q3(Upper) 0.113 P-value 0.036 <0.001 44
55-
KaKs>1 9 Other 152
50+
NA 16 Q1(Lower) 0.019 Ka 0.102 NA 48
51+
KaKs<1 675 Q2(Median) 0.044 Ks 2.793 <1E-100 419
52+
KaKs~1 0 Q3(Upper) 0.113 P-value 0.036 <1E-10 85
53+
KaKs>1 9 <1.0 148
5654

5755
SEQUENCES: 707
5856
Average Lengths %GC CpG O/E
@@ -86,6 +84,5 @@ <h2>mTCW overview for ex</h2>
8684
B12 BBH Sim 60; Cov 40(Both); AA; bar,fly
8785
B13 BBH Sim 60; Cov 40(Both); AA; bar,foo
8886
B23 BBH Sim 60; Cov 40(Both); AA; fly,foo
89-
OM2 User Defined orthoMCL.OM-4
9087

9188
</pre></table></body></html>

doc/ov/sDemo.html

Lines changed: 38 additions & 38 deletions
Original file line numberDiff line numberDiff line change
@@ -5,10 +5,10 @@
55
<h2>Overview for sTCW_demoTra </h2>
66
<table width=700 border=1><tr><td>
77
<pre>
8-
Project: tra #Seqs: 211 #Hits: 11,957 #GOs: 6,076 TPM Seq-DE GO-Enrich Pairs
8+
Project: tra #Seqs: 211 #Hits: 12,296 #GOs: 6,144 TPM Seq-DE GO-Enrich Pairs
99

10-
28-Feb-22 Build Database sequences loaded from external source
11-
06-Mar-22 Last Annotation with sTCW v3.4.2
10+
15-Mar-22 Build Database sequences loaded from external source
11+
15-Mar-22 Last Annotation with sTCW v4.0.0
1212

1313
INPUT
1414
Counts:
@@ -23,40 +23,40 @@ <h2>Overview for sTCW_demoTra </h2>
2323

2424
ANNOTATION
2525
Hit statistics:
26-
Sequences with hits 210 (99.5%) Bases covered by hit 180,288 (79.2%)
27-
Unique hits 11,957 Total bases 227,683
28-
Total sequence hits 17,549
26+
Sequences with hits 210 (99.5%) Bases covered by hit 180,339 (79.2%)
27+
Unique hits 12,296 Total bases 227,683
28+
Total sequence hits 17,763
2929

3030
Annotation databases (annoDBs): 7 (see Legend below)
3131
ANNODB ONLY BITS ANNO UNIQUE TOTAL AVG Rank HAS (%Seqs) AVG COVER COVER
3232
%SIM =1 HIT %SIM >=50 >=90
3333
SP-plants 0 0 0 1,344 2,959 53.8 | 193 (91.5%) 70.0 61.7% 3.1%
34-
SP-invertebrates 0 0 0 624 1,271 42.5 | 126 (59.7%) 47.5 30.2% 0.8%
35-
SP-fungi 0 0 0 895 1,516 43.1 | 121 (57.3%) 46.5 24.8% 0.8%
34+
SP-invertebrates 0 0 0 624 1,270 42.5 | 126 (59.7%) 47.5 30.2% 0.8%
35+
SP-fungi 0 0 0 896 1,517 43.0 | 122 (57.8%) 46.4 24.6% 0.8%
3636
SP-bacteria 0 0 0 790 1,126 44.1 | 63 (29.9%) 45.8 25.4% 0%
37-
SP-full_BFIP 0 0 0 740 1,742 45.4 | 144 (68.2%) 47.5 25.0% 0.7%
38-
TR-plants 8 207 207 4,614 5,235 76.0 | 210 (99.5%) 82.3 68.6% 11.0%
39-
TR-invertebrates 0 3 3 2,950 3,700 50.0 | 181 (85.8%) 54.3 38.7% 2.2%
37+
SP-full_BFIP 0 0 0 744 1,696 45.7 | 125 (59.2%) 49.1 32.0% 0.8%
38+
TR-plants 7 207 207 4,626 5,241 76.0 | 210 (99.5%) 82.3 68.6% 11.0%
39+
TR-invertebrates 0 3 3 3,272 3,954 48.9 | 186 (88.2%) 53.8 37.6% 2.2%
4040

41-
Top 15 species from total: 1,571
41+
Top 15 species from total: 1,594
4242
SPECIES (25 char) BITS ANNO TOTAL SPECIES BITS ANNO TOTAL
43-
Musa acuminata 97 2 491 Dendrobium catenatum 2 2 70
44-
Musa balbisiana 52 71 313 Macleaya cordata 2 2 38
45-
Elaeis guineensis 12 45 351 Pinus tabuliformis 2 2 4
46-
Ensete ventricosum 11 14 207 Curcuma alismatifolia 2 2 2
47-
Zingiber officinale 7 7 27 Vitis vinifera 1 3 86
48-
Ananas comosus 3 17 452 Apostasia shenzhenica 1 3 60
49-
Meloidogyne enterolobii 3 2 10 Ricinus communis 1 2 41
50-
Anthurium amnicola 2 4 96 Other 12 32 15,301
43+
Musa acuminata 97 2 490 Dendrobium catenatum 2 2 70
44+
Musa balbisiana 52 71 312 Macleaya cordata 2 2 38
45+
Elaeis guineensis 12 45 347 Curcuma alismatifolia 2 2 2
46+
Ensete ventricosum 11 14 206 Vitis vinifera 1 3 86
47+
Zingiber officinale 7 7 27 Apostasia shenzhenica 1 3 59
48+
Ananas comosus 3 17 456 Ricinus communis 1 2 41
49+
Meloidogyne enterolobii 3 2 10 Pinus tabuliformis 1 2 4
50+
Anthurium amnicola 2 4 96 Other 13 32 15,519
5151

5252
Gene ontology statistics:
53-
Unique GOs 6,076 Unique hits with GOs 10,332 (86.4%)
53+
Unique GOs 6,144 Unique hits with GOs 10,632 (86.5%)
5454
Sequences with GOs 208 (98.6%) Seq best hit has GOs 107 (50.7%)
5555
Has goslim_plant 94
5656

57-
biological_process 4,387 (72.2%) is_a 9,762
58-
molecular_function 967 (15.9%) part_of 1,044
59-
cellular_component 722 (11.9%)
57+
biological_process 4,442 (72.3%) is_a 9,866
58+
molecular_function 979 (15.9%) part_of 1,057
59+
cellular_component 723 (11.8%)
6060

6161
EXPRESSION
6262
TPM: (% of 211)
@@ -71,11 +71,11 @@ <h2>Overview for sTCW_demoTra </h2>
7171
StOl 0 (0%) 2 (1%) 12 (6%) 65(31%) 101(48%)
7272
RoOl 1(<1%) 16 (8%) 37(18%) 74(35%) 110(52%)
7373

74-
Gene ontology enrichment: (% of 6,076)
74+
Gene ontology enrichment: (% of 6,144)
7575
<1E-5 <1E-4 <0.001 <0.01 <0.05
76-
StRo 0 (0%) 0 (0%) 4(<1%) 59 (1%) 310 (5%)
77-
StOl 0 (0%) 0 (0%) 0 (0%) 1(<1%) 40 (1%)
78-
RoOl 0 (0%) 1(<1%) 2(<1%) 26(<1%) 182 (3%)
76+
StRo 0 (0%) 1(<1%) 6(<1%) 59 (1%) 306 (5%)
77+
StOl 0 (0%) 0 (0%) 0 (0%) 1(<1%) 43 (1%)
78+
RoOl 0 (0%) 1(<1%) 2(<1%) 26(<1%) 183 (3%)
7979

8080
SEQUENCES
8181
Sequence lengths:
@@ -91,7 +91,7 @@ <h2>Overview for sTCW_demoTra </h2>
9191
0(0%) 62(29%) 92(44%) 47(22%) 4(2%) 6(3%) 0(0%) 0(0%)
9292

9393
ORF stats: Average length 862
94-
Has Hit 210 (99.5%) Both Ends 69 (32.7%) Multi-frame 4 (1.9%)
94+
Has Hit 210 (99.5%) Both Ends 69 (32.7%) Multi-frame 6 (2.8%)
9595
Is Longest ORF 192 (91.0%) ORF>=300 190 (90.0%) Stops in Hit 6 (2.8%)
9696
Markov Best Score 210 (99.5%) ORF=Hit 109 (51.7%) >=9 Ns in ORF 1 (<1%)
9797
All of the above 191 (90.5%) with Ends 32 (15.2%)
@@ -116,18 +116,18 @@ <h2>Overview for sTCW_demoTra </h2>
116116
-------------------------------------------------------------------
117117
PROCESSING INFORMATION
118118
AnnoDB files:
119-
Type Taxo FILE DB DATE ADD DATE EXECUTE
120-
sp plants uniprot_sprot_plants.fasta 27-Jan-21 06-Mar-22 Load tra_SPpla.dmnd.tab
121-
sp invertebrates uniprot_sprot_invertebrates.fasta 27-Jan-21 06-Mar-22 Load tra_SPinv.dmnd.tab
122-
sp fungi uniprot_sprot_fungi.fasta 27-Jan-21 06-Mar-22 Load tra_SPfun.dmnd.tab
123-
sp bacteria uniprot_sprot_bacteria.fasta 27-Jan-21 06-Mar-22 Load tra_SPbac.dmnd.tab
124-
sp full_BFIP uniprot_sprot_xBFxIxPxxx.fasta 27-Jan-21 06-Mar-22 Load tra_SPful.dmnd.tab
125-
tr plants uniprot_trembl_plants.fasta 27-Jan-21 06-Mar-22 Load tra_TRpla.dmnd.tab
126-
tr invertebrates uniprot_trembl_invertebrates.fasta 27-Jan-21 06-Mar-22 Load tra_TRinv.dmnd.tab
119+
Type Taxo FILE DB DATE ADD DATE EXECUTE
120+
sp plants uniprot_sprot_plants.fasta 27-Jan-21 15-Mar-22 diamond --masking 0
121+
sp invertebrates uniprot_sprot_invertebrates.fasta 27-Jan-21 15-Mar-22 diamond --masking 0
122+
sp fungi uniprot_sprot_fungi.fasta 27-Jan-21 15-Mar-22 diamond --masking 0
123+
sp bacteria uniprot_sprot_bacteria.fasta 27-Jan-21 15-Mar-22 diamond --masking 0
124+
sp full_BFIP uniprot_sprot_xBFxIxPxxx.fasta 27-Jan-21 15-Mar-22 diamond --masking 0
125+
tr plants uniprot_trembl_plants.fasta 27-Jan-21 15-Mar-22 diamond --masking 0
126+
tr invertebrates uniprot_trembl_invertebrates.fasta 27-Jan-21 15-Mar-22 diamond --masking 0
127127

128128
Prune: none
129129

130-
Gene ontology: go-basic.obo-Nov2021 GOdb: go_demo [GOs added with sTCW v3.4.2]
130+
Gene ontology: go-basic.obo-Nov2021 GOdb: go_demo [GOs added with sTCW v4.0.0]
131131
GO slim: goslim_plant
132132

133133
ORF finder:

doc/stcw/AnnoDBs.html

Lines changed: 5 additions & 6 deletions
Original file line numberDiff line numberDiff line change
@@ -106,7 +106,7 @@ <h2>Demo annotation setup </h2>
106106
<td class="top"> The TCW package provides subsets of UniProt for annotating the demo.
107107
In order to add GO annotations, a local GO mySQL 'demo' database needs to be created.
108108
<ol type="A">
109-
<li>From the TCW_3 directory, execute:
109+
<li>From the TCW_4 directory, execute:
110110
<br><ttx>./runAS -d</ttx>
111111
<br>The "-d" will cause it to enter the demo parameters, as shown on the right.
112112
The highlighted entries already exist. Its only necessary to build the GO database.
@@ -168,11 +168,10 @@ <h2>Java graphical interface -- <ttp>runAS</ttp> </h2>
168168
that contains all the information downloaded; this can be used as input to <ttp>runSingleTCW</ttp>
169169
(see <a href="#runstcw">Import AnnoDBs</a>). <br>
170170
&nbsp;
171-
<li><ttc>Check</ttc>: Selecting this button highlights everything that has been done.
172-
For example, the figure on the right shows that fungi and plant SwissProt have been downloaded and processed.
173-
If <ttc>Build Tax</ttc> is run again, 3 more SwissProt and plant TrEMBL will be downloaded and processed.
174-
The Full SwissProt and the GOdb have not been done.
175-
The <ttc>Check</ttc> automatically runs on startup.
171+
<li><ttc>Check</ttc>: The Check function automatically runs on startup and after any Build.
172+
It highlights everything that has been done.
173+
For example, the figure on the right shows that fungi and plant SwissProt have been downloaded and processed.
174+
<i>To force a check, or to view the UniProts in an existing goDB, select this button.</i>
176175
</ol>
177176
<td><a href="img/runAS.png"><img src="img/runAS.png" alt="runAS" style="border: 1px solid black; width: 350px"></a>
178177
</tr>

0 commit comments

Comments
 (0)