You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: README.md
+29-40Lines changed: 29 additions & 40 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -1,6 +1,6 @@
1
1
# read2tree
2
2
3
-
read2tree is a software tool that allows to obtain alignment matrices for tree inference. For this purpose it makes use of the OMA database and a set of reads. Its strength lies in the fact that it bipasses the several standard steps when obtaining such a matrix in regular analysis. These steps are read filtereing, assembly, gene prediction, gene annotation, all vs all comparison, orthology prediction, alignment and concatination.
3
+
read2tree is a software tool that allows to obtain alignment matrices for tree inference. For this purpose it makes use of the OMA database and a set of reads. Its strength lies in the fact that it bipasses the several standard steps when obtaining such a matrix in regular analysis. These steps are read filtereing, assembly, gene prediction, gene annotation, all vs all comparison, orthology prediction, alignment and concatenation.
4
4
5
5
read2tree works in linux with [](https://www.python.org/downloads/release/python-310/)
Besides, you need softwares including [mafft](http://mafft.cbrc.jp/alignment/software/) (multiple sequence aligner), [iqtree](http://www.iqtree.org/) (phylogenomic inference), [ngmlr](https://github.com/philres/ngmlr), [ngm/nextgenmap](https://github.com/Cibiv/NextGenMap) (long and short read mappers), and [samtools](http://www.htslib.org/download/) which could be installed using conda.
50
+
Besides, you need softwares including [mafft](http://mafft.cbrc.jp/alignment/software/) (multiple sequence aligner), [iqtree](http://www.iqtree.org/) (phylogenomic inference), [minimap2](https://github.com/lh3/minimap2) (long and short read mappers), and [samtools](http://www.htslib.org/download/) which could be installed using conda.
51
+
For this version, the `--read_type` argument accepts any minimap2 options string that defines how reads are aligned to the reference. For example, it could be `-ax sr`, `-ax map-hifi` or `-ax map-ont`. You can also pass `--threads 40` to be used with minimap2.
Alternatively, you could also try using [mamba](https://mamba.readthedocs.io/en/latest/). Caution: please read about compatiblity of conda and mamba in one envirnoment.
73
-
74
-
### 3) Installation using Docker
75
-
The Dockerfile is also available in this repository. There is an example how to run in the [test example](#test-example) section.
76
-
77
-
A prebuild container can be loaded from dockerhub:
The output of Read2Tree is the concatenated alignments as a fasta file where each record corresponds to one species. We also provide the option `--tree` for inferring the species tree using IQTREE as defualt.
84
+
The output of Read2Tree is the concatenated alignments as a fasta file where each record corresponds to one species. We also provide the option `--tree` for inferring the species tree using IQTREE as default.
logging.debug(' We are counting the number of fa files in folder _ogs_aa and _ogs_daa which are '+str(num_ogs_aa) +" and "+str(num_ogs_dna) +" in folder "+str(self._folder_ref_ogs_aa) +" and "+str(self._folder_ref_ogs_dna))
162
167
if (num_ogs_expected-num_ogs_aa) ==0and (num_ogs_expected-num_ogs_dna) ==0:
168
+
logging.debug(' We are counting the number of fa files in folder _ogs_aa and _ogs_daa, which are '+str(num_ogs_aa) +" and "+str(num_ogs_dna) +" the same as expected"+str(num_ogs_expected) +". So this step is done")
163
169
returnTrue
164
170
else:
171
+
logging.debug(' We are counting the number of fa files in folder _ogs_aa and _ogs_daa, which are '+str(num_ogs_aa) +" and "+str(num_ogs_dna) +" but not the same as expected "+str(num_ogs_expected) +". So this step is done")
if (num_ogs_expected-num_ogs_aa) <=0and (num_ogs_expected-num_ogs_dna) <=0:
186
+
logging.debug("Number of ogs expected after appending is"+str(num_ogs_expected)+" the same as ogs number in dna and aa "+str(self._folder_ref_ogs_dna) +". So this step is done")
179
187
returnTrue
180
188
else:
189
+
logging.debug("Number of ogs expected after appending is"+str(num_ogs_expected)+" but the number in dna and aa "+str(self._folder_ref_ogs_dna) +" are "+str(num_ogs_aa) +" and "+str(num_ogs_dna)+". So this step is not done")
logging.debug("Number of ref expected is "+str( num_ref_expected) +" and number in "+str( self._folder_ref_ogs_dna) +" is "+str(num_references)+". So this step is done")
194
205
returnTrue
195
206
else:
207
+
logging.debug("Number of ref expected is "+str( num_ref_expected) +" but number in "+str( self._folder_ref_ogs_dna) +" is "+str(num_references)+". So this step is not done")
if (num_aligns_expected-num_align_aa) ==0and (num_aligns_expected-num_align_dna) ==0:
222
+
logging.debug("Number of aligns expected is "+str( num_aligns_expected) +" and number in "+str( self._folder_ref_ogs_dna) +" is "+str(num_align_dna)+", similarly for aa. So this step is done")
210
223
returnTrue
211
224
else:
225
+
logging.debug("Number of aligns expected is "+str( num_aligns_expected) +" and number in "+str( self._folder_ref_ogs_dna) +" and aa versions are "+str(num_align_dna)+" and "+str(num_align_aa)+". So this step is not done")
if (num_aligns_expected-num_align_aa) ==0and (num_aligns_expected-num_align_dna) ==0:
240
+
logging.debug("Number of aligns expected is "+str(num_aligns_expected) +" and number in "+str(self._folder_align_append_aa) +" is "+str(num_align_aa) +", similarly for aa. So this step is done")
226
241
returnTrue
227
242
else:
243
+
logging.debug("Number of aligns expected is "+str(num_aligns_expected) +" and number in "+str(self._folder_align_append_aa) +" and aa versions are "+str(num_align_aa) +" and "+str(num_align_dna) +". So this step is not done")
0 commit comments