You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
GMSC database / habitat / taxonomy / quality file path and output directory path can be assigned on your own.Default is `GMSC-mapper/db` and `GMSC-mapper/output`.
111
+
GMSC database / habitat / taxonomy / quality / domain file path and output directory path can be assigned on your own.Default is `GMSC-mapper/db` and `GMSC-mapper/output`.
110
112
111
113
1. Input is genome contig sequences.
112
114
@@ -133,11 +135,11 @@ If you want to change alignment tool (Diamond / MMseqs2), you can use `--tool`.
@@ -191,17 +193,17 @@ The output folder will contain
191
193
192
194
- Habitat annotation of smORFs (optional) (habitat.out.smorfs.tsv)
193
195
194
-
A file listing the habitat annotation for each smORF homologous to GMSC.
196
+
This file lists the habitat annotations of the query/predicted sequence, where the habitat is obtained from the sequence annotations of its homologous origin in GMSC.
195
197
196
198
There are two columns in the file:
197
199
198
200
`qseqid`: Query seq id
199
201
200
-
`habitat`: Habitat, ',' separated if the sequences is from multiple habitats
202
+
`habitat`: Habitat, ',' separated if the sequence is from multiple habitats
201
203
202
204
- Taxonomy annotation of smORFs (optional) (taxonomy.out.smorfs.tsv)
203
205
204
-
A file listing the taxonomy annotation for each smORF homologous to GMSC.
206
+
This file lists the taxonomy annotations of the query/predicted sequence, where the taxonomy is obtained from the sequence annotations of its homologous origin in GMSC.
205
207
206
208
There are two columns in the file:
207
209
@@ -211,13 +213,21 @@ The output folder will contain
211
213
212
214
- Quality annotation of smORFs (optional) (quality.out.smorfs.tsv)
213
215
214
-
A file listing the quality annotation for each smORF homologous to GMSC.
216
+
This file lists the quality annotations of the query/predicted sequence, where the quality is obtained from the sequence annotations of its homologous origin in GMSC.
215
217
216
218
`qseqid`: Query seq id
217
219
218
220
`quality`: Quality label
219
221
220
-
- Summry (summary.txt)
222
+
- Conserved domain annotation of smORFs (optional) (domain.out.smorfs.tsv)
223
+
224
+
This file lists the conservative domain annotations of the query/predicted sequence, where the conservative domain is obtained from the sequence annotations of its homologous origin in GMSC.
225
+
226
+
`qseqid`: Query seq id
227
+
228
+
`cdd`: Identifiers from Conserved domain database, ',' separated if the sequence is annotated with multiple conserved domains.
229
+
230
+
- Summary (summary.txt)
221
231
222
232
A file providing a human-readable summary of the results.
223
233
@@ -244,11 +254,13 @@ The output folder will contain
244
254
245
255
*`--filter`: Use this to filter <100 aa or <303 nt input sequences. (default: False)
246
256
247
-
*`--nohabitat`: Use this if no need to annotate habitat. (default: False)
257
+
*`--no-habitat`: Use this if no need to annotate habitat. (default: False)
258
+
259
+
*`--no-taxonomy`: Use this if no need to annotate taxonomy. (default: False)
248
260
249
-
*`--notaxonomy`: Use this if no need to annotate taxonomy. (default: False)
261
+
*`--no-quality`: Use this if no need to annotate quality. (default: False)
250
262
251
-
*`--noquality`: Use this if no need to annotate quality. (default: False)
263
+
*`--no-domain`: Use this if no need to annotate conserved domain. (default: False)
0 commit comments