You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: README.md
+45-46Lines changed: 45 additions & 46 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -3,12 +3,12 @@
3
3
GMSC-mapper is a command line tool to query the Global Microbial smORFs Catalog (GMSC).
4
4
5
5
GMSC-mapper can be used to
6
-
- Find query smORFs (< 100aa) homologous to Global Microbial smORFs Catalog (GMSC) by alignment.
6
+
- Find query smORFs (< 100aa) homologous to Global Microbial smORFs Catalogue (GMSC) by alignment.
7
7
- Support 3 types of input:
8
8
- contigs (GMSC-mapper will predict smORFs from contigs first)
9
9
- amino acid sequences
10
10
- nucleotide gene sequences
11
-
- Annotate query/predicted smORFs with quality, habitat and taxonomy information constructed manually in detail.
11
+
- Annotate query / predicted smORFs with quality, habitat and taxonomy information constructed manually in detail.
12
12
13
13
## Installation
14
14
@@ -44,10 +44,8 @@ cd GMSC-mapper
44
44
python setup.py install
45
45
```
46
46
47
-
### Example test
48
-
Because the whole GMSC database is large, and takes some minutes to process.
49
-
50
-
If you want to check if the installation works well, you can test with mock datasets easily and fast.
47
+
#### Example test
48
+
As the whole GMSC database is large and takes some minutes to process. To check if the installation works well, you can test with mock datasets easily and fast.
We recommend to use `GMSC-mapper` as your current work directory. You can derectly follow the commonds below.
100
+
#### Download GMSC database and create index
101
101
102
-
`--dbdir`: Path to GMSC database annotation index files. (default: `./db`. If `GMSC-mapper`is your current work directory, the database files will be downloaded at `GMSC-mapper/db`)
102
+
We recommend to use `GMSC-mapper`as your current work directory. You can derectly follow the commonds below.
103
103
104
104
```bash
105
105
cd GMSC-mapper
106
106
```
107
107
108
+
Download GMSC database
109
+
108
110
```bash
109
111
gmsc-mapper downloaddb --dbdir ./db
110
112
```
111
113
112
-
Otherwise if you want to use custom `--dbdir` directory, it should be consistent with `-o` (Path to database index output of Diamond and MMseqs2) in the creating index step
113
-
114
-
### Create GMSC database index of Diamond/MMseqs2
115
-
116
-
We also recommend to use `GMSC-mapper` as your current work directory. You can derectly follow the commonds below.
117
-
118
-
The input (`i`) is the fasta file (`GMSC10.90AA.faa.gz`) downloaded to the dbdir (default: `./db`. If `GMSC-mapper` is your current work directory, the dbdir is `GMSC-mapper/db`) in the downloading step.
119
-
120
-
`-o`: Path to database index output of Diamond and MMseqs2. (default: `./db`. If `GMSC-mapper` is your current work directory, the database files will be created at `GMSC-mapper/db`)
121
-
122
-
`-m`: Alignment tool (Diamond / MMseqs2).
114
+
The default `--dbdir` is `./db`. If you want to use custom `--dbdir` directory, it should be consistent with `-o` in the next creating database index step.
Otherwise if you want to use custom `-o` directory, it should be consistent with `--dbdir` (Path to GMSC database annotation index files) in the download step.
122
+
The input (`i`) is the fasta file (`GMSC10.90AA.faa.gz`) downloaded to the dbdir (default: `./db`) in the downloading step.
123
+
124
+
The default `-o` is `./db`. If you want to use custom `-o` directory, it should be consistent with `--dbdir` in the last downloading database step.
125
+
126
+
#### GMSC Annotation
137
127
138
-
### Default
139
-
GMSC Database directory (`--dbdir`) and output directory (`-o`) can be assigned on your own. Default is `./db` and `./output`. If `GMSC-mapper` is your current work directory, they will be `GMSC-mapper/db` and `GMSC-mapper/output`.
128
+
GMSC Database directory (`--dbdir`) and output directory (`-o`) can be assigned on your own. Default is `./db` and `./output`.
140
129
141
130
If you use `GMSC-mapper` as your current work directory. You can derectly follow the commonds below. Otherwise, you need to assign your custom `--dbdir` which contains database files.
0 commit comments