You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: README.md
+37-15Lines changed: 37 additions & 15 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -1,12 +1,20 @@
1
1
[](http://bioconda.github.io/recipes/metabuli/README.html)
2
2
# Metabuli
3
-
Metabuli is metagenomic classifier that jointly analyze both DNA and amino acid (AA) sequences.
4
-
DNA-based classifiers can make specific classifications, exploiting point mutations to distinguish close taxa.
5
-
AA-based classifiers have higher sensitivity in detecting homology between query and reference sequences, leverageing higher conservation of AA sequences.
6
-
Metabuli combines the information of both sequence types using a novel k-mer structure, _metamer_, to enable both specific and sensitive characterization of metagenomic samples.
7
-
In addition, it can classify reads against a database of any size as long as it fits in the hard disk.
3
+
***Metabuli*** classifies metagenomic reads by comparing them to reference genomes. You can use Metabuli to profile the taxonomic composition of your samples or to detect specific (pathogenic) species.
8
4
9
-
For more details of Metabuli, please see
5
+
***Sensitive and Specific.*** Metabuli uses a novel k-mer structure, called *metamer*, to analyze both amino acid (AA) and DNA sequences. It leverages AA conservation for sensitive homology detection and DNA mutations for specific differentiation between closely related taxa.
6
+
7
+
***A laptop is enough.*** Metabuli operates within user-specified RAM limits, allowing it to search any database that fits in storage. A PC with 8 GiB of RAM is sufficient for most analyses.
8
+
9
+
***A few clicks are enough.*** A GUI is available [here](https://github.com/steineggerlab/Metabuli-App). You can run Metabuli and browse the results with just a few clicks on your PC.
10
+
11
+
***Short reads, long reads, and contigs.*** Metabuli can classify all types of sequences.
### 🖥️ GUI apps for Windows, MacOS, and Linux are [here](https://github.com/steineggerlab/Metabuli-App).
28
+
---
29
+
### Update in v1.0.7
30
+
-**Metabuli became faster 🚀**
31
+
- Windows: *8.3* times faster
32
+
- MacOS: *1.7* times faster
33
+
- Linux: *1.3* times faster
34
+
- Test details are in release note.
35
+
- Fixed a bug in score calculation that could affect classification results.
36
+
### Update in v1.0.6
19
37
- Windows OS is supported.
20
-
> We found Metabuli is too slow with Windows OS. Currently making it faster.
38
+
> Metabuli v1.0.6 is too slow on Windows OS. Please use v1.0.7 or later.
39
+
21
40
22
-
## Update in v1.0.4
41
+
###Update in v1.0.4
23
42
- Fixed a minor reproducibility issue.
24
43
- Fixed a performance-harming bug occurring with sequences containing lowercased bases.
25
44
- Auto adjustment of `--match-per-kmer` parameter. Issue #20 solved.
26
45
- Record version info. in `db.parameter`
27
46
47
+
---
28
48
## Installation
29
49
### Precompiled binaries
30
50
```
@@ -40,8 +60,9 @@ wget https://mmseqs.com/metabuli/metabuli-linux-sse2.tar.gz; tar xvzf metabuli-l
40
60
41
61
# MacOS (Universal, works on Apple Silicon and Intel Macs)
42
62
wget https://mmseqs.com/metabuli/metabuli-osx-universal.tar.gz; tar xvzf metabuli-osx-universal.tar.gz; export PATH=$(pwd)/metabuli/bin/:$PATH
63
+
43
64
```
44
-
Metabuli also works on Linux ARM64 systems. Please check [https://mmseqs.com/metabuli](https://mmseqs.com/metabuli) for static builds for other architectures.
65
+
Metabuli also works on Linux ARM64 and Windows systems. Please check [https://mmseqs.com/metabuli](https://mmseqs.com/metabuli) for static builds for other architectures.
45
66
46
67
### Compile from source code
47
68
To compile Metabuli from source code use the following commands:
Metabuli can classify reads against a database of any size as long as the database is fits in the hard disk, regardless of the machine's RAM size.
169
-
We tested it with a MacBook Air (2020, M1, 8 GiB), where we classified about 1.5 M paired-end 150 bp reads (~5 GiB in size) against a database built with ~23K prokaryotic genomes (~69 GiB in size)
191
+
We tested it with a MacBook Air (2020, M1, 8 GiB), where we classified about 15 M paired-end 150 bp reads (~5 GiB in size) against a database built with ~23K prokaryotic genomes (~69 GiB in size).
170
192
171
193
## Custom database
172
194
To build a custom database, you need three things:
0 commit comments