Skip to content

Commit 0e09758

Browse files
Updates in preparation for the next release
1 parent c0d056e commit 0e09758

File tree

2 files changed

+60
-45
lines changed

2 files changed

+60
-45
lines changed

README.md

Lines changed: 57 additions & 42 deletions
Original file line numberDiff line numberDiff line change
@@ -6,18 +6,17 @@
66
[![Install on conda](https://anaconda.org/bioconda/falco/badges/license.svg)](https://anaconda.org/bioconda/falco)
77
[![Install on conda](https://anaconda.org/bioconda/falco/badges/downloads.svg)](https://anaconda.org/bioconda/falco)
88

9-
10-
119
This program is an emulation of the popular
12-
[FastQC](https://www.bioinformatics.babraham.ac.uk/projects/fastqc) software to
13-
check large sequencing reads for common problems.
10+
[FastQC](https://www.bioinformatics.babraham.ac.uk/projects/fastqc)
11+
software to check large sequencing reads for common problems.
1412

1513
Installing falco
1614
================
1715

1816
## Installing through conda
19-
If you use [anaconda](https://anaconda.org) to manage your packages, and the `conda` binary
20-
is in your path, you can install the most recent release of `falco` by running
17+
If you use [anaconda](https://anaconda.org) to manage your packages,
18+
and the `conda` binary is in your path, you can install the most
19+
recent release of `falco` by running
2120
```
2221
$ conda install -c bioconda falco
2322
```
@@ -27,12 +26,13 @@ installer.
2726

2827
## Installing from source (code release)
2928

30-
Compilation from source can be done by downloading a `falco` release from the
31-
[releases](https://github.com/smithlabcode/falco/releases)
32-
section above. Upon downloading the release (in `.tar.gz` or `.zip` format),
33-
and inflating the downloaded file to a directory (e.g. `falco`), move to the
34-
target directory the file was inflated (e.g. `cd falco`). You should see a
35-
`configure` file in it. In this directory, run
29+
Compilation from source can be done by downloading a `falco` release
30+
from the [releases](https://github.com/smithlabcode/falco/releases)
31+
section above. Upon downloading the release (in `.tar.gz` or `.zip`
32+
format), and inflating the downloaded file to a directory
33+
(e.g. `falco`), move to the target directory the file was inflated
34+
(e.g. `cd falco`). You should see a `configure` file in it. In this
35+
directory, run
3636

3737
```
3838
$ ./configure CXXFLAGS="-O3 -Wall"
@@ -50,10 +50,11 @@ The `falco` binary will be found in the `bin` directory inside the
5050
specified prefix.
5151

5252
## Installing from a cloned repository
53-
We strongly recommend using `falco` through stable releases as described above,
54-
as the latest commits might contain undocumented bugs. For the more
55-
advanced users who wish to test the most recent code, `falco` can be
56-
installed by first cloning the repository
53+
54+
We strongly recommend using `falco` through stable releases as
55+
described above, as the latest commits might contain undocumented
56+
bugs. For the more advanced users who wish to test the most recent
57+
code, `falco` can be installed by first cloning the repository
5758

5859
```
5960
$ git clone https://github.com/smithlabcode/falco.git
@@ -71,10 +72,10 @@ containing `falco`.
7172

7273
### Required C++ dependencies
7374

74-
[zlib](https://zlib.net) is required to read gzip compressed FASTQ files. It is
75-
usually installed by default in most UNIX computers and is part of the htslib
76-
setup, but it can also be installed with standard package managers like
77-
apt, brew or conda.
75+
[zlib](https://zlib.net) is required to read gzip compressed FASTQ
76+
files. It is usually installed by default in most UNIX computers and
77+
is part of the htslib setup, but it can also be installed with
78+
standard package managers like apt, brew or conda.
7879

7980
On Ubuntu, zlib C++ libraries can be installed with `apt`:
8081
```
@@ -83,9 +84,9 @@ $ sudo apt install zlib1g zlib1g-dev
8384

8485
### Optional C++ dependencies
8586

86-
[htslib](https://github.com/samtools/htslib) is required to process bam
87-
files. If not provided, bam files will be treated as unrecognized file
88-
formats.
87+
[htslib](https://github.com/samtools/htslib) is required to process
88+
bam files. If not provided, bam files will be treated as unrecognized
89+
file formats.
8990

9091
If htslib is installed, falco can be compiled with it by simply replacing the
9192
configure command above with the `--enable-hts` flag:
@@ -116,19 +117,23 @@ $ falco example.fq
116117
```
117118

118119
This will generate three files in the same directory as the input fastq file:
119-
* ``fastqc_data.txt`` is a text file with a summary of the QC
120-
metrics
121-
* ``fastqc_report.html`` is the visual HTML report showing plots of the
120+
121+
* `fastqc_data.txt` is a text file with a summary of the QC metrics
122+
123+
* `fastqc_report.html` is the visual HTML report showing plots of the
122124
QC metrics summarized in the text summary.
123-
* ``summary.txt``: A tab-separated file describing whether the pass/warn/fail
124-
result for each module. If multiple files are provided, only one summary file
125-
is generated, with one of the columns being the file name associated to each
126-
module result.
127125

128-
the full list of arguments and options can be seen by running `falco` without any arguments, as well as `falco -?` or `falco --help`. This will print the following list:
126+
* `summary.txt`: A tab-separated file describing whether the
127+
pass/warn/fail result for each module. If multiple files are
128+
provided, only one summary file is generated, with one of the
129+
columns being the file name associated to each module result.
130+
131+
The full list of arguments and options can be seen by running `falco`
132+
without any arguments, as well as `falco -?` or `falco --help`. This
133+
will print the following list:
129134

130135
```
131-
Usage: ./bin/falco [OPTIONS] <seqfile1> <seqfile2> ...
136+
Usage: falco [OPTIONS] <seqfile1> <seqfile2> ...
132137
133138
Options:
134139
-h, --help print this help file and exit
@@ -149,38 +154,48 @@ Options:
149154
-t, --threads Specifies number of threads to process
150155
simultaneos files in parallel (currently
151156
set for compatibility with fastqc. Not yet
152-
supported!)
157+
supported!) [1]
153158
-c, --contaminants Non-default filer with a list of
154159
contaminants
160+
/path/to/falco/Configuration/contaminant_list.txt
155161
-a, --adapters Non-default file with a list of adapters
162+
/path/to/falco/Configuration/adapter_list.txt
156163
-l, --limits Non-default file with limits and warn/fail
157164
criteria
165+
/path/to/falco/Configuration/limits.txt
158166
-T, --skip-text Skip generating text file (Default = false)
167+
159168
-H, --skip-html Skip generating HTML file (Default = false)
160-
-S, --skip-short-summary Skip short summary(Default = false)
169+
170+
-S, --skip-short-summary Skip short summary (Default = false)
161171
-q, --quiet print more run info
162172
-d, --dir directory in which to create temp files
163-
-A, --advanced-mode advanced mode: adds more information to the
164-
FastQC output depending on non-fastqc user
165-
flags
173+
-s, --subsample makes falco faster (but possibly less
174+
accurate) by only processing reads that are
175+
multiple of this value [1]
166176
-B, --bisulfite reads are whole genome bisulfite
167177
sequencing, and more Ts and fewer Cs are
168178
therefore expected and will be accounted
169179
for in base content (advanced mode)
170180
-R, --reverse-complement The input is a reverse-complement. All
171181
modules will be tested by swapping A/T and
172182
C/G
183+
-K, --add-call add the function call to fastqc_data.txt
184+
and fastqc-report.html (this may break the
185+
parse of fastqc_data.txt in programs that
186+
require rigorous FastQC format
173187
174188
Help options:
175189
-?, -help print this help message
176190
-about print about message
177191
178-
PROGRAM: ./bin/falco
192+
PROGRAM: ./falco
179193
A high throughput sequence QC analysis tool
180194
```
181195

182196
Citing falco
183-
=============
197+
============
198+
184199
If `falco` was helpful for your research, you can cite us as follows:
185200

186201
*de Sena Brandine G and Smith AD. Falco: high-speed FastQC emulation for
@@ -192,10 +207,10 @@ quality control of sequencing data. F1000Research 2021, 8:1874
192207
Copyright and License Information
193208
=================================
194209

195-
Copyright (C) 2019
196-
University of Southern California,
210+
Copyright (C) 2019-2022 Guilherme de Sena Brandine and
211+
Andrew D. Smith
197212

198-
Current Authors: Guilherme de Sena Brandine, Andrew D. Smith
213+
Authors: Guilherme de Sena Brandine and Andrew D. Smith
199214

200215
This is free software: you can redistribute it and/or modify it under
201216
the terms of the GNU General Public License as published by the Free

configure.ac

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -1,7 +1,7 @@
11
dnl falco: quality control for sequencing read files
22
dnl
3-
dnl Copyright (C) 2019 Guilherme De Sena Brandine and
4-
dnl Andrew D. Smith
3+
dnl Copyright (C) 2019-2022 Guilherme De Sena Brandine and
4+
dnl Andrew D. Smith
55
dnl Authors: Guilherme De Sena Brandine, Andrew Smith
66
dnl
77
dnl This program is free software: you can redistribute it and/or
@@ -14,7 +14,7 @@ dnl WITHOUT ANY WARRANTY; without even the implied warranty of
1414
dnl MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU
1515
dnl General Public License for more details.
1616

17-
AC_INIT([falco], [1.0.0], [desenabr@usc.edu],
17+
AC_INIT([falco], [1.1.0], [desenabr@usc.edu],
1818
[falco], [https://github.com/smithlabcode/falco])
1919
dnl the config.h is not currently #included in the source, and only
2020
dnl used to keep command lines short.

0 commit comments

Comments
 (0)