You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Support for GEO ids has been dropped in this release due to breaking changes introduced in the NCBI API. For more detailed information please see [this PR](https://github.com/nf-core/fetchngs/pull/102).
11
+
12
+
As a workaround, if you have a GEO accession you can directly download a text file containing the appropriate SRA ids to pass to the pipeline:
13
+
14
+
- Search for your GEO accession on [GEO](https://www.ncbi.nlm.nih.gov/geo)
15
+
- Click `SRA Run Selector` at the bottom of the GEO accession page
16
+
- Select the desired samples in the `SRA Run Selector` and then download the `Accession List`
17
+
18
+
This downloads a text file called `SRR_Acc_List.txt` that can be directly provided to the pipeline e.g. `--input SRR_Acc_List.txt`.
7
19
8
20
### Enhancements & fixes
9
21
10
22
-[#97](https://github.com/nf-core/fetchngs/pull/97) - Add support for generating nf-core/taxprofiler compatible samplesheets.
23
+
-[#99](https://github.com/nf-core/fetchngs/issues/99) - SRA_IDS_TO_RUNINFO fails due to bad request
11
24
- Add `enum` field for `--nf_core_pipeline` to parameter schema so only accept supported pipelines are accepted
Copy file name to clipboardExpand all lines: README.md
+14-2Lines changed: 14 additions & 2 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -17,7 +17,7 @@
17
17
18
18
## Introduction
19
19
20
-
**nf-core/fetchngs** is a bioinformatics pipeline to fetch metadata and raw FastQ files from both public and private databases. At present, the pipeline supports SRA / ENA / DDBJ / GEO / Synapse ids (see [usage docs](https://nf-co.re/fetchngs/usage#introduction)).
20
+
**nf-core/fetchngs** is a bioinformatics pipeline to fetch metadata and raw FastQ files from both public and private databases. At present, the pipeline supports SRA / ENA / DDBJ / Synapse ids (see [usage docs](https://nf-co.re/fetchngs/usage#introduction)).
21
21
22
22
The pipeline is built using [Nextflow](https://www.nextflow.io), a workflow tool to run tasks across multiple compute infrastructures in a very portable manner. It uses Docker/Singularity containers making installation trivial and results highly reproducible. The [Nextflow DSL2](https://www.nextflow.io/docs/latest/dsl2.html) implementation of this pipeline uses one container per process which makes it much easier to maintain and update software dependencies.
23
23
@@ -27,7 +27,7 @@ On release, automated continuous integration tests run the pipeline on a full-si
27
27
28
28
Via a single file of ids, provided one-per-line (see [example input file](https://raw.githubusercontent.com/nf-core/test-datasets/fetchngs/sra_ids_test.txt)) the pipeline performs the following steps:
29
29
30
-
### SRA / ENA / DDBJ / GEO ids
30
+
### SRA / ENA / DDBJ ids
31
31
32
32
1. Resolve database ids back to appropriate experiment-level ids and to be compatible with the [ENA API](https://ena-docs.readthedocs.io/en/latest/retrieval/programmatic-access.html)
33
33
2. Fetch extensive id metadata via ENA API
@@ -36,6 +36,18 @@ Via a single file of ids, provided one-per-line (see [example input file](https:
36
36
- Otherwise use [`sra-tools`](https://github.com/ncbi/sra-tools) to download `.sra` files and convert them to FastQ
37
37
4. Collate id metadata and paths to FastQ files in a single samplesheet
38
38
39
+
### GEO ids
40
+
41
+
Support for GEO ids was dropped in [[v1.7](https://github.com/nf-core/fetchngs/releases/tag/1.7)] due to breaking changes introduced in the NCBI API. For more detailed information please see [this PR](https://github.com/nf-core/fetchngs/pull/102).
42
+
43
+
As a workaround, if you have a GEO accession you can directly download a text file containing the appropriate SRA ids to pass to the pipeline instead:
44
+
45
+
- Search for your GEO accession on [GEO](https://www.ncbi.nlm.nih.gov/geo)
46
+
- Click `SRA Run Selector` at the bottom of the GEO accession page
47
+
- Select the desired samples in the `SRA Run Selector` and then download the `Accession List`
48
+
49
+
This downloads a text file called `SRR_Acc_List.txt` that can be directly provided to the pipeline e.g. `--input SRR_Acc_List.txt`.
50
+
39
51
### Synapse ids
40
52
41
53
1. Resolve Synapse directory ids to their corresponding FastQ files ids via the `synapse list` command.
Copy file name to clipboardExpand all lines: docs/output.md
+3-3Lines changed: 3 additions & 3 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -9,19 +9,19 @@ This document describes the output produced by the pipeline. The directories lis
9
9
The pipeline is built using [Nextflow](https://www.nextflow.io/) and processes data depending on the type of ids provided:
10
10
11
11
- Download FastQ files and create samplesheet from:
12
-
1.[SRA / ENA / DDBJ / GEO ids](#sra--ena--ddbj--geo-ids)
12
+
1.[SRA / ENA / DDBJ ids](#sra--ena--ddbj-ids)
13
13
2.[Synapse ids](#synapse-ids)
14
14
-[Pipeline information](#pipeline-information) - Report metrics generated during the workflow execution
15
15
16
16
Please see the [usage documentation](https://nf-co.re/fetchngs/usage#introduction) for a list of supported public repository identifiers and how to provide them to the pipeline.
17
17
18
-
### SRA / ENA / DDBJ / GEO ids
18
+
### SRA / ENA / DDBJ ids
19
19
20
20
<detailsmarkdown="1">
21
21
<summary>Output files</summary>
22
22
23
23
-`fastq/`
24
-
-`*.fastq.gz`: Paired-end/single-end reads downloaded from the SRA / ENA / DDBJ / GEO.
24
+
-`*.fastq.gz`: Paired-end/single-end reads downloaded from the SRA / ENA / DDBJ.
25
25
-`fastq/md5/`
26
26
-`*.md5`: Files containing `md5` sum for FastQ files downloaded from the ENA.
Copy file name to clipboardExpand all lines: docs/usage.md
+9-9Lines changed: 9 additions & 9 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -8,15 +8,15 @@
8
8
9
9
The pipeline has been set-up to automatically download and process the raw FastQ files from both public and private repositories. Identifiers can be provided in a file, one-per-line via the `--input` parameter. Currently, the following types of example identifiers are supported:
0 commit comments