You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Download links are currently unavailable, we work on restoring them as soon as possible. In the meantime, the unprocessed data (fastq) can be downloaded from ENA: https://www.ebi.ac.uk/ena/browser/view/PRJEB44348
3
+
The current data release consists of 93 files that include long read and short read RNA-Seq data from all 5 cell lines. The sample description and download links can be found [here](docs/Sample_information.txt).
6
4
7
-
---
8
-
9
-
As the core datasets, we have in total 72 runs for core cell lines using three different Nanopore RNA-Sequencing prototocols.
10
-
11
-
As an initial release, we are providing fastq and bam files. You can sign up for the sg-nex-updates email list to receive notifications about upcoming data releases:
Detailed information on sample ids and corresponding sample attributes can be found [here](docs/Sample_information.txt).
20
-
21
-
Notes on data usage: This site provides early access to the SG-NEx data for research. Please note that the data is under publication embargo until the SG-NEx project is published.
5
+
**_Notes on data usage_**: This site provides early access to the SG-NEx data. These data can be used in research and publications, but we ask data users to refrain from publishing a systematic comparison that is described in the pre-print until the final manuscript is published. If you are uncertain, please feel free to reach out (https://github.com/GoekeLab/sg-nex-data/#contact). You can sign up for the sg-nex-updates email list to receive notifications about upcoming data releases: https://groups.google.com/forum/#!forum/sg-nex-updates/join. If you use the SG-NEx data in your research, please specify the [release version](https://github.com/GoekeLab/sg-nex-data/#data-download) and cite the pre-print (see [citation](https://github.com/GoekeLab/sg-nex-data/#citing-the-SG-NEx-project)).
The SG-NEx project is an international collaboration that was initiated at the [Genome Institute of Singapore](https://www.a-star.edu.sg/gis/). The aim of the SG-NEx Project is to generate reference transcriptomes for 5 of the most commonly used cancer cell lines using Nanopore long read RNA-Seq data:
Transcriptome profiling is done using PCR-cDNA sequencing ("PCR-cDNA"), amplification-free cDNA sequencing ("direct cDNA"), direct sequencing of native RNA (“direct RNA”), and short read RNA-Seq. All samples are sequenced with at least 3 high quality replicates. For a subset of samples, we used sequin spike-in RNAs.
Notes on data usage: This site provides early access to the SG-NEx data for research. Please note that the data is under publication embargo until the SG-NEx project is published.
36
+
37
+
_**Notes on data usage**_: This site provides early access to the SG-NEx data. These data can be used in research and publications, but we ask data users to refrain from publishing a systematic comparison that is described in the pre-print until the final manuscript is published. If you are uncertain, please feel free to reach out ([Contact](#contact)).
38
+
39
+
**Release History**
40
+
41
+
You can find previous releases here in the [release history](https://github.com/GoekeLab/sg-nex-data/releases)
23
42
24
43
## Data Processing
25
44
26
45
We collaborated with [nf-core](https://github.com/nf-core) to develop [nanoseq](https://github.com/nf-core/nanoseq), a standardardized pipeline for Nanopore RNA-Seq data processing.
27
46
47
+
**Reference files**
28
48
29
-
## Reference files
30
49
Details on reference files can be found [here](ANNOTATIONS.md).
31
50
51
+
## Use Cases and Applications
52
+
53
+
You can browse a list of articles using the SG-NEx data in research [here](SGNEx_usecases.md)
54
+
55
+
## Data Access Tutorials
56
+
57
+
Coming soon! Please refer to [Data Download](#data-download) in the meantime.
58
+
32
59
## Contributors
33
60
34
61
**GIS Sequencing Platform and Data Generation**
@@ -38,10 +65,18 @@ Hwee Meng Low, Yao Fei, Sarah Ng, Wendy Soon, CC Khor
Chen Ying, Nadia M. Davidson, Harshil Patel, Yuk Kei Wan, Naruemon Pratanwanich, Christopher Hendra, Laura Watten, Chelsea Sawyer, Dominik Stanojevic, Philip Andrew Ewels, Andreas Wilm, Mile Sikic, Alexandre Thiery, Michael I. Love, Alicia Oshlak, Jonathan Göke
68
+
Ying Chen, Nadia M. Davidson, Harshil Patel, Yuk Kei Wan, Naruemon Pratanwanich, Christopher Hendra, Laura Watten, Chelsea Sawyer, Dominik Stanojevic, Philip Andrew Ewels, Andreas Wilm, Mile Sikic, Alexandre Thiery, Michael I. Love, Alicia Oshlak, Jonathan Göke
69
+
70
+
## Citing the SG-NEx project
71
+
72
+
If you use the SG-NEx data in your research, please specify the [release version](#data-download) and cite the pre-print that describes this data resource:
73
+
74
+
Chen, Ying, et al. "A systematic benchmark of Nanopore long read RNA sequencing for transcript level analysis in human cell lines." _bioRxiv_ (2021). doi: https://doi.org/10.1101/2021.04.21.440736
75
+
76
+
Please see the note on data usage (under [Data Download](#data-download)).
42
77
43
78
## Contact
44
79
45
-
Questions about SG-NEx? Please contact [Jonathan Göke](https://www.a-star.edu.sg/gis/our-people/faculty-staff)
80
+
Questions about SG-NEx? Please add an entry in the [Discussions Forum](https://github.com/GoekeLab/sg-nex-data/discussions). You can also contact [Jonathan Göke](https://www.a-star.edu.sg/gis/our-people/faculty-staff)
### Use Cases and Applications: Research articles using the SG-NEx data (pre-release versions)
2
+
3
+
This site lists some examples how the SG-NEx data resource is used in research:
4
+
5
+
#### Transcript discovery/quantification
6
+
7
+
- Schulz, Laura, et al. "Direct long-read RNA sequencing identifies a subset of questionable exitrons likely arising from reverse transcription artifacts." _Genome Biology_ 22.1 (2021): 1-12. https://doi.org/10.1186/s13059-021-02411-1
8
+
- Annaldasula, Siddharth, Martyna Gajos, and Andreas Mayer. "IsoTV: processing and visualizing functional features of translated transcript isoforms." _Bioinformatics_ (2021). https://doi.org/10.1093/bioinformatics/btab103
9
+
10
+
#### RNA modifications
11
+
12
+
- Pratanwanich, Ploy N., et al. "Identification of differential RNA modifications from nanopore direct RNA sequencing with xPore." _Nature Biotechnology_ (2021): 1-9. https://doi.org/10.1038/s41587-021-00949-w
13
+
- Hendra, Christopher, et al. "Detection of m6A from direct RNA sequencing using a Multiple Instance Learning framework." _bioRxiv_ (2021). https://doi.org/10.1101/2021.09.20.461055
14
+
- Campos, João H., et al. "Direct RNA sequencing reveals SARS-CoV-2 m6A sites and possible differential DRACH motif methylation among variants." _bioRxiv_ (2021). https://doi.org/10.1101/2021.08.24.457397
15
+
16
+
#### Fusion detection
17
+
18
+
- Davidson, Nadia M., et al. "JAFFAL: Detecting fusion genes with long read transcriptome sequencing." _bioRxiv_ (2021). https://doi.org/10.1101/2021.04.26.441398
19
+
20
+
#### Reviews and other use cases
21
+
22
+
- De Paoli-Iseppi, Ricardo, Josie Gleeson, and Michael B. Clark. "Isoform age-splice isoform profiling using long-read technologies." Frontiers in Molecular Biosciences 8 (2021). https://doi.org/10.3389/fmolb.2021.711733
23
+
24
+
Please feel free to add more examples by creating a pull request.
0 commit comments