Skip to content

Commit 610dc24

Browse files
authored
Merge pull request #14 from GoekeLab/update_links
update download links
2 parents 0039f2e + ce3fe61 commit 610dc24

File tree

4 files changed

+161
-97
lines changed

4 files changed

+161
-97
lines changed

DATA.md

Lines changed: 2 additions & 18 deletions
Original file line numberDiff line numberDiff line change
@@ -1,21 +1,5 @@
11
### Datasets
22

3-
---
4-
#### Update (15-07-2021)
5-
Download links are currently unavailable, we work on restoring them as soon as possible. In the meantime, the unprocessed data (fastq) can be downloaded from ENA: https://www.ebi.ac.uk/ena/browser/view/PRJEB44348
3+
The current data release consists of 93 files that include long read and short read RNA-Seq data from all 5 cell lines. The sample description and download links can be found [here](docs/Sample_information.txt).
64

7-
---
8-
9-
As the core datasets, we have in total 72 runs for core cell lines using three different Nanopore RNA-Sequencing prototocols.
10-
11-
As an initial release, we are providing fastq and bam files. You can sign up for the sg-nex-updates email list to receive notifications about upcoming data releases:
12-
13-
https://groups.google.com/forum/#!forum/sg-nex-updates/join
14-
15-
Please see below for the downloading links:
16-
- fastq: [fastq](https://www.dropbox.com/sh/q098af3xdzfqc72/AAA-UhZGSvmez5pOdZIN2mpRa?dl=0)
17-
- bam: [genomeBam](https://www.dropbox.com/sh/mjzbtp31cgtxato/AACPTouVgMztbArwTP9Yt0zCa?dl=0), [transcriptomeBam](https://www.dropbox.com/sh/cuyicuormo809fx/AAA9ndo8BWvGRjaByWKvrALIa?dl=0)
18-
19-
Detailed information on sample ids and corresponding sample attributes can be found [here](docs/Sample_information.txt).
20-
21-
Notes on data usage: This site provides early access to the SG-NEx data for research. Please note that the data is under publication embargo until the SG-NEx project is published.
5+
**_Notes on data usage_**: This site provides early access to the SG-NEx data. These data can be used in research and publications, but we ask data users to refrain from publishing a systematic comparison that is described in the pre-print until the final manuscript is published. If you are uncertain, please feel free to reach out (https://github.com/GoekeLab/sg-nex-data/#contact). You can sign up for the sg-nex-updates email list to receive notifications about upcoming data releases: https://groups.google.com/forum/#!forum/sg-nex-updates/join. If you use the SG-NEx data in your research, please specify the [release version](https://github.com/GoekeLab/sg-nex-data/#data-download) and cite the pre-print (see [citation](https://github.com/GoekeLab/sg-nex-data/#citing-the-SG-NEx-project)).

README.md

Lines changed: 41 additions & 6 deletions
Original file line numberDiff line numberDiff line change
@@ -1,4 +1,7 @@
1-
# SG-NEx - The Singapore Nanopore-Expression Project
1+
![The Singapore Nanopore-Expression Project\!](
2+
https://jglaborg.files.wordpress.com/2021/10/sg_nex_textlogo.png)
3+
4+
[![GitHub release (latest SemVer)](https://img.shields.io/github/v/release/GoekeLab/sg-nex-data?color=blue&include_prereleases)](#data-download)
25

36
The SG-NEx project is an international collaboration that was initiated at the [Genome Institute of Singapore](https://www.a-star.edu.sg/gis/). The aim of the SG-NEx Project is to generate reference transcriptomes for 5 of the most commonly used cancer cell lines using Nanopore long read RNA-Seq data:
47

@@ -7,28 +10,52 @@ https://jglaborg.files.wordpress.com/2020/10/sg_nex_design-1.png)
710

811
Transcriptome profiling is done using PCR-cDNA sequencing ("PCR-cDNA"), amplification-free cDNA sequencing ("direct cDNA"), direct sequencing of native RNA (“direct RNA”), and short read RNA-Seq. All samples are sequenced with at least 3 high quality replicates. For a subset of samples, we used sequin spike-in RNAs.
912

13+
## Content
14+
15+
- [Email list](#sign-up-for-data-release-notifications-and-updates)
16+
- [Data Download and Release History](#data-download)
17+
- [Data Processing](#data-processing)
18+
- [Use Cases and Applications](#use-cases-and-applications)
19+
- [Data Access Tutorials](#data-access-tutorials)
20+
- [Contributors](#contributors)
21+
- [Citing the SG-NEx project](#citing-the-sg-nex-project)
22+
- [Contact](#contact)
23+
1024
## Sign up for data release notifications and updates
1125
You can sign up for the sg-nex-updates email list to receive notifications about upcoming data releases:
1226

1327
https://groups.google.com/forum/#!forum/sg-nex-updates/join
1428

15-
## Data Releases
29+
## Data Download
1630

1731
**Pre-Release (v0.1)**
1832

1933
[![DOI](https://zenodo.org/badge/DOI/10.5281/zenodo.4159715.svg)](https://doi.org/10.5281/zenodo.4159715)
2034

2135
Data can be downloaded [here](DATA.md)
22-
Notes on data usage: This site provides early access to the SG-NEx data for research. Please note that the data is under publication embargo until the SG-NEx project is published.
36+
37+
_**Notes on data usage**_: This site provides early access to the SG-NEx data. These data can be used in research and publications, but we ask data users to refrain from publishing a systematic comparison that is described in the pre-print until the final manuscript is published. If you are uncertain, please feel free to reach out ([Contact](#contact)).
38+
39+
**Release History**
40+
41+
You can find previous releases here in the [release history](https://github.com/GoekeLab/sg-nex-data/releases)
2342

2443
## Data Processing
2544

2645
We collaborated with [nf-core](https://github.com/nf-core) to develop [nanoseq](https://github.com/nf-core/nanoseq), a standardardized pipeline for Nanopore RNA-Seq data processing.
2746

47+
**Reference files**
2848

29-
## Reference files
3049
Details on reference files can be found [here](ANNOTATIONS.md).
3150

51+
## Use Cases and Applications
52+
53+
You can browse a list of articles using the SG-NEx data in research [here](SGNEx_usecases.md)
54+
55+
## Data Access Tutorials
56+
57+
Coming soon! Please refer to [Data Download](#data-download) in the meantime.
58+
3259
## Contributors
3360

3461
**GIS Sequencing Platform and Data Generation**
@@ -38,10 +65,18 @@ Hwee Meng Low, Yao Fei, Sarah Ng, Wendy Soon, CC Khor
3865
Viktoriia Iakovleva, Puay Leng Lee, Lixia Xin, Hui En Vanessa Ng, Jia Min Loo, Xuewen Ong, Hui Qi Amanda Ng, Suk Yeah Polly Poon, Hoang-Dai Tran, Kok Hao Edwin Lim, Huck Hui Ng, Boon Ooi Patrick Tan, Huck-Hui Ng, N.Gopalakrishna Iyer, Wai Leong Tam, Wee Joo Chng, Leilei Chen, Ramanuj DasGupta, Yun Shen Winston Chan, Qiang Yu, Torsten Wüstefeld, Wee Siong Sho Goh
3966

4067
**Statistical Modeling and Data Analytics**
41-
Chen Ying, Nadia M. Davidson, Harshil Patel, Yuk Kei Wan, Naruemon Pratanwanich, Christopher Hendra, Laura Watten, Chelsea Sawyer, Dominik Stanojevic, Philip Andrew Ewels, Andreas Wilm, Mile Sikic, Alexandre Thiery, Michael I. Love, Alicia Oshlak, Jonathan Göke
68+
Ying Chen, Nadia M. Davidson, Harshil Patel, Yuk Kei Wan, Naruemon Pratanwanich, Christopher Hendra, Laura Watten, Chelsea Sawyer, Dominik Stanojevic, Philip Andrew Ewels, Andreas Wilm, Mile Sikic, Alexandre Thiery, Michael I. Love, Alicia Oshlak, Jonathan Göke
69+
70+
## Citing the SG-NEx project
71+
72+
If you use the SG-NEx data in your research, please specify the [release version](#data-download) and cite the pre-print that describes this data resource:
73+
74+
Chen, Ying, et al. "A systematic benchmark of Nanopore long read RNA sequencing for transcript level analysis in human cell lines." _bioRxiv_ (2021). doi: https://doi.org/10.1101/2021.04.21.440736
75+
76+
Please see the note on data usage (under [Data Download](#data-download)).
4277

4378
## Contact
4479

45-
Questions about SG-NEx? Please contact [Jonathan Göke](https://www.a-star.edu.sg/gis/our-people/faculty-staff)
80+
Questions about SG-NEx? Please add an entry in the [Discussions Forum](https://github.com/GoekeLab/sg-nex-data/discussions). You can also contact [Jonathan Göke](https://www.a-star.edu.sg/gis/our-people/faculty-staff)
4681

4782
![The Singapore Nanopore-Expression Project\!](https://jglaborg.files.wordpress.com/2020/10/sg_nex_logos-1.png)

SGNEx_usecases.md

Lines changed: 24 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,24 @@
1+
### Use Cases and Applications: Research articles using the SG-NEx data (pre-release versions)
2+
3+
This site lists some examples how the SG-NEx data resource is used in research:
4+
5+
#### Transcript discovery/quantification
6+
7+
- Schulz, Laura, et al. "Direct long-read RNA sequencing identifies a subset of questionable exitrons likely arising from reverse transcription artifacts." _Genome Biology_ 22.1 (2021): 1-12. https://doi.org/10.1186/s13059-021-02411-1
8+
- Annaldasula, Siddharth, Martyna Gajos, and Andreas Mayer. "IsoTV: processing and visualizing functional features of translated transcript isoforms." _Bioinformatics_ (2021). https://doi.org/10.1093/bioinformatics/btab103
9+
10+
#### RNA modifications
11+
12+
- Pratanwanich, Ploy N., et al. "Identification of differential RNA modifications from nanopore direct RNA sequencing with xPore." _Nature Biotechnology_ (2021): 1-9. https://doi.org/10.1038/s41587-021-00949-w
13+
- Hendra, Christopher, et al. "Detection of m6A from direct RNA sequencing using a Multiple Instance Learning framework." _bioRxiv_ (2021). https://doi.org/10.1101/2021.09.20.461055
14+
- Campos, João H., et al. "Direct RNA sequencing reveals SARS-CoV-2 m6A sites and possible differential DRACH motif methylation among variants." _bioRxiv_ (2021). https://doi.org/10.1101/2021.08.24.457397
15+
16+
#### Fusion detection
17+
18+
- Davidson, Nadia M., et al. "JAFFAL: Detecting fusion genes with long read transcriptome sequencing." _bioRxiv_ (2021). https://doi.org/10.1101/2021.04.26.441398
19+
20+
#### Reviews and other use cases
21+
22+
- De Paoli-Iseppi, Ricardo, Josie Gleeson, and Michael B. Clark. "Isoform age-splice isoform profiling using long-read technologies." Frontiers in Molecular Biosciences 8 (2021). https://doi.org/10.3389/fmolb.2021.711733
23+
24+
Please feel free to add more examples by creating a pull request.

0 commit comments

Comments
 (0)