Skip to content

Commit 16fd58b

Browse files
committed
2 parents 02187d4 + daf3671 commit 16fd58b

File tree

1 file changed

+7
-1
lines changed

1 file changed

+7
-1
lines changed

scripts/download_ensembl.sh

Lines changed: 7 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,7 +1,7 @@
11
#!/bin/bash
22
# Download and process data for a particular species from an Ensembl release, e.g.
33
#
4-
# download_ensembl mouse 95 <your_email> /srv/data/ensembl
4+
# download_ensembl mouse 95 <your_email> /srv/data/genome/mouse/ensembl-95
55
#
66
# The script:
77
# 1) Downloads top-level sequences for the species' genome in FASTA format
@@ -178,3 +178,9 @@ fi
178178
# Generating refFlat file for Picard RNA-seq metrics
179179
mkdir -p picard
180180
generate_picard_refFlat picard ${SPECIES} ${VERSION} ${gtf_file}
181+
182+
# Generating get_gene_lengths file
183+
echo "Running get_gene_lengths for species ...."
184+
get_gene_lengths ${gtf_file} > ./gene_lengths.csv
185+
# Construct transcript->gene mapping file for tximport
186+
awk '$3=="transcript" {print $14, $10}' ${gtf_file} | sed 's/"//g;s/;//g' > ./tx2gene.tsv

0 commit comments

Comments
 (0)