Skip to content

Commit e624917

Browse files
authored
Update README.md
move data description to a separate section
1 parent d518162 commit e624917

File tree

1 file changed

+6
-3
lines changed

1 file changed

+6
-3
lines changed

Pilot1/NT3/README.md

Lines changed: 6 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -1,11 +1,14 @@
11
The Pilot1 Benchmark 4, commonly referred to as NT3, is a 1D convolutional network for classifying RNA-seq gene expression profiles into normal or tumor tissue categories.
22
The network follows the classic architecture of convolutional models with multiple 1D convolutional layers interleaved with pooling layers followed by final dense layers.
33
The network can optionally use 1D locally connected layers in place of convolution layers as well as dropout layers for regularization.
4-
The model is trained on the balanced 700 matched normal-tumor gene expression profile pairs available from the NCI genomic data commons.
5-
The full set of expression features contains 60,483 float columns transformed from RNA-seq FPKM-UQ values. This model achieves around 98% classification accuracy.
64
It is useful for studying the difference and transformation of latent representation between normal and tumor tissues.
75
The model also acts as a quality control check for synthetically generated gene expression profiles.
86

7+
## Data
8+
The model is trained on the balanced 700 matched normal-tumor gene expression profile pairs available from the NCI genomic data commons.
9+
The full set of expression features contains 60,483 float columns from RNA-seq [FPKM-UQ](https://docs.gdc.cancer.gov/Encyclopedia/pages/HTSeq-FPKM-UQ/) values. This model achieves around 98% classification accuracy.
10+
The associated metadata for the samples can be found [here](http://ftp.mcs.anl.gov/pub/candle/public/benchmarks/Pilot1/normal-tumor/matched_normal_samples.metadata.tsv).
11+
912
## Profile runs
1013
We have run the same configuration across multiple machines and compared the resource utilization.
1114
```
@@ -19,4 +22,4 @@ python uno_baseline_keras2.py --conf nt3_perf_benchmark.txt
1922
| Tesla (K20) | 1:29:06 | 103 | 0.96 | 18.1 | 25.8 | 6.8 | 97.3 | 45.3 | |
2023
| Titan | 1:07:31 | 99 | 1.00 | 5.1 | 15.5 | 3.8 | | | |
2124
1. MKL-DNN does not support Conv1D. need tf1.11
22-
* Time per epoch on the machine divided by time per epoch of Titan (or Tesla)
25+
* Time per epoch on the machine divided by time per epoch of Titan (or Tesla)

0 commit comments

Comments
 (0)