Skip to content

Commit 70b50ff

Browse files
committed
Removed previous mapping files because vast dataset changes and renaming
1 parent 57b9fe0 commit 70b50ff

File tree

4 files changed

+3
-531418
lines changed

4 files changed

+3
-531418
lines changed

coderbuild/hcmi/02-getHCMIData.py

Lines changed: 0 additions & 10 deletions
Original file line numberDiff line numberDiff line change
@@ -802,11 +802,8 @@ def deduplicate_final_csv(csv_path, subset=None):
802802
-------
803803
None (file is overwritten in-place)
804804
"""
805-
# 1. build a temp file with .csv.gz suffix
806805
fd, tmp = tempfile.mkstemp(suffix=".csv.gz")
807806
os.close(fd)
808-
809-
# 2. lazy-scan original CSV, drop dupes, write compressed
810807
(
811808
pl.scan_csv(csv_path)
812809
.unique(subset=subset, maintain_order=True)
@@ -816,17 +813,10 @@ def deduplicate_final_csv(csv_path, subset=None):
816813
separator=",")
817814
)
818815

819-
# 3. define the final output name
820816
out_path = csv_path + ".gz"
821-
822-
# 4. atomically move temp → final
823817
os.replace(tmp, out_path)
824-
825-
# 5. remove the old uncompressed CSV
826818
os.remove(csv_path)
827-
828819
print(f"De-duplicated and gzipped file written to: {out_path}")
829-
830820
# 6. free memory
831821
gc.collect()
832822

coderbuild/hcmi/build_omics.sh

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -4,10 +4,10 @@ set -euo pipefail
44
trap 'echo "Error on or near line $LINENO while executing: $BASH_COMMAND"; exit 1' ERR
55

66
echo "Running 02-getHCMIData.py for transcriptomics."
7-
python 02-getHCMIData.py -m full_manifest.txt -t transcriptomics -o /tmp/hcmi_transcriptomics.csv.gz -g $1 -s $2
7+
python 02-getHCMIData.py -m full_manifest.txt -t transcriptomics -o /tmp/hcmi_transcriptomics.csv -g $1 -s $2
88

99
echo "Running 02-getHCMIData.py for copy_number."
10-
python 02-getHCMIData.py -m full_manifest.txt -t copy_number -o /tmp/hcmi_copy_number.csv.gz -g $1 -s $2
10+
python 02-getHCMIData.py -m full_manifest.txt -t copy_number -o /tmp/hcmi_copy_number.csv -g $1 -s $2
1111

1212
echo "Running 02-getHCMIData.py for mutations."
13-
python 02-getHCMIData.py -m full_manifest.txt -t mutations -o /tmp/hcmi_mutations.csv.gz -g $1 -s $2
13+
python 02-getHCMIData.py -m full_manifest.txt -t mutations -o /tmp/hcmi_mutations.csv -g $1 -s $2

0 commit comments

Comments
 (0)