-
extract_scores: Summarise a metrics output tsv. -
dataset_concatenate: Concatenate N AnnData datasets. -
Created test data
resources_test/pancreaswithsrc/common/resources_test_scripts/pancreas.sh. -
list_git_shas: create list of latest commit hashes of all files in repo. -
get_api_info: extract api info from tasks -
get_method_info: extract method info vrom config yaml -
get_metric_info: extract metric info vrom config yaml -
check_migration_status: compare git shas from methods with v1 -
get_results: extract benchmark scores
-
api/anndata_*: Created a file format specifications for the h5ad files throughout the pipeline. -
api/comp_*: Created an api definition for the split, method and metric components. -
split_dataset: Added a component for splitting raw datasets into task-ready dataset objects. -
resources_test/label_projection/pancreaswithsrc/label_projection/resources_test_scripts/pancreas.sh.
-
methods/knn: Migrated from v1. -
methods/logistic_regression: Migrated from v1. -
methods/mlp: Migrated from v1. -
methods/scanvi: Migrated and adapted from v1. -
methods/seurat_transferdata: Migrated and adapted from v1. -
methods/xgboost: Migrated from v1. -
control_methods/majority_vote: Migrated from v1. -
control_methods/random_labels: Migrated from v1. -
control_methods/true_labels: Migrated from v1. -
metric/accuracy: Migrated from v1. -
metric/f1: Migrated from v1.
-
workflows/process_openproblems_v1: Fetch and process legacy OpenProblems v1 datasets -
normalization/log_cpm: A log CPM normalization method. -
normalization/log_scran_pooling: A log scran pooling normalization method. -
normalization/sqrt_cpm: A sqrt CPM normalization method. -
normalization/l1_sqrt: A scaled L1 sqrt normalization. extracted from Alra method in the denoising task from v1 -
subsample: Subsample an h5ad file.
loaders/openproblems_v1: Fetch a dataset from OpenProblems v1
-
api/anndata_*: Created a file format specifications for the h5ad files throughout the pipeline. -
api/comp_*: Created an api definition for the split, method and metric components. -
split_dataset: Added a component for splitting raw datasets into task-ready dataset objects. -
resources_test/denoising/pancreaswithsrc/denoising/resources_test_scripts/pancreas.sh.
-
control_methods/no_denoising: Migrated from v1. Extracted from baseline method -
control_methods/perfect_denoising: Migrated from v1.Extracted from baseline method -
methods/alra: Migrated from v1. Changed from python to R and uses lg_cpm normalised data instead of L1 sqrt -
methods/dca: Migrated and adapted from v1. -
methods/knn_smoothing: Migrated and adapted from v1. -
methods/magic: Migrated from v1. -
metrics/mse: Migrated from v1. -
metrics/poisson: Migrated from v1.
-
Anndata layers are used to store data instead of obsm
-
extended the use of sparse data in methods unless it was not possible
-
split_dataset also removes unnecessary data from train and test datasets not needed by the methods and metrics.
-
api/anndata_*: Created a file format specifications for the h5ad files throughout the pipeline. -
api/comp_*: Created an api definition for the split, control method, method and metric components. -
split_dataset: Added a component for splitting raw datasets into task-ready dataset objects. -
control_methods: Added a component for baseline methods specifically. -
resources_test/dimensionality_reduction/pancreaswithsrc/dimensionality_reduction/resources_test_scripts/pancreas.sh.
-
control_methods/high_dim_pca: Migrated from v1. Extracted from baseline methodHigh-dimensional PCA. -
control_methods/random_features: Migrated from v1. Extracted from baseline methodRandom Features. -
methods/umap: Migrated from v1. -
methods/tsne: Migrated and adapted from v1. -
methods/densmap: Migrated and adapted from v1. -
methods/phate: Migrated from v1. -
metrics/rmse: Migrated from v1. -
metrics/trustworthiness: Migrated from v1. -
metrics/density: Migrated from v1.
-
Anndata layers are used to store normalized and raw counts instead of
.X. -
Metrics are stored in
.unsdata. -
split_datasetremoves nonessential data from train and test datasets for the methods and metrics. -
Higher dimensional data used to obtain the metrics is calculated from test data instead of the whole dataset. So far test and train data contain the same counts values, but this may change eventually.
-
Test data is used instead of the whole dataset in control (baseline) methods.
-
api/anndata_*: Created a file format specifications for the h5ad files throughout the pipeline. -
api/comp_*: Created an api definition for the mask, method and metric components. -
mask_dataset: Added a component for masking raw datasets into task-ready dataset objects. -
resources_test/joint_embedding/pancreaswithsrc/joint_embedding/resources_test_scripts/pancreas.sh.
-
control_methods/random_embed: Migrated from neurips 2021. Extracted from baseline methoddummy_random. -
control_methods/zeros_embed: Migrated from neurips 2021. Extracted from baseline methoddummy_zeros. -
methods/lmds: Migrated from neurips 2021. -
methods/mnn: Migrated and adapted from neurips 2021. -
methods/newwave: Migrated and adapted from neurips 2021. -
methods/pca: Migrated from neurips 2021. -
methods/totalvi: Migrated from neurips 2021. -
methods/umap: Migrated from neurips 2021. -
metrics/ari: Migrated from neurips 2021. -
metrics/asw_batch: Migrated from neurips 2021. -
metrics/asw_label: Migrated from neurips 2021. -
metrics/cc_cons: Migrated from neurips 2021. -
metrics/check_format: Migrated from neurips 2021. -
metrics/graph_connectivity: Migrated from neurips 2021. -
metrics/latent_mixing: Migrated from neurips 2021. -
metrics/nmi: Migrated from neurips 2021. -
metrics/rfoob: Migrated from neurips 2021. -
metrics/ti_cons: Migrated from neurips 2021. -
metrics/ti_cons_batch: Migrated from neurips 2021.
-
Updated docker config from R script. Was using an old
anndatapackage which was giving warnings -
stores the output from the methods in
.obsm["X_emb"]instead of.Xin theanndata -
X_embdata is stored as aSparse Matrix -
updated configs to latest
viash