openproblems-v2 0.1.0

common

NEW FUNCTIONALITY

extract_scores: Summarise a metrics output tsv.
dataset_concatenate: Concatenate N AnnData datasets.
Created test data resources_test/pancreas with src/common/resources_test_scripts/pancreas.sh.
list_git_shas: create list of latest commit hashes of all files in repo.
get_api_info: extract api info from tasks
get_method_info: extract method info vrom config yaml
get_metric_info: extract metric info vrom config yaml
check_migration_status: compare git shas from methods with v1
get_results: extract benchmark scores

label_projection

NEW FUNCTIONALITY

api/anndata_*: Created a file format specifications for the h5ad files throughout the pipeline.
api/comp_*: Created an api definition for the split, method and metric components.
split_dataset: Added a component for splitting raw datasets into task-ready dataset objects.
resources_test/label_projection/pancreas with src/label_projection/resources_test_scripts/pancreas.sh.

V1 MIGRATION

methods/knn: Migrated from v1.
methods/logistic_regression: Migrated from v1.
methods/mlp: Migrated from v1.
methods/scanvi: Migrated and adapted from v1.
methods/seurat_transferdata: Migrated and adapted from v1.
methods/xgboost: Migrated from v1.
control_methods/majority_vote: Migrated from v1.
control_methods/random_labels: Migrated from v1.
control_methods/true_labels: Migrated from v1.
metric/accuracy: Migrated from v1.
metric/f1: Migrated from v1.

datasets

NEW FUNCTIONALITY

workflows/process_openproblems_v1: Fetch and process legacy OpenProblems v1 datasets
normalization/log_cpm: A log CPM normalization method.
normalization/log_scran_pooling: A log scran pooling normalization method.
normalization/sqrt_cpm: A sqrt CPM normalization method.
normalization/l1_sqrt: A scaled L1 sqrt normalization. extracted from Alra method in the denoising task from v1
subsample: Subsample an h5ad file.

V1 MIGRATION

loaders/openproblems_v1: Fetch a dataset from OpenProblems v1

denoising

NEW FUNCTIONALITY

api/anndata_*: Created a file format specifications for the h5ad files throughout the pipeline.
api/comp_*: Created an api definition for the split, method and metric components.
split_dataset: Added a component for splitting raw datasets into task-ready dataset objects.
resources_test/denoising/pancreas with src/denoising/resources_test_scripts/pancreas.sh.

V1 MIGRATION

control_methods/no_denoising: Migrated from v1. Extracted from baseline method
control_methods/perfect_denoising: Migrated from v1.Extracted from baseline method
methods/alra: Migrated from v1. Changed from python to R and uses lg_cpm normalised data instead of L1 sqrt
methods/dca: Migrated and adapted from v1.
methods/knn_smoothing: Migrated and adapted from v1.
methods/magic: Migrated from v1.
metrics/mse: Migrated from v1.
metrics/poisson: Migrated from v1.

Changes from V1

Anndata layers are used to store data instead of obsm
extended the use of sparse data in methods unless it was not possible
split_dataset also removes unnecessary data from train and test datasets not needed by the methods and metrics.

Dimensionality reduction

New functionality

api/anndata_*: Created a file format specifications for the h5ad files throughout the pipeline.
api/comp_*: Created an api definition for the split, control method, method and metric components.
split_dataset: Added a component for splitting raw datasets into task-ready dataset objects.
control_methods: Added a component for baseline methods specifically.
resources_test/dimensionality_reduction/pancreas with src/dimensionality_reduction/resources_test_scripts/pancreas.sh.

V1 migration

control_methods/high_dim_pca: Migrated from v1. Extracted from baseline method High-dimensional PCA.
control_methods/random_features: Migrated from v1. Extracted from baseline method Random Features.
methods/umap: Migrated from v1.
methods/tsne: Migrated and adapted from v1.
methods/densmap: Migrated and adapted from v1.
methods/phate: Migrated from v1.
metrics/rmse: Migrated from v1.
metrics/trustworthiness: Migrated from v1.
metrics/density: Migrated from v1.

Changes from V1

Anndata layers are used to store normalized and raw counts instead of .X.
Metrics are stored in .uns data.
split_dataset removes nonessential data from train and test datasets for the methods and metrics.
Higher dimensional data used to obtain the metrics is calculated from test data instead of the whole dataset. So far test and train data contain the same counts values, but this may change eventually.
Test data is used instead of the whole dataset in control (baseline) methods.

Multi modality - Joint Embedding

New functionality

api/anndata_*: Created a file format specifications for the h5ad files throughout the pipeline.
api/comp_*: Created an api definition for the mask, method and metric components.
mask_dataset: Added a component for masking raw datasets into task-ready dataset objects.
resources_test/joint_embedding/pancreas with src/joint_embedding/resources_test_scripts/pancreas.sh.

neurips 2021 migration

control_methods/random_embed: Migrated from neurips 2021. Extracted from baseline method dummy_random.
control_methods/zeros_embed: Migrated from neurips 2021. Extracted from baseline method dummy_zeros.
methods/lmds: Migrated from neurips 2021.
methods/mnn: Migrated and adapted from neurips 2021.
methods/newwave: Migrated and adapted from neurips 2021.
methods/pca: Migrated from neurips 2021.
methods/totalvi: Migrated from neurips 2021.
methods/umap: Migrated from neurips 2021.
metrics/ari: Migrated from neurips 2021.
metrics/asw_batch: Migrated from neurips 2021.
metrics/asw_label: Migrated from neurips 2021.
metrics/cc_cons: Migrated from neurips 2021.
metrics/check_format: Migrated from neurips 2021.
metrics/graph_connectivity: Migrated from neurips 2021.
metrics/latent_mixing: Migrated from neurips 2021.
metrics/nmi: Migrated from neurips 2021.
metrics/rfoob: Migrated from neurips 2021.
metrics/ti_cons: Migrated from neurips 2021.
metrics/ti_cons_batch: Migrated from neurips 2021.

changes from neurips 2021

Updated docker config from R script. Was using an old anndata package which was giving warnings
stores the output from the methods in .obsm["X_emb"] instead of .X in the anndata
X_emb data is stored as a Sparse Matrix
updated configs to latest viash

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

openproblems-v2 0.1.0

common

NEW FUNCTIONALITY

label_projection

NEW FUNCTIONALITY

V1 MIGRATION

datasets

NEW FUNCTIONALITY

V1 MIGRATION

denoising

NEW FUNCTIONALITY

V1 MIGRATION

Changes from V1

Dimensionality reduction

New functionality

V1 migration

Changes from V1

Multi modality - Joint Embedding

New functionality

neurips 2021 migration

changes from neurips 2021

FilesExpand file tree

CHANGELOG.md

Latest commit

History

CHANGELOG.md

File metadata and controls

openproblems-v2 0.1.0

common

NEW FUNCTIONALITY

label_projection

NEW FUNCTIONALITY

V1 MIGRATION

datasets

NEW FUNCTIONALITY

V1 MIGRATION

denoising

NEW FUNCTIONALITY

V1 MIGRATION

Changes from V1

Dimensionality reduction

New functionality

V1 migration

Changes from V1

Multi modality - Joint Embedding

New functionality

neurips 2021 migration

changes from neurips 2021