SpuriousFL

Repository structure

project
│   README.md
│   .gitignore 
|   flower_train.py # Run flower training
|   centralized_training.py # Training centralized models
|   biased_training.py # Test N matrix prediction
│
└───src # Source code
│   │   flower_client.py # Client Flower class: client optimizers, client info
│   │   flower_strategy.py # Server Flower class: server optimizers, client weighting
|   |   flower_manager.py # Flower manager for client selection
|   |   corr.py # N matrix metrics SC,CI,AI
|   |   matrix_inference.py # N matrix inference
|   |   weighting_strategy.py # Helpers for imported selection and weighting strategies
│   └───datasets # Dataset loaders, splits
│   └───models # PyTorch models
|   └───optimizers # Handling subpopulation shift optimizers
└───conf # Config files following hydra
└───datasets # Dataset files or links
└───checkpoints # Experiment files

Important config parameters:

FedProx is a client optimizer method: the client_opt.subpop_optimizer can be changed from ERM to Prox to enable
FedAvgM or FedAvg changes the server aggregation in server_opt.optimizer
Client participation can be controlled with weights or selection: server_opt.participation can be weighting or selection
- If weighting is used, server_opt.weight_clients sets the specific weighting method. Weighting takes into account even if selection is set (for FedPNS), should be server_opt.weight_clients=same to disable.
- If selection is used, server_opt.selection_method sets the specific selection method. It can be random, groupweights (original matrix with Oracle ReWeight), or triplets_stochasticmatrix for triplets.
Some methods use information from the client. The server_opt.client_info tells what data is shared.
- If groupweights, triplets or nova is in the string, these infos will be passed to the server.
- If Npredicted is in the string, numbers are generated with the N matrix prediction algorithm
The named data splits can be controlled with dataset_options.split_mode. Most options are in src.datasets.data_splits.py
- dataset_options.num_clients must be set together with the split_mode.

Methods

FedProx: client_opt.subpop_optimizer=Prox
FedAvgM: server_opt.optimizer=FedAvgM
FedDiverse: server_opt.participation=selection, server_opt.selection_method=triplets_stochasticmatrix, triplets_Npredicted in server_opt.client_info
- as weighting: server_opt.weight_clients=server_post_triplets_stochasticmatrix_noreplacement, server_opt.participation=weighting
FedNova: nova in server_opt.client_info, server_opt.participation=weighting, server_opt.weight_clients=server_post_nova, server_opt.optimizer=FedAvgM
pow-d: gloss in server_opt.client_info, server_opt.participation=weighting, server_opt.weight_clients=server_post_powd
round robin: server_opt.participation=selection, server_opt.selection_method=roundrobin
FedPNS: server_opt.participation=selection, server_opt.selection_method=fedpns, server_opt.weight_clients=server_post_fedpns, gloss in server_opt.client_info
FedPNS w/o weights: server_opt.participation=selection, server_opt.selection_method=fedpns, server_opt.weight_clients=same, gloss in server_opt.client_info
Oort: oort in server_opt.client_info, server_opt.selection_method=oort, server_opt.participation=selection
HCSFed: compgrad in server_opt.client_info, server_opt.selection_method=hcsfed
FairFed: groupacc i server_opt.client_info, server_opt.selection_method=fairfed or server_opt.weight_clients=server_post_FairFed

Datasets distributions

Spawrious_GSC: dataset_options.split_mode=spawrious2, dataset_options.num_clients=24
Spawrious_GCI: dataset_options.split_mode=spawrious_GCI, dataset_options.num_clients=24
Spawrious_GAI: dataset_options.split_mode=spawrious_GAI_2, dataset_options.num_clients=25
Waterbirds_dist: dataset_options.split_mode=waterbirds_dist, dataset_options.num_clients=30
Spawrious_4: dataset_options.split_mode=spawrious4, dataset_options.num_clients=25, dataset_options.num_targets=4
CMINST_GSC: dataset_options.split_mode=sparwious2, dataset_options.num_clients=24 dataset_options.name=CMNIST, dataset_options.input_size=28
Spawrious_GCI_100: dataset_options.split_mode=spawrious_GCI_100, dataset_options.num_clients=100, server_opt.num_active_clients=12

Run

Example run: python flower_train.py

Requirements

python 3.10.11
flwr==1.8.0[simulation]
torch
torchvision
matplotlib
wilds
hydra-core
timm
cvxopt
wandb
git+https://github.com/aengusl/spawrious.git
submitit hydra-submitit-launcher
datasets[vision]
aif360

Name		Name	Last commit message	Last commit date
Latest commit History 347 Commits
conf		conf
pretrained_ckpt/pt_mobilenetv2_100-4065ukrp		pretrained_ckpt/pt_mobilenetv2_100-4065ukrp
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
biased_training.py		biased_training.py
biased_training_mini.py		biased_training_mini.py
centralized_training.py		centralized_training.py
conda_env.yml		conda_env.yml
flower_train.py		flower_train.py
global_variables.py		global_variables.py
test_sampling.py		test_sampling.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SpuriousFL

Repository structure

Important config parameters:

Methods

Datasets distributions

Run

Requirements

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

SpuriousFL

Repository structure

Important config parameters:

Methods

Datasets distributions

Run

Requirements

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages