Baseline functions for aggregator & collaborator #23

dskhanirfan · 2021-06-02T21:45:37Z

dskhanirfan
Jun 2, 2021

what are the aggregator & collaborator functions that you are going to use as baseline?

Jun 3, 2021

There is no baseline per se for participant ranking and final selection, rather participants will be ranked against each other based on their performance scores using whatever customization they choose to use. If you are asking what could be considered baseline to compare against your own customizations, appropriate functions would be weighted_average_aggregation, all_collaborators_train, and constant_hyper_parameter as these represent simple yet reasonable first passes at running a federation.

View full answer

sarthakpati · 2021-06-03T02:28:40Z

sarthakpati
Jun 3, 2021
Maintainer

Hi @dskhanirfan,

I am not sure what you mean by "aggregator & collaborator functions". Could you please elaborate?

0 replies

dskhanirfan · 2021-06-03T15:28:32Z

dskhanirfan
Jun 3, 2021
Author

There are options for aggregation_function like weighted_average_aggregation and clipped_aggregation, options for choose_training_collaborators options like all_collaborators_train and one_collaborator_on_odd_rounds, options for training_hyper_parameters_for_round like constant_hyper_parameter, train_less_each_round and fixed_number_of_batches, which functions will be used as a baseline for performance evaluation?

0 replies

brandon-edwards · 2021-06-03T19:17:24Z

brandon-edwards
Jun 3, 2021
Maintainer

There is no baseline per se for participant ranking and final selection, rather participants will be ranked against each other based on their performance scores using whatever customization they choose to use. If you are asking what could be considered baseline to compare against your own customizations, appropriate functions would be weighted_average_aggregation, all_collaborators_train, and constant_hyper_parameter as these represent simple yet reasonable first passes at running a federation.

0 replies

dskhanirfan · 2021-06-04T22:57:46Z

dskhanirfan
Jun 4, 2021
Author

Thanks, How many iterations are considered baseline? 5 is default for 1 epoch

0 replies

brandon-edwards · 2021-06-04T23:17:12Z

brandon-edwards
Jun 4, 2021
Maintainer

The total number of rounds of 5 was used only as a value to use for a short test. To run a complete test, set this to something excessively large (like 1000 rounds). The experiment.py script will exit when the simulated time exceeds 1 week and will now return a dataframe of your results which you can use for a plot (the simulated time of 1 week is considering a complete run) (look to the top README to find out more about simulated time). If you wish however to run a shorter test, you can keep the rounds small. In this case the results of the experiment will be calculated by projecting your training curve out to one simulated week using the max performance metric value over the rounds you completed. In general, stopping your experiment short of the week of simulated training time will under-estimate the final score achievable by your method.

0 replies

zhanghaoyue · 2021-06-07T02:48:55Z

zhanghaoyue
Jun 7, 2021

Hello @brandon-edwards I ran 5 rounds and it took me 30-40 hours. running 1000 seems like Mission Impossible. Currently it seems that the framework does not support multi-gpu and multi-cpu?

0 replies

brandon-edwards · 2021-06-07T17:01:16Z

brandon-edwards
Jun 7, 2021
Maintainer

Hello @brandon-edwards I ran 5 rounds and it took me 30-40 hours. running 1000 seems like Mission Impossible. Currently it seems that the framework does not support multi-gpu and multi-cpu?

Hi @zhanghaoyue, please see the words following my suggestion to set to 1000 rounds to see that early exit will occur, though 1 week of simulated time will indeed take a good deal of time. The OpenFL framework allows a model writer to train their model in whatever way they wish, including multi-gpu, etc.. However for the challenge, YES, the model we are using does not have data-parallel support.

The primary issue here, is that the code to produce collaborator model updates needs to result in the exact same collaborator updates (for a given setting of the training parameters) for all participants in the challenge. Holding the model code constant (as far as what collaborator-side model updates are produced for a given setting of the training parameters) is a critical feature of this challenge, and data parallel training (for example) general changes the data science (results in different collaborator trained updates). What participants are supposed to demonstrate is improved FL logic (holding the collaborator model update creation constant, but changing the four functions in the notebook). Every participant faces the same difficulty of long times to experiment completion.

0 replies

dskhanirfan · 2021-06-07T22:41:15Z

dskhanirfan
Jun 7, 2021
Author

Which ML model is used in Task1 for example UNet? and what does WT, ET, TC stand for in DICE WT, DICE ET, DICE TC?

0 replies

sarthakpati · 2021-06-08T02:43:12Z

sarthakpati
Jun 8, 2021
Maintainer

Which ML model is used in Task1

It is a U-Net with residual connections.

what does WT, ET, TC stand for

This is the BraTS convention and stands for the following:

WT == Whole tumor (label 1+2+4)
ET == enhancing tumor (label 4)
TC == tumor core (label 1+4)

0 replies

dskhanirfan · 2021-06-10T07:35:39Z

dskhanirfan
Jun 10, 2021
Author

What is meant by label 1, label 2 and label 4? Can you also confirm that the following 28 patients data out of 369 is missing? 149,248, 249, 252,254,255,256,258,259,262,263,267,268,271,281,284,287,289,292,305,307,314,316,317,318,320,324,335 ?

0 replies

sarthakpati · 2021-06-10T12:15:52Z

sarthakpati
Jun 10, 2021
Maintainer

We follow the BraTS convention:

Label	Long form	Value
NET	Necrotic core of tumor	1
ED	Peritumoral edematous tissue	2
ET	Enhancing/active part of tumor	4

0 replies

sarthakpati · 2021-06-10T12:16:34Z

sarthakpati
Jun 10, 2021
Maintainer

I believe the original query has been addressed. If you have any further questions, please comment and/or open a new discussion.

0 replies

Baseline functions for aggregator & collaborator #23

Uh oh!

dskhanirfan Jun 2, 2021

Replies: 12 comments

Uh oh!

sarthakpati Jun 3, 2021 Maintainer

Uh oh!

dskhanirfan Jun 3, 2021 Author

Uh oh!

Uh oh!

brandon-edwards Jun 3, 2021 Maintainer

Uh oh!

dskhanirfan Jun 4, 2021 Author

Uh oh!

brandon-edwards Jun 4, 2021 Maintainer

Uh oh!

zhanghaoyue Jun 7, 2021

Uh oh!

brandon-edwards Jun 7, 2021 Maintainer

Uh oh!

dskhanirfan Jun 7, 2021 Author

Uh oh!

sarthakpati Jun 8, 2021 Maintainer

Uh oh!

dskhanirfan Jun 10, 2021 Author

Uh oh!

sarthakpati Jun 10, 2021 Maintainer

Uh oh!

sarthakpati Jun 10, 2021 Maintainer

dskhanirfan
Jun 2, 2021

sarthakpati
Jun 3, 2021
Maintainer

dskhanirfan
Jun 3, 2021
Author

brandon-edwards
Jun 3, 2021
Maintainer

dskhanirfan
Jun 4, 2021
Author

brandon-edwards
Jun 4, 2021
Maintainer

zhanghaoyue
Jun 7, 2021

brandon-edwards
Jun 7, 2021
Maintainer

dskhanirfan
Jun 7, 2021
Author

sarthakpati
Jun 8, 2021
Maintainer

dskhanirfan
Jun 10, 2021
Author

sarthakpati
Jun 10, 2021
Maintainer

sarthakpati
Jun 10, 2021
Maintainer