You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Master does not delete and relaunch OOM workers. (#2107)
* Fix version
* Retry to get table size to avoid ReadTimeout
* fix extras in setup
* Calculate logit using DNN output
* Extend tf.keras.layers.Layer
* Modify the tile of building models with structured data using ElasticDL
* Don't relaunch OOM pod
* Don't relaunch OOM pod
* Don't remove the timeout pod
* Only remove timeout worker
* Restore mistakes
* Add a annotation
* Format codes
* Don't relaunch OOM pod
* Recover tasks when the worker failed
* Remove unused imports
* Format codes
* Print log for unit test
* Recover task for failed workers
* Remove log
0 commit comments