I attempted fine-tuning on a dataset of solo and group portrait images (33k images with corresponding 33k alpha masks). However, I observed that after each epoch, the quality of the predicted masks progressively degrades.
I am using a matting task setup with pretrained weights (model.safetensor). All images in the dataset are at 1K resolution.
Could you help identify possible reasons for this issue?