-
Notifications
You must be signed in to change notification settings - Fork 10
Description
hi again,
I have started using PyTorch with my RTX3080, which seems fast enough to experiment with (I'd been put off in the past by training times)
I'm tinkering with denoising autoencoders (and eventually want to try using parts of the stable diffusion model thats all the rage now, but initially I'm experimenting with my own smaller examples); currently my intention is to make something to enhance lo-res retro/indie game art with neural nets, so I'm getting the pieces in place.. a simple runtime, and something in PyTorch for training
what I've got in mind is
I'd like to grab the imagemonkey database , to setup training for the following in particular:
Road+Pavement
left/man , right/man. left/woman,right/woman.
.. plus the entire label list
these labels and a lot of the examples are locked.. would you be able to approve them and make it accessible to be somehow
There's 3000 person outlines in the format I'm after (1500 x left/right man , 1500 x left/right woman)
I'll try to setup multi-task training -
- an autoencoder that uses all images regardless of annotations;
- prediction of the full label list, regardless of annotation (eg using scene labels, etc)
- per-pixel segmentation of the annotations
I've managed to adapt my own "DataLoader" in PyTorch for a denoising autoencoder, which I'll extend to do all this
- I'm putting code in this repo, and could easily make a dedicated "ImageMonkey DataLoader", "image monkey trainer" .. as I know my way around the annotations here I can use the graph and so on
(the repo has a simple OpenCL inference test thats intended to grow into a little library to integrate with game engines, and my PyTorch training setup)