I’m currently using the CNN implementation from [ #74 ] to preprocess depth-camera frames. Previously, I was feeding all observations (including images) into the ActorCriticRecurrent module.
I’d like to feed the raw camera observation into the CNN encoder, then concatenate its output embedding with my other (non-image) observations, and finally pass that combined vector through the rest of the ActorCriticRecurrent network. Is this straightforward?