Skip to content
Discussion options

You must be logged in to vote

So I took a look at what the issue is and I'm fairly confident that the issue is inside of your loss function where you're never actually using anything that is going to touch the forwards distribution of the variational GP.

For the GridVariationalStrategy, your forwards pass is using solely the variational strategy (for a code ref see here ). This is pointed out in the appendix of the svdkl paper that describes the strategy (although really buried), see appendix b around eq. 12: https://arxiv.org/pdf/1611.00336.pdf.

Thus, to train the GP hypers you need to either a) use a different variational strategy or b) to refactor your loss to include the KL(q||p) term between the approximate GP po…

Replies: 3 comments 2 replies

Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
1 reply
@nicob15
Comment options

Comment options

You must be logged in to vote
1 reply
@nicob15
Comment options

Answer selected by nicob15
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants