A t-test with FDR correction would probably work fine as a first pass at this. We should also compare signal to shuffled prediction.