tempered posterior experiment (Temperature) with log scores by leokim-l · Pull Request #30 · P2GX/BOQA

leokim-l · 2025-11-21T09:00:54Z

Base branch add parameters because I needed that function

leokim-l · 2025-11-21T13:34:34Z

This method will be implemented after first submission as an alternative method if it ends up yielding better results. We agree @hansenp will implement log scores in BOQA and a prioritiser for his method in Exomiser[boqa branch].

Please leave this branch open as a draft, I will port it where appropriate after Xmas

leokim-l · 2025-12-16T15:21:40Z

After careful consideration, the tempering of the posterior has been placed inside the BOQA repository, namely in computeBoqaResults, since the default temperature of 1.0 is exactly standard BOQA. The only not too pretty part is the if statement which checks if the temperature is 1.0, then the standard normalization is applied, elsewhere we divide by the largest score (careful, I really mean score, not log(score)). We changed our mind multiple times about where stuff should live, if in Exomiser or in BOQA, I do not believe that it would have made sense to rewrite some of the logic present here in Exomiser's BoqaPrioritiser. Now calling computeBoqaResults with temperature different from 1.0 will change normalization.

Still todo: update javadoc which is probably not correct anymore.

hansenp · 2025-12-22T13:49:17Z

The re-scaling of the BOQA scores is motivated by the fact that in Exomiser the BOQA scores cannot be combined well with the variant score (and presumably in the future ACMG score), because in most of the cases the BOQA score for the gene/disease at rank 1 is almost 1 and close to zero at all subsequent ranks. Therefore, I believe that such re-scaling methods should be implemented in BoqaPrioritiser.java within Exomiser (where the re-scaling methods we already have are also implemented).

In this PR you created an additional parameter for BOQA (temperature) and embedded it deep within the BOQA code in various places (e.g., AlgorithmParameters, computeUnnormalizedProbability, etc.). This complicates the code without any apparent benefit. So far, we have no evidence that the temperature re-scaling method performs better in the boqa/exomiser context than the re-scaling methods we already have, nor do we have any reason to believe that the temperature re-scaled score offers any advantages over the conventional BOQA probabilities in other contexts. Finally, we still have no idea how to optimally set the additional temperature parameter.

As an alternative to all the changes to BOQA in this PR, one could implement the following function in BoqaPrioritiser.java in Exomiser:

 private static List<BoqaResult> reScaledRawLogTempBoqaExomiserScores(List<BoqaResult> boqaResults, double temperature) {

      // Extract raw BOQA log scores, divide score by temperature and find maximum score
      double maxScore =
              boqaResults.stream()
                      .mapToDouble(r -> r.boqaScore() / temperature)
                      .max()
                      .orElse(Double.NEGATIVE_INFINITY);

      // Re-scale
      return boqaResults.stream()
              .map(br -> {
                  double boqaExomiserScore = Math.exp(br.boqaScore() / temperature - maxScore);
                  return new BoqaResult(br.counts(), boqaExomiserScore);
              })
              .toList();
  }

Let's discuss this next year.

leokim-l · 2025-12-22T16:53:56Z

Hi, I find myself once more somewhat in disagreement with what you say, sorry! In any case, thanks for the input. I think more scientific questions and "where should the code live" could be handled offline, but since we started, here is my take:

The re-scaling of the BOQA scores is motivated by the fact that in Exomiser the BOQA scores cannot be combined well with the variant score

I agree partially. I think in general, beyond what Exomiser does, there is some interest in having a "decent" distribution of scores. What that means, is not easy to define. In this sense, tempering of posteriors is a standard Bayesian approach, though there is still discussion in the literature about it and I could not make up my mind about how mathematically solid this really is, yet. My gut feeling is that it is a good tool to use.

Therefore, I believe that such re-scaling methods should be implemented in BoqaPrioritiser.java within Exomiser (where the re-scaling methods we already have are also implemented).

Same as answer above, but I would like to further stress that this is not just a pure normalization, but a transformation of the distribution following some principles. It has been picked to be easily merged with Exomiser, but if it turns out working I would not want it do depend on Exomiser, but to be in BOQA where it belongs.

In this PR you created an additional parameter for BOQA (temperature) and embedded it deep within the BOQA code in various places

I don't see an issue with embedding a parameter deep in the codebase, especially if we are in develop. This is git and we are in a development phase, and if it turns out that we do not want it at all, we can still go back. In this regard, I also believe having an extra parameter in AlgorithmParameters is exactly correct and where it should be. We pass AlgorithmParameters as an object and if it contains 2 or 3 parameters, the code using it does not change.

This complicates the code without any apparent benefit. So far, we have no evidence that the temperature re-scaling method performs better in the boqa/exomiser context than the re-scaling methods we already have, nor do we have any reason to believe that the temperature re-scaled score offers any advantages over the conventional BOQA probabilities in other contexts.

I strongly disagree and find this somewhat uncalled for. Temperature is an extension. Temperature equal to 1 behaves exactly in the same way as previous BOQA. Saying we have no evidence that the temperature re-scaling performs better seems a bit unfair. We are in a development phase and the method that is already there is there simply because you put it there, while I had not had the time to put my proposal in until last week. I could even go as far as saying so far, we have no evidence that the re-scaling methods we already have perform better in the boqa/exomiser context than the temperature re-scaling.

I have tried on multiple occasions to explain to you where I think there might be issues with the current method, and it seems to me like you did not even try to understand. I need to put the code somewhere in order to test it, and I would prefer to avoid having the two options, what is already there vs temperature, in two different branches, or with the current way to run exomiser-boqa (recompilation, moving to lib etc.) it will be really impractical to carry out.

Finally, we still have no idea how to optimally set the additional temperature parameter.

This I pointed out myself elsewhere. This is the usual conundrum: adding a parameter makes the model richer, but one also faces the issue of having to find a somewhat robust and detail-independent methodology to tune it. I think, though, that rather than a static frozen re-scaling like the one we already have, which has unpredictable issues depending on the set of diseases one uses and whether some specific patient-disease pair happens to have a very low log-score (simply stated: if a given patient happens to have a really low score among all of the existing diseases) what we really need to have solid, robust results is to have a tunable parameter that can be flexibly adapted if needed. If you don't understand this, I already said I am happy to explain it to you once more, but please refrain from rejecting ideas without having taken the time to try understanding them.

leokim-l · 2025-12-23T10:31:45Z

The two good points about embedding deep into the codebase are which I fill fix next year:

AlgorithmParameters.create() should not only work either without any parameters or with all, but it should be able to work with alpha and beta only.
Methods using AlgorithmParameters should not contain explicit parameters in the signature such as here

BOQA/boqa-core/src/main/java/org/p2gx/boqa/core/analysis/BoqaPatientAnalyzer.java

Line 146 in 82bee1b

static double computeUnnormalizedProbability(double alpha, double beta, double temperature, BoqaCounts counts){

but rather use the methods such as getAlpha() like here

BOQA/boqa-core/src/main/java/org/p2gx/boqa/core/analysis/BoqaPatientAnalyzer.java

Line 126 in 82bee1b

static double computeUnnormalizedLogProbability(AlgorithmParameters params, BoqaCounts counts){

I am not even sure why computeUnnormalizedLogProbability is still here...

leokim-l added 3 commits November 20, 2025 11:17

added tempering, see e.g. arxiv.org/abs/1209.3198

97bdf49

compute intermediate results with log for numerical stability

228944c

log scores with temperature and max rescaling

b4fd581

extend standard boqa score with temperature

ccada3b

leokim-l changed the base branch from lc/readd-pars to develop December 16, 2025 15:21

cleaned up

82bee1b

leokim-l marked this pull request as ready for review December 18, 2025 15:33

leokim-l requested a review from hansenp December 18, 2025 15:33

leokim-l merged commit 122bb59 into develop Dec 22, 2025
3 checks passed

leokim-l mentioned this pull request Jan 11, 2026

Revert "tempered posterior experiment (Temperature) with log scores" #35

Merged

leokim-l deleted the lc/tempered_posterior_experiment branch February 2, 2026 12:09

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

tempered posterior experiment (Temperature) with log scores#30

tempered posterior experiment (Temperature) with log scores#30
leokim-l merged 5 commits intodevelopfrom
lc/tempered_posterior_experiment

leokim-l commented Nov 21, 2025

Uh oh!

leokim-l commented Nov 21, 2025

Uh oh!

leokim-l commented Dec 16, 2025

Uh oh!

hansenp commented Dec 22, 2025

Uh oh!

leokim-l commented Dec 22, 2025 •

edited

Loading

Uh oh!

Uh oh!

leokim-l commented Dec 23, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

leokim-l commented Nov 21, 2025

Uh oh!

leokim-l commented Nov 21, 2025

Uh oh!

leokim-l commented Dec 16, 2025

Uh oh!

hansenp commented Dec 22, 2025

Uh oh!

leokim-l commented Dec 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

leokim-l commented Dec 23, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

leokim-l commented Dec 22, 2025 •

edited

Loading