Skip to content

Conversation

@Aurashk
Copy link
Collaborator

@Aurashk Aurashk commented Dec 10, 2025

Description

If annual fixed costs (AFC) are zero, this currently makes the metric in appraisal comparisons NaN. This PR modifies the behaviour so that we use the total annualised surplus to appraise assets in this case instead. Since there are two different possible metrics in this case, with one strictly better (AFC == 0 always better than AFC > 0). I've added a metric_precedence to AppraisalOutput which ranks the metrics by the order which they can be used. Then, select_best_assets will disregard all appraisals which have a precedence higher that the minimum. Another way of doing this may be to make the metric itself a struct and implement more sophisticated comparison logic, but since lcox uses the same struct it might end up getting too involved

Fixes #1012

Type of change

  • Bug fix (non-breaking change to fix an issue)
  • New feature (non-breaking change to add functionality)
  • Refactoring (non-breaking, non-functional change to improve maintainability)
  • Optimization (non-breaking change to speed up the code)
  • Breaking change (whatever its nature)
  • Documentation (improve or add documentation)

Key checklist

  • All tests pass: $ cargo test
  • The documentation builds and looks OK: $ cargo doc

Further checks

  • Code is commented, particularly in hard-to-understand areas
  • Tests added that prove fix is effective or that feature works

@Aurashk Aurashk requested review from dalonsoa and tsmbland December 10, 2025 16:42
@codecov
Copy link

codecov bot commented Dec 10, 2025

Codecov Report

❌ Patch coverage is 89.15663% with 9 lines in your changes missing coverage. Please review.
✅ Project coverage is 82.18%. Comparing base (5ddf353) to head (7f602b5).
⚠️ Report is 52 commits behind head on main.

Files with missing lines Patch % Lines
src/simulation/investment/appraisal.rs 85.48% 9 Missing ⚠️
Additional details and impacted files
@@            Coverage Diff             @@
##             main    #1027      +/-   ##
==========================================
+ Coverage   82.09%   82.18%   +0.09%     
==========================================
  Files          53       53              
  Lines        7310     7383      +73     
  Branches     7310     7383      +73     
==========================================
+ Hits         6001     6068      +67     
- Misses       1019     1025       +6     
  Partials      290      290              

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:
  • ❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

Comment on lines 208 to 212
let (metric_precedence, metric) = match annual_fixed_cost.value() {
// If AFC is zero, use total surplus as the metric (strictly better than nonzero AFC)
0.0 => (0, -profitability_index.total_annualised_surplus.value()),
// If AFC is non-zero, use profitability index as the metric
_ => (1, -profitability_index.value().value()),
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I'm not fully convinced by this. the profitability index is dimensionless, but the annualised surplus is Money. Even though it does not matter from the typing perspective since you are getting the underlying value in both cases, which is float, I wonder if this choice makes sense from a logic perspective.

Not that I've a better suggestion.


// calculate metric and precedence depending on asset parameters
// note that metric will be minimised so if larger is better, we negate the value
let (metric_precedence, metric) = match annual_fixed_cost.value() {
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thinking again on this, I think this logic (whatever it becomes, see my other comment) should be within the ProfitabilityIndex.value, also adding a ProfitabilityIndex.precedence method that returns 0 or 1 depending on the value of AFC.

Copy link
Collaborator

@tsmbland tsmbland left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think this is ok, I have an alternative suggestion though.

The idea would be introduce a new trait for appraisal metrics:

pub trait MetricTrait {
    fn value(&self) -> f64; 
    fn compare(&self, other: &Self) -> Ordering;
}

pub struct AppraisalOutput {
    pub metric: Box<dyn MetricTrait>,
    // ...
}

You could add this trait to your ProfitabilityIndex struct, and add a custom compare method here. You'd also have to make an equivalent struct for LCOX - it would probably be very simple, although there may be some edge cases we haven't thought of yet. I think this would help to contain the comparison logic and make the code cleaner. We'd also no longer have to make the profitability index negative as the custom compare method could be written to look for the maximum - I always found this a bit hacky and it makes the output files confusing

@tsmbland
Copy link
Collaborator

pub struct AppraisalOutput {
    pub metric: Box<dyn MetricTrait>,
    // ...
}

Or this:

pub struct AppraisalOutput<M: MetricTrait> {
    pub metric: M,
    // ...
}

I think there are various pros and cons of each option which I don't fully understand. I think possibly the latter is better if you don't need to store AppraisalOutputs with mixed metrics in the same Vec, which I don't think we need to/should do

Copy link
Collaborator

@alexdewar alexdewar left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I've taken the liberty of jumping in for a review here @Aurashk 🙃. Hope that's ok!

I agree with @tsmbland's suggestion that it would be better to use traits for this instead though -- I just think it will make it a bit clearer and more maintainable.

I'm wondering if it might be best to define a supertrait instead (that's just an alias for a combination of traits). In our case, we just need things which can be compared (Ord) and written to file (Serialize). We did talk about having an Ord implementation for unit types (#717) and I think I've actually done that somewhere, but didn't open a PR as we didn't need it, but I can do if that would be useful! That unit types would automatically define the supertrait.

I think the problem with having a value() method returning f64, as @tsmbland suggested, is that it wouldn't be obvious which value was being returned for the NPV case.

E.g.:

trait ComparisonMetric: Ord + Serialize {}

pub struct AppraisalOutput {
    pub metric: Box<dyn ComparisonMetric>,
    // ...
}

What do people think?

/// Where there is more than one possible metric for comparing appraisals, this integer
/// indicates the precedence of the metric (lower values have higher precedence).
/// Only metrics with the same precedence should be compared.
pub metric_precedence: u8,
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I'd probably make this a u32 instead. I know we won't ever need more than 256 different values here (if we did, that would be horrible!), but I think it's best to use 32-bit integers as a default, unless there's a good reason not to.

// Calculate profitability index for the hypothetical investment
let annual_fixed_cost = annual_fixed_cost(asset);
if annual_fixed_cost.value() < 0.0 {
bail!("The current NPV calculation does not support negative annual fixed costs");
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Can this actually happen? I'm struggling to think... @tsmbland?

If it's more of a sanity check instead (still worth doing!) then I'd change this to an assert! instead.

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

If it is then something has gone badly wrong! Agree, better to change to assert!

Copy link
Collaborator Author

@Aurashk Aurashk Jan 5, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

From my understanding of this comment from Adam, it may be possible in the future. And in that case a negative AFC isn't necessarily wrong, just the resulting profitability index will be wrong.

#716 (comment)

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Ah ok. I'd think I'd still make it a panic! for now though, because if it happens it's a coding bug, not a user-facing error.

@alexdewar
Copy link
Collaborator

PS -- this isn't super important, but for the NPV metric, I'd include the numerator and denominator in the output CSV file (e.g. "1.0/2.0" or something) rather than just saving one of the two values. We want users to be able to see why the algo has made whatever decisions it has

@alexdewar
Copy link
Collaborator

Actually, on reflection, my suggestion won't work after all 😞. I still think we should use traits, but it's not as simple as adding a single supertrait.

The problem is that Ord is a trait for comparing a struct with another struct of the same type, but we need to be able to compare a generic AppraisalMetric (or whatever we call it) with another. We know that we will only be comparing things of the same type, but the compiler doesn't!

I think what you want is the Any type, which lets you downcast to a specific type. Then you can do something like this:

pub trait AppraisalMetric {
    fn compare(&self, other: &dyn Any) -> Ordering;
    // ...
}

impl AppraisalMetric for LCOXMetric {
    fn compare(&self, other: &dyn Any) -> Ordering {
        let other = other.downcast_ref::<Self>().expect("Cannot compare metrics of different types");
        // ...
    }
}

(This assumes you have an LCOXMetric struct defined.) Does this make sense?

I still think it might make sense to have a supertrait which is AppraisalMetric + Serialize, as we'll need to be able to do both of these things with our tool outputs. You'll need to define Serialize for the structs for it to work.

It's a little convoluted but that's the downside of using a statically typed language 😛

@alexdewar
Copy link
Collaborator

Btw, I know I'm a bit late to the party on this one, but I'm not sure about the compare_with_equal_metrics stuff (and apologies for the many messages...).

I'm not sure why we need to go ensure consistent ordering for assets with identical appraisal outputs. If they're identical -- or approximately identical -- then, by definition, we should just return cmp::Equal, right? And in that case, it doesn't matter what order we choose, as long as we don't pick a different one every time MUSE2 runs. Bear in mind that this is an unlikely situation to happen in the first place and users shouldn't rely on us appraising assets in a particular order anyway!

I'm only raising it because I think the added complexity will make @Aurashk's life harder here and I can't really see what benefit it brings. Is there something I'm missing?

@tsmbland
Copy link
Collaborator

Btw, I know I'm a bit late to the party on this one, but I'm not sure about the compare_with_equal_metrics stuff (and apologies for the many messages...).

I'm not sure why we need to go ensure consistent ordering for assets with identical appraisal outputs. If they're identical -- or approximately identical -- then, by definition, we should just return cmp::Equal, right? And in that case, it doesn't matter what order we choose, as long as we don't pick a different one every time MUSE2 runs. Bear in mind that this is an unlikely situation to happen in the first place and users shouldn't rely on us appraising assets in a particular order anyway!

I'm only raising it because I think the added complexity will make @Aurashk's life harder here and I can't really see what benefit it brings. Is there something I'm missing?

It's not actually that unlikely:

  • You could have multiple existing assets with identical metrics. For example, an asset from 2025 and an asset from 2030 which have equal metrics because none of the technology parameters have changed. In this case I think we do need a consistent rule (e.g. favouring the newer one).
  • You could have a candidate asset equal to an existing asset because capital costs are zero. Not likely for "real" tangible processes, but people can use processes to represent all sorts of conversions which will not always be associated with physical infrastructure. Again, we need a consistent rule here such as favouring the existing asset.
  • Identical assets due to parent asset division (i.e. Make assets divisible #1030). Not something that users should have to be concerned with, but these warnings are a useful reminder to us that our investment algorithm is wasteful and needlessly appraising multiple identical assets.
  • The case of two different processes with identical parameters. In this case, I think it's fair to raise a warning as we currently do. Users may very well wonder why one particular process was selected/retained over the other, and I think it's helpful to clarify that it's an arbitrary choice

I agree with you that it feels a little hacky for identical metrics not to return cmp::Equal. I think we did it this way because it was easiest, but by all means suggest a better approach. I'm pretty adamant that we do need something though!

@alexdewar
Copy link
Collaborator

It's not actually that unlikely:

  • You could have multiple existing assets with identical metrics. For example, an asset from 2025 and an asset from 2030 which have equal metrics because none of the technology parameters have changed. In this case I think we do need a consistent rule (e.g. favouring the newer one).
  • You could have a candidate asset equal to an existing asset because capital costs are zero. Not likely for "real" tangible processes, but people can use processes to represent all sorts of conversions which will not always be associated with physical infrastructure. Again, we need a consistent rule here such as favouring the existing asset.
  • Identical assets due to parent asset division (i.e. Make assets divisible #1030). Not something that users should have to be concerned with, but these warnings are a useful reminder to us that our investment algorithm is wasteful and needlessly appraising multiple identical assets.
  • The case of two different processes with identical parameters. In this case, I think it's fair to raise a warning as we currently do. Users may very well wonder why one particular process was selected/retained over the other, and I think it's helpful to clarify that it's an arbitrary choice

Ok, good point.

I agree with you that it feels a little hacky for identical metrics not to return cmp::Equal. I think we did it this way because it was easiest, but by all means suggest a better approach. I'm pretty adamant that we do need something though!

Can I just check what the motivation for this is? On reflection, I'm guessing that the idea was that it would make it easier to figure out why a particular asset was chosen over another. Is that right? If so, that seems reasonable.

Initially I was thinking that it was to make choosing between two assets with identical metrics less arbitrary which I'm less convinced about. A lot of things in MUSE2 are arbitrary, e.g. how HiGHS distributes activity across time slices, and the results fluctuate as we change the code anyway, so it seemed overkill to try to make guarantees to users about this when we can't do the same for so much of the rest of the model.

Anyway, in terms of the code, I think the problem is that it's not a good separation of concerns. It would be better if the compare_metric method just compared the metrics and we did the fallback check for asset properties somewhere else, e.g. in a compare_assets_fallback function in investment.rs. Then where you sort the appraisal outputs, you could try these one at a time, which would make it clearer what's going on. If we do this, it'll make the refactoring for this PR easier.

@tsmbland
Copy link
Collaborator

It's not actually that unlikely:

  • You could have multiple existing assets with identical metrics. For example, an asset from 2025 and an asset from 2030 which have equal metrics because none of the technology parameters have changed. In this case I think we do need a consistent rule (e.g. favouring the newer one).
  • You could have a candidate asset equal to an existing asset because capital costs are zero. Not likely for "real" tangible processes, but people can use processes to represent all sorts of conversions which will not always be associated with physical infrastructure. Again, we need a consistent rule here such as favouring the existing asset.
  • Identical assets due to parent asset division (i.e. Make assets divisible #1030). Not something that users should have to be concerned with, but these warnings are a useful reminder to us that our investment algorithm is wasteful and needlessly appraising multiple identical assets.
  • The case of two different processes with identical parameters. In this case, I think it's fair to raise a warning as we currently do. Users may very well wonder why one particular process was selected/retained over the other, and I think it's helpful to clarify that it's an arbitrary choice

Ok, good point.

I agree with you that it feels a little hacky for identical metrics not to return cmp::Equal. I think we did it this way because it was easiest, but by all means suggest a better approach. I'm pretty adamant that we do need something though!

Can I just check what the motivation for this is? On reflection, I'm guessing that the idea was that it would make it easier to figure out why a particular asset was chosen over another. Is that right? If so, that seems reasonable.

That's part of it at least. E.g. Before working on this I didn't previously consider that multiple existing assets from different commission years might have the same metric. If it's going to have to pick one over the other, I'd at least like some consistency so that decision is explainable.

Initially I was thinking that it was to make choosing between two assets with identical metrics less arbitrary which I'm less convinced about. A lot of things in MUSE2 are arbitrary, e.g. how HiGHS distributes activity across time slices, and the results fluctuate as we change the code anyway, so it seemed overkill to try to make guarantees to users about this when we can't do the same for so much of the rest of the model.

I don't think we can/should try guarantee to users that the model is completely unarbitrary, but we can still do our best and if there's an easy way to make certain behaviours just a little bit more predictable/explainable then I don't think there's an excuse not to.

Anyway, in terms of the code, I think the problem is that it's not a good separation of concerns. It would be better if the compare_metric method just compared the metrics and we did the fallback check for asset properties somewhere else, e.g. in a compare_assets_fallback function in investment.rs. Then where you sort the appraisal outputs, you could try these one at a time, which would make it clearer what's going on. If we do this, it'll make the refactoring for this PR easier.

I think that's a good idea, do you want to give it a try?

@tsmbland
Copy link
Collaborator

I think your other point is that the warning that we're currently raising if it ultimately does have to make an arbitrary decision isn't really worthy of a warning that users should have to be concerned about. I think that's fair, so happy if you'd rather change that to a debug message

@alexdewar
Copy link
Collaborator

@tsmbland Ok cool. Seems like we're on the same page now.

I'll have a go at the refactoring.

@alexdewar
Copy link
Collaborator

I've had a go at the refactoring in #1039. @Aurashk it probably makes sense to merge that before you have another go at this.

@Aurashk
Copy link
Collaborator Author

Aurashk commented Jan 5, 2026

Sorry just catching up with the discussion. I could do with a bit more convincing on the the traits approach to help me understand the benefits. I do see the elegance of it (particularly over the existing approach), but in the end we have just have three possible comparable metrics - the LCOX one and the two NPV ones (profitability index and total annualised surplus) - and adding others would be a rare occasion. It seems like a metric for our purposes is always going to be a number and one of three (maybe more in the future) labels, we only need to compare like-for-like labels on the same logical path and we are always comparing an f64.

Based on the requirements above, it feels overly-general to make the metric an arbitrary data type that has the property of being comparable. If the implementation was really simple I might feel differently, but it feels like you're having to navigate through too much abstraction for a relatively simple bit of logic. I realise I'm outvoted on this and happy to go with the majority, just want to understand the reasoning better.

@alexdewar
Copy link
Collaborator

Sorry just catching up with the discussion. I could do with a bit more convincing on the the traits approach to help me understand the benefits. I do see the elegance of it (particularly over the existing approach), but in the end we have just have three possible comparable metrics - the LCOX one and the two NPV ones (profitability index and total annualised surplus) - and adding others would be a rare occasion. It seems like a metric for our purposes is always going to be a number and one of three (maybe more in the future) labels, we only need to compare like-for-like labels on the same logical path and we are always comparing an f64.

Based on the requirements above, it feels overly-general to make the metric an arbitrary data type that has the property of being comparable. If the implementation was really simple I might feel differently, but it feels like you're having to navigate through too much abstraction for a relatively simple bit of logic. I realise I'm outvoted on this and happy to go with the majority, just want to understand the reasoning better.

You're right that we won't be adding new metrics all that often. There is an open issue to add support for "equivalent annual cost" (#524), but I guess the output of that will be similar to the other two.

I think the main rationale for using traits was to have clearer code with better separation of concerns, so you'd have a dedicated function to handle the logic for comparing NPV outputs (for example). Another upside is that we could represent these metrics in more intuitive ways in the output file (currently we output negative NPV, which is potentially pretty confusing), but I don't think this is super important. I hear what you're saying about it being clunky and overcomplex though.

Another way to do this would be to keep your current approach, but have a generic Metric struct (not trait) with named fields for precedence and value rather than using a tuple. You could add a #[derive(PartialOrd)] and then you wouldn't need to write custom logic for comparing the values. I'd be happy with this too, as long as the code's clear. What do you think @tsmbland?

@tsmbland
Copy link
Collaborator

tsmbland commented Jan 5, 2026

What do you think @tsmbland?

I'd probably still favour the traits approach, for the reasons that @alexdewar mentioned, but if it's too much work then don't worry about it (I didn't think it would be too difficult when I suggested it here, but maybe I'm missing something).

We're also going to have to revisit this when we come to allow multiple objectives, so no point over-engineering things right now

@alexdewar
Copy link
Collaborator

I'd probably still favour the traits approach, for the reasons that @alexdewar mentioned, but if it's too much work then don't worry about it (I didn't think it would be too difficult when I suggested it here, but maybe I'm missing something).

I don't think it's actually too difficult -- there's just slightly more boilerplate. Maybe try it and see how it looks @Aurashk? If it's a pain, let me know and I can try to help or we can go with the other approach.

We're also going to have to revisit this when we come to allow multiple objectives, so no point over-engineering things right now

Good point!

@alexdewar
Copy link
Collaborator

@Aurashk Actually, would you be able to sort the serialisation stuff in this PR too 👼? You just need to update the code in write_appraisal_results to use the serialise method of the metrics rather than calling value().

Copilot AI review requested due to automatic review settings January 13, 2026 17:22
@Aurashk
Copy link
Collaborator Author

Aurashk commented Jan 13, 2026

? You just need to update the code in write_appraisal_results to use the serialise method of the metrics rather than calling value().

I've had a go at this but I'm a bit confused about what argument to apply to the serialize(..) method of the metric, it needs a Serializer but we're just loading the data into the AppraisalWriterRow struct so that we can write them row-by-row so that doesn't seem right. It also doesn't seem like the csv writer has the flexibility for a dynamic number of columns. E.g you can't do some form of:

self.appraisal_results_writer.serialize(row)?;
self.appraisal_results_writer.serialize(&result.metric)?;

because the writer works row by row. Am I missing something obvious here?

In the latest commit I tried what seems like the simplest solution - you can serialize them to json, then just write the json as a string. It requires another dependency but wasn't able to find anything more straightforward. We could alternatively explicitly add the fields (profitability index etc) to the results row and downcast but it seems like that defeats the object of doing the Serialize stuff in the first place

Copy link
Contributor

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Copilot encountered an error and was unable to review this pull request. You can try again by re-requesting a review.

Comment on lines 180 to 206
fn compare(&self, other: &dyn ComparableMetric) -> Ordering {
let other = other
.as_any()
.downcast_ref::<Self>()
.expect("Cannot compare metrics of different types");

// Handle comparison based on fixed cost status
match (self.is_zero_fixed_cost(), other.is_zero_fixed_cost()) {
// Both have zero fixed cost: compare total surplus (higher is better)
(true, true) => {
let self_surplus = self.0.total_annualised_surplus;
let other_surplus = other.0.total_annualised_surplus;

compare_approx(other_surplus, self_surplus)
}
// Both have non-zero fixed cost: compare profitability index (higher is better)
(false, false) => {
let self_pi = self.0.value();
let other_pi = other.0.value();

compare_approx(other_pi, self_pi)
}
// Zero fixed cost is always better than non-zero fixed cost
(true, false) => Ordering::Less,
(false, true) => Ordering::Greater,
}
}
Copy link

Copilot AI Jan 13, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The new comparison logic in NPVMetric::compare handles an important edge case (zero fixed costs) with different comparison semantics than normal NPV comparison. However, there are no tests added to verify this behavior works correctly. Consider adding tests that verify:

  1. Two NPVMetrics with zero fixed cost are compared correctly by surplus
  2. Two NPVMetrics with non-zero fixed cost are compared correctly by profitability index
  3. An NPVMetric with zero fixed cost is always considered better than one with non-zero fixed cost (lines 203-204)
  4. The comparison correctly returns Ordering::Equal for approximately equal values

Copilot uses AI. Check for mistakes.
ensure!(
number_of_years[&(commodity_id.clone(), region_id.clone())]
== required_years.len().try_into().unwrap(),
== u32::try_from(required_years.len()).unwrap(),
Copy link

Copilot AI Jan 13, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This refactoring from .try_into().unwrap() to u32::try_from(...).unwrap() improves code clarity but appears unrelated to the PR's main purpose of allowing different metrics in NPV calculation. Consider keeping such unrelated refactorings in separate commits or PRs to maintain a clear change history.

Suggested change
== u32::try_from(required_years.len()).unwrap(),
== required_years.len().try_into().unwrap(),

Copilot uses AI. Check for mistakes.
Comment on lines +285 to +288
assert!(
annual_fixed_cost >= MoneyPerCapacity(0.0),
"The current NPV calculation does not support negative annual fixed costs"
);
Copy link

Copilot AI Jan 13, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The assertion that checks annual fixed cost is non-negative only runs in debug builds. If negative annual fixed costs could occur in production (even if they shouldn't), this would silently pass in release builds and could lead to incorrect behavior. Consider using ensure! or returning a Result error instead of assert! to enforce this constraint in all builds.

Copilot uses AI. Check for mistakes.
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Again, this is fine.

Comment on lines +45 to +48
assert!(
self.annualised_fixed_cost != Money(0.0),
"Annualised fixed cost cannot be zero when calculating profitability index."
);
Copy link

Copilot AI Jan 13, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The assertion that checks annualised fixed cost is non-zero only runs in debug builds. In release builds, if this condition is violated, the division on line 49 would produce infinity or NaN, leading to incorrect behavior. Consider using a runtime check (like ensure!) that returns an error instead of assert! to enforce this constraint in all builds.

Copilot uses AI. Check for mistakes.
Comment on lines 78 to 81
assert!(
!(self.metric.is_nan() || other.metric.is_nan()),
!(self.metric.value().is_nan() || other.metric.value().is_nan()),
"Appraisal metric cannot be NaN"
);
Copy link

Copilot AI Jan 13, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The assertion checking that metric values are not NaN only runs in debug builds. In release builds, if NaN values occur, the comparison logic could produce unexpected results. Consider using a runtime check that returns an error or handles NaN values explicitly to ensure correct behavior in all builds.

Copilot uses AI. Check for mistakes.
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

That's... not true. assert! also works in release builds in Rust (unlike assert in other languages). There is a debug_assert! macro for checks that you really only want in debug builds

@alexdewar
Copy link
Collaborator

? You just need to update the code in write_appraisal_results to use the serialise method of the metrics rather than calling value().

I've had a go at this but I'm a bit confused about what argument to apply to the serialize(..) method of the metric, it needs a Serializer but we're just loading the data into the AppraisalWriterRow struct so that we can write them row-by-row so that doesn't seem right. It also doesn't seem like the csv writer has the flexibility for a dynamic number of columns. E.g you can't do some form of:

self.appraisal_results_writer.serialize(row)?;
self.appraisal_results_writer.serialize(&result.metric)?;

because the writer works row by row. Am I missing something obvious here?

In the latest commit I tried what seems like the simplest solution - you can serialize them to json, then just write the json as a string. It requires another dependency but wasn't able to find anything more straightforward. We could alternatively explicitly add the fields (profitability index etc) to the results row and downcast but it seems like that defeats the object of doing the Serialize stuff in the first place

I don't think you're missing anything obvious -- just seems it's more of a faff than I anticipated 🙃. Sorry for dragging this out! In the interests of getting this merged, how about just reverting that last commit and then we can do a re-review? We can come back to the serialisation stuff later.

Copilot AI review requested due to automatic review settings January 14, 2026 09:34
Copy link
Contributor

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull request overview

Copilot reviewed 5 out of 6 changed files in this pull request and generated 7 comments.


💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Comment on lines +181 to +184
let other = other
.as_any()
.downcast_ref::<Self>()
.expect("Cannot compare metrics of different types");
Copy link

Copilot AI Jan 14, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The downcast operation with expect will panic at runtime if metrics of different types are compared. While this shouldn't happen in practice (as all appraisals in a round use the same objective type), consider adding a debug_assert or a more descriptive error message that includes the actual types being compared to aid debugging if this invariant is ever violated.

Copilot uses AI. Check for mistakes.
///
/// # Returns
///
/// An `AppraisalOutput` containing the hypothetical capacity, activity profile and unmet demand.
Copy link

Copilot AI Jan 14, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The documentation comment has been removed but was actually helpful in explaining the metric transformation. Consider keeping documentation that explains the relationship between metrics and comparison semantics, particularly noting that NPVMetric handles the "higher is better" logic internally so external code doesn't need to negate values.

Suggested change
/// An `AppraisalOutput` containing the hypothetical capacity, activity profile and unmet demand.
/// An `AppraisalOutput` containing the hypothetical capacity, activity profile and unmet demand.
/// The returned `metric` encapsulates the NPV value for this appraisal. For NPV, higher values
/// are better, and the associated `NPVMetric` type encodes this "higher is better" comparison
/// semantics internally, so callers can compare `metric` values directly without negating them.

Copilot uses AI. Check for mistakes.
Comment on lines +209 to 219
#[test]
#[should_panic(expected = "Annualised fixed cost cannot be zero")]
fn profitability_index_panics_on_zero_cost() {
let result = profitability_index(
Capacity(0.0),
MoneyPerCapacity(100.0),
&indexmap! {},
&indexmap! {},
);
result.value();
}
Copy link

Copilot AI Jan 14, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The removed test case for zero capacity with infinite profitability index has been replaced with a panic test, but there's no test coverage for the new NPVMetric behavior when annualised_fixed_cost is zero. Consider adding integration tests that verify the entire appraisal flow correctly handles assets with zero annual fixed costs and compares them properly against assets with non-zero fixed costs.

Copilot uses AI. Check for mistakes.
Comment on lines 180 to 209
fn compare(&self, other: &dyn ComparableMetric) -> Ordering {
let other = other
.as_any()
.downcast_ref::<Self>()
.expect("Cannot compare metrics of different types");

// Handle comparison based on fixed cost status
match (self.is_zero_fixed_cost(), other.is_zero_fixed_cost()) {
// Both have zero fixed cost: compare total surplus (higher is better)
(true, true) => {
let self_surplus = self.0.total_annualised_surplus;
let other_surplus = other.0.total_annualised_surplus;
compare_approx(other_surplus, self_surplus)
}
// Both have non-zero fixed cost: compare profitability index (higher is better)
(false, false) => {
let self_pi = self.0.value();
let other_pi = other.0.value();
compare_approx(other_pi, self_pi)
}
// Zero fixed cost is always better than non-zero fixed cost
(true, false) => Ordering::Less,
(false, true) => Ordering::Greater,
}
}

fn as_any(&self) -> &dyn Any {
self
}
}
Copy link

Copilot AI Jan 14, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

There are no unit tests for the new NPVMetric comparison logic, particularly for the critical cases: (1) comparing two assets with zero fixed cost, (2) comparing zero fixed cost vs non-zero fixed cost, and (3) comparing two assets with non-zero fixed cost. The complex branching logic in the compare method should be thoroughly tested to ensure correct behavior.

Copilot uses AI. Check for mistakes.
Comment on lines 169 to 175
fn value(&self) -> f64 {
if self.is_zero_fixed_cost() {
self.0.total_annualised_surplus.value()
} else {
self.metric.partial_cmp(&other.metric).unwrap()
self.0.value().value()
}
}
Copy link

Copilot AI Jan 14, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The value method returns different semantics depending on whether fixed cost is zero. When fixed cost is zero, it returns total_annualised_surplus (an absolute money value). When fixed cost is non-zero, it returns the profitability index (a dimensionless ratio). This inconsistency could be confusing for code that uses the value for purposes other than comparison. Consider documenting this behavior clearly or adding a separate method that consistently returns a comparable numeric representation.

Copilot uses AI. Check for mistakes.
@dalonsoa dalonsoa requested review from dalonsoa and removed request for dalonsoa January 14, 2026 10:05
Copilot AI review requested due to automatic review settings January 14, 2026 10:31
Copy link
Contributor

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull request overview

Copilot reviewed 5 out of 6 changed files in this pull request and generated 1 comment.


💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Comment on lines +192 to +198
compare_approx(other_surplus, self_surplus)
}
// Both have non-zero fixed cost: compare profitability index (higher is better)
(false, false) => {
let self_pi = self.0.value();
let other_pi = other.0.value();
compare_approx(other_pi, self_pi)
Copy link

Copilot AI Jan 14, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The order of arguments to compare_approx is reversed (other, self) compared to the pattern used in LCOXMetric::compare at line 141 which uses (self, other). For consistency across metric implementations, consider using the same argument order and applying .reverse() where needed to achieve the desired ordering semantics.

Suggested change
compare_approx(other_surplus, self_surplus)
}
// Both have non-zero fixed cost: compare profitability index (higher is better)
(false, false) => {
let self_pi = self.0.value();
let other_pi = other.0.value();
compare_approx(other_pi, self_pi)
compare_approx(self_surplus, other_surplus).reverse()
}
// Both have non-zero fixed cost: compare profitability index (higher is better)
(false, false) => {
let self_pi = self.0.value();
let other_pi = other.0.value();
compare_approx(self_pi, other_pi).reverse()

Copilot uses AI. Check for mistakes.
@Aurashk
Copy link
Collaborator Author

Aurashk commented Jan 14, 2026

@alexdewar I've reverted the commit and added a few tests for the compare methods.

@alexdewar
Copy link
Collaborator

Cool. Is it ready for re-review?

@Aurashk
Copy link
Collaborator Author

Aurashk commented Jan 14, 2026

Cool. Is it ready for re-review?

Yes

@alexdewar alexdewar self-requested a review January 14, 2026 13:48
@alexdewar
Copy link
Collaborator

@Aurashk I'll review now. For future reference, you can re-request review by clicking the button with the arrows by a reviewer's name (I've already done it for myself):

image

That way it'll appear in my list of things to review

Copy link
Collaborator

@alexdewar alexdewar left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Just one small thing, then I think this is good to merge. Thanks for bearing with -- sorry it ended up dragging out.

I've just suggested adding additional test cases for the NPV comparison code; should be easy if you use a parametrised test with rstest (see comment)

if a.approx_eq(b, F64Margin::default()) {
Ordering::Equal
} else {
a.partial_cmp(&b).unwrap()
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

An error message might be good for debugging:

Suggested change
a.partial_cmp(&b).unwrap()
a.partial_cmp(&b).expect("Cannot compare NaN values")

Comment on lines 78 to 81
assert!(
!(self.metric.is_nan() || other.metric.is_nan()),
!(self.metric.value().is_nan() || other.metric.value().is_nan()),
"Appraisal metric cannot be NaN"
);
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

That's... not true. assert! also works in release builds in Rust (unlike assert in other languages). There is a debug_assert! macro for checks that you really only want in debug builds

Comment on lines +112 to +115
let other = other
.as_any()
.downcast_ref::<Self>()
.expect("Cannot compare metrics of different types");
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Seems a bit verbose to me...

Comment on lines +285 to +288
assert!(
annual_fixed_cost >= MoneyPerCapacity(0.0),
"The current NPV calculation does not support negative annual fixed costs"
);
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Again, this is fine.

}

#[test]
fn npv_compare_both_zero_fixed_cost() {
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

There are some test cases missing here, as Copilot has pointed out. It would be good to have total coverage.

There are 11 branches:

  • AFC is non-zero for both: less than, greater than, equal
  • AFC is zero for both: less than, greater than, equal
  • AFC is zero for self but not other and vice versa

You could also add a test or two for the case where AFC is approx zero, but not exactly.

I'd write these as parametrised tests with rstest: https://docs.rs/rstest/latest/rstest/#creating-parametrized-tests

I wouldn't normally be this picky, but this actually is a function where we can test all branches without too much hassle, so let's do it!

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks @alexdewar, I've tagged you for a final check of the new tests, feel free to merge it if you're happy with them

@Aurashk Aurashk requested a review from alexdewar January 15, 2026 12:39
Copy link
Collaborator

@alexdewar alexdewar left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks ace! Good work.

@alexdewar alexdewar merged commit 280c997 into main Jan 15, 2026
8 checks passed
@alexdewar alexdewar deleted the make-profitability-index-more-robust branch January 15, 2026 14:00
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

NPV profitability_index NPV is broken

5 participants