Unit test compile cost assumptions - Part 1 #170

finozzifa · 2025-01-24T10:37:46Z

This pull request proposes a set of unit tests for scripts/compile_cost_assumptions.py, together with some minimal code re-factoring.

This pull request does not contain any major to the functionalities of the scripts.

Changes proposed in this Pull Request

Unit tests

The unit tests are for the functions:

add_description
annuity
clean_up_units
convert_units
get_data_from_DEA
get_excel_sheets
get_sheet_location
set_round_trip_efficiency
set_specify_assumptions

For introducing unit tests, I have introduce some minor changes to the functions. For example, I now pass as function arguments all snakemake.params/.config previously used in functions (but not passed as function arguments).

changes in function annuity

The function annuity is defined stand-alone and within the function add_home_battery_costs. The two versions are identical. I have removed the version within add_home_battery_costs.

Moreover, annuity implemented an if statement for discount rates that are pandas series. Considering that this case does not seem to occur, I have removed it.

numpydoc strings

I have enriched the doc-strings (following the numpydoc standard) for all functions in compile_cost_assumptions.py and compile_cost_assumptions_usa.py with the list of input arguments and outputs. I have also added static types to the function definitions.

Other changes

add logging

Added a logger and replaced print(msg) with the corresponding logger.warn/.info

shadow names from outer scopes

I have also:

proposed to rename the dictionary sheet_names to dea_sheet_names as I feel that the latter name is clearer
proposed to replace shadow names from outer scopes in the functions. For example, I have replaced

def set_specify_assumptions(tech_data):

with

def set_specify_assumptions(technology_dataframe):

AttributeError: 'NoneType' object has no attribute 'fillna' in `get_data_DEA`

The function get_data_DEA returns None if, for a given technology, no DEA excel file is found.

    excel_file = get_sheet_location(tech_name, sheet_names_dict, input_data_dict)
    if excel_file == "Sheet not found" or excel_file == "Multiple sheets found":
        logger.info(f"excel file not found for technology: {tech_name}")
        return None

The function get_data_DEA is then called in get_data_from_DEA. The function fill NaN with 0 as shown in the snippet below

        df = get_data_DEA(
            years,
            tech_name,
            sheet_names_dict,
            input_data_dictionary,
            offwind_no_grid_costs,
            expectation,
        ).fillna(0)

However NoneType has no attribute fillna. I therefore propose a change to fix this behavior. I have modified get_data_DEA such that it returns an empty dataframe

    excel_file = get_sheet_location(tech_name, sheet_names_dict, input_data_dict)
    if excel_file == "Sheet not found" or excel_file == "Multiple sheets found":
        logger.info(f"excel file not found for technology: {tech_name}")
        return pd.DataFrame()

Checks

I checked explicitly that the above-mentioned changes do not result in changes in the output files.

Checklist

Code changes are sufficiently documented; i.e. new functions contain docstrings and further explanations may be given in doc.
Data source for new technologies is clearly stated.
Newly introduced dependencies are added to environment.yaml (if applicable).
A note for the release notes doc/release_notes.rst of the upcoming release is included.
I consent to the release of this PR's code under the GPLv3 license.

…chnology-data into unit_test_compile_cost_assumptions

lkstrp · 2025-02-05T07:53:35Z

@finozzifa is doing some great and needed refactoring here. I guess we do not need backwards compatible argument names/ be as strict as we are in PyPSA/ linopy @FabianHofmann ?

FabianHofmann · 2025-02-05T08:02:32Z

no, not at all. this is a rather a data processing tool to produce final reusable results rather than a package used in scripts. meaning, let's go :)

…o unit_test_compile_cost_assumptions

lkstrp

Thanks again for tons of docstrings, types and tests! I skimmed most parts, just two comments:

Could you maybe move the mocked data fixtures away from conftest? Maybe into something like mocked_data.py and import it from there. Otherwise it will bloat conftest.

And a general note on naming: Especially when typed, I prefer something like years: list over list_of_years: list. It is more concise and readable, and it is overly descriptive for typed variables.

finozzifa added 23 commits January 24, 2025 11:34

code: unit test for get_excel_sheets

7fbfb6c

Merge branch 'master' of https://github.com/open-energy-transition/te…

2ec34f1

…chnology-data into unit_test_compile_cost_assumptions

pre-commit

8f40db1

code: move sheet_names to _helpers.py

b2efc33

code: modify years_list

9cf87d5

output: cost outputs

be66811

Merge branch 'master' of https://github.com/open-energy-transition/te…

074e1e0

…chnology-data into unit_test_compile_cost_assumptions

code: merge from master

2b43e07

code: new unit tests

5516b47

code: revert changes to _helpers.py

11543fe

code:fix quick issues

d339277

code: add logger

c31a02f

code: add docstrings

a9aa5df

add pre-commit changes

f07979d

code: modify sheet location method

8af30ca

code: add docstring

d020522

code: unit test clean_up_units

98b92b8

code: add docstring for get_dea_martime_data

45e33c3

code: add unit test for set_specify_assumptions

299220f

code: add docstring for set_specify_assumptions

4afab7d

code: remove assert False

eb2a7ad

code: unit test for set_round_trip_efficiency

01c398d

include pre-commit

a3af10d

finozzifa added 5 commits February 5, 2025 11:46

code: switch to numpydoc and introduce static types

8dadf6c

code:include pre-commit

43f3391

code: numpydoc and static types for order_data

c5898bf

code:replace print with logger statements

6d20ebb

code: add logger

1e6e000

finozzifa added 11 commits February 10, 2025 16:34

modify logger msg

ac97bdf

code: fix logger messages

ce3706d

code: add unit test for convert_units

f843db9

code: add numpydoc for add_gas_storage

58d72a6

code: new changes

b2fc9e9

Merge branch 'master' of https://github.com/PyPSA/technology-data int…

5a61847

…o unit_test_compile_cost_assumptions

code: add numpydoc

6a11f07

code: re-factor and numpydoc'

0709e06

code: add new numpydoc

12be7dc

code: add new numpydoc - 2

933ac90

code: add new numpydoc - 3

6adb0bf

finozzifa marked this pull request as ready for review February 17, 2025 14:12

finozzifa added 4 commits February 17, 2025 15:13

doc: update release_notes.rst

9b9b40e

code: remove unit test for dea_maritime

76beb53

code: add numpydoc for carbon_flow

c3af440

code: pre-commit for carbon_flow

6dffb71

finozzifa changed the title ~~Unit test compile cost assumptions~~ Unit test compile cost assumptions - Part 1 Feb 17, 2025

finozzifa added 3 commits February 17, 2025 16:43

code: numpydoc for add_egs_data

9864097

code: add unit test for annuity

450caec

code: merge from master

3502f7a

lkstrp approved these changes Feb 18, 2025

View reviewed changes

finozzifa added 7 commits February 18, 2025 16:07

code: update numpydoc and naming

044a1b7

code: add numpydoc to compile_cost_assumptions_usa.py

6bc4579

code: replace list_of_years to years

189f4e5

code: numpydoc for final batch of functions

b3fc382

code: replace NoneType with empty DataFrame

6893e65

code: pre-commit run all

38d6be2

code: new changes

6ffd02b

lkstrp merged commit ee37d71 into PyPSA:master Feb 19, 2025
3 checks passed

finozzifa mentioned this pull request Feb 19, 2025

Add proper logging to repository #165

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Unit test compile cost assumptions - Part 1 #170

Unit test compile cost assumptions - Part 1 #170

Uh oh!

finozzifa commented Jan 24, 2025 •

edited

Loading

Uh oh!

lkstrp commented Feb 5, 2025

Uh oh!

FabianHofmann commented Feb 5, 2025 •

edited

Loading

Uh oh!

lkstrp left a comment •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Unit test compile cost assumptions - Part 1 #170

Unit test compile cost assumptions - Part 1 #170

Uh oh!

Conversation

finozzifa commented Jan 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Changes proposed in this Pull Request

Unit tests

changes in function annuity

numpydoc strings

Other changes

add logging

shadow names from outer scopes

AttributeError: 'NoneType' object has no attribute 'fillna' in get_data_DEA

Checks

Checklist

Uh oh!

lkstrp commented Feb 5, 2025

Uh oh!

FabianHofmann commented Feb 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

lkstrp left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

finozzifa commented Jan 24, 2025 •

edited

Loading

AttributeError: 'NoneType' object has no attribute 'fillna' in `get_data_DEA`

FabianHofmann commented Feb 5, 2025 •

edited

Loading

lkstrp left a comment •

edited

Loading