ecmwf
diff --git a/‎.gitattributes‎
Lines changed: 1 addition & 1 deletion b/‎.gitattributes‎
Lines changed: 1 addition & 1 deletion
diff --git a/‎.github/workflows/ci.yml‎
Lines changed: 2 additions & 1 deletion b/‎.github/workflows/ci.yml‎
Lines changed: 2 additions & 1 deletion
diff --git a/‎.pre-commit-config.yaml‎
Lines changed: 2 additions & 2 deletions b/‎.pre-commit-config.yaml‎
Lines changed: 2 additions & 2 deletions
diff --git a/‎.readthedocs.yaml‎
Lines changed: 1 addition & 1 deletion b/‎.readthedocs.yaml‎
Lines changed: 1 addition & 1 deletion
diff --git a/‎README.md‎
Lines changed: 47 additions & 67 deletions b/‎README.md‎
Lines changed: 47 additions & 67 deletions
diff --git a/‎docs/hat_extract-timeseries.md‎
Lines changed: 2 additions & 2 deletions b/‎docs/hat_extract-timeseries.md‎
Lines changed: 2 additions & 2 deletions
diff --git a/‎docs/hat_hydrostats.md‎
Lines changed: 2 additions & 2 deletions b/‎docs/hat_hydrostats.md‎
Lines changed: 2 additions & 2 deletions
diff --git a/‎docs/requirements.txt‎
Lines changed: 1 addition & 1 deletion b/‎docs/requirements.txt‎
Lines changed: 1 addition & 1 deletion
diff --git a/‎docs/station_mapping.md‎
Lines changed: 11 additions & 11 deletions b/‎docs/station_mapping.md‎
Lines changed: 11 additions & 11 deletions
@@ -1 +1 @@
-*.ipynb filter=strip-notebook-output
+*.ipynb filter=strip-notebook-output
@@ -13,7 +13,8 @@ jobs:
     steps:
       - uses: actions/checkout@v2
       - run: pip install ruff
-      - run: ruff check
+      - run: ruff check # check linting violations
+      - run: ruff format --check --exclude *.ipynb # check formatting
 
   docs:
     runs-on: ubuntu-latest
 
@@ -7,9 +7,9 @@ repos:
     - repo: https://github.com/charliermarsh/ruff-pre-commit
       rev: v0.5.6
       hooks:
-          - id: ruff
+          - id: ruff # fix linting violations
             args: [ --fix ]
-          - id: ruff-format
+          - id: ruff-format # fix formatting
     - repo: https://github.com/pre-commit/pre-commit-hooks
       rev: v4.4.0
       hooks:
 
@@ -16,4 +16,4 @@ mkdocs:
 # Optionally declare the Python requirements required to build your docs
 python:
   install:
-  - requirements: docs/requirements.txt
+  - requirements: docs/requirements.txt
@@ -1,75 +1,59 @@
-# HAT - Hydrological Analysis Toolkit
+# Hydrological Analysis Toolkit (HAT)
+
+<p align="center">
+  <a href="https://github.com/ecmwf/codex/raw/refs/heads/main/ESEE">
+    <img src="https://github.com/ecmwf/codex/raw/refs/heads/main/ESEE/foundation_badge.svg" alt="ECMWF Software EnginE">
+  </a>
+  <a href="https://github.com/ecmwf/codex/raw/refs/heads/main/Project Maturity">
+    <img src="https://github.com/ecmwf/codex/raw/refs/heads/main/Project Maturity/emerging_badge.svg" alt="Maturity Level">
+  </a>
+  <a href="https://opensource.org/licenses/apache-2-0">
+    <img src="https://img.shields.io/badge/Licence-Apache 2.0-blue.svg" alt="Licence">
+  </a>
+  <a href="https://github.com/ecmwf/hat/releases">
+    <img src="https://img.shields.io/github/v/release/ecmwf/hat?color=purple&label=Release" alt="Latest Release">
+  </a>
+</p>
+
+<p align="center">
+  <!-- <a href="#quick-start">Quick Start</a>
+  • -->
+  <a href="#installation">Installation</a>
+  •
+  <a href="https://hydro-analysis-toolkit.readthedocs.io">Documentation</a>
+</p>
+
+> \[!IMPORTANT\]
+> This software is **Emerging** and subject to ECMWF's guidelines on [Software Maturity](https://github.com/ecmwf/codex/raw/refs/heads/main/Project%20Maturity).
 
 The Hydrological Analysis Toolkit (HAT) is a software suite for hydrologists working with simulated and observed river discharge. HAT performs data analysis on hydrological datasets, with its main features being:
 - mapping station locations into hydrological model grids
-- extraction of timeseries
+- extraction of timeseries at station locations from gridded model outputs
 - statistical analysis of hydrological timeseries
 
-The documentation can be found at https://hydro-analysis-toolkit.readthedocs.io
-
-**DISCLAIMER**
-This project is **BETA** and will be **Experimental** for the foreseeable future.
-Interfaces and functionality are likely to change, and the project itself may be scrapped.
-**DO NOT** use this software in any project/software that is operational.
-
 ### Installation
 
-Clone source code repository
-
-    $ git clone https://github.com/ecmwf/hat.git
-    $ cd hat
-
-Create and activate conda environment
-
-    $ conda create -n hat python=3.10
-    $ conda activate hat
-
-For default installation, run
-
-    $ pip install .
-
-For a developer installation (includes linting and test libraries), run
-
-    $ pip install -e .[dev]
-    $ pre-commit install
-
-If you only plan to run the tests, instead run
-
-    $ pip install -e .[test]
-
-If you plan to build a source and a wheel distribution, it is additionally required to run
-
-    $ pip install build
+For a default installation, run
 
-### Usage
+```
+pip install hydro-analysis-toolkit
+```
 
-Run a command line tool
+For a developer setup, run
 
-    $ hat-extract-timeseries --help
+```
+conda create -n hat python=3.12
+conda activate hat
+git clone https://github.com/ecmwf/hat.git
+cd hat
+pip install -e .[dev]
+pre-commit install
+```
 
-### Running the tests
+## Licence
 
-Tests are stored in the `tests/` folder and can be run with
-
-    $ pytest
-
-### Deployment
-
-To build a source and a wheel distribution, run
-
-    $ python build
-
-### Contributing
-
-The main repository is hosted on [GitHub](https://github.com/ecmwf/hat). Testing, bug reports and contributions are highly welcomed and appreciated.
-
-Please report [bug](https://github.com/ecmwf/hat/issues) reports or [pull-requests](https://github.com/ecmwf/hat/pulls) on [GitHub](https://github.com/ecmwf/hat).
-
-We want your feedback, please e-mail: user-services@ecmwf.int
-
-### License
-
-Copyright 2023 European Centre for Medium-Range Weather Forecasts (ECMWF)
+```
+Copyright 2023, European Centre for Medium Range Weather Forecasts.
 
 Licensed under the Apache License, Version 2.0 (the "License");
 you may not use this file except in compliance with the License.
@@ -84,10 +68,6 @@ See the License for the specific language governing permissions and
 limitations under the License.
 
 In applying this licence, ECMWF does not waive the privileges and immunities
-granted to it by virtue of its status as an intergovernmental organisation nor
-does it submit to any jurisdiction.
-
-
-### Citing
-
-In publications, please use a link to this repository (https://github.com/ecmwf/hat) and its documentation (https://hydro-analysis-toolkit.readthedocs.io)
+granted to it by virtue of its status as an intergovernmental organisation
+nor does it submit to any jurisdiction.
+```
@@ -1,7 +1,7 @@
 `hat-extract-timeseries` documentation
 ===============================
 
-Extract timeseries from a collection of simulation raster files. 
+Extract timeseries from a collection of simulation raster files.
 
 How to use
 -----
@@ -27,4 +27,4 @@ To create your own configuration json file you might want to start with the defa
         "station_id_column_name": "obsid",
         "station_filters":"",
         "station_coordinates": ["Lon", "Lat"]
-    }
+    }
@@ -1,7 +1,7 @@
 `hat-hydrostats` documentation
 ==============================
 
-Command line tool to calculate hydrological statistics on timeseries. 
+Command line tool to calculate hydrological statistics on timeseries.
 
 How to use
 -----
@@ -39,4 +39,4 @@ You can calculate more than one function at once using commas with the `--functi
 
 (Optionally) define the minimum percentage of observations required for timeseries to be valid using the `--obs_threshold` option (default is 80%)
 
-`hat-hydrostats --functions kge --sims $SIMS --obs $OBS --obs_threshold 70`
+`hat-hydrostats --functions kge --sims $SIMS --obs $OBS --obs_threshold 70`
@@ -1 +1 @@
-mkdocs
+mkdocs
@@ -5,9 +5,9 @@
 The `station_mapping` library is designed for mapping the location of hydrological station data onto the optimum location of a hydrological model grid (netcdf).
 This tool is available as both [command line](#station-mapping-with-command-line) and [Python API](#station-mapping-as-python-script-eg-called-within-jupyter-notebook-or-python-file)
 
-The optimum grid cell location is searched through optimising the upstream area error and the cell distance(s) from the station nearest grid cells. In this tool, users can define their <b>acceptable area difference/ error</b> using the parameter `max_area_difference` (%) and <b>the maximum cell radius</b> parameter: `max_neighboring_cell` (number of cells) to search for this optimum grid. The tool can also be parameterised to ignore further searching of optimum cells when upstream area difference of a station nearest grid is below, i.e. when the uspteam area of the nearest cell to the station is already deemed acceptable by defining `min_area_diff`(%). 
+The optimum grid cell location is searched through optimising the upstream area error and the cell distance(s) from the station nearest grid cells. In this tool, users can define their <b>acceptable area difference/ error</b> using the parameter `max_area_difference` (%) and <b>the maximum cell radius</b> parameter: `max_neighboring_cell` (number of cells) to search for this optimum grid. The tool can also be parameterised to ignore further searching of optimum cells when upstream area difference of a station nearest grid is below, i.e. when the uspteam area of the nearest cell to the station is already deemed acceptable by defining `min_area_diff`(%).
 
-For instance, refer to illustration example below, if the specified `max_area_difference` is 10%, then the optimum grid to be returned when specified `max_neighboring_cell` = 1 cell, is the one with 7% upstream area difference (blue). While if the `max_neighboring_cell` = 2 cell, then the cell with 5% upstream area difference will be returned as the optimum grid instead. 
+For instance, refer to illustration example below, if the specified `max_area_difference` is 10%, then the optimum grid to be returned when specified `max_neighboring_cell` = 1 cell, is the one with 7% upstream area difference (blue). While if the `max_neighboring_cell` = 2 cell, then the cell with 5% upstream area difference will be returned as the optimum grid instead.
 
 <img src="station_mapping_search_algo.svg" alt="illustration of optimum grid search algorithm" width="450"/>
 
@@ -21,11 +21,11 @@ To use the `station_mapping` as command line, follow these steps:
 
 1.  Prepare your data input: station data and grid data in the appropriate format. Station data should be in a CSV file, and grid data should be in a NetCDF file.
     PLease ensure all lattitudes and longitude values in [<b> decimal degree format/ DD</b>](https://en.wikipedia.org/wiki/Decimal_degrees).
-    
+
 2.  Create a [JSON configuration](https://github.com/ecmwf/hat/tree/main/notebooks/examples/station_mapping_config_example.json) file specifying the paths to your data files, column names, and other relevant parameters.
-    
+
 3.  Run the `station_mapping.py` script with the path to your configuration file:
-    
+
 `./station_mapping.py path/to/your/config.json`
 
 
@@ -48,7 +48,7 @@ config = {
     "csv_ups_col": "DrainingArea.km2.Provider", # column name for metadata of upstream  (string)
 
     # Mapping parameters (3x)
-    "max_neighboring_cells": 5, # Parameter 1: maximum radius to search for best cells (no. of cells)  
+    "max_neighboring_cells": 5, # Parameter 1: maximum radius to search for best cells (no. of cells)
     "max_area_diff": 20, # Parameter 2: acceptable/ optimum upstream area difference (%)
     "min_area_diff": 0, # Parameter 3: minimum upstream area difference (%) between nearest grid and the station metadata
 
@@ -59,15 +59,15 @@ config = {
 
     # if Output directory is provided, it will save the geodataframe outputs to geojson and csv readable by GIS or jupyter interactive
     # "out_directory": None # put none if you don't want to save the output
-    "out_directory": "output"    
+    "out_directory": "output"
 }
 ```
 3. Run the `station_mapping` function with the config dictionary input and store result as dataframe (df)
 Since in the above example, the out_directory is not empty/ None, i.e. hence geojson and csv output of the station mapping tool will be saved in the specified directory.
 
 ```
-# import station mapping 
-from hat.mapping.station_mapping import station_mapping 
+# import station mapping
+from hat.mapping.station_mapping import station_mapping
 # call station_mapping function and apply on the created config dictionary
 df = station_mapping(config)
 ```
@@ -87,7 +87,7 @@ Outputs
 ------
 
 The following elements (column) will be written as dataframe as the expected `station_mapping` output.
-Note: `_lat` and `_lon` refer to the actual lattitude and longitude of the location, while `_lat_idx` and `_lon_idx` refer to the lat and lon grid ID. 
+Note: `_lat` and `_lon` refer to the actual lattitude and longitude of the location, while `_lat_idx` and `_lon_idx` refer to the lat and lon grid ID.
 
 * Station data
 `station_name`, `station_lat`, `station_lon`, `station_area`
@@ -101,7 +101,7 @@ Note: `_lat` and `_lon` refer to the actual lattitude and longitude of the locat
 * Manually mapped variable
 `manual_lat`, `manual_lon`, `manual_lat_idx`, `manual_lon_idx`, `manual_area`
 
-* GIS compatble output files (optional) 
+* GIS compatble output files (optional)
 if the "out_directory" in the `configuration` is specified, then the following files will be written in the directory:
 
     1. `stations.geojson`: stations point vector in geojson (readable in GIS)
Original file line number	Diff line number	Diff line change
`@@ -1 +1 @@`
`1`		`-*.ipynb filter=strip-notebook-output`
	`1`	`+*.ipynb filter=strip-notebook-output`