ESA-APEx
diff --git a/‎.github/workflows/docs.yml‎
Lines changed: 28 additions & 0 deletions b/‎.github/workflows/docs.yml‎
Lines changed: 28 additions & 0 deletions
diff --git a/‎README.md‎
Lines changed: 8 additions & 70 deletions b/‎README.md‎
Lines changed: 8 additions & 70 deletions
diff --git a/‎docs/architecture.md‎
Lines changed: 159 additions & 0 deletions b/‎docs/architecture.md‎
Lines changed: 159 additions & 0 deletions
diff --git a/‎CONTRIBUTE.md‎ ‎docs/contributing.md‎CONTRIBUTE.md renamed to docs/contributing.md
Lines changed: 3 additions & 3 deletions b/‎CONTRIBUTE.md‎ ‎docs/contributing.md‎CONTRIBUTE.md renamed to docs/contributing.md
Lines changed: 3 additions & 3 deletions
@@ -0,0 +1,28 @@
+name: Deploy docs to GitHub Pages
+
+on:
+  push:
+    branches:
+      - main  # or 'master' if that's your default branch
+
+jobs:
+  deploy:
+    runs-on: ubuntu-latest
+
+    steps:
+      - name: Checkout repository
+        uses: actions/checkout@v3
+
+      - name: Set up Python
+        uses: actions/setup-python@v4
+        with:
+          python-version: '3.10'
+
+      - name: Install dependencies
+        run: |
+          python -m pip install --upgrade pip
+          pip install mkdocs mkdocs-material mkdocs-jupyter mkdocs-mermaid2-plugin
+
+      - name: Deploy to GitHub Pages
+        run: |
+          mkdocs gh-deploy --force
@@ -1,76 +1,14 @@
-# APEx Dispatch API (FastAPI)
 
-This repository contains the implementation of the APEx Upscaling Service API using FastAPI.
+# APEx Dispatch API
 
-## Getting Started: Running the API Locally
+The **APEx Dispatch API** provides a service for **executing and upscaling EO-based services** across multiple platforms.
+It is built with **FastAPI** and is designed for scalable, asynchronous processing.
 
-1. **Install dependencies:**
+## 📦 Getting Started
 
-   ```bash
-   pip install -r requirements.txt
-   ```
+Learn more on how to get started [here](docs/getting_started.md)
 
-2. **Configure environment variables:**
+## 🤝 Contributing
 
-   Create a `.env` file and set your environment variables accordingly (e.g., `DATABASE_URL`).
-
-3. **Set up the database:**
-
-   Follow the [Database Setup](#database-setup) instructions below to prepare your local PostgreSQL instance.
-
-4. **Run the FastAPI application:**
-
-   ```bash
-   uvicorn app.main:app --reload
-   ```
-
-## Running Tests
-
-Execute the test suite using:
-
-```bash
-pytest
-```
-
-## Database Setup
-
-1. **(Optional) Create a Docker volume to persist PostgreSQL data:**
-
-   ```bash
-   docker volume create local-postgres-data
-   ```
-
-2. **(Optional) Inspect the volume mount point:**
-
-   ```bash
-   docker volume inspect local-postgres-data
-   ```
-
-   This shows the physical location of your data on the host machine.
-
-3. **Start a PostgreSQL container linked to the volume:**
-
-   ```bash
-   docker run -d --name postgres -p 5432:5432 \
-     -e POSTGRES_USER=testuser \
-     -e POSTGRES_PASSWORD=secret \
-     -e POSTGRES_DB=testdb \
-     -v local-postgres-data:/var/lib/docker/volumes/local-postgres-data \
-     postgres:latest
-   ```
-
-4. **Set your database connection string:**
-
-   Add the following to your `.env.local` (or `.env`) file:
-
-   ```env
-   DATABASE_URL=postgresql+psycopg2://testuser:secret@localhost:5432/testdb
-   ```
-
-5. **Apply database migrations:**
-
-   Make sure your database schema is up-to-date by running:
-
-   ```bash
-   alembic upgrade head
-   ```
+We welcome contributions!
+Please read our [contributing guidelines](docs/contributing.md) before submitting pull requests.
@@ -0,0 +1,159 @@
+# Architecture Overview
+
+The **APEx Dispatch API** acts as a **broker service** that allows clients to trigger job executions on external Earth Observation platforms.  
+Instead of interacting directly with platform-specific APIs, clients can use the **uniform Dispatch API interface**, while the dispatcher handles the translation and job management.
+
+##  Key Concepts
+
+### Dispatch API
+
+The **Dispatch API** is the core component of the system. It acts as the entry point for clients who want to execute jobs or perform upscaling tasks. When a job request is submitted, the dispatcher takes care of translating it into a **platform-specific request** using standards such as [openEO](https://openeo.org/) or [OGC API – Processes](https://ogcapi.ogc.org/processes/) that is sent to an existing EO platform, such as CDSE or the Geohazard Exploitation Platform. Beyond handling the translation, the dispatcher is also responsible for storing all relevant job information, including metadata and references to the external platform where the job is ultimately executed.
+
+```mermaid
+flowchart LR
+    C["Client"] --> D["APEx Dispatch API"]
+    D-.->C
+    D-->P1[Platform 1]
+    P1-.->D
+    D-->P2[Platform 2]
+    P2-.->D
+    D-->P3[Platform 3]
+    P3-.->D
+```
+
+### Processing Job Execution
+
+When a client wants to perform a task, it submits a job to the Dispatch API. A job request typically contains two main pieces of information: the service that needs to be executed and the parameters required for that service. Once received, the dispatcher forwards this request to the chosen external platform, which carries out the execution. In response, the platform provides a job identifier, which the dispatcher records internally to keep track of the execution.
+
+After a job has been submitted and forwarded to an external platform, the dispatcher maintains an internal record of it. This record includes a unique internal job identifier, which the client can use for reference, as well as the mapping to the external platform’s job ID. Additional metadata, such as the job status, the creation timestamp, and the parameters used during submission, are also stored. This internal tracking mechanism ensures that the client has a single point of reference for all jobs, regardless of where they are executed.
+
+``` mermaid
+sequenceDiagram
+    participant UI as Client
+    box APEx
+    participant API as APEx Dispatch API
+    end
+    box Platform
+    participant Platform as API (openEO / OGC API Process)
+    end
+
+    UI->>API: POST /unit_jobs
+
+    API->>API: Create processing job
+    API->>Platform: Submit processing job
+    Platform-->>API: Return platform job ID
+    API->>API:Store platform job ID 
+    API->>API:Set job status as "submitted"
+
+    API-->>UI: Return processing job summary
+```
+
+### Upscaling Task Execution
+
+In addition to individual job submissions, the dispatcher also supports **upscaling** activities. In this case, a client submits a request that includes not just the target service and execution parameters, but also a **parameter dimension with multiple values**. The dispatcher uses this information to generate multiple job requests, each corresponding to one value in the parameter dimension, and forwards them to the external platform. From the client’s perspective, however, this entire batch of jobs is managed as a single **upscaling task**. The dispatcher keeps track of the execution of all related jobs and exposes them as part of one unified task, simplifying monitoring and retrieval for the user.
+
+``` mermaid
+sequenceDiagram
+    participant UI as Client
+    box APEx
+    participant API as APEx Dispatch API
+    end
+    box Platform
+    participant Platform as API (openEO / OGC API Process)
+    end
+
+    UI->>API: POST /upscale_tasks
+
+    API->>API: Create upscaling task
+
+
+    loop For each job of upscaling task
+    API->>API: Create processing job
+    API->>Platform: Submit processing job
+    Platform-->>API: Return platform job ID
+    API->>API:Store platform job ID 
+    API->>API:Set job status as "submitted"
+    end
+
+    API-->>UI: Return upscaling task summary
+```
+
+### Status Retrieval
+
+To check the progress of their jobs and upscale tasks, clients use a single status endpoint exposed by the Dispatch API. When such a request arrives, the dispatcher looks up the corresponding external job reference stored in its internal records. It then queries the external platform to obtain the most up-to-date status. This status information is returned to the client, allowing them to monitor their job execution transparently through the dispatcher without needing to interact with the external platform directly.
+
+``` mermaid
+sequenceDiagram
+    participant UI as Client
+    box APEx
+    participant API as APEx Dispatch API
+    end
+    box Platform
+    participant Platform as API (openEO / OGC API Process)
+    end
+
+    UI->>+API: Set up websocket to /job_status
+
+    loop Every X minutes
+        loop For each running processing job
+            API->>Platform: Request job status
+            Platform-->>API: Send job status
+            API->>API: Update job status
+        end
+
+        loop For each running upscaling task
+            loop For each running processing job in upscaling task
+                API->>Platform: Request job status
+                Platform-->>API: Send job status
+                API->>API: Update job status
+            end
+            API->>API: Compute upscaling task status
+        end
+    end
+    API-->>-UI: Return summary list of processing jobs and upscaling tasks
+```
+
+## Authentication and Authorization
+
+Authentication and authorization are critical components of the APEx Dispatch API, as jobs launched through the API result in resource consumption on external platforms. To support remote job execution and manage this resource usage effectively, the project has identified two distinct scenarios:
+
+### APEx Service Account (Current Implementation)
+
+In this scenario, all jobs are executed on the external platforms using a generic APEx service account that has access to them. This means that each job or upscaling task triggered through the API is executed on the platform under the APEx account, rather than the actual user’s identity. However, the Dispatch API maintains the link between the platform job ID and the user who initiated the request in its database.
+
+```mermaid
+flowchart LR
+    C["Alice"] -- Request job---> D["APEx Dispatch API"]
+    D--Launch job as user APEx -->P1[Platform ]
+```
+
+**Pros:**
+
+* Provides a seamless user experience: users do not need to create or manage platform-specific accounts.
+* Simplifies integration for clients using the Dispatch API.
+  
+**Cons:**
+
+* For each new platform, a dedicated APEx account must be created and funded appropriately. Estimating the required funding in advance is challenging, especially since the account is shared across all users.
+* No user-level auditing or accounting is available. Users can continue triggering jobs as long as the APEx account has sufficient funds. This poses a risk of misuse, potentially leading to service disruption for all users if the APEx account is depleted.
+* Implementing safeguards would require advanced accounting features within the Dispatch API, requiring the translation of existing business models into a uniform business logic. This adds complexity and may introduce additional costs by layering over existing platform models.
+
+### User Impersonation (Preferred Approach)
+
+The preferred solution is to execute jobs on behalf of the user who initiates the request via the APEx Dispatch API. In this model, all accounting and access control are handled directly by the platform, and users are responsible for maintaining sufficient access and funding—potentially supported through the ESA Network of Resources (NoR).
+
+```mermaid
+flowchart LR
+    C["Alice"] -- Request job---> D["APEx Dispatch API"]
+    D--Launch job as user Alice -->P1[Platform ]
+```
+
+**Pros:**
+
+* No need for custom accounting logic in the APEx Dispatch API, as platforms handle this natively.
+* APEx avoids introducing a layer over the platform’s existing business model, preserving operational simplicity.
+
+**Cons:**
+
+* Propagating user identity across platforms is a technical challenge and currently lacks a proven, ready-to-use solution.
+* May require modifications on the target platform to support user impersonation, depending on the chosen implementation strategy.
@@ -1,4 +1,4 @@
-# Contributing to the APEx Dispatch API
+# Contributing
 
 ## Making Contributions
 
@@ -27,7 +27,7 @@ Contributions to the APEx Dispatch API are welcome! If you have suggestions for
 
 ## Registration of a new Platform Implementation
 
-To add a new platform implementation, you will need to create a new class that inherits from the `BaseProcessingPlatform` class located at [`app/platforms/base.py`](app/platforms/base.py). In this new class, you will need to implement all the abstract methods defined in the [`BaseProcessingPlatform`](app/platforms/base.py) class. This will ensure that your new platform implementation adheres to the expected interface and functionality.
+To add a new platform implementation, you will need to create a new class that inherits from the `BaseProcessingPlatform` class located at `app/platforms/base.py`. In this new class, you will need to implement all the abstract methods defined in the `BaseProcessingPlatform` class. This will ensure that your new platform implementation adheres to the expected interface and functionality.
 
 To register the new implementation, it is important to add the following directive right above the class definition:
 
@@ -40,6 +40,6 @@ class OGCAPIProcessPlatform(BaseProcessingPlatform):
     ...
 ```
 
-The processing type, defined by `ProcessTypeEnum`, is the unique identifier for the platform implementation. It is used to distinguish between different platform implementations in the system. This value is used by the different request endpoints to determine which platform implementation to use for processing the request. To add a new platform implementation, you will need to define a new `ProcessTypeEnum` value in the [`app/schemas/enum.py`](app/schemas/enum.py) file. This value should be unique and descriptive of the platform you are implementing.
+The processing type, defined by `ProcessTypeEnum`, is the unique identifier for the platform implementation. It is used to distinguish between different platform implementations in the system. This value is used by the different request endpoints to determine which platform implementation to use for processing the request. To add a new platform implementation, you will need to define a new `ProcessTypeEnum` value in the `app/schemas/enum.py` file. This value should be unique and descriptive of the platform you are implementing.
 
 Once you have completed the above steps, the new platform implementation will be registered automatically and made available for use in the APEx Dispatch API. You can then proceed to implement the specific functionality required for your platform.