CogStack
diff --git a/‎docs/conf.py‎
Lines changed: 2 additions & 2 deletions b/‎docs/conf.py‎
Lines changed: 2 additions & 2 deletions
diff --git a/‎docs/index.md‎
Lines changed: 11 additions & 9 deletions b/‎docs/index.md‎
Lines changed: 11 additions & 9 deletions
diff --git a/‎docs/overview/CogStack Product Documentation.md‎
Lines changed: 0 additions & 18 deletions b/‎docs/overview/CogStack Product Documentation.md‎
Lines changed: 0 additions & 18 deletions
diff --git a/‎docs/overview/CogStack ecosystem (v1).md‎
Lines changed: 9 additions & 15 deletions b/‎docs/overview/CogStack ecosystem (v1).md‎
Lines changed: 9 additions & 15 deletions
diff --git a/‎docs/overview/CogStack Documentation.md‎ renamed to ‎docs/overview/CogStack-Documentation.md‎
Lines changed: 2 additions & 2 deletions b/‎docs/overview/CogStack Documentation.md‎ renamed to ‎docs/overview/CogStack-Documentation.md‎
Lines changed: 2 additions & 2 deletions
@@ -10,11 +10,11 @@
 # -- Project information -----------------------------------------------------
 # https://www.sphinx-doc.org/en/master/usage/configuration.html#project-information
 
-project = 'CogStack Platform Toolkit'
+project = 'CogStack Documentation'
 copyright = '2025, CogStack Org'
 author = 'CogStack Org'
 release = 'latest'
-html_title = "CogStack Platform Toolkit"
+html_title = "CogStack Documentation"
 
 # -- General configuration ---------------------------------------------------
 # https://www.sphinx-doc.org/en/master/usage/configuration.html#general-configuration
 
@@ -1,26 +1,28 @@
 
 # Cogstack Documentation
 
+Welcome to the CogStack Documentation site.
 
-CogStack is composed of a range of adaptable modular interoperable tools which introduce tiered functionality which can be used for a variety of use-technologies:
+Get started by looking at the [CogStack Overview](overview/CogStack-Documentation.md)
 
-Centralise and lake clinical data including structured data i.e. observations, results, and unstructured data i.e. clinical narratives such as clinic letters, discharge and admission summaries and radiology reports also varying formats e.g. binary word docs, PDFs, images.
+Any broad questions then please do reach out in our community space [here](https://discourse.cogstack.org/)
 
-Search and visualise millions of distinct data points in near-real-time – ‘unlocking’ capabilities that would otherwise have taken days or months previously.
+Further in development projects are [here](https://github.com/orgs/CogStack/repositories)
 
-Natural Language Processing of clinical text to standardised clinical terminologies (SNOMED-CT) for interoperable clinical data combined with semantic context. This allows cohorting based on “find all patients with a heart attack”, regardless of how this has been referred to in the clinical text, such as “patient had myocardial infarct”, “MI“, “infarct of heart”, “cardiac infarct” and distinguishing “the patient’s father had a MI”.
+![](./overview/attachments/43c14755-e565-4ae0-a0a3-ec6dc18a691c.png)
 
-Deep phenotyping using NLP allows accelerated NHS clinical coding, disease registry submissions and advanced cohorting for observational studies.
+| Tool | Description |
+|:-----|:------------|
+| <img src="./overview/attachments/36c0d23f-a632-4fbf-9f7c-6669e88bbd39.png" width="100"/> <br/> [**CogStack-Nifi**](https://cogstack-nifi.readthedocs.io/en/latest/main.html) | Data flow orchestration using Apache NiFi |
+| <img src="./overview/attachments/09a8bb60-9864-41fa-be7b-cf9a9dc04498.png" width="100"/> <br/> [**MedCAT**](https://medcat.readthedocs.io/en/latest/) | Medical Concept Annotation Toolkit |
+| <img src="./overview/attachments/09a8bb60-9864-41fa-be7b-cf9a9dc04498.png" width="100"/> <br/> [**MedCATTrainer**](https://medcattrainer.readthedocs.io/en/latest/) | Web-based annotation and training interface for MedCAT |
 
-Population health dashboards for combining data from structured and text components of the electronic health record to track patient outcomes, enhance patient safety and improve patient care.
-
-Advanced analytics using generative AI for virtual trial emulation, high-dimensional patient or disease modelling and digital patient twins.
 
 ```{toctree}
 :hidden:
 
 overview/_index
-observability/_index
+toolkit/_index
 
 ```
 
@@ -3,15 +3,9 @@
 
 # CogStack ecosystem (v1)
 
-|                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                |                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                           |
-|:---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
-| Overview  <br/> In this part are covered the available services that can be running in an example CogStack deployment. To such deployment with many running services we refer as an  *ecosystem* or a *platform*. Below is presented a high-level perspective of CogStack platform with the possibilities it enables through many components and services. <br/> []() <br/>  <br/> In practice, many of the functionalities that CogStack platform enables are implemented as separate, but interconnected services working inside the ecosystem. <br/>  <br/> | > [!NOTE] **On this page :**  <br/> <ul class="toc-indentation"><br/><li><a href="#CogStackecosystem(v1)-Overview">Overview</a></li><br/><li><a href="#CogStackecosystem(v1)-platform-coreCoreservices">Core services</a></li><br/><li><a href="#CogStackecosystem(v1)-platform-pipelineCogStackPipeline">CogStack Pipeline</a></li><br/><li><a href="#CogStackecosystem(v1)-platform-postgres"></a></li><br/><li><a href="#CogStackecosystem(v1)-PostgreSQL">PostgreSQL</a></li><br/><li><a href="#CogStackecosystem(v1)-platform-esElasticSearch">ElasticSearch</a></li><br/><li><a href="#CogStackecosystem(v1)-platform-kibanaKibana">Kibana</a></li><br/><li><a href="#CogStackecosystem(v1)-platform-nginxNGINX">NGINX</a></li><br/><li><a href="#CogStackecosystem(v1)-platform-fluentdFluentd">Fluentd</a></li><br/></ul>   <br/> |
+In this part are covered the available services that can be running in an example CogStack deployment. To such deployment with many running services we refer as an  *ecosystem* or a *platform*. Below is presented a high-level perspective of CogStack platform with the possibilities it enables through many components and services. In practice, many of the functionalities that CogStack platform enables are implemented as separate, but interconnected services working inside the ecosystem. 
 
----
-
----
-
-# Core services
+## Core services
 
 In most scenarios CogStack platform will consist of *core* services tailored to specific use-cases. Additional application and services can be run on top of it, such as [SemEHR](../../CogStack%20General/CogStack%20Wiki/CogStack%20projects/SemEHR.md), [Patient Timeline](../../CogStack%20General/CogStack%20Wiki/CogStack%20projects/Patient%20Timeline.md), Live Alerting (through ElasticSearch plugins) or any other custom developed applications. For an ease-of-use, when deploying a sample CogStack platform, we always emphasise to use Docker Compose (see: [Running CogStack](Running%20CogStack.md)).
 
@@ -41,7 +35,7 @@ It is essential to note that presented is a very simplified scenario, which can
 
 ---
 
-# CogStack Pipeline
+### CogStack Pipeline
 
 CogStack Pipeline is the main data processing service used inside the CogStack platform. Within the ecosystem it's main responsibilities is to ingest the EHR data from a specified data source, process the data (e.g. by applying the text extraction methods, records de-identification or extracting the NLP annotations) and store the resulting data in the specified sink.
 
@@ -60,9 +54,9 @@ The information about available data processing components offered by CogStack P
 
 ---
 
-# 
 
-# PostgreSQL
+
+### PostgreSQL
 
 [PostgreSQL](https://www.postgresql.org/) is a widely used object-relational database management system. In CogStack platform it is primarily used as a job repository, for storing the jobs execution status of running CogStack Pipeline instances. However, there may be cases where one may need to store the partial results treating PostgreSQL DB either as a data cache (see: [Examples](Examples.md) ) or an auxiliary data sink.
 
@@ -88,7 +82,7 @@ When used as a job repository, it requires defining appropriate tables with a us
 
 ---
 
-# ElasticSearch
+### ElasticSearch
 
 [ElasticSearch](https://www.elastic.co/guide/) is a popular NoSQL search engine based on the Lucene library that provides a distributed full-text search engine storing the data as schema-free JSON documents. Inside CogStack platform it is usually used as a primary data store for processed EHR data by CogStack Pipeline.
 
@@ -151,7 +145,7 @@ Depending on the use-case, the processed EHR data is usually stored in indices a
 
 ---
 
-# Kibana
+### Kibana
 
 [Kibana](https://www.elastic.co/products/kibana) is a data visualisation module for ElasticSeach that be easily used to explore and query the data. In sample CogStack platform deployments it can be used as a ready-to-use data exploration tool.
 
@@ -168,7 +162,7 @@ Apart from providing exploratory data analysis functionality it also offers admi
 
 ---
 
-# NGINX
+### NGINX
 
 NGINX is a popular, open-source web server that can also be used as a reverse proxy, load balancer, HTTP cache and more. In CogStack platform deployments, it can be used as a reverse-proxy and providing a basic security access to the exposed data stores and service endpoints. Some of the functionality may include general user-based authentication, IP filtering and selective service access. A more detailed description of security features offered by NGINX can be found in the [official documentation](https://docs.nginx.com/nginx/admin-guide/security-controls/).
 
@@ -185,7 +179,7 @@ NGINX is a popular, open-source web server that can also be used as a reverse pr
 
 ---
 
-# Fluentd
+### Fluentd
 
 [Fluentd](https://www.fluentd.org/) is an open source data collector providing a unified logging layer. In sample CogStack platform deployments it can be used running as a service collecting the logs from all the running services which can be used for auditing.
 
 
@@ -3,7 +3,7 @@
 
 # CogStack Documentation
 
-# What is CogStack?
+## What is CogStack?
 
 CogStack is a lightweight distributed, fault tolerant database processing architecture and ecosystem, intended to make NLP processing and preprocessing easier in resource constrained environments. It comprises of multiple components, and has been designed to provide configurable data processing pipelines for working with EHR data. For the moment it mainly uses databases and files as the primary source of EHR data with the possibility of adding custom data connectors in the near future. It makes use of the [Apache-Nifi](https://nifi.apache.org/) framework in order to provide a fully configurable data processing pipeline with the goal of generating annotated JSON standardised schema files that can be readily indexed into [ElasticSearch](https://www.elastic.co/), stored as files or pushed back to a database.
 
@@ -16,7 +16,7 @@ The CogStack ecosystem has been developed as an open source project with the cod
 >
 > Starting from version 1.2 CogStack is preferably being run as an ecosystem using a set of different microservices and deployed using [Docker Compose](https://docs.docker.com/compose/). The ready-to-use CogStack images are available to pull directly from the official Docker Hub under [cogstacksystems](https://hub.docker.com/u/cogstacksystems/) organisation. We’re actively pursuing running the stack in a K8s cluster also.
 
-# Why does this project exist?
+## Why does this project exist?
 
 The CogStack consists of a range of technologies designed to to support modern, open source healthcare analytics within the NHS, and is chiefly comprised of the Elastic stack ([ElasticSearch](https://www.elastic.co/products/elasticsearch), [Kibana](https://www.elastic.co/products/kibana), etc.), [MedCAT](https://github.com/CogStack/MedCAT) (clinical natural language processing for named entity extraction and linking), clinical text [OCR](https://github.com/CogStack/ocr-service), clinical text de-identification. Since the processed EHR data can be represented and stored in databases or ElasticSearch, CogStack can be perfectly utilised as one of the solutions for integrating EHR data with other types of biomedical, -omics, wearables data, etc.