Feature: Option: Add regions to graphs

Minimal illustration of the problem

![image](https://github.com/root-11/graph-theory/assets/4048422/079ae3cf-279b-47d2-a4bb-3919712214d5)
(classic graph vs multi-graph)

Assume G is a binary tree with a root and 2 levels if bifurcation resulting in $2^{2}$ leaves with randomized weights on the edges.

Assume that all search starts at the root and ends by identifying the route to a leaf using BFS to determine the shortest path.

**Problem**: Due to the symmetric nature of the graph, shortest path BFS will practically visit every node every time a search is performed.

**Proposition 1**: If (!) G is redesigned such that the graph is holds information about what can be found below each bifurcation point, only 10 nodes need to be visited. This is *ideal* from a search perspective, but the memory overhead is problematic as it requires the graph to store all leaves at all bifurcation levels: ~10x more memory. A second problem with this approach is that it only works for DAGs.

**Proposition 2**: If a partition of G can be declared as a another graph G' and BFS and shortest-path search can query G' to whether or not it **contains** or **has a route to the target** node, then the search can be accelerated:

1. If the target node is in G' and BFS sees G' as a single node in G, then the destination node has been found.
2. If the target node is NOT in G', BFS can eliminate the search through G' all together.

For the binary tree this means that G defined as $G_{1}' + G_{2}' = G_{1.1}' + G_{1.2}' + G{2.1}' + G_{2.2}...$ a BFS or shortest-path will require only $2*10$ recursive queries akin to "is target in G'". 

The reason for `2*10` is because at each recursive step the binary partition will have at least one failure.

Edges cases:

For non-trees, such as road networks, which may be partitioned using the "AA", "A", "B", ... road network classification, each branch will lead to a $G_{n}'$ where knowing the probability of reaching the target (for example using (lat, lon)-distance) will help to accelerate the search, but if such information isn't available - for example in information networks - the better method is to partition by proximity e.g. in clusters of $G/2$-nodes. The search must thereby treat G' as nodes that either have been visited or not.











Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Feature: Option: Add regions to graphs #36

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Feature: Option: Add regions to graphs #36

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions