Skip to content

Commit 5ef74b8

Browse files
committed
11_datasets.ipynb: Add original tables
Add tables from FELINE, FERRARI and PReaCH papers as images, together with description of columns.
1 parent 54d35f2 commit 5ef74b8

6 files changed

+42
-1
lines changed

11_datasets.ipynb

Lines changed: 42 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -77,6 +77,22 @@
7777
"http://openprocedings.org/EDBT/2014/paper_166.pdf"
7878
]
7979
},
80+
{
81+
"cell_type": "markdown",
82+
"metadata": {},
83+
"source": [
84+
"![Table 1: Datasets](images/feline/FELINE_paper-table_1-datasets-with_caption.png)\n",
85+
"\n",
86+
"- _vertices_ - $|V|$, number of vertices / nodes in the graph\n",
87+
"- _edges_ - $|E|$, number of edges in the graph\n",
88+
"- _Cluster-coeff_ - clustering coefficient\n",
89+
"- _Eff-diameter_ - effective diameter (or effective eccentricity) is an estimated size of the path in which 90% of all pairs of vertices connected are reachable from each other\n",
90+
"- _roots_ - number of roots, nodes with no incoming edges, vertices with no predecessors (sources)\n",
91+
"- _leafs_ - number of leafs, nodes with no outgoing edges (sinks)\n",
92+
"\n",
93+
"The FELINE paper authors used the SNAP software <snap.stanford.edu/snap/> to compute these values"
94+
]
95+
},
8096
{
8197
"cell_type": "code",
8298
"execution_count": null,
@@ -396,7 +412,20 @@
396412
"cell_type": "markdown",
397413
"metadata": {},
398414
"source": [
399-
"\"FERRARI: Flexible and Efficient Reachability Range Assignment for Graph Indexing\" (2013)"
415+
"\"FERRARI: Flexible and Efficient Reachability Range Assignment for Graph Indexing\" (2013) \n",
416+
"http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.365.2894&rep=rep1&type=pdf"
417+
]
418+
},
419+
{
420+
"cell_type": "markdown",
421+
"metadata": {},
422+
"source": [
423+
"<!-- ![Table 2. Datasets used](images/ferrari/FERRARI_arXiv-table_2-datasets_used-with_caption.png) -->\n",
424+
"![Table 1. Datasets used](images/ferrari/FERRARI_paper-table_1-datasets_used-with_caption.png)\n",
425+
"\n",
426+
"- $|V|$ - number of vertices (nodes)\n",
427+
"- $|E|$ - number of edges\n",
428+
"- $|V_C|$, $|E_C|$ - number of, respectively, vertices and edges in the _condensed graph_, where maximial strongly connected components were collapsed into \"supernodes\", i.e. directed acyclic graph (DAG) created from given graph"
400429
]
401430
},
402431
{
@@ -630,6 +659,18 @@
630659
"https://arxiv.org/abs/1404.4465"
631660
]
632661
},
662+
{
663+
"cell_type": "markdown",
664+
"metadata": {},
665+
"source": [
666+
"![Table 1: Headers](images/preach/PReaCH_arXiv-table_1-datasets_used-header.png) \n",
667+
"![Table 1: Instances used for our experiments. _d_ is the maximal path length.](images/preach/PReaCH_arXiv-table_1-datasets_used-real_graphs-no_caption.png)\n",
668+
"\n",
669+
"- $m/n$ - edge density, number of edges divided by number of nodes (vertices)\n",
670+
"- $d$ - length of the longest path, or maximal path length\n",
671+
"- % pos - the fraction of positive queries in a random sample of 100000 queries"
672+
]
673+
},
633674
{
634675
"cell_type": "code",
635676
"execution_count": null,
62.1 KB
Loading
91.9 KB
Loading
84.6 KB
Loading
4.42 KB
Loading
79.8 KB
Loading

0 commit comments

Comments
 (0)