You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: README.md
+5-4Lines changed: 5 additions & 4 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -11,7 +11,8 @@ Datagen is part of the [LDBC project](https://ldbcouncil.org/).
11
11
:warning: There are two different versions of the Datagen:
12
12
13
13
* The [Hadoop-based Datagen](https://github.com/ldbc/ldbc_snb_datagen_hadoop/) generates the Interactive SF1-1000 data sets
14
-
* For the Interactive workload's larger data sets (up to SF30k) and for the BI workload, use the Spark-based Datagen (in this repository). This is an experimental repository and breaking changes (e.g. changes in the directory layout and CSV schema) are expected
14
+
* For the BI workload, use the Spark-based Datagen (in this repository).
15
+
* For the Interactive workloads's larger data sets, there is no out-of-the-box solution (see [this issue](https://github.com/ldbc/ldbc_snb_interactive/issues/173)).
15
16
16
17
The LDBC SNB Data Generator (Datagen) is the responsible for providing the datasets used by all the LDBC benchmarks. This data generator is designed to produce directed labelled graphs that mimic the characteristics of those graphs of real data. A detailed description of the schema produced by Datagen, as well as the format of the output files, can be found in the latest version of official [LDBC SNB specification document](https://github.com/ldbc/ldbc_snb_docs).
17
18
@@ -127,19 +128,19 @@ To get a complete list of the arguments, pass `--help` to the JAR file:
0 commit comments