You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
./tools/run.py -- --format csv --scale-factor 0.003 --mode raw --output-dir sf0.003-raw
131
132
```
132
133
133
134
* Generating Parquet files:
134
135
135
136
```bash
136
-
./tools/run.py ./target/ldbc_snb_datagen_${PLATFORM_VERSION}-${DATAGEN_VERSION}.jar -- --format parquet --scale-factor 0.003 --mode bi
137
+
./tools/run.py -- --format parquet --scale-factor 0.003 --mode bi
137
138
```
138
139
139
140
* Use epoch milliseconds encoded as longs (née `LongDateFormatter`) for serializing date and datetime values:
140
141
141
142
```bash
142
-
./tools/run.py ./target/ldbc_snb_datagen_${PLATFORM_VERSION}-${DATAGEN_VERSION}.jar -- --format csv --scale-factor 0.003 --mode bi --epoch-millis
143
+
./tools/run.py -- --format csv --scale-factor 0.003 --mode bi --epoch-millis
143
144
```
144
145
145
146
* For the `interactive` and `bi` formats, the `--format-options` argument allows passing formatting options such as timestamp/date formats, the presence/abscence of headers (see the [Spark formatting options](https://spark.apache.org/docs/2.4.8/api/scala/index.html#org.apache.spark.sql.DataFrameWriter) fordetails), and whether quoting the fieldsin the CSV required:
[ !-f target/ldbc_snb_datagen_${PLATFORM_VERSION}-${DATAGEN_VERSION}-jar-with-dependencies.jar ] &&echo"target/ldbc_snb_datagen_${PLATFORM_VERSION}-${DATAGEN_VERSION}-jar-with-dependencies.jar does not exist, exiting"&&exit 1
4
-
5
3
# make sure that out directory exists and clean previously generated data
0 commit comments