DLA Sequence Pattern Mining
Download the reference paper: https://github.com/andreagulino/DLA/raw/master/DLA.pdf
Usage
java -jar dla.jar [OPTIONS]
The following options are available:
| Option | Value |
|---|---|
| -minLen | integer representing the minimum length of patterns that should be found in the sequence database |
| -maxLen | integer representing the maximum length of patterns that should be found in the sequence database |
| -input | path tho the input sequence database (text file having as first column the sequence-id, second column the sequence string). HDFS paths are supported. |
| -output | path to the output folder |
| -minPartitions | number of partitions to split the input file |
| -sparkMaster | address of the spark master (default: local) |