forked from thammegowda/autoextractor
    
        
        - 
                Notifications
    You must be signed in to change notification settings 
- Fork 11
Build Instructions
        Thamme Gowda edited this page May 12, 2016 
        ·
        3 revisions
      
    - Newer JDK (1.7+) : Tested on JDK 7
- Newer Maven (3.x+): Tested on Maven 3.3x
- Working Internet Connection
This project contains following modules
- autoext - The core module
- autoext-spark - the module for performing distributed clustering on Apache Spark
- apted - A faster Tree Edit Distance (TED) Implementation
- visuals - Web interface for visualizations
- Executable jar with all dependencies (default)
- 
spark-submit- profile for packaging jar for submitting to spark visspark-submitcommand
- go to the root of the project cd
- 
mvn clean compile package
 autoext-spark/target/autoext-spark-xx.jar
To run this jar :  java -jar autoext-spark/target/autoext-spark-xx.jar
- mvn clean compile package -Pspark-submit
autoext-spark/target/autoext-spark-xx-SNAPSHOT-submit-{spark.version}_{scala.version}.jar
To run this jar, use spark-submit command
For quick start tutorial visit Clustering tutorial
NOTE:
If you have trouble compiling the project, make sure your version of JDK, Scala and Spark are compatible.