You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
* SparkML POC; unit tests pass for all 4 operators
* Added Pipeline logic+tests; Digressed Pipeline Conversion from sklearn; all unit tests pass
* Adding documentation; Changing locations of some files
* fixes after relocating file; Added Profiling test to sparkml
* fixing broken link
* added verification step to the profiling test for sparkml
* removed the zipmap step from LogisticRegression conversion; conversion has sped up already
* fixing profile sparkml to include all columns
* removing individual shape_calculator files and merging their code with the converters
* adding Binarizer to SparkML
* removing individual shape_calculator files and merging their code with the converters
* Adding MLeap to sparkml profiler; generating plot graphics
* fixing the SPARK_HOME detection code
* fixing sparkml test base for start_spark() args
* Adding Normalizer to Spark ML converter
* updating gitignore
* fixing unit tests for sparkml
* adding cmake exclusions to gitignore
* cherrypicking with slight modifications changes from sparkml branch related to profile pipeline
* fixing imports based on review comments
* fixing unit tests
* restructuring code
* Adding LinearSVC operator
* Adding Scaler(MinMax, MaxAbs,Standard) convertors to Spark ML
* adding sys_platform qualifiers to requirements
* adding Imputer to sparkML
* adding Imputer to sparkML
* removing sklearn from this project's requirements
* adding LinearRegressor to Spark ML
* code cleanup
* proper renaming of files
* adding GenerelizedLinearRegression to Spark ML
* moved sparkml utils from where it caused depenency errors
* skipping tests for py2.7; skipping failed pipeline tests for now until more investigation is done
* disabling all pipeline tests in spark since they cause issues on the build machine
* disabling all pipeline tests in spark since they cause issues on the build machine
* removing onnxruntime from requirements-dev
* formatting
* interim checkin
* Adding VectorIndexer conversion; Allowing Spark Session to be passed to the parser/converter code; untested DecisionTree code (needs ore work)
* adding proposal for missing ops
* first seemingly working version of DecisionTree; needs more testing
* fixing the testcase for DecisionTree; DecisionTree conversion is complete
* merging from additional-ops with squash
* fixing randomforest regressor unit test
* Leaving out Imputer tests because of bug in pyspark
* fixing import issues
* skipping tests under python2
* fixing missing import
* fixing imports
* skipping tests that require opset 9
* fixing the environment marker for py2.7
* enabling coremltools in the requirements again
* Adding bucketizer converter to Sparkml
* adding convertors for StopWordsRemover and VectorSlicer
* Adding converter for Spark ML NaiveBayes
* fixing build issues for my PR
* Adding Tokenizer and PolynomialExpansion operator converters
* adding PCA converter to Sparkml
* removing comments
* adding DCT converter to sparkml
* Adding GBTClassifier converter to Spark ML; some re-orging of files
* Adding converter for GBTRegressor; some code cleanup
* Adding converter for OneVsRest in Spark ML; some formatting changes
* RandomForests were missing from the init file
* Adding script to read pipeline as saved by R on Spark, and run in runtime
* Adding converter for ChiSqSelector in Spark ML
* formatting
* Added converter for IndexToString; Currently cannot handle cases where Labels are not specified in the Spark model
* Adding converter for Word2Vec in Spark ML
* cleanup
* removing a hardcoded value
* Adding converters ElementWiseProduct and MinHashLSH in Spark ML
* cleanup
* cleanup
* Adding converter for AftSurvivalRegression in Spark ML
* Adding converter for BucketedRandomProjectionLSH in Spark ML
* using separators for Tokenizer conversion in Spark ML
* path cleanup
* excluding test from py2.7
* fixing failures in LinearClassifier tests
* excluding word2vec test from automatic run due to pyspark issues
* fixing unit test errors
* removing verbose prints
* Changing the test run and validation routines to account for output names; Retrofitting all tests cases accordingly
* fixing an issue in MinHashLSH which is related to Spark version
* reducing the ReduceMin version to 1
* formatting
* fixing ReduceMin issue in min_hash_lsh
* formatting and cleanup
* copyright headers
* fixed accidental line deletion
0 commit comments