Commit 0fa5f1c
feat(datasets): SparkDataset Rewrite (#1185)
* rework spark in pyproject.toml
Signed-off-by: Sajid Alam <[email protected]>
* Update pyproject.toml
Signed-off-by: Sajid Alam <[email protected]>
* Update spark_dataset.py
Signed-off-by: Sajid Alam <[email protected]>
* Update spark_dataset.py
Signed-off-by: Sajid Alam <[email protected]>
* lint
Signed-off-by: Sajid Alam <[email protected]>
* Update spark_dataset.py
Signed-off-by: Sajid Alam <[email protected]>
* Update test_spark_dataset.py
Signed-off-by: Sajid Alam <[email protected]>
* lint
Signed-off-by: Sajid Alam <[email protected]>
* revert and split sparkdataset rewrite into v2
Signed-off-by: Sajid Alam <[email protected]>
* Update test_spark_dataset_v2.py
Signed-off-by: Sajid Alam <[email protected]>
* changes based on feedback
Signed-off-by: Sajid Alam <[email protected]>
* lint
Signed-off-by: Sajid Alam <[email protected]>
* Update __init__.py
Signed-off-by: Sajid Alam <[email protected]>
* fix tests
Signed-off-by: Sajid Alam <[email protected]>
* fix docstring and lint
Signed-off-by: Sajid Alam <[email protected]>
* Update spark_dataset_v2.py
Signed-off-by: Sajid Alam <[email protected]>
* Update spark_dataset_v2.py
Signed-off-by: Sajid Alam <[email protected]>
* fix unity catalog
Signed-off-by: Sajid Alam <[email protected]>
* Update spark_dataset_v2.py
Signed-off-by: Sajid Alam <[email protected]>
* Update spark_dataset_v2.py
Signed-off-by: Sajid Alam <[email protected]>
* revert
Signed-off-by: Sajid Alam <[email protected]>
* clean-up SparkDatasetV2
Signed-off-by: Sajid Alam <[email protected]>
* type check fix
Signed-off-by: Sajid Alam <[email protected]>
* Update databricks_utils.py
Signed-off-by: Sajid Alam <[email protected]>
* remove duplicate
Signed-off-by: Sajid Alam <[email protected]>
* lint
Signed-off-by: Sajid Alam <[email protected]>
* Delete .idea/workspace.xml
Signed-off-by: Sajid Alam <[email protected]>
* Update pyproject.toml
Signed-off-by: Sajid Alam <[email protected]>
* changes based on review
Signed-off-by: Sajid Alam <[email protected]>
* changes based on review
Signed-off-by: Sajid Alam <[email protected]>
* fix test 1
Signed-off-by: Sajid Alam <[email protected]>
* fix test 2
Signed-off-by: Sajid Alam <[email protected]>
* address review comments
Signed-off-by: Sajid Alam <[email protected]>
* secret fix
Signed-off-by: Sajid Alam <[email protected]>
* coverage
Signed-off-by: Sajid Alam <[email protected]>
* Update test_spark_dataset_v2.py
Signed-off-by: Sajid Alam <[email protected]>
* add SparkDatasetV2 windows tests
Signed-off-by: Sajid Alam <[email protected]>
* clean tests and coverage
Signed-off-by: Sajid Alam <[email protected]>
* coverage pt 2
Signed-off-by: Sajid Alam <[email protected]>
* coverage
Signed-off-by: Sajid Alam <[email protected]>
* skip coverage for unreachable
Signed-off-by: Sajid Alam <[email protected]>
* enable spark windows tests
Signed-off-by: Sajid Alam <[email protected]>
* windows ci test with spark 4.0.1
Signed-off-by: Sajid Alam <[email protected]>
* Update unit-tests.yml
Signed-off-by: Sajid Alam <[email protected]>
* Update unit-tests.yml
Signed-off-by: Sajid Alam <[email protected]>
* use java 11
Signed-off-by: Sajid Alam <[email protected]>
* Update unit-tests.yml
Signed-off-by: Sajid Alam <[email protected]>
* Update unit-tests.yml
Signed-off-by: Sajid Alam <[email protected]>
* please spark tests work!!
Signed-off-by: Sajid Alam <[email protected]>
* try fix spark test 2
Signed-off-by: Sajid Alam <[email protected]>
* attempt 3
Signed-off-by: Sajid Alam <[email protected]>
* please work spark windows
Signed-off-by: Sajid Alam <[email protected]>
* Update unit-tests.yml
Signed-off-by: Sajid Alam <[email protected]>
* Update unit-tests.yml
Signed-off-by: Sajid Alam <[email protected]>
* spark potential fix
Signed-off-by: Sajid Alam <[email protected]>
* update makefile
Signed-off-by: Sajid Alam <[email protected]>
* Update conftest.py
Signed-off-by: Sajid Alam <[email protected]>
* Update conftest.py
Signed-off-by: Sajid Alam <[email protected]>
* revert spark windows tests
Signed-off-by: Sajid Alam <[email protected]>
* Modify Databrics connect
Signed-off-by: Dmitry Sorokin <[email protected]>
* check if DatabricksSession is None
Signed-off-by: Sajid Alam <[email protected]>
* coverage
Signed-off-by: Sajid Alam <[email protected]>
* Update spark_dataset_v2.py
Signed-off-by: Sajid Alam <[email protected]>
* auto convert pandas to spark dataframe
Signed-off-by: Sajid Alam <[email protected]>
* revert conftest
Signed-off-by: Sajid Alam <[email protected]>
* lint
Signed-off-by: Sajid Alam <[email protected]>
* add docs and release notes
Signed-off-by: Sajid Alam <[email protected]>
* pin mkdocsstrings 2.0 has breaking changes
Signed-off-by: Sajid Alam <[email protected]>
---------
Signed-off-by: Sajid Alam <[email protected]>
Signed-off-by: Sajid Alam <[email protected]>
Signed-off-by: Dmitry Sorokin <[email protected]>
Co-authored-by: Ravi Kumar Pilla <[email protected]>
Co-authored-by: Dmitry Sorokin <[email protected]>1 parent 7028b65 commit 0fa5f1c
File tree
13 files changed
+1956
-24
lines changed- kedro-datasets
- docs
- api/kedro_datasets
- kedro_datasets
- _utils
- spark
- tests
- _utils
- spark
13 files changed
+1956
-24
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
1 | 1 | | |
2 | 2 | | |
3 | 3 | | |
| 4 | + | |
| 5 | + | |
| 6 | + | |
| 7 | + | |
| 8 | + | |
| 9 | + | |
| 10 | + | |
4 | 11 | | |
5 | 12 | | |
6 | 13 | | |
| |||
Lines changed: 8 additions & 0 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
| 1 | + | |
| 2 | + | |
| 3 | + | |
| 4 | + | |
| 5 | + | |
| 6 | + | |
| 7 | + | |
| 8 | + | |
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
58 | 58 | | |
59 | 59 | | |
60 | 60 | | |
| 61 | + | |
61 | 62 | | |
62 | 63 | | |
63 | 64 | | |
| |||
Lines changed: 133 additions & 7 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
| 1 | + | |
| 2 | + | |
| 3 | + | |
| 4 | + | |
1 | 5 | | |
2 | 6 | | |
3 | 7 | | |
4 | 8 | | |
5 | 9 | | |
| 10 | + | |
6 | 11 | | |
7 | 12 | | |
8 | 13 | | |
9 | 14 | | |
10 | 15 | | |
11 | 16 | | |
| 17 | + | |
| 18 | + | |
12 | 19 | | |
13 | 20 | | |
14 | 21 | | |
| |||
31 | 38 | | |
32 | 39 | | |
33 | 40 | | |
34 | | - | |
| 41 | + | |
35 | 42 | | |
36 | 43 | | |
37 | 44 | | |
| |||
40 | 47 | | |
41 | 48 | | |
42 | 49 | | |
43 | | - | |
| 50 | + | |
44 | 51 | | |
45 | 52 | | |
46 | 53 | | |
| |||
58 | 65 | | |
59 | 66 | | |
60 | 67 | | |
61 | | - | |
| 68 | + | |
62 | 69 | | |
63 | 70 | | |
64 | 71 | | |
| |||
71 | 78 | | |
72 | 79 | | |
73 | 80 | | |
74 | | - | |
75 | | - | |
| 81 | + | |
| 82 | + | |
76 | 83 | | |
77 | 84 | | |
78 | | - | |
| 85 | + | |
| 86 | + | |
| 87 | + | |
79 | 88 | | |
80 | 89 | | |
81 | 90 | | |
82 | 91 | | |
83 | | - | |
| 92 | + | |
84 | 93 | | |
85 | 94 | | |
86 | 95 | | |
| |||
103 | 112 | | |
104 | 113 | | |
105 | 114 | | |
| 115 | + | |
| 116 | + | |
| 117 | + | |
| 118 | + | |
| 119 | + | |
| 120 | + | |
| 121 | + | |
| 122 | + | |
| 123 | + | |
| 124 | + | |
| 125 | + | |
| 126 | + | |
| 127 | + | |
| 128 | + | |
| 129 | + | |
| 130 | + | |
| 131 | + | |
| 132 | + | |
| 133 | + | |
| 134 | + | |
| 135 | + | |
| 136 | + | |
| 137 | + | |
| 138 | + | |
| 139 | + | |
| 140 | + | |
| 141 | + | |
| 142 | + | |
| 143 | + | |
| 144 | + | |
| 145 | + | |
| 146 | + | |
| 147 | + | |
| 148 | + | |
| 149 | + | |
| 150 | + | |
| 151 | + | |
| 152 | + | |
| 153 | + | |
| 154 | + | |
| 155 | + | |
| 156 | + | |
| 157 | + | |
| 158 | + | |
| 159 | + | |
| 160 | + | |
| 161 | + | |
| 162 | + | |
| 163 | + | |
| 164 | + | |
| 165 | + | |
| 166 | + | |
| 167 | + | |
| 168 | + | |
| 169 | + | |
| 170 | + | |
| 171 | + | |
| 172 | + | |
| 173 | + | |
| 174 | + | |
| 175 | + | |
| 176 | + | |
| 177 | + | |
| 178 | + | |
| 179 | + | |
| 180 | + | |
| 181 | + | |
| 182 | + | |
| 183 | + | |
| 184 | + | |
| 185 | + | |
| 186 | + | |
| 187 | + | |
| 188 | + | |
| 189 | + | |
| 190 | + | |
| 191 | + | |
| 192 | + | |
| 193 | + | |
| 194 | + | |
| 195 | + | |
| 196 | + | |
| 197 | + | |
| 198 | + | |
| 199 | + | |
| 200 | + | |
| 201 | + | |
| 202 | + | |
| 203 | + | |
| 204 | + | |
| 205 | + | |
| 206 | + | |
| 207 | + | |
| 208 | + | |
| 209 | + | |
| 210 | + | |
| 211 | + | |
| 212 | + | |
| 213 | + | |
| 214 | + | |
| 215 | + | |
| 216 | + | |
| 217 | + | |
| 218 | + | |
| 219 | + | |
| 220 | + | |
| 221 | + | |
| 222 | + | |
| 223 | + | |
| 224 | + | |
| 225 | + | |
| 226 | + | |
| 227 | + | |
| 228 | + | |
| 229 | + | |
| 230 | + | |
| 231 | + | |
0 commit comments