Skip to content

Commit f9d2f42

Browse files
committed
[SPARK-50394][PYTHON][INFRA] Reduce parallelism in Pure Python library builds
### What changes were proposed in this pull request? This PR proposes to decrease parallelism in Pure Python library builds ### Why are the changes needed? In order to make the tests more robust: https://github.com/apache/spark/actions/workflows/build_python_connect.yml https://github.com/apache/spark/actions/workflows/build_python_connect35.yml Now they fail because of OOM. ### Does this PR introduce _any_ user-facing change? No, test-only ### How was this patch tested? Will monitor the build: https://github.com/apache/spark/actions/workflows/build_python_connect.yml https://github.com/apache/spark/actions/workflows/build_python_connect35.yml ### Was this patch authored or co-authored using generative AI tooling? No. Closes #48932 from HyukjinKwon/reduce-parallelism. Authored-by: Hyukjin Kwon <[email protected]> Signed-off-by: Hyukjin Kwon <[email protected]>
1 parent 5e076ef commit f9d2f42

File tree

2 files changed

+2
-2
lines changed

2 files changed

+2
-2
lines changed

.github/workflows/build_python_connect.yml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -93,7 +93,7 @@ jobs:
9393
# Several tests related to catalog requires to run them sequencially, e.g., writing a table in a listener.
9494
./python/run-tests --parallelism=1 --python-executables=python3 --modules pyspark-connect,pyspark-ml-connect
9595
# None of tests are dependent on each other in Pandas API on Spark so run them in parallel
96-
./python/run-tests --parallelism=4 --python-executables=python3 --modules pyspark-pandas-connect-part0,pyspark-pandas-connect-part1,pyspark-pandas-connect-part2,pyspark-pandas-connect-part3
96+
./python/run-tests --parallelism=2 --python-executables=python3 --modules pyspark-pandas-connect-part0,pyspark-pandas-connect-part1,pyspark-pandas-connect-part2,pyspark-pandas-connect-part3
9797
9898
# Stop Spark Connect server.
9999
./sbin/stop-connect-server.sh

.github/workflows/build_python_connect35.yml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -98,7 +98,7 @@ jobs:
9898
# Run branch-3.5 tests
9999
./python/run-tests --parallelism=1 --python-executables=python3 --modules pyspark-connect
100100
# None of tests are dependent on each other in Pandas API on Spark so run them in parallel
101-
./python/run-tests --parallelism=4 --python-executables=python3 --modules pyspark-pandas-connect,pyspark-pandas-slow-connect
101+
./python/run-tests --parallelism=2 --python-executables=python3 --modules pyspark-pandas-connect,pyspark-pandas-slow-connect
102102
- name: Upload test results to report
103103
if: always()
104104
uses: actions/upload-artifact@v4

0 commit comments

Comments
 (0)