-
Notifications
You must be signed in to change notification settings - Fork 410
Open
Description
I have installed spark-sql-perf using:
- sudo yum install -y gcc make flex bison byacc git
- cd /tmp/
- git clone https://github.com/databricks/tpcds-kit.git
- cd tpcds-kit/tools
- make OS=LINUX
- curl https://bintray.com/sbt/rpm/rpm | sudo tee /etc/yum.repos.d/bintray-sbt-rpm.repo
- sudo yum install sbt
- cd /home/hadoop/
- git clone https://github.com/databricks/spark-sql-perf
- mkdir -p /home/hadoop/.sbt/preloaded/org/spark-packages/sbt-spark-package_2.10_0.13/0.1.1/
- cd /home/hadoop/.sbt/preloaded/org/spark-packages/sbt-spark-package_2.10_0.13/0.1.1/
- wget https://repos.spark-packages.org/org/spark-packages/sbt-spark-package/0.1.1/sbt-spark-package-0.1.1.pom
- wget https://repos.spark-packages.org/org/spark-packages/sbt-spark-package/0.1.1/sbt-spark-package-0.1.1.jar
- cd /home/hadoop/spark-sql-perf
- sbt +package
and spark configurations are set at /usr/lib/spark/conf/spark-defaults.conf
where spark.executor.memory =19650M & spark.executor.cores = 5 & spark.executor.memoryOverhead =2184
In another case, I tried to set executor-per-cores at run-time using --executor-cores along with spark-submit...
Yet, in the YARN UI , I see this:
_**
Container State: COMPLETE
Mon Jun 21 06:12:54 +0000 2021
Elapsed Time: 7mins, 16sec
Resource: 21856 Memory, 1 VCores
And, there are 5 executors on each node, when there are 32 vCores.
Metadata
Metadata
Assignees
Labels
No labels