-
Notifications
You must be signed in to change notification settings - Fork 19
add Alibaba Cloud Hologres #87
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Conversation
@rschu1ze Hello,Could you help us review it? |
echo "[$(date '+%Y-%m-%d %H:%M:%S')] Vacuum analyze the table" | ||
$HOLOGRES_PSQL -d "$DB_NAME" -c '\timing' -c "VACUUM $TABLE_NAME" | ||
$HOLOGRES_PSQL -d "$DB_NAME" -c '\timing' -c "ANALYZE $TABLE_NAME" | ||
$HOLOGRES_PSQL -d "$DB_NAME" -c '\timing' -c "select hologres.hg_full_compact_table('$TABLE_NAME')" |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
hey boy, extra commands should not be used, like VACUUM, ANALYZE, compact
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
it is allowed, you can see https://github.com/ClickHouse/JSONBench/blob/main/postgresql/create_and_load.sh
Also in ClickBench a lot of Postgres-based systems use commands like "vacuum", "analyze"
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
By the way, hg_full_compact_table performs essentially the same function as VACUUM, with the added benefit of ensuring that all files are fully compacted and compressed using ZSTD. Without this step, some files might be compressed with ZSTD while others are not, which could lead to inconsistencies in performance stability and overall storage size. That said, if @rschu1ze strongly prefers that we remove it, we can do so—there is no significant impact on the results.
# https://www.postgresql.org/download/linux/ubuntu/ | ||
|
||
sudo apt-get update | ||
sudo apt-get install -y postgresql-common postgresql-16 |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
i would suggest print all settings of the db after installation, so that everyone can reproduce the test result of this saas product
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I don't understand what you are saying, this is just installing standard postgresql client, it has nothing to do with settings. The scripts in pull request already provide everything needed to reproduce result.
|
||
ALTER TABLE bluesky ALTER COLUMN data SET (enable_columnar_type = ON); | ||
CALL set_table_property('bluesky', 'dictionary_encoding_columns', 'data:auto'); | ||
CALL set_table_property('bluesky', 'bitmap_columns', 'data:auto'); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
An extra index, such as bitmap_columns
, would be considered a form of manual tuning.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The rule is very clear: "It is allowed to apply various indexing methods whenever appropriate." Bitmap is a very common indexing methods.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Clarified here: #95
I will need some help to reproduce the measurements: |
Sorry for the delay, recently it was our national holiday and we had a long vacation. My colleague replied you about the detailed steps in the ClickBench PR. The details for JsonBench is similar, the main difference is choosing 32CU when buying Hologres instance 1. Create an Alibaba Cloud Account and Provide Your UIDPlease first create an Alibaba Cloud account. After registration, kindly provide us with your UID (Account ID), which you can find by:
2. Purchase an Alibaba Cloud Hologres InstanceWhen creating the Hologres instance, please use the following configuration:
Once everything is configured and you’ve received the coupon, click Buy Now to proceed. 3. Purchase an ECS Instance (as Client Machine)This ECS instance acts as a client to download data, run queries, and load data into Hologres.
Click Create Order to launch the instance. 4. Connect to the ECS and Run the BenchmarkAfter the ECS instance is ready:
✅ That’s it! You’re all set to run the benchmark. Thank you again for your valuable feedback and contribution! |
This pull request introduces benchmark results for Hologres.
Hologres is a commercial real-time data warehouse product developed by Alibaba Cloud. Unfortunately, we cannot provide deployment options on other machines. We apologize that verifying the performance requires purchasing a Hologres instance on Alibaba Cloud. Testing methods are described in README.md, and we can offer vouchers to assist with performance testing.