Replies: 15 comments
-
@sib19 sorry for the delay. Can you provide Can you provide the failing query? Is it |
Beta Was this translation helpful? Give feedback.
-
Hi Pio Thanks for your debugging. I usually try with
|
Beta Was this translation helpful? Give feedback.
-
i did:
it didn't fail for me.
|
Beta Was this translation helpful? Give feedback.
-
Hi Pio Thanks for your quick reply and testing the data. This is very interesting, i used to setup presto-server326 using tar.gz hope this file was included all the dependant jar files. I can’t able to do the build from your commit , kindly provide tar.gz format of installation i will check again with select query. Moreover many hive tables are having multiple active parquet files under a hive table directory. Please provide tar.gz installation format that would be more helpful .... |
Beta Was this translation helpful? Give feedback.
-
@sib19 you can build a .tar.gz from source by running this:
Sure, this is typical. eg you can remove half of the files and see if the problem persist. |
Beta Was this translation helpful? Give feedback.
-
Pio Due to my restricted environment i can’t able to download maven jars for this build , after that only we will make tar.gz format as per your command. Please provide installation file as like under url prestosql.io/download.html. Yes sure i will try the select against each file Thanks |
Beta Was this translation helpful? Give feedback.
-
Sure, I made the .tar.gz snapshot build temporarily available for you -- https://www.dropbox.com/sh/ilz4yoqg7wtwg45/AAD_-SPuzPlX8bXvHc43M8eMa?dl=0 |
Beta Was this translation helpful? Give feedback.
-
Thanks Pio, you are so great.. Please avail the file for another 5/6 hours I will download the file from Dropbox In my home network . Due to more restrictions on company environment It doesn’t allow Dropbox access. |
Beta Was this translation helpful? Give feedback.
-
Unfortunately, I couldn't reproduce this on 326. If you still face the issue on the above .tar.gz, could you share the DDL on Spark(Hive) to confirm the full table properties? Also, Haddop and Hive versions may be helpful. |
Beta Was this translation helpful? Give feedback.
-
@sib19 i deleted the file you attached previously, in case it contains anything sensitive. in the meantime, i believe the issue is not Presto's fault, but it's some corrupted Parquet file. |
Beta Was this translation helpful? Give feedback.
-
@sib19 If you're still here can you please confirm if the data source you were querying has some ETL job running on it that re-writes files? Or if you were using some kind of caching layer (eg. Rubix?). I ran into this issue today and in my case this happened (most probably) due to Rubix. After dropping the cached files for that table this error went away. I ran into this with Presto 333. |
Beta Was this translation helpful? Give feedback.
-
Hi hashhar |
Beta Was this translation helpful? Give feedback.
-
I'm experiencing the same problam on Trino 381 (on kubernetes using the community container image). Everything works fine until I activate caching for the hive connector. Then after a few successful queries, they start failing with:
Here is my hive catalog config WITHOUT caching enabled (queries work just fine):
And with WITH caching enabled (causes the exception):
CREATE EXTERNAL TABLE `master_email`(
`email` string COMMENT '',
`sha256_lower` string COMMENT '')
ROW FORMAT SERDE
'org.apache.hadoop.hive.ql.io.parquet.serde.ParquetHiveSerDe'
STORED AS INPUTFORMAT
'org.apache.hadoop.hive.ql.io.parquet.MapredParquetInputFormat'
OUTPUTFORMAT
'org.apache.hadoop.hive.ql.io.parquet.MapredParquetOutputFormat'
LOCATION
's3://***********/master-db/master-email-20201002/'
TBLPROPERTIES (
'COLUMN_STATS_ACCURATE'='false',
'STATS_GENERATED_VIA_STATS_TASK'='workaround for potential lack of HIVE-12730',
'has_encrypted_data'='false',
'last_modified_by'='hadoop',
'last_modified_time'='1601685042',
'numFiles'='0',
'numRows'='1700455799',
'spark.sql.create.version'='2.2 or prior',
'spark.sql.sources.schema.numParts'='1',
'spark.sql.sources.schema.part.0'='{\"type\":\"struct\",\"fields\":[{\"name\":\"email\",\"type\":\"string\",\"nullable\":true,\"metadata\":{\"comment\":\"\"}},{\"name\":\"sha256_lower\",\"type\":\"string\",\"nullable\":true,\"metadata\":{\"comment\":\"\"}}]}',
'totalSize'='0',
'transient_lastDdlTime'='1601685042')
|
Beta Was this translation helpful? Give feedback.
-
Facing the same issue here. Is there any known resolution to this apart from deleting the cache everytime? |
Beta Was this translation helpful? Give feedback.
-
I am also seeing the same error in trino 389 even when hive cache is not enabled. @hashhar , do we have any resolution for this? |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
Hi please refer below error when accessing parquet kindly help . Sample file i have enclosed from haggle download and tested same below error is appearing in presto cli.
I am using 326 version ...
Beta Was this translation helpful? Give feedback.
All reactions