Replies: 1 comment 3 replies
-
Experimented again. docker-compose.yml version: '3.9'
services:
nessie:
image: "ghcr.io/projectnessie/nessie"
ports:
- 19120:19120
trino:
image: trinodb/trino
volumes:
- ./trino/nessie.properties:/etc/trino/catalog/nessie.properties:ro
ports:
- 8080:8080 nessie.properties connector.name=iceberg
iceberg.catalog.type=nessie
iceberg.nessie-catalog.uri=http://nessie:19120/api/v1
iceberg.nessie-catalog.default-warehouse-dir=/tmp
hive.azure.abfs-storage-account=<Storage account>
hive.azure.abfs-access-key=<Secret key> I connect to Trino and run: create schema nessie.tabletest;
use nessie.tabletest;
create table test1(id int) with (location = 'abfs://[email protected]/landing-zone');
insert into test1 (id) values (2); And this works, I can see the Iceberg tables created in my data lake. create table meters (_id varchar, name varchar) with (external_location = 'abfs://[email protected]/metry/landing-zone/meters.json', format = 'json'); Which gives me So i try: create table meters (_id varchar, name varchar) with (location = 'abfs://[email protected]
ows.net/landing-zone.json', format = 'json'); Which returns So I convert that JSON to Parquet and upload it. create table meters (_id varchar, name varchar) with (location = 'abfs://[email protected]/landing-zone/meters.parquet', format = 'parquet'); With the result |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
I'm trying to setup Trino with Nessie for creating a data lakehouse with Iceberg tables. I want to ingest data (csv, json, etc) with Trino and load the Iceberg tables (ELT with Trino).
I've played around with a Docker-compose file:
And I mount a volume with properties:
I cannot figure out how to query a file though and add it.
I've tried creating schemas like
But I realize this might not work since I'm using Nessie.
I want to avoid setting up a Hive metastore for the only sake of accessing the data before loading it into Iceberg tables + Nessie.
Is there any way of doing that or am I really thinking of Trino in the wrong way?
Beta Was this translation helpful? Give feedback.
All reactions