Problems with connection to AWS Glue Catalog Iceberg table

Ritik · August 11, 2022, 3:05pm

Hi,

I have gone through the steps here:

And have managed to create a physical dataset, with everything, including the glue job working fine. In the relevant S3 bucket, I can also see the data and metadata files. However on trying to query that dataset, I am getting this error:

dataset at [/dataset/###.db.###] not found.

Can anyone help here?

hope · September 9, 2022, 4:35pm

Can you check that you have set the connection property: hive.metastore.warehouse.dir to your s3 bucket location

More information here: Dremio

Ritik · September 12, 2022, 6:49am

Hi, thanks for your reply.

I have done it now, by pointing the data source in Dremio to the S3 bucket location where the AWS Glue job created the iceberg table with the data. However, I still get the same error:

dataset at [/dataset/###.db.###] not found.

My Dremio version is 21.3. Also, the AWS instance on which I am running Dremio is in Ireland, not in N. Virginia. Could these have anything to do with it?

hope · September 12, 2022, 4:24pm

Your VPC and s3 bucket location need to be in the same aws region. As long as all of your resources are in Ireland, you should be fine.

Ritik · September 13, 2022, 6:38am

Ah. So my Dremio instance is deployed in Ireland, but my S3 bucket and Glue job are in Virginia (I did it that way since the Iceberg connector for the Glue job works only in Virginia).

However, the issue just seems to be with iceberg tables. I created other glue databases in Virginia pointing to normal flat files in S3 and they seem to work fine on Dremio.

Could the version have something to do with it? Or is there a configuration which I may have missed?

Topic		Replies	Views
Iceberg Dataset not found	3	1383	August 17, 2022
Unable to access iceberg data created through AWS glue and stored in s3 Dremio Cloud	7	958	September 14, 2023
Error creating Iceberg Table in S3 Dremio Cloud	1	1246	June 17, 2022
Trouble adding Glue Catalog?	5	57	December 5, 2024
I don't see support Apache Iceberg metadata tables using AWS Glue catalogs in Release 20.0	0	987	February 9, 2022

Problems with connection to AWS Glue Catalog Iceberg table

Related topics