Hi All!
I was able to configure Azure Data Lake as a data source but when I try to add some folders as tables I get this “504 - gateway - time out”.
Does anyone knows how to handle that?
Thanks,
Marcus
Hi All!
I was able to configure Azure Data Lake as a data source but when I try to add some folders as tables I get this “504 - gateway - time out”.
Does anyone knows how to handle that?
Thanks,
Marcus
Hi,
From the Dremio coordinator are you able to read or download the folders outside of Dremio using “az download”?
Is there anything in the server.log?
Thanks
@balaji.ramaswamy
Hi! Thanks for replying.
I’m able to register data lake as source and some paths as tables when the data volume is small, but for large tables I get this error.
Actually I’m using Dremio’s image from DockerHub, just a single container. I search in server log available in /var/log/dremio/server.out but there is only it:
core file size (blocks, -c) unlimited
data seg size (kbytes, -d) unlimited
scheduling priority (-e) 0
file size (blocks, -f) unlimited
pending signals (-i) 27748
max locked memory (kbytes, -l) 64
max memory size (kbytes, -m) unlimited
open files (-n) 1048576
pipe size (512 bytes, -p) 8
POSIX message queues (bytes, -q) 819200
real-time priority (-r) 0
stack size (kbytes, -s) 8192
cpu time (seconds, -t) unlimited
max user processes (-u) 1048576
virtual memory (kbytes, -v) unlimited
file locks (-x) unlimited
Thanks.
Hi @marcusrehm
Couple of things
The large file that times out, are you able to download that file from the coordinator outside of Dremio using az cli?
There should be another file called server.log under the same folder where you found server.out. Can you please look into that? In case you have configured systemctl then there is a chance it might be writing to journal. In that case you can try doing the below commnd
journalctl -u service.dremio
(or)
journalctl _PID=<DREMIO_PID>
Thanks
@balaji.ramaswamy
About the large files it’s not just one file but actually a folder contains partitions and lot of files.
I couldn’t find the server.log neither run journalctl, it says command not found. I’m using the docker image dremio/dremio-oss.
Hi @marcusrehm
The logs files should go to Standard out and you should be able to use “docker logs” to view the messages
Kindly let us know if you are still not able to view the Dremio server messages
Thanks
@balaji.ramaswamy
Hi @marcusrehm
I do not see anything in the logs. It seems from your description you are timing out from the server on larger datasets. Not sure if you can run az on the docker and try to download the folder you are trying to access,
time az dls fs download --account --output table --source-path