Hi,
I setup minio and iceberg together first using git:
After that I tried to setup hadoop and integrate it with minio using following steps:
“”“”“”“”“”“”“”“”“”“”“”“”“”“”“”“”“”“”“”“”“”“”“”“”“”“”“”“”“”
sudo apt update
sudo apt install openjdk-8-jdk
java -version
sudo apt install ssh
ssh-keygen -t rsa
cat ~/.ssh/id_rsa.pub >> ~/.ssh/authorized_keys
chmod 640 ~/.ssh/authorized_keys
ssh (external IP)
wget dlcdn.apache.org/hadoop/common/hadoop-3.4.0/hadoop-3.4.0.tar.gz
tar -xvzf hadoop-3.4.0.tar.gz
mv hadoop-3.4.0 hadoop
nano ~/.bashrc
cd hadoop/
mkdir -p ~/hadoopdata/hdfs/namenode
mkdir -p ~/hadoopdata/hdfs/datanode
After that i setup the core-site.xml and installed the mentioned packages Modern Data Lake with MinIO : Part 2
Then I started hadoop
hadoop/bin/hdfs namenode -format
hadoop/sbin/start-all.sh
“”“”“”“”“”“”“”“”“”“”“”“”“”“”“”“”“”“”“”“”“”“”“”“”“”“”“”“”“”“”“”“”“”“”“”“”“”“”“”“”
When I did JPS, it’s showing that Nodemanager is running
After that I ran the following command to read from minio:
hadoop fs -ls s3a://mydemobucket/
But it’s throwing the following error:
java.lang.NoClassDefFoundError: software/amazon/awssdk/transfer/s3/progress/TransferListener
Note - mydemobucket is already present in minio