We are joining multiple hive tables together and these tables are backed by parquet data. Still we see it is not performing great. Could you please advise how we can tune hive queries to run optimally apart from hive table statistics?
Would you be able to send us the profile for one of these jobs?
Share a Query Profile
239a7a4d-cabd-4a95-9763-debd4a11ceac.zip (209.6 KB)
Have uploaded the profile, please suggest optimization avenues