Dremio become slow when using where clause

Carillpower · February 27, 2019, 7:04am

Hi expert,

I would like to check with you, currently our dremio connected directly to ElasticSearch Cluster as datasource.

Currently im testing the performance by executing 1 of reporting query as per below :-

SELECT _
_ to_char(logins.date_time_iso8601,‘yyyy-MM-dd HH:MI:SS’) AS transformed_date, _
_ logins.data.school_code AS school_code, _
_ logins.data.auth.role AS user_profile, _
_ COUNT(DISTINCT(logins.data.session)) AS total_logins, _
_ COUNT(DISTINCT(logins.data.auth.name)) AS unique_logins _
_ FROM
_ “ES Cluster Dev”.“auth_login-2018_06_06”.index_typ logins_
_ WHERE 1=1_
– AND logins.data.auth.role = ‘Student’
_ GROUP BY_
_ to_char(logins.date_time_iso8601,‘yyyy-MM-dd HH:MI:SS’) ,_
_ logins.data.school_code ,_
_ logins.data.auth.role_

I’ve notice that the query runs in acceptable speed (less than 5 second) when I disable the WHERE clause filter. Yet when I enable it to make the filtration for only Student the query runs in unacceptable speed (longer than 30 second) .

Is there any way that I could improve on this issue?

If the datasource are connected to RDBMS then it’s easy for me to tackle the performance from the source directly but as the data are NoSQL under ElasticSearch so im quite clueless where could I improve on it.

Below are the profiler if you guys want to have a deep look on it.
873b63f5-3257-402e-944c-bd1740604849.zip (17.1 KB)

Thanks and appreciate the effort and thought on this.

kelly · February 27, 2019, 12:20pm

This profile has the WHERE clause commented out. You can try attaching profile with the WHERE clause enabled.

Also, you can try adding a Data Reflection to speed up performance in general: https://docs.dremio.com/acceleration/reflections.html

Topic		Replies	Views
Speed of execution of a query for oracle dataset	5	1769	November 11, 2019
How to speed up dremio	8	3420	August 1, 2018
Simple Query with in clauses running extremely slow	3	1818	May 20, 2020
Dremio on Elasticsearch too slow	1	1085	October 22, 2018
Dremio performance on Elasticsearch cluster	10	1416	September 21, 2018

Dremio become slow when using where clause

Related topics