Hi
We use Dremio 2.05 on Elasticsearch 5.6.9
Dremio stands on its own server with 4 cpus et 8G of RAM
ES is distributed on a 3 servers cluster each of them with 4 cpus and 16Go of RAM
The 3 ES nodes are data nodes and one of them is a master node as well
I built a dataset that runs in 12 minutes from dremio
I retrieved the DSL query (see below) and ran it as CURL command:
- From Dremio server itself it took 3’ to send results
- From ES master node it took as well 3’ to send results
I then deduce i have no network problems between dremio and ES machines
When doing all these tests i was focussing ressource usage on each ES nodes and what i can tell is
- When querying from dremio app cpus are scarcely used during the refresh period
- When doing the CURL tests cpus working hard
In my ES source definition i mentionned the ES master node only. I tried two list all the nodes but refresh time is the same
Can someone help me understanding this gap of performance when using dremio app?
Please don’t start with using reflexion for i think i want to make dremio app work properly before going further with features.
Here is the DSL query:
=[{
“size” : 0,
“query” : {
“bool” : {
“must” : [ {
“bool” : {
“should” : [ {
“match” : {
“UC” : {
“query” : “DOMUS-ACADEMY”,
“type” : “boolean”
}
}
}, {
“match” : {
“UC” : {
“query” : “DOMUS-ACADEMY-LANDING”,
“type” : “boolean”
}
}
}, {
“match” : {
“UC” : {
“query” : “NABA-LP”,
“type” : “boolean”
}
}
}, {
“match” : {
“UC” : {
“query” : “NABA”,
“type” : “boolean”
}
}
}, {
“match” : {
“UC” : {
“query” : “ISTITUTO-MARANGONI”,
“type” : “boolean”
}
}
} ]
}
}, {
“range” : {
“Date” : {
“from” : “2016-09-06T00:00:00.000Z”,
“to” : null,
“format” : “date_time”,
“include_lower” : true,
“include_upper” : true
}
}
} ]
}
},
“aggregations” : {
“UC” : {
“terms” : {
“field” : “UC”,
“missing” : “NULL_STRING_TAG”,
“size” : 2147483647
},
“aggregations” : {
“Year” : {
“terms” : {
“field” : “Year”,
“missing” : -2147483648,
“size” : 2147483647
},
“aggregations” : {
“Month” : {
“terms” : {
“field” : “Month”,
“missing” : -2147483648,
“size” : 2147483647
},
“aggregations” : {
“Monthlabel” : {
“terms” : {
“field” : “MonthLabel”,
“missing” : “NULL_STRING_TAG”,
“size” : 2147483647
},
“aggregations” : {
“week” : {
“terms” : {
“field” : “Week”,
“missing” : -2147483648,
“size” : 2147483647
},
“aggregations” : {
“continent” : {
“terms” : {
“field” : “Continent”,
“missing” : “NULL_STRING_TAG”,
“size” : 2147483647
},
“aggregations” : {
“country” : {
“terms” : {
“field” : “Country”,
“missing” : “NULL_STRING_TAG”,
“size” : 2147483647
},
“aggregations” : {
“device” : {
“terms” : {
“field” : “Device”,
“missing” : “NULL_STRING_TAG”,
“size” : 2147483647
},
“aggregations” : {
“Query without accent” : {
“terms” : {
“field” : “Query without accent”,
“missing” : “NULL_STRING_TAG”,
“size” : 2147483647
},
“aggregations” : {
“Clicks” : {
“sum” : {
“field” : “Clicks”
}
},
“Position” : {
“sum” : {
“field” : “Position”
}
},
“impressions” : {
“sum” : {
“field” : “Impressions”
}
}
}
}
}
}
}
}
}
}
}
}
}
}
}
}
}
}
}
}
}
}]