Some virtual datasets are out of date and need to be manually updated

oluis · April 2, 2020, 6:53pm

I get this error on try make a reflection:
Some virtual datasets are out of date and need to be manually updated. These include: "@oluis".vwTempoDosCalculo. Please correct any issues and rerun this query.
What can I do to solve?

dacopan · April 2, 2020, 7:19pm

@oluis
where this error are visualized? on query time?
on server logs?
can you share query profile?
also can you share print screen of reflection status in VDS?

oluis · April 2, 2020, 7:32pm

thanks for reply

51408420-f45e-4c35-b260-7c54577746f4.zip (4,8,KB)

dacopan · April 2, 2020, 10:53pm

apparently you have a mongo datasource, defined a VDS with some fields, for example field1, field2, etc
then you active raw reflection, but now some of this fields are altered from origin, for example, mixin datatype, or remove some field

How are defined your VDS (the current VDS that throw error and the vwTempoDosCalculovds`)?
also can you share a print screen of reflection definition of both vds?

some as

oluis · April 3, 2020, 12:39pm

This is the failed VDS when creating a reflection

This is the origin

My origin comes from an aggregation of a table in mongodb. The table has 2.5 million documents, with aggregation reaching up to 20x more

dacopan · April 3, 2020, 3:37pm

please share the SQL of the VDS,
apparently some fields are inconsistent could be some of the mixin data types (fields with A# mark)

oluis · April 3, 2020, 4:13pm

The selec is only this

 SELECT *
    FROM "mongo-prod".statistics.vwTempoCalculo
    WHERE DATA = CURRENT_DATE

dacopan · April 4, 2020, 1:21am

please avoid use * instead always list fields, this can cause inconsistent metadata.
rewrite your query, then retry then execute

alter "mongo-prod".statistics.vwTempoCalculo refresh metadata

then retry to create reflection

balaji.ramaswamy · April 6, 2020, 4:33pm

@oluis

We have fixed this behavior in 4.1.8. In 4.1.6, did you run a query on the VDS and did that work?

oluis · April 7, 2020, 1:37pm

This helped me to resolve. thanks

datocrats-org · February 16, 2021, 11:45pm

Can someone describe how to generalize this for other data sources, is it

 ALTER PDS "Source-Name"."FolderSchemaEtc"."TableFileEtc"

Also how does this manual command compare to the Dataset Handling Metadata settings at the source level for refresh for example with Dataset Discovery Fetch every 1 Hour(s) - can I set it to a minute then wait for it to complete and have the same effect, without running a query against any specific PDS?

balaji.ramaswamy · February 20, 2021, 6:42am

@datocrats-org

You should not be running background refresh every minute as that can cause continuous load on the coordinator. Best is to call “alter pds refresh metadata” for affected tables at the end of the ETL pipeline. The background refresh and alter pds refresh metadata are the same where only changed (modified timestamp since last refresh) are refreshed. If we add “FORCE UPDATE” at the end, Dremio will refresh all folder irrespective of timestamp changed or not

Thanks
Bali

Topic		Replies	Views
Virtual Dataset shows stale data	15	3761	September 24, 2018
Refreshing reflection of VDS never triggered	3	2073	July 14, 2021
Bug in the process of refreshing physical dataset : dependent reflections refreshed too soon?	6	1677	November 28, 2018
External_query() reflection refresh schedule - what is it?	2	150	June 27, 2024
Refreshing a VDS reflection	1	1975	September 10, 2018

Some virtual datasets are out of date and need to be manually updated

Related topics