full-refresh on `insert_overwrite` strategy #211

ghost · 2023-08-16T03:48:03Z

Describe the feature

When create a model with strategy insert_overwrite, the --full-refresh seems not to work.

Describe alternatives you've considered

From the code it seems full fresh only supports view in incremental

https://github.com/aws-samples/dbt-glue/blob/main/dbt/include/glue/macros/materializations/incremental/incremental.sql#L63

Additional context

Step to reproduce:

Create a model and build the model containing 25 columns with dbt run
Add a new column to the model and build dbt run again:

There are errors like below

`xxx`.`xxxx` requires that the data to be inserted have the same number of columns as the target table: target table has 25 column(s) but the inserted data has 26 column(s), including 0 partition column(s) having constant value(s).

Who will this benefit?

When the schema change and users want to refresh the table, --full-refresh will be helpful.

Are you interested in contributing this feature?

Yes.

The text was updated successfully, but these errors were encountered:

ghost · 2023-08-16T03:58:49Z

My understanding is that in the original dbt with full fresh, it backup and rename the old table, build the table for the model, and drop the backup old table.

https://github.com/dbt-labs/dbt-core/blob/main/core/dbt/include/global_project/macros/materializations/models/incremental/incremental.sql#L64-L68

However, as dbt-glue (and dbt-spark) is using external table. Even we can rename the table, the data stays in the same location. So the backup doesn't make sense.

How about just drop the table only when full-fresh ?

HaykManukyanAvetiky · 2024-04-04T10:19:17Z

Is there any update on this ? I am facing hard time to use Hudi with full refresh, this functionality is very important. It will allow to use Hudi indexes also for full load tables and increase performance

ghost added the enhancement New feature or request label Aug 16, 2023

Jeremynadal33 mentioned this issue Aug 5, 2024

Allow adding write options to model configuration #415

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

full-refresh on `insert_overwrite` strategy #211

full-refresh on `insert_overwrite` strategy #211

ghost commented Aug 16, 2023

ghost commented Aug 16, 2023

HaykManukyanAvetiky commented Apr 4, 2024

full-refresh on insert_overwrite strategy #211

full-refresh on insert_overwrite strategy #211

Comments

ghost commented Aug 16, 2023

Describe the feature

Describe alternatives you've considered

Additional context

Who will this benefit?

Are you interested in contributing this feature?

ghost commented Aug 16, 2023

HaykManukyanAvetiky commented Apr 4, 2024

full-refresh on `insert_overwrite` strategy #211

full-refresh on `insert_overwrite` strategy #211