Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fix Errata : lastModified on Delta table properties is last "metadata" modification not last Update to the table (add rows) #27

Open
newfront opened this issue Mar 24, 2024 · 0 comments
Assignees
Labels
bug Something isn't working

Comments

@newfront
Copy link
Owner

https://github.com/newfront/hitchhikers_guide_to_deltalake_streaming/blob/main/hitchhikers_guide/notebooks/101-first-steps/101-dl-streaming.ipynb

Under:

Calculate Table Freshness: abs(current_time()-{table.lastModified}): To answer the universal question of - "How Fresh Is It?".

  • I was using the table.lastModified timestamp. While this works on a new table, you quickly come to realize that unless there are "modifications" to the structure of the table, or table properties, then the modification timestamp is not useful to calculate the freshness of a table.

  • instead you can use the dt.history() to gather the last operation that added or otherwise "effectively" modified the table.

@newfront newfront self-assigned this Mar 24, 2024
@newfront newfront added this to the Data AI Summit 2024 milestone Jun 3, 2024
@newfront newfront added baby-steps Gentle Introductions to Concepts. This is like the First Steps idea or Gentle Introductions bug Something isn't working and removed baby-steps Gentle Introductions to Concepts. This is like the First Steps idea or Gentle Introductions labels Jun 3, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

1 participant