Skip to content

Releases: dlt-hub/dlt

0.3.16

03 Sep 22:50
Compare
Choose a tag to compare

Core Library

  • add default user agent header to dlt requests client by @sh-rp in #595

  • Add issue templates by @burnash in #537

  • Add pydantic support by @steinitzu in #589
    You can use pydantic to define table schemas. You can load pydantic instances like you can load dictionaries

  • retain schema hints order by @sh-rp in #600

  • NormalizerInfo: item counts in table present in trace by @sh-rp in #582
    Get counts of items added to table from normalization stage

  • Add azure blob storage filesystem/staging destination by @steinitzu in #592
    Also includes Snowflake stage support

  • Feat: add time to pua serde by @z3z1ma in #604

  • general state sync interface by @sh-rp in #564
    You can restore state and schemas from Weaviate now (filesystem comes later)

  • uses botocore instead of boto3 in AwsCredentials by @rudolfix in #590

Docs

New Verified Sources

New Contributors

Full Changelog: 0.3.13...0.3.16

0.3.13

27 Aug 19:40
fe4ceee
Compare
Choose a tag to compare

Core Library

  • Feat: don't require AWS credentials for external Snowflake stage by @codingcyclist in #587
  • connecting to local Weaviate made easy by @rudolfix in #591
  • allows setting table name via property on DltResource by @rudolfix in #593
  • destination tests refactored by @sh-rp in #572

Docs

  • docs snippet and examples will be now linted and tested by @sh-rp in #559
  • several blog posts and verified sourced docs updates by @adrianbr and @dat-a-man

New Verified Sources

New Contributors

Full Changelog: 0.3.12...0.3.13

0.3.12

20 Aug 19:13
Compare
Choose a tag to compare

Core Library

In this version we release two new types of a destinations:

  • Add a Weaviate destination by @burnash in #479
    A vector data store: load and query vectorized text data
  • Basic AWS Athena support by @sh-rp in #522
    A data lake destination which works together with filesystem as a staging

Apart from that bug fixes:

Docs

New Contributors

Full Changelog: 0.3.10...0.3.12

0.3.10

13 Aug 18:23
f765eb8
Compare
Choose a tag to compare

Core Library

Full Changelog: 0.3.9...0.3.10

0.3.9

11 Aug 20:36
Compare
Choose a tag to compare

Bugfix Release

When a replace with staging dataset was used in version 0.3.8, tables with other write dispositions were also truncated (in other words all the tables in the schema could be truncated). Note that default replace strategy does not use staging dataset so if you didn't explicitly changed you were not affected.
This release fixes that bug. If you use the replace strategy above, update the library.

Full Changelog: 0.3.8...0.3.9

0.3.8

06 Aug 18:14
5e903b2
Compare
Choose a tag to compare

Core Library

Support for unstructured data!

A really cool data source that let's you ask questions about your PDF documents and stores the answers in any of our destinations. Going from binary blobs through unstrucutred.io, vector databases and LLM queries to ie. duckdb and bigquery. Blobs coming from filesystem, google drive or your inbox (also incrementally) by @AstrakhantsevaAA

0.3.6

30 Jul 20:54
6a9c322
Compare
Choose a tag to compare

Core Library

  • fixes lost data and incorrect handling of child tables during truncate-and-insert replace by @sh-rp in #499
    This is important improvement that fixes a few holes in truncate-and-insert replace mode (which was there from beginning of dlt). Now we truncate all the tables before multithreaded append process starts. We also truncate child tables that could be left with data before.
    details: #263 #271

  • fixes deploy airflow secrets and makes toml the default layout by @rudolfix in #513

  • check the required verified source dlt version during dlt init and warn users by @steinitzu in #514

  • add schema version to _dlt_loads table by @codingcyclist in #466

Docs

New Contributors

Full Changelog: 0.3.5...0.3.6

0.3.5

23 Jul 22:11
95644ec
Compare
Choose a tag to compare

Core Library

  • Fix incremental hitting end_value throwing out whole batches by @steinitzu in #495

  • replace with staging tables by @sh-rp in #488
    Now staging dataset may be used to replace tables. you can chose from several replace strategies (https://dlthub.com/docs/general-usage/full-loading) including fully transactional and atomic replacing of parent and all child tables or optimized where we use ie. ability to clone tables and copy on write in BigQuery and Snowflake

  • detect serverless aws_lambda by @muppinesh in #490

  • fixes credentials instantiation by @rudolfix in #507

  • fixes extract schema update by @rudolfix in #509

  • adds destination fingerprint by @rudolfix in #511

Docs

New Contributors

Full Changelog: 0.3.4...0.3.5

0.3.4

17 Jul 05:30
15c32da
Compare
Choose a tag to compare

Core Library

  • staging for loader files implemented by @sh-rp in #451
  • staging for redshift on s3 bucket and json + parquet by @sh-rp in #451
  • staging for bigquery on gs bucket and json + parquet by @sh-rp in #451
  • staging for snowflake on s3+gs buckets and json + parquet by @sh-rp in #451
  • improvements and bugfixes for parquet generation by @rudolfix in #451
  • tracks helpers usage and source names by @rudolfix in #497
  • Fix: use sets to prevent unnecessary truncate calls by @z3z1ma in #481

Docs

New Contributors

Full Changelog: 0.3.3...0.3.4

0.3.3

10 Jul 06:44
306baf1
Compare
Choose a tag to compare

Core Library

  • supports motherduck as a destination by @rudolfix in #460
  • dbt 1.5 compatibility, enabled motherduck dbt support by @sh-rp in #475
  • add more retry conditions and makes timeouts configurable in dlt requests drop-in replacement by @steinitzu in #477
  • end_value support to incremental: backloading in parallel chunks now possible by @steinitzu in #467

Docs

Full Changelog: 0.3.2...0.3.3