[WIP]Try to unblock payments backfill #881

yuli-han · 2024-11-20T18:46:12Z

Summary

Why / Goal

Test Plan

Added Unit Tests
Covered by existing CI
Integration tested

Checklist

Documentation update

Reviewers

hzding621 · 2024-11-20T19:30:22Z

spark/src/main/scala/ai/chronon/spark/TableUtils.scala

-      repartitionAndWriteInternal(df, tableName, saveMode, stats, sortByCols)
-    }.get
+      //repartitionAndWriteInternal(df, tableName, saveMode, stats, sortByCols)
+    }.get*/


Ideas for incorporating this change long term:

if (df.sparkSession.conf.getOption(SparkConstants.ChrononEnableRowCountBasedRepartitin).exists("true")) { wrapWithCache(s"repartition & write to $tableName", df) { logger.info(s"Repartitioning before writing...") repartitionAndWriteInternal(df, tableName, saveMode, stats, sortByCols) }.get } else { val shuffleParallelism = df.sparkSession.conf .getOption(SparkConstants.ChrononOutputParallelismOverride) .map(_.toInt) .flatMap(value => if (value > 0) Some(value) else None) val dfRepartitioned = if (shuffleParallelism.isDefined) { df.repartition(shuffleParallelism.get) } else { df } dfRepartitioned .write .mode(saveMode) .insertInto(tableName) }

yuli_han added 3 commits November 18, 2024 13:24

try to unblock

95de059

remoev wrapWithCache

718960f

test

f5b1393

hzding621 reviewed Nov 20, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP]Try to unblock payments backfill #881

[WIP]Try to unblock payments backfill #881

yuli-han commented Nov 20, 2024

hzding621 Nov 20, 2024

[WIP]Try to unblock payments backfill #881

Are you sure you want to change the base?

[WIP]Try to unblock payments backfill #881

Conversation

yuli-han commented Nov 20, 2024

Summary

Why / Goal

Test Plan

Checklist

Reviewers

hzding621 Nov 20, 2024

Choose a reason for hiding this comment