Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Feature] Support filter pushdown for Log Tables #197

Open
1 of 2 tasks
wuchong opened this issue Dec 16, 2024 · 0 comments
Open
1 of 2 tasks

[Feature] Support filter pushdown for Log Tables #197

wuchong opened this issue Dec 16, 2024 · 0 comments
Labels
component=log feature New feature or request
Milestone

Comments

@wuchong
Copy link
Member

wuchong commented Dec 16, 2024

Search before asking

  • I searched in the issues and found nothing similar.

Motivation

Currently, we have supported projection pushdown for Log Tables. Meanwhile, we want to support filter pushdown as well, which can improve query performance and resource efficiency a lot.

This feature is very similar to Kafka Header Filter, but header is not directly accessible in SQL. In SQL language, this is filter pushdown, that automatically optimizes users' where col1 = 'something' condition into a pushdown operation to avoid transferring the unnecessary data.

Solution

We may need to add some metadata/statistics (e.g., min/max) for each Arrow log batch. This allows TabletServers can apply the pushdowned predicate on the metadata without reading and deserializing the Arrow log.

Anything else?

No response

Willingness to contribute

  • I'm willing to submit a PR!
@wuchong wuchong added feature New feature or request component=log labels Dec 16, 2024
@wuchong wuchong added this to the v0.6 milestone Dec 16, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
component=log feature New feature or request
Projects
None yet
Development

No branches or pull requests

1 participant