Arrow Expressions and Vortex DataSets #5801
Unanswered
paultiq
asked this question in
Issue Triage
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Issue Description
Vortex's compute differs significantly from Arrow compute in terms of supported predicates. #5781 provides one list where Vortex doesn't support things that Arrow supports... and #5765 is an example where Vortex's compute supports things Arrow doesn't.
Assuming that 5781 will take some time to resolve (to achieve parity with Arrow), this leads to a number of problems... especially with duckdb which will attempt to pushdown unsupported expressions. For example, the following query will raise an exception in duckdb:
* duckdb's CanPushdown assumes all Arrow sources support the same expressions.
I'm unclear on whether this is an Arrow problem, a Vortex problem or a duckdb problem:
One thing Vortex can do is add test cases for:
DuckDB Example
Arrow Example
All this does is demonstrate that certain kernels aren't implemented. The point here is not that the kernel isn't implement, but just to demonstrate why the above (duckdb) example occurs.
Parquet Works
Vortex Fails
Expected Behavior
.
Actual Behavior
.
Reproduction Steps
.
OS Version Information
Ubuntu 24.04
I acknowledge that:
```) on separate lines.Beta Was this translation helpful? Give feedback.
All reactions