You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I am unsure why the series version of n_unique will not work whereas the dataframe version will work. I tried looking up this information in the documentation but was unable to find anything substantive.
Log output
No response
Issue description
Inconsistent behavior of n_unique for a series of list-of-structs.
Expected behavior
I believe that df.select('list-of-structs').n_unique() and df['list-of-structs'].n_unique() both ought to work and give the same answer. Currently the series version raises an InvalidOperationError.
Checks
Reproducible example
I am unsure why the series version of
n_unique
will not work whereas the dataframe version will work. I tried looking up this information in the documentation but was unable to find anything substantive.Log output
No response
Issue description
Inconsistent behavior of
n_unique
for a series of list-of-structs.Expected behavior
I believe that
df.select('list-of-structs').n_unique()
anddf['list-of-structs'].n_unique()
both ought to work and give the same answer. Currently the series version raises anInvalidOperationError
.Installed versions
The text was updated successfully, but these errors were encountered: