v1.3.0
Arrow v1.3.0
Closed issues:
- Attempting to serialize
DataType
s induces segfault (#74) - tables containing
Set
values are serializable but corresponding deserializedArrow.Table
s are inaccessible (#75) - support for heterogeneously typed tuples (#85)
- Difficult to read the code (#91)
- Arrow.write hangs on Tables.partitioner (#108)
- Unsafe conversion to signed integer types (#121)
- Arrow.write in v1.4.2 can create an invalid arrow file (#126)
- Arrow dataset imported as DataFrames are not pure DataFrames? (#127)
- Arrow.jl issue with struct types (#128)
unsupported ARROW:extension:name type: "JuliaLang.Nothing"
(#132)- Loss of parametric type information for custom types (#134)
- Avoid assuming field values can be used in constructors (#135)
- Help (#137)
- Losing type in unnamed column (#138)
- How to handle parametric Unitful types (#139)
- Can't serialize structs that contain
::Type{T}
fields (#140) - Cannot iterate Arrow.Stream (#141)
- Arrow.write("my.arrow", CategoricalArray([1,2,3])) hangs (#143)
- Arrow Table conversion to DataFrame throws DimensionMismatch Error (#144)
- copying Arrow.Table does not always copy columns (#146)
- Hang with multithreaded reading (#155)
Merged pull requests:
- Add ntasks keyword to limit # of tasks allowed to write at a time (#106) (@quinnj)
- Fix typo (#130) (@Sov-trotter)
- implement Base.IteratorSize for Stream, fixes #141 (#142) (@damiendr)
- Introduce new
maxdepth
keyword argument for setting a limit on nesting (#147) (@quinnj) - Ensure dict encoded index types match from record batch to record batch (#148) (@quinnj)
- Ensure serializing Arrow.DictEncoded writes dictionary messages (#149) (@quinnj)
- revert setting Arrow.write debug message threshold to -1 (#152) (@jrevels)
- add unexported
tobuffer
utility for interactive testing/development (#153) (@jrevels) - Better handle errors when something goes wrong writing partitions (#154) (@quinnj)
- Overhaul type serialization/deserialization machinery (#156) (@quinnj)