Releases: apache/arrow-julia
Releases · apache/arrow-julia
v1.5.0
Arrow v1.5.0
Closed issues:
- Better fallback when registered types don't match (#133)
- Issue with
Tables.rowtable
when entries contain a vector of strings (#167) - VersionNumbers don't serialize (#200)
- Error in "arrow json integration tests" (#202)
- Automated conversion fails for
Date{MILLISECOND, Int64}
(#206)
Merged pull requests:
- Ensure requested List type is requested on List getindex (#182) (@quinnj)
- Add global metadata lock to ensure thread safety of global metadata (#183) (@quinnj)
- Handle empty tuple (#201) (@omus)
- Support
VersionNumber
(#205) (@omus) - support Date with type parameter ms (#207) (@piever)
- bump Project.toml from v1.4.1 to v1.5.0 (#208) (@jrevels)
v1.4.1
Arrow v1.4.1
Closed issues:
- maxdepth argument not fully forwarded to inner functions from Arrow.write? (#180)
Merged pull requests:
v1.4.0
Arrow v1.4.0
Closed issues:
- reconsidering the current type registration/serialization mechanism (and its internal usage) (#88)
- provide mechanism to free metadata stored in OBJ_METADATA? (#90)
- Arrow.write slow perf with ZonedDateTime (#95)
- Implement DataAPI pool/dict encoding methods for DictEncoded (#120)
- Slower materialization Feather vs Arrow (#131)
- Usage with MPI (#151)
- Reading CSV (#157)
- Reading an Arrow file with no message batches after the schema seems to produce a partly initialized Table? (#158)
- DictEncoded methods for refpool, refarray and levels (#159)
- MethodError
Int64(::Arrow.Timestamp...
when reading arrow file saved bypandas
. (#166) - Improve printing? (#168)
Merged pull requests:
- Add refpool, refarray and levels for DictEncoded (#161) (@dmbates)
- Tweak promoteunion to always avoid abstract types (#162) (@quinnj)
- Restructure ArrowTypes so it can be registered as its own package (#163) (@quinnj)
- DataAPI methods (#164) (@quinnj)
- Don't store table metadata globally (#165) (@quinnj)
- document guarantee that
getmetadata
returns alias not copy (#169) (@jrevels) - add missing setmedata! method for Arrow.Table (#170) (@jrevels)
- use actual deprecation for
registertype!
(#171) (@ericphanson) - Warn when converting Arrow.Timestamps to Dates.DateTime or ZonedDateTime (#172) (@quinnj)
- Introduce Arrow.ToTimestamp for performant ZonedDateTime encoding (#173) (@quinnj)
- Fix () -> {} typo in docs (#174) (@etpinard)
- Fix case when ipc stream has no record batches, only schema (#175) (@quinnj)
- Fix slight perf hit when checking validity bitmap (#176) (@quinnj)
v1.3.0
Arrow v1.3.0
Closed issues:
- Attempting to serialize
DataType
s induces segfault (#74) - tables containing
Set
values are serializable but corresponding deserializedArrow.Table
s are inaccessible (#75) - support for heterogeneously typed tuples (#85)
- Difficult to read the code (#91)
- Arrow.write hangs on Tables.partitioner (#108)
- Unsafe conversion to signed integer types (#121)
- Arrow.write in v1.4.2 can create an invalid arrow file (#126)
- Arrow dataset imported as DataFrames are not pure DataFrames? (#127)
- Arrow.jl issue with struct types (#128)
unsupported ARROW:extension:name type: "JuliaLang.Nothing"
(#132)- Loss of parametric type information for custom types (#134)
- Avoid assuming field values can be used in constructors (#135)
- Help (#137)
- Losing type in unnamed column (#138)
- How to handle parametric Unitful types (#139)
- Can't serialize structs that contain
::Type{T}
fields (#140) - Cannot iterate Arrow.Stream (#141)
- Arrow.write("my.arrow", CategoricalArray([1,2,3])) hangs (#143)
- Arrow Table conversion to DataFrame throws DimensionMismatch Error (#144)
- copying Arrow.Table does not always copy columns (#146)
- Hang with multithreaded reading (#155)
Merged pull requests:
- Add ntasks keyword to limit # of tasks allowed to write at a time (#106) (@quinnj)
- Fix typo (#130) (@Sov-trotter)
- implement Base.IteratorSize for Stream, fixes #141 (#142) (@damiendr)
- Introduce new
maxdepth
keyword argument for setting a limit on nesting (#147) (@quinnj) - Ensure dict encoded index types match from record batch to record batch (#148) (@quinnj)
- Ensure serializing Arrow.DictEncoded writes dictionary messages (#149) (@quinnj)
- revert setting Arrow.write debug message threshold to -1 (#152) (@jrevels)
- add unexported
tobuffer
utility for interactive testing/development (#153) (@jrevels) - Better handle errors when something goes wrong writing partitions (#154) (@quinnj)
- Overhaul type serialization/deserialization machinery (#156) (@quinnj)
v1.2.4
Arrow v1.2.4
Merged pull requests:
v1.2.3
Arrow v1.2.3
Merged pull requests:
v1.2.2
Arrow v1.2.2
Closed issues:
- Segmentation Fault with Threads.@Spawn + Tables.partitioner + write with compression (#82)
- Types deserialize differently during session in which they were written (#88)
- Producing unsigned dict encoding indices; should be signed (#112)
- Unsigned integers as indices in DictEncoded type (#113)
- DictEncoded doesn't write as DictEncoded (#116)
- Errors writing file with missing in categorical (#117)
Merged pull requests:
v1.2.1
v1.2.0
v1.1.0
Arrow v1.1.0
Closed issues:
- memory leaking when reading compressed arrow files (#80)
- writing column with missing / struct data errors (#84)
- downstream packages need to put
Arrow.ArrowTypes.registertype!
statements in__init__
(#87)
Merged pull requests:
- Support new Decimal256 type (#79) (@quinnj)
- fix typo (#83) (@ericphanson)
- add
ArrowTypes.default
methods and tests for dates (#86) (@ericphanson) - add default UUID <-> UInt128 Arrow type mapping (#89) (@jrevels)
- bump Project.toml to v1.1.0 (#94) (@jrevels)
- Add warning for
Arrow.ArrowTypes.registertype!
(#96) (@ericphanson) - Fix
deploydocs
(#97) (@ericphanson) - convert Arrow-flavored eltypes to Julia-flavored eltypes on copy (#98) (@jrevels)
- Fix copy on DictEncoding arrays with missing values (#99) (@quinnj)
- Add BitIntegers compat (#100) (@quinnj)