Apache Arrow 15.0.0 (21 January 2024)
This is a major release covering more than 1 months of development.
Download
- Source Artifacts
- Binary Artifacts
- Git tag
Contributors
This release includes 536 commits from 101 distinct contributors.
$ git shortlog -sn apache-arrow-14.0.2..apache-arrow-15.0.0
67 dependabot[bot]
42 Sutou Kouhei
31 Raúl Cumplido
27 Josh Soref
20 Antoine Pitrou
19 Felipe Oliveira Carvalho
19 mwish
18 Dewey Dunnington
17 James Duong
16 Curt Hagenlocher
16 Dominik Moritz
16 Joris Van den Bossche
15 Jacob Wujciak-Jens
14 Hyunseok Seo
12 Alenka Frim
10 Yue
10 sgilmore10
9 Matt Topol
8 Dane Pitkin
8 Diego Fernández Giraldo
8 Thomas Newton
6 Bryce Mecum
5 Jin Shang
5 Jonathan Keane
5 Kevin Gurney
4 Benjamin Kietzman
4 Gang Wu
3 Ben Harkins
3 Ivan Chesnov
3 Judah Rand
3 Lei Hou
3 Rossi(Ruoxi) Sun
3 Tim Schaub
3 abandy
2 Alfonso Subiotto Marqués
2 David Li
2 Divyansh200102
2 Eero Lihavainen
2 Francis
2 Gavin Murrison
2 JB Onofré
2 Jeremy Aguilon
2 John
2 Junming Chen
2 Laurent Goujon
2 Nic Crane
2 Peter Andreas Entschev
2 Rok Mihevc
2 Stas Stepanov
2 Tsutomu Katsube
2 Vibhatha Lakmal Abeykoon
1 Abram Fleishman
1 Anja Kefala
1 Atheel Massalha
1 Benjamin Schmidt
1 Bob Plotts
1 Bryan Cutler
1 Carl Jackson
1 ChinYikMing
1 Chris Larsen
1 Christian Holm Christensen
1 Dan Homola
1 Dan Stone
1 Danyaal Khan
1 Diogo Teles Sant'Anna
1 Dongjoon Hyun
1 Fatemah Panahi
1 Fernando Mayer
1 Fokko Driesprong
1 Hattonuri
1 Hendrik Makait
1 Jiaxing Liang
1 Jinpeng
1 John Garland
1 Kyle Barron
1 LucasG0
1 Maximilian Muecke
1 Miguel Pragier
1 Miles
1 Neal Richardson
1 Nick Hughes
1 Paul Spangler
1 Phillip LeBlanc
1 Pierre Moulon
1 Quang Hoang
1 Ravjot Brar
1 Tammy DiPrima
1 Will Jones
1 William Ayd
1 Yifeng-Sigma
1 david dali susanibar arce
1 davidhcoe
1 frazar
1 jiawei liang
1 loicalleyne
1 orgadish
1 panbingkun
1 prmoore77
1 shibei
1 zhipeng
1 谢天
Patch Committers
The following Apache committers merged contributed patches to the repository.
$ git shortlog -sn --group=trailer:signed-off-by apache-arrow-14.0.2..apache-arrow-15.0.0
154 Sutou Kouhei
56 David Li
50 Antoine Pitrou
35 Joris Van den Bossche
30 Jacob Wujciak-Jens
24 Raúl Cumplido
22 Matt Topol
19 Curt Hagenlocher
18 Kevin Gurney
16 Benjamin Kietzman
16 Felipe Oliveira Carvalho
14 mwish
13 Dewey Dunnington
12 AlenkaF
7 Nic Crane
3 Weston Pace
2 Gang Wu
2 Jonathan Keane
1 Josh Soref
1 Rok Mihevc
1 Will Jones
1 dependabot[bot]
Changelog
Apache Arrow 15.0.0 (2024-01-21 08:00:00)
Bug Fixes
- GH-15192 - [C++] Bring back
case_when
tests for union types (#39308) - GH-32570 - [C++] Fix the issue of
ExecBatchBuilder
when appending consecutive tail rows with the same id may exceed buffer boundary (#39234) - GH-32662 - [C#] Make dictionaries in file and memory implementations work correctly and support integration tests (#39146)
- GH-33475 - [Java] Add parameter binding for Prepared Statements in JDBC driver (#38404)
- GH-34532 - [Java][FlightSQL] Change JDBC to handle multi-endpoints (#38521)
- GH-34610 - [Java] Fix valueCount and field name when loading/transferring NullVector (#38973)
- GH-34890 - [C++][Python] Add a no-op kernel for dictionary_encode(dictionary) (#38349)
- GH-35497 - [C++] Use the latest tagged version of flatbuffers (#38192)
- GH-36588 - [C#] Support blank column names and enable more integration tests. (#39167)
- GH-36594 - [C++] Don’t use MSVC_VERSION to determin -fms-compatibility-version (#36595)
- GH-36912 - [Java] JDBC driver stops consuming roots if it sees an empty root (#38590)
- GH-37055 - [C++] Optimize hash kernels for Dictionary ChunkedArrays (#38394)
- GH-37657 - [JS] Run bin scripts with ts-node (#38500)
- GH-37726 - [Swift][FlightSQL] Update behavior to be similar to existing impls (#37764)
- GH-37751 - [C++][Gandiva] Avoid registering exported functions multiple times in gandiva (#37752)
- GH-37796 - [C++][Acero] Fix race condition caused by straggling input in the as-of-join node (#37839)
- GH-37884 - [Swift] allow reading of unaligned FlatBuffers buffers (#38635)
- GH-37969 - [C++][Parquet] add more closed file checks for ParquetFileWriter (#38390)
- GH-38096 - [Java] FlightStream with metadata can cause error when closing (#38110)
- GH-38198 - [Go] Fix AuthenticateBasicToken to be reliable behind proxies (#38199)
- GH-38210 - [C++][FlightRPC] Add missing app_metadata arguments (#38231)
- GH-38216 - [R] open_dataset(format = “json”) not documented (#38258)
- GH-38242 - [Java] Fix incorrect internal struct accounting for DenseUnionVector#getBufferSizeFor (#38305)
- GH-38254 - [Java] Add reusable buffer getters to char/binary vectors (#38266)
- GH-38268 - [Java] Disable flaky TestFlightSqlStreams (#38319)
- GH-38281 - [Go] Ensure CData imported arrays are freed on release (#38314)
- GH-38297 - [C#] Fix build for .NET 4.7.2 (#38299)
- GH-38304 - [C++][Parquet] Fix Valgrind memory leak in arrow-dataset-file-parquet-encryption-test (#38306)
- GH-38307 - [CI] Remove gemfury_clean.rb (#38308)
- GH-38318 - [Java][FlightRPC] Enable tests that leaked (#38719)
- GH-38323 - [CI][Python] Use system gdb on test-conda-python (#38324)
- GH-38363 - [Release][CI] Omit tests for main/maintenance branches on RC branch (#38365)
- GH-38366 - [Java] Fix Murmur hash on buffers less than 4 bytes (#38368)
- GH-38378 - [C++][Parquet] Don’t initialize OpenSSL explicitly with OpenSSL 1.1 (#38379)
- GH-38382 - [R] Explicitly clean up
arrow_duck_connection()
on exit (#38495) - GH-38387 - [Java] Fix JDK8 compilation issue with TestAllTypes (#38388)
- GH-38395 - [Go] fix rounding errors in decimal256 string functions (#38426)
- GH-38399 - [Go][Parquet] DeltaBitPack decoder reset usedFirst after SetData (#38413)
- GH-38401 - [C++] Re-generate flatbuffers C++ for Skyhook (#38405)
- GH-38436 - [R] Test segfault on reading CSVs with non-UTF-8 encoding
- GH-38439 - [Java][CI] Use Eclipse Temurin for all Java CI linux jobs (#38440)
-
GH-38447 - [CI][Release] Don’t use “ {exit,continue}” (#38486) - GH-38458 - [Go] Add ValueLen to BinaryLike interface (#39242)
- GH-38470 - [CI][Integration] Install jpype and build JNI c-data to run integration tests (#39502)
- GH-38477 - [Go] Fixing decimal 128 rounding issue (#38478)
- GH-38479 - [C++] Avoid passing null pointer to LZ4 frame decompressor (#39125)
- GH-38503 - [Go][Parquet] Make the arrow column writer internal (#38727)
- GH-38503 - [Go][Parquet] Style improvement for using ArrowColumnWriter (#38581)
- GH-38516 - [Go][Parquet] Increment the number of rows written when appending a new row group (#38517)
- GH-38535 - [Python] Fix S3FileSystem equals None segfault (#39276)
- GH-38554 - [Release][Website] post-03-website.sh doesn’t quote current.date (#38555)
- GH-38556 - [C++] Add missing explicit size_t cast for i386 (#38557)
- GH-38594 - [Docs][C++][Gandiva] Document how to register Gandiva external functions (#38763)
- GH-38599 - [Docs] Update Headers (#38696)
- GH-38614 - [Java] Add VarBinary and VarCharWriter helper methods to more writers (#38631)
- GH-38624 - [C++] Fix: add TestingEqualOptions for gtest functions. (#38642)
- GH-38630 - [MATLAB]
arrow.array.BooleanArray
’stoMATLAB
method does not take slice offsets into account (#38636) - GH-38653 - [Packaging][Java][Python][Ruby] Raise the minimum macOS version to 10.15 catalina to allow using new APIs in C++17 (#38677)
- GH-38683 - [Python][Docs] Update docstrings for Time32Type and Time64Type (#39059)
- GH-38684 - [Integration] Try to strengthen C Data Interface testing (#38846)
- GH-38697 - [C++][Gandiva] Use arrow io util to replace std::filesystem::path in gandiva (#38698)
- GH-38709 - [C++] Protect against PREALLOCATE preprocessor defined on macOS (#38760)
- GH-38711 - [CI] Rollback aws-cli for preview documentation (#38723)
- GH-38725 - [Java] decompression in Lz4CompressionCodec.java does not set writer index (#38840)
- GH-38728 - [Go] ipc: put lz4 decompression buffers back into sync.Pool (#38729)
- GH-38737 - [Java] Fix JDBC caching of SqlInfo values (#38739)
- GH-38738 - [C++] Check variadic buffer counts in bounds (#38740)
- GH-38745 - [Integration] Fix huge integration test (#38746)
- GH-38762 - [R] Versions of R and RTools in CI config are no longer current
- GH-38764 - [Java] Clarify warning about
--add-opens=java.base/java.nio=ALL-UNNAMED
(#38765) - GH-38782 - [C++][FS][Azure] Do nothing for CreateDir(“/container”, true) (#38783)
- GH-38795 - [Go] Fix race GetToTimeFunc for Timestamp (#38797)
- GH-38811 - [R] Actually use fetched cmake on macos (#39453)
- GH-38816 - [C#] Fix IArrowRecord implementation on StructArray (#38827)
- GH-38823 - Fix TestArrowReaderAdHoc.ReadFloat16Files to use new uncompressed files (#38825)
- GH-38832 - [Java] Avoid building twice in
ci/scripts/java_build.sh
(#38829) - GH-38844 - [C++] S3FileSystem export s3 sdk config “use_virtual_addressing” to arrow::fs::S3Options (#38858)
- GH-38851 - Website: Contributing link is not working
- GH-38879 - [C++][Gandiva] Fix Gandiva to_date function’s validation for supress errors parameter (#38987)
- GH-38883 - [Docs] Fix struct example to show hiding a child’s entry (#38898)
- GH-38906 - [R] Improve Windows CI configuration (#38927)
- GH-38921 - [CI] Fix spelling (#38922)
- GH-38925 - [CI] Fix spelling (#38926)
- GH-38928 - [R] Fix spelling (#38929)
- GH-38930 - [Java] Fix spelling (#38931)
- GH-38932 - [GO] Fix spelling (#38933)
- GH-38938 - [FlightRPC] Fix spelling (#38939)
- GH-38940 - [Ruby] Fix spelling (#38941)
- GH-38942 - [C#] Fix spelling (#38943)
- GH-38944 - [Python] Fix spelling (#38945)
- GH-38946 - [MATLAB] Fix spelling (#38947)
- GH-38948 - [Swift] Fix spelling (#38949)
- GH-38950 - [Docs] Fix spelling (#38951)
- GH-38952 - [Format] Fix spelling (#38953)
- GH-38954 - [Dev] Fix spelling (#38955)
- GH-38956 - [Gandiva] Fix spelling (#38957)
- GH-38960 - [C++] Fix spelling (acero) (#38961)
- GH-38964 - [C++] Fix spelling (compute) (#38965)
- GH-38966 - [C++] Fix spelling (util) (#38967)
- GH-38968 - [C++] Fix spelling (dataset) (#38969)
- GH-38971 - [C++] Fix spelling (filesystem) (#38972)
- GH-38975 - [Dev] Fix spelling (#38976)
- GH-38977 - [C++] Fix spelling (#38978)
- GH-38979 - [C++] Fix spelling (#38980)
- GH-38981 - [R][Release] Don’t update version.json on compatible version release (#38982)
- GH-39014 - [Java] Add default truststore along with KeychainStore when on Mac system (#39235)
- GH-39031 - [Docs] Remove misspelled rule from contrib css (#39032)
- GH-39045 - [C++][Acero] union node output batches should be unordered (#39046)
- GH-39113 - [Integration][Flight][Java] Fix occasional failure starting Java server (#39115)
- GH-39116 - [Go] Fix CI Staticcheck (#39117)
- GH-39126 - [C++][CI] Fix Valgrind failures (#39127)
- GH-39130 - [CI][GLib][Windows] Use old Ruby as workaround for load error (#39168)
- GH-39136 - [C++] Remove needless system Protobuf dependency with -DARROW_HDFS=ON (#39137)
- GH-39138 - [R] Fix implicit conversion warnings (#39250)
- GH-39156 - [C++][Compute] Fix negative duration division (#39158)
- GH-39163 - [C++] Add missing data copy in StreamDecoder::Consume(data) (#39164)
- GH-39185 - [C++] Remove compiler warnings with
-Wconversion -Wno-sign-conversion
in public headers (#39186) - GH-39191 - [R] throw error when
string_replace
is passed vector of values inpattern
(#39219) - GH-39238 - [Go] PATCH Prevents empty record to be appended to empty resultset (#39239)
- GH-39288 - [Java][FlightSQL] Update Apache Avatica to version 1.24.0 (#39325)
- GH-39306 - [C++][Benchmarking] Remove hardcoded min times (#39307)
- GH-39327 - [Java] define assemble descriptor for new custom maven plugin project (#39331)
- GH-39333 - [C++] Don’t use “if constexpr” in lambda (#39334)
- GH-39359 - [CI][C++] Remove MinGW MINGW32 C++ job (#39376)
- GH-39384 - [C++] Disable -Werror=attributes for Azure SDK’s identity.hpp (#39448)
- GH-39387 - [C++] Fix compile warning (#39389)
- GH-39421 - [CI][Ruby] Update to using Ubuntu 22.04 on test-ruby and test-c-glib nightly jobs (#39422)
- GH-39423 - [CI][JS] TypeScript: Compilation failed on yarn build for several CI jobs
- GH-39425 - [CI] Fix import to match new substrait repo structure (#39426)
- GH-39433 - [Ruby] Add support for Table.load(format: json) options (#39464)
- GH-39437 - [CI][Python] Update pandas tests failing on pandas nightly CI build (#39498)
- GH-39468 - [Java] Fix site build for docs (#39471)
- GH-39469 - [CI][JS] Force node 20 on JS build on arm64 to fix build issues (#39499)
- GH-39488 - [Ruby] Add support for ChunkedArray in Ractor (#39490)
- GH-39517 - [C++] Disable parallelism for jemalloc external project (#39522)
- GH-39562 - [C++][Parquet] Fix crash in test_parquet_dataset_lazy_filtering (#39632)
- GH-39564 - [CI][Java] Set correct version on Java BOM (#39580)
- GH-39584 - [R] fallback to source gracefully (#39587)
- GH-39588 - [CI][Go] Add CGO_ENABLED=1 to cdata_integration build to fix macOS build with conda (#39589)
- GH-39598 - [C#] Fix verification script (#39605)
- GH-39604 - [JS] Do not use resizable buffers yet (#39607)
- GH-39628 - [C++] Disable parallelism for all `make`-based externalProjects when CMake >= 3.28 is used
New Features and Improvements
- GH-14936 - [Java] Remove netty dependency from arrow-vector (#38493)
- GH-28994 - [C++][JSON] Change the max rows to Unlimited(int_32) (#38582)
- GH-30117 - [C++][Python] Add “Z” to the end of timestamp print string when tz defined (#39272)
- GH-30717 - [C#] Add ToString() methods to Arrow classes (#36566)
- GH-31303 - [Python] Remove the legacy ParquetDataset custom python-based implementation (#39112)
- GH-31579 - [C#] : Remove out-of-support versions of .NET and update C# README (#39165)
- GH-33500 - [Python] add
Table.to/from_struct_array
(#38520) - GH-33984 - [C++][Python] DLPack implementation for Arrow Arrays (producer) (#38472)
- GH-34316 - [Python] FixedSizeListArray.from_arrays supports mask parameter (#39396)
- GH-34569 - [C++] Diffing of Run-End Encoded arrays (#35003)
- GH-34636 - [C#] Reduce allocations when using ArrayPool (#39166)
- GH-35260 - [C++][Python][R] Allow users to adjust S3 log level by environment variable (#38267)
- GH-35331 - [Python] Expose Parquet sorting metadata (#37665)
- GH-35344 - [C++][Format] Implementation of the LIST_VIEW and LARGE_LIST_VIEW array formats (#35345)
- GH-35560 - [C++] Use Cast() instead of CastTo() for Scalar in test (#39044)
- GH-36036 - [C++][Python][Parquet] Implement Float16 logical type (#36073)
- GH-36044 - [Python][Docs] Added ParquetFileFragment to the API reference docs (#38277)
- GH-36099 - [C++] Add Utf8View and BinaryView to the c ABI (#38443)
- GH-36441 - [Python] Make
CacheOptions
configurable from Python (#36627) - GH-36760 - [Go] Add Avro OCF reader (#37115)
- GH-36815 - [C#] : Enable net472 tests under Windows (#36818)
- GH-36898 - [CI] Hashpin Sensitive GitHub Actions (#37676)
- GH-37002 - [C++][Parquet] Add api to get RecordReader from RowGroupReader (#37003)
- GH-37061 - [Docs][Format] Clarify semantics of GetSchema in FSQL (#38549)
- GH-37199 - [C++] Expose a span converter for Buffer and ArraySpan (#38027)
- GH-37242 - [Python][Parquet] Parquet Support write and validate Page CRC (#38360)
- GH-37312 - [Python][Docs] Update Python docstrings to reflect new parquet encoding option (#38070)
- GH-37359 - [C#] Add ToList() to Decimal128Array and Decimal256Array (#37383)
- GH-37378 - [C++] Add A Dictionary Compaction Function For DictionaryArray (#37418)
- GH-37429 - [C++] Add arrow::ipc::StreamDecoder::Reset() (#37970)
- GH-37511 - [C++] Implement file reads for Azure filesystem (#38269)
- GH-37582 - [Go][Parquet] Implement Float16 logical type (#37599)
- GH-37592 - [MATLAB] Add
NumRows
property toarrow.tabular.RecordBatch
(#38215) - GH-37710 - [C++][Integration] Add C++ Utf8View implementation (#37792)
- GH-37753 - [C++][Gandiva] Add external function registry support (#38116)
- GH-37812 - [MATLAB] Add
arrow.type.ListType
MATLAB class (#38189) - GH-37815 - [MATLAB] Add
arrow.array.ListArray
MATLAB class (#38357) - GH-37848 - [C++][Gandiva] Migrate LLVM JIT engine from MCJIT to ORC v2/LLJIT (#39098)
- GH-37857 - [Python][Dataset] Expose file size to python dataset (#37868)
- GH-37889 - [Java][Doc] Improve JDBC driver documentation (#38469)
- GH-37895 - [C++] Feature: support concatenate recordbatches. (#37896)
- GH-37910 - [Java][Integration] Implement C Data Interface integration testing (#38248)
- GH-37943 - [Java] Add parquet file with all supported types (#38249)
- GH-37979 - [C++] Add support for specifying custom Array opening and closing delimiters to
arrow::PrettyPrintDelimiters
(#38187) - GH-38022 - [Java][FlightRPC] Expose app_metadata on FlightInfo and FlightEndpoint (#38331)
- GH-38024 - [Java][FlightRPC] Expose appMetadata through JDBC ResultSet (#38781)
- GH-38033 - [R] Allow
code()
to return package name prefix. (#38144) - GH-38042 - [C++][Benchmark] Add non-stream Codec Compression/Decompression (#38067)
- GH-38117 - [C++][Parquet] Change DictEncoder dtor checking to warning log (#38118)
- GH-38131 - [Swift][CI] Add linting and fix linting errors (#38133)
- GH-38153 - [C#] expose ArrayDataConcatenator.Concatenate (#38154)
- GH-38164 - [MATLAB] Rename
Length
property onarrow.array.Array
andarrow.array.ChunkedArray
toNumElements
(#38190) - GH-38166 - [MATLAB] Improve tabular object display (#38482)
- GH-38246 - [JAVA] added new getTransferPair() function that takes in a Field type for Complex Type Vectors (#38261)
- GH-38264 - [Java][Packaging] Add BOM file (#38336)
- GH-38271 - [C++][Parquet] Support reading parquet files with multiple gzip members (#38272)
- GH-38300 - [Dev][Docs] Update dev/README.md for the current merge_arrow_pr.py (#38301)
- GH-38310 - [MATLAB] Create the testing guideline document for testing MATLAB interface (#38459)
- GH-38316 - [C#] Implement interval types (#39043)
- GH-38326 - [C++][Parquet] check the decompressed page size same as size in page header (#38327)
- GH-38330 - [C++][Azure] Use properties for input stream metadata (#38524)
- GH-38333 - [C++][FS][Azure] Implement file writes (#38780)
- GH-38335 - [C++] Implement
GetFileInfo
for a single file in Azure filesystem (#38505) - GH-38339 - [C++][CMake] Use transitive dependency for system GoogleTest (#38340)
- GH-38341 - [Python] Remove usage of pandas internals DatetimeTZBlock (#38321)
- GH-38346 - [C++][Parquet] Use new encrypted files for page index encryption test (#38347)
- GH-38348 - [C#] Make PrimitiveArray
support IReadOnlyList<T?> (#38680) - GH-38351 - [C#] Add SqlDecimal support to Decimal128Array (#38481)
- GH-38354 - [MATLAB] Implement
fromMATLAB
method forarrow.array.ListArray
(#38561) - GH-38361 - Add validation logic for
offsets
andvalues
toarrow.array.ListArray.fromArrays
(#38531) - GH-38376 - [R] : Add
dimnames
method toDataset
class (#38377) - GH-38381 - [C++][Acero] Create a sorted merge node (#38380)
- GH-38398 - [MATLAB] Improve array display (#38400)
- GH-38402 - [CI][Integration] Provide wrapper scripts for integration testing (#38403)
- GH-38415 - [MATLAB] Add indexing “slice” method to C++ Array Proxy class (#38674)
- GH-38417 - [MATLAB] Implement a
TableTypeValidator
class that validates a MATLABcell
array contains onlytable
s that share the same schema (#38551) - GH-38418 - [MATLAB] Add method for extracting one row of an
arrow.tabular.Table
as a string (#38463) - GH-38419 - [MATLAB] Implement a
ClassTypeValidator
class that validates a MATLABcell
array contains only values of the same class type. (#38530) - GH-38420 - [MATLAB] Implement a
DatetimeValidator
class that validates a MATLABcell
array contains only values of zoned or unzoneddatetime
s (#38533) - GH-38424 - [CI][C++] Use Fedora 38 instead of 35 (#38425)
- GH-38452 - [C++][Benchmark] Adding benchmark for LZ4/Snappy Compression (#38453)
- GH-38457 - [C++] Support LogicalNullCount for DictionaryArray (#38681)
- GH-38460 - [Java][FlightRPC] Add mTLS support for Flight SQL JDBC driver (#38461)
- GH-38462 - [Go][Parquet] Handle Boolean RLE encoding/decoding (#38367)
- GH-38483 - [C#] Add support for more decimal conversions (#38508)
- GH-38506 - [Go][Parquet] Add NumRows and RowGroupNumRows to pqarrow.FileWriter (#38507)
- GH-38511 - [Java] Add getTransferPair(Field, BufferAllocator, CallBack) for StructVector and MapVector (#38512)
- GH-38528 - [Python][Compute] Describe strptime format semantics (#38665)
- GH-38537 - [Java] upgrade to netty 4.1.100.Final (#38538)
- GH-38541 - [MATLAB] Add remaining tests for arrow tabular object display (#38564)
- GH-38542 - [C++][Parquet] Faster scalar BYTE_STREAM_SPLIT (#38529)
- GH-38562 - [Packaging] Add support for Ubuntu 23.10 (#38563)
- GH-38576 - [Java] Change JDBC driver to optionally preserve cookies and auth tokens when getting streams (#38580)
- GH-38578 - [Java][FlightSQL] Remove joda usage from flight-sql library (#38579)
- GH-38589 - [C++][Gandiva] Support registering external C functions (#38632)
- GH-38597 - [C++] Implement GetFileInfo(selector) for Azure filesystem (#39009)
- GH-38602 - [R] Add missing
prod
for summarize (#38601) - GH-38627 - [Java][FlightRPC] Handle null parameter values (#38628)
- GH-38648 - [Java] Regenerate Flatbuffers (#38650)
- GH-38652 - [MATLAB] Add tests about time precision preservation when converting MATLAB duration to
arrow.array.Time32Array
andarrow.array.Time64Array
(#38661) - GH-38662 - [Java] Add comparators (#38669)
- GH-38699 - [C++][FS][Azure] Implement
CreateDir()
(#38708) - GH-38700 - [C++][FS][Azure] Implement
DeleteDir()
(#38793) - GH-38701 - [C++][FS][Azure] Implement
DeleteDirContents()
(#38888) - GH-38702 - [C++] : Implement AzureFileSystem::DeleteRootDirContents (#39151)
- GH-38705 - [C++][FS][Azure] Implement CopyFile() (#39058)
- GH-38712 - [Python] Remove dead code in _reconstruct_block (#38714)
- GH-38718 - [Go][Format][Integration] Add StringView/BinaryView to Go implementation (#35769)
- GH-38732 - [Java][FlightRPC] Add support for Array parameter binding in JDBC (#38733)
- GH-38751 - [C++][Go][Parquet] Add tests for reading Float16 files in parquet-testing (#38753)
- GH-38757 - [C#] Implement common interfaces for structure arrays and record batches (#38759)
- GH-38758 - [C++][FS][Azure] Rename AzurePath to AzureLocation (#38773)
- GH-38772 - [C++] Implement directory semantics even when the storage account doesn’t support HNS (#39361)
- GH-38798 - [Integration] Enable C Data Interface integration testing on Rust (#38799)
- GH-38814 - [C++][Parquet] Update parquet.thrift to sync with 2.10.0 (#38815)
- GH-38824 - [Go] Enable GC checks (#38826)
- GH-38836 - [Go] Add Size() for ArrayData (#38839)
- GH-38852 - [C++] Replace “#ifdef ARROW_WITH_GZIP” in dataset test to ARROW_WITH_ZLIB (#38853)
- GH-38857 - [Python] Fix append mode for cython 2 (#39027)
- GH-38857 - [Python] Add append mode for pyarrow.OsFile (#38820)
- GH-38860 - [C++][Parquet] Using length to optimize bloom filter read (#38863)
- GH-38870 - [Documentation] Add List View and Large List View to status.rst (#38871)
- GH-38874 - [C++][Parquet] Minor: making parquet TypedComparator operation as const method (#38875)
- GH-38884 - [C++] DatasetWriter release rows_in_flight_throttle when allocate writing failed (#38885)
- GH-38887 - [C++][Parquet] Move EstimatedBufferedValueBytes from TypedColumnWriter to ColumnWriter (#39055)
- GH-38907 - [C++] Stop installing internal bpacking_simd* headers (#38908)
- GH-38909 - [Packaging] Drop support for Ubuntu 23.04 (#38910)
- GH-38918 - [Go] Avoid schema.Fields allocations in some places (#38919)
- GH-38920 - [C++][Gandiva] Refactor function holder to return arrow Result (#38873)
- GH-38958 - [C++][Parquet] Fix spelling (#38959)
- GH-38990 - [Java] Upgrade to flatc version 23.5.26 (#38991)
- GH-38996 - [Java] Update dependencies and plugins for JPMS modules (#38994)
- GH-39006 - [Python] Extract libparquet requirements out of libarrow_python.so to new libarrow_python_parquet_encryption.so (#39316)
- GH-39013 - [Go][Integration] Support cABI import/export of StringView (#39019)
- GH-39020 - [CI][Release][JS] Use Node.js 18 instead of 16 (#39021)
- GH-39028 - [Python][CI] Fix dask integration build by temporarily skipping test_categorize_info (#39029)
- GH-39037 - [Java] Remove (Contrib/Experimental) mention in Flight SQL (#39040)
- GH-39049 - [C++] Use Cast() instead of CastTo() for Dictionary Scalar in test (#39362)
- GH-39050 - [C++] Use Cast() instead of CastTo() for Timestamp Scalar in test (#39060)
- GH-39051 - [C++] Use Cast() instead of CastTo() for List Scalar in test (#39353)
- GH-39064 - [C++][Parquet] Support row group filtering for nested paths for struct fields (#39065)
- GH-39088 - [Dev][Java] Add Dependabot configuration for Java (#39089)
- GH-39096 - [Python] Release GIL in
.nbytes
(#39097) - GH-39119 - [C++] Refactor the Azure FS tests and filesystem class instantiation (#39207)
- GH-39122 - [C++][Parquet] Optimize FLBA record reader (#39124)
- GH-39134 - Create module info compiler plugin (#39135)
- GH-39159 - [C++] : Try to make Buffer::device_type_ non-optional (#39150)
- GH-39170 - [Java] Improve error message explaining why TestTls might fail (#39171)
- GH-39189 - [Java] Bump com.h2database:h2 from 1.4.196 to 2.2.224 in /java (#39188)
- GH-39196 - [Python][Docs] Document the Arrow PyCapsule protocol in the ‘extending pyarrow’ section of the Python docs (#39199)
- GH-39208 - [C++][Parquet] Remove deprecated AppendRowGroup(int64_t num_rows) (#39209)
- GH-39210 - [C++][Parquet] Avoid WriteRecordBatch from produce zero-sized RowGroup (#39211)
- GH-39217 - [Python] RecordBatchReader.from_stream constructor for objects implementing the Arrow PyCapsule protocol (#39218)
- GH-39223 - [C#] Support IReadOnlyList<T?> on remaining scalar types (#39224)
- GH-39225 - [GLib] Use Cast() instaed of CastTo (#39228)
- GH-39232 - [C++] Support binary to fixed_size_binary cast (#39236)
- GH-39243 - [R][CI] Remove r-arrow conda nightlies (#39244)
- GH-39246 - [CI][GLib][Ruby] Use Ubuntu 22.04 not 20.04 (#39247)
- GH-39262 - [C++][Azure][FS] Add default credential auth configuration (#39263)
- GH-39265 - [Java] Make it run well with the netty newest version 4.1.104 (#39266)
- GH-39268 - [C++] Don’t install bundled Azure SDK for C++ with CMake 3.28+ (#39269)
- GH-39292 - [C++][FS] : Remove the AzureBackend enum and add more flexible connection options (#39293)
- GH-39297 - [C++][FS] : Inform caller of container not-existing when checking for HNS support (#39298)
- GH-39299 - [Java] Upgrade to Avro 1.11.3 (#39300)
- GH-39303 - [Archery][Benchmarking] Allow setting C++ repetition min time (#39324)
- GH-39318 - [C++][FS][Azure] Add workload identity auth configuration (#39319)
- GH-39320 - [C++][FS][Azure] Add managed identity auth configuration (#39321)
- GH-39322 - [C++] Forward arguments to ExceptionToStatus all the way to Status::FromArgs (#39323)
- GH-39326 - [C++] Flaky DatasetWriterTestFixture.MaxRowsOneWriteBackpresure test (#39379)
- GH-39328 - [Java] Make default getConsumer public (#39329)
- GH-39335 - [C#] Support creating FlightClient with Grpc.Core.Channel (#39348)
- GH-39339 - [C++] Add ForceCachedHierarchicalNamespaceSupport to help with testing (#39340)
- GH-39341 - [C#] Support Utf8View, BinaryView and ListView (#39342)
- GH-39343 - [C++][FS][Azure] Add client secret auth configuration (#39346)
- GH-39355 - [Java] Improve JdbcConsumer exceptions (#39356)
- GH-39357 - [C++] Reduce function.h includes (#39312)
- GH-39363 - [C++] Use Cast() instead of CastTo() for Parquet (#39364)
- GH-39413 - [C++][Parquet] Vectorize decode plain on FLBA (#39414)
- GH-39419 - [C++][Parquet] Style: Using arrow::Buffer data_as api rather than reinterpret_cast (#39420)
- GH-39430 - [C++][ORC] Upgrade ORC to 1.9.2 (#39431)
- GH-39449 - [C++] Use default Azure credentials implicitly and support anonymous credentials explicitly (#39450)
- GH-39484 - [Java] Support 256 bit decimals in JdbcToArrowUtils (#39485)
- GH-39500 - [Docs] Pin pydata-sphinx-theme to 0.14 (#39501)
- GH-39515 - [Python] Pass in type to
MapType.from_arrays
(#39516) - GH-39531 - [Python][CI] Skip failing dask tests: test_describe_empty and test_view (#39534)
- GH-39533 - [Python] NumPy 2.0 compat: remove usage of np.core (#39535)
- GH-39537 - [Packaging][Python] Add a numpy<2 pin to the install requirements for the 15.x release branch (#39538)
- GH-39601 - [R] Don’t download cmake when TEST_OFFLINE_BUILD=true (#39602)
- GH-39624 - [R][CI] Add CMake to docker file and update envvars (#39625)
- GH-39626 - [Docs][R] Update NEWS.md for 15.0.0
- PARQUET-2411 - [C++][Parquet] Allow reading dictionary without reading data via ByteArrayDictionaryRecordReader (#39153)