Apache Arrow 23.0.0 (18 January 2026)
This is a major release covering more than 3 months of development.
Download
- Source Artifacts
- Binary Artifacts
- Git tag
Contributors
This release includes 417 commits from 71 distinct contributors.
$ git shortlog -sn apache-arrow-22.0.0..apache-arrow-23.0.0
54 Sutou Kouhei
52 Hyukjin Kwon
45 Raúl Cumplido
41 Antoine Pitrou
36 Sten Larsson
34 Alina (Xi) Li
32 Nic Crane
12 William Ayd
10 dependabot[bot]
7 Hiroyuki Sato
5 Rossi Sun
4 Alenka Frim
4 Arash Andishgar
4 Bogdan Romenskii
3 Adam Reeve
3 Antoine Prouvost
3 Jonathan Giannuzzi
3 Ruifeng Zheng
3 justing-bq
3 tadeja
2 Gang Wu
2 Jonathan Keane
2 Kevin Liu
2 Sarah Gilmore
2 Vishwanatha-HD
2 Zehua Zou
2 hypsakata
2 tennisleng
1 Aihua Xu
1 Andrew Lamb
1 Aokizy
1 Arnav Balyan
1 Artem Leshchev
1 Bryce Mecum
1 Connor Sanders
1 Dan Homola
1 Daniil Timižev
1 Davis Vaughan
1 Eirik
1 Enrico Minack
1 Eslam Ahmed
1 Graham Markall
1 Hendrik Makait
1 Jason Wong
1 Johan Peltenburg
1 Jonah Kelman
1 Jordan Rupprecht
1 Joris Van den Bossche
1 Julien Stephan
1 Lucas Valente
1 Michael Behrisch
1 Mo.Aamir Hokabaj
1 Nick Mitchell
1 Patrick J. Roddy
1 Patrick Parsons
1 Rok Mihevc
1 Shadab Zafar
1 Sidney Mau
1 Simon Elbaz
1 Sirui Mu
1 Smith Cruise
1 TennyZhuang
1 Will Gulian
1 Yibo Cai
1 chegoryu
1 feuillatte
1 kilavvy
1 meher745
1 paultiq
1 scott-routledge2
1 tobim
Patch Committers
The following Apache committers merged contributed patches to the repository.
$ git shortlog -sn --group=trailer:signed-off-by apache-arrow-22.0.0..apache-arrow-23.0.0
153 Sutou Kouhei
62 Antoine Pitrou
58 Raúl Cumplido
36 David Li
27 Nic Crane
19 AlenkaF
8 Adam Reeve
7 Rossi Sun
6 Rok Mihevc
4 dependabot[bot]
3 Gang Wu
2 James Duong
2 Sarah Gilmore
2 Will Ayd
1 Bryce Mecum
1 raulcd
Changelog
Bug Fixes
- GH-33473 - [Python] Fix KeyError on Pandas roundtrip with RangeIndex in MultiIndex (#39983)
- GH-35957 - [C++][Compute] Graceful error for decimal binary arithmetic and comparison instead of firing confusing assertion (#48639)
- GH-41246 - [C++][Python] Simplify nested field encryption configuration (#45462)
- GH-42173 - [R][C++] Writing partitioned dataset on S3 fails if ListBucket is not allowed for the user (#47599)
- GH-43660 - [C++][Compute] Avoid ZeroCopyCastExec when casting Binary offset -> Binary offset types (#48171)
- GH-44318 - [C++][Python] Fix RecordBatch::FromStructArray for sliced arrays with offset = 0 (#47843)
- GH-45260 - [R][Docs] Improve documentation on GCS support
-
GH-45867 - [Python] Fix
SetuptoolsDeprecationWarning(#47141) - GH-46063 - [C++][Compute] Fix the issue that MinMax kernel emits -inf/inf for all-NaN input (#48459)
- GH-46584 - [C++][FlightRPC] Iterate over endpoints in ODBC driver (#47991)
- GH-47000 - [R] concat_tables on a record_batch causes segfault (#47885)
- GH-47022 - [Python] Support unsigned dictionary indices in pandas conversion (#48451)
-
GH-47099 - [C++][Parquet] Add missing
pragma warning(pop)toparquet/platform.h(#47114) - GH-47371 - , GH-48281: [Python][CI] Fix Numba-CUDA interop (#48284)
- GH-47559 - [Python] Fix missing argument in pyarrow fs (#47497)
- GH-47564 - [C++] Update expected L2 CPU cache range to 32KiB-64MiB (#47563)
- GH-47664 - [C++][Parquet] add num_rows_ before each call to RowGroupWriter::Close in FileSerializer (#47665)
- GH-47734 - [Python] Fix hypothesis timedelta bounds for duration/interval types (#48460)
- GH-47751 - [CI] Fix check for job to ignore on reporting (#47755)
- GH-47778 - [CI][Python] Remove ORC alias timezone for US/Pacific on test_orc.py::test_timezone_absent (#47956)
- GH-47781 - [C++] Cleaned up type-limit warning in sink_node.cc (#47782)
- GH-47807 - [C++][Compute] Fix the issue that null count is not updated when setting slice on an array span (#47808)
- GH-47812 - [R][CI] Fix lint for new version of styler (#47813)
- GH-47821 - [CI][Release][R] Fix test repository path in release (#47929)
- GH-47823 - [Python] Use PyWeakref_GetRef instead of PyWeakref_GET_OBJECT (Python 3.15) (#48027)
- GH-47825 - [C++] Fix the issue that bitmap ops overriding partial leading byte (#47912)
- GH-47830 - [Release] Run RC verification source testing step in a subshell (#47831)
- GH-47836 - [C++] Fix Meson configuration after bpacking changes (#47837)
- GH-47840 - [CI][C++] Check whether the CSV module/thread sanitizer is enabled or not before building example (#47841)
- GH-47844 - [CI] Fix unconditionally running extra workflows reporting when there are jobs failing (#47917)
- GH-47859 - [C++] Fix creating union types without type_codes for fields.size() == 128 (#47815)
- GH-47861 - [Python] reduce memory usage when using to_pandas() with many extension arrays columns (#47860)
- GH-47883 - [CI] Add openssl gem explicitly to fix ceriticate validation error on test (#47884)
- GH-47909 - [C++] Fix MSVC ARM64 build (#47910)
- GH-47914 - [C++] Fix system Apache ORC/Google logging used detection (#47915)
- GH-47918 - [Format] Clarify that empty compressed buffers can omit the length header (#48541)
- GH-47919 - [C++] Update Meson config for C Data Interface changes (#47920)
- GH-47921 - [C++] Implement substrait option in Meson (#48016)
- GH-47923 - [CI] Use macos-15-intel instead of macos-13 for macOS x86 runner (#47690)
- GH-47924 - [C++] Fix issues in CSV reader with invalid inputs (#47925)
- GH-47927 - [Release] Fix APT repository metadata generation with new repository (#47928)
- GH-47932 - [Release][Python] PyPI rejects our source distribution due to missing LICENSE.txt
- GH-47933 - [Release][R] Don't upload *.sha512.{asc,sha512} (#47982)
- GH-47941 - [R] Fix codegen.R error from dplyr pipe to base pipe change (#47985)
- GH-47942 - [R] CRAN 22.0.0 R package release fails on Winbuilder due to "non-API call to R: 'Rf_lazy_duplicate'" (#47943)
- GH-47945 - [C++] Add support for Boost 1.89.0 and require Boost 1.69 or later (#47947)
- GH-47948 - [CI][Packaging][Deb] Add missing directory existent check (#47949)
-
GH-47953 - [C++] Remove Windows inclusion from
int_util_overflow.h(#47950) - GH-47955 - [C++][Parquet] Support reading INT-encoded Decimal stats as Arrow scalar (#48001)
- GH-47961 - [C++] Fix Meson's Boost process version detection (#48017)
- GH-47964 - [Docs] Add dcleblanc/SafeInt to the LICENSE.txt file (#47965)
- GH-47966 - [Python] PyArrow v22.0 assumes Pandas DataFrame attrs are serializable (#47977)
- GH-47967 - [C++] Update Meson Configuration with SafeInt Changes (#47968)
- GH-47970 - [CI][C++] Fix a bug that JNI jobs runs nothing (#47972)
- GH-47973 - [C++][Parquet] Fix invalid Parquet files written when dictionary encoded pages are large (#47998)
- GH-47981 - [C++][Parquet] Add compatibility with non-compliant RLE stream (#47992)
- GH-47983 - [CI][R] R nightly upload workflow failing for a few weeks (#47984)
- GH-48004 - [C++][Parquet] Fix hang in ColumnReader benchmark (#48005)
- GH-48010 - [C++] Update bundled RE2 from 2022-06-01 to 2023-03-01 (#48011)
- GH-48029 - [R][CI] R nightly upload workflow failing in pruning step (#48030)
-
GH-48044 - [Packaging][RPM][Parquet] Don't install
parquet-glib.pcbyparquet-devel(#48045) - GH-48046 - [Docs][C++] Clarify "Exporting Tracing Information" section in OTel docs (#48047)
- GH-48057 - [R] Slow reading performance caused by apply_arrow_r_metadata() looping through all columns, including NULL ones (#48104)
- GH-48062 - [C++] Fix null pointer dereference in MakeExecBatch (#48063)
- GH-48064 - [C++] Set ARROW_BUILD_STATIC=ON when features-flight are enabled on CMake presets (#48065)
- GH-48076 - [C++][Flight] fix GeneratorStream for Tables (#48082)
- GH-48079 - [CI] Fix a typo in util_free_space.sh (#48088)
- GH-48095 - [Python][Docs] Add missing {pyarrow,compute} functions to API docs (#48117)
- GH-48098 - [R] Fix nightly libarrow binary uploads (#48100)
- GH-48107 - [CI] Update testing submodule (#48114)
- GH-48115 - [C++] Better align Meson configuration and config.h (#48116)
- GH-48125 - [C++] Remove gnu11 standard from the Meson configuration (#48126)
- GH-48127 - [R] stringr argument deprecation - add binding for stringr::str_ilike() and remove ignore_case argument for stringr::str_like() (#48262)
- GH-48129 - [CI] Stale issues bot only looks at 30 issues at a time (#48130)
- GH-48134 - [C++] Make StructArray::field() thread-safe (#48128)
- GH-48142 - [CI] Disallow scheduled GitHub Actions run on forked repos (#48143)
- GH-48146 - [C++][Parquet] Fix undefined behavior with invalid column/offset index (#48147)
- GH-48162 - [CI] Stale issues bot hit secondary rate limit and did not complete (#48165)
- GH-48168 - [C++][Parquet] Fix setting column-specific options when writing an encrypted Dataset (#48170)
- GH-48234 - [C++][Parquet] Fix overly strict check for BIT_PACKED levels byte size (#48235)
- GH-48238 - [C++] Actually write IPC schema endianness, not host endianness (#48239)
- GH-48246 - [C++][Parquet] Fix pre-1970 INT96 timestamps roundtrip (#48247)
- GH-48263 - [CI] Stale issues workflow doesn't go through enough issues (#48264)
- GH-48268 - [C++][Acero] Enhance the type checking for hash join residual filter (#48272)
- GH-48280 - [CI] PYTHON_PATCH_VERSION docker warnings (#48282)
- GH-48283 - [R][CI] Failures on R Lint on main (#48286)
- GH-48308 - [C++][Parquet] Fix potential crash when reading invalid Parquet data (#48309)
- GH-48314 - [Python] Compat with pandas 3.0 changed default datetime unit (#48319)
-
GH-48340 - [R] respected
MAKEFLAGS(#48341) - GH-48376 - [C++] Update GoogleTest from 1.16.0 to 1.17.0 (#48377)
- GH-48416 - [Packaging][CI] Use custom orc_for_bundling when using FetchContent to avoid ar issues with + symbol on path (#48430)
- GH-48417 - [Packaging][CI] Skip downgrade testing for Debian testing (#48427)
- GH-48432 - [CI][Ruby] Don't run Red Arrow Format tests with Ruby 3.1 (#48434)
- GH-48478 - [Ruby] Fix Ruby list inference for nested non-negative integer arrays (#48584)
- GH-48481 - [Ruby] Correctly infer types for nested integer arrays (#48699)
- GH-48540 - [Python][C++][CI] test_s3_options crash on macOS
- GH-48566 - [C++][CI] Fix compilation on Valgrind job (#48567)
- GH-48570 - [C++] Add Missing Fuzz Sources to Meson configuration (#48571)
- GH-48608 - [Python] Fix interpolate actual values in Message.repr f-string (#48656)
- GH-48610 - [Ruby] Add FixedSizeListArray glue (#48609)
- GH-48625 - [Python] Add temporal unit checking in NumPyDtypeUnifier (#48626)
- GH-48641 - [CI] Multiple nightly R builds failing due to ssache errors
- GH-48725 - [C++] Fix bundled Protobuf doesn't exist in libarrow_bundled_dependencies (#48726)
- GH-48735 - [CI][Python] Fix macOS wheel builds by forcing setuptools upgrade in venv (#48739)
-
GH-48736 - [CI][Python] Restore AlmaLinux 8 support of
dev/release/setup-rhel-rebuilds.shfor wheel verification (#48748) - GH-48741 - [C++] Fix deadlock in CSV AsyncThreadedTableReader destructor (#48742)
- GH-48750 - [CI][Documentation] Disable Unity build for OpenTelemetry (#48751)
- GH-48776 - [CI][Ruby][Windows] Ensure removing temporary files (#48777)
- GH-48780 - [CI] Add missing permissions for reusable workflow calls (#48778)
- GH-48782 - [Docs][CI] Skip Markdown files with doxygen and trigger Docs job on PR when files are modified (#48786)
- GH-48784 - [GLib] Make (system) Parquet C++ is optional (#48785)
-
GH-48787 - [C++] Disable
-Werrorfor s2n-tls (#48791) - GH-48806 - [CI][Packaging] ubuntu-noble-arm64 has failes for several days due to network failure (403 Forbidden [IP: 91.189.92.19 80])
- GH-48807 - [CI] Clean up space on GitHub runner to fix manylinux wheel failure (#48790)
- GH-48809 - [CI] Fix homebrew-cpp with Mac by using formula-based dependency resolution (#48824)
-
GH-48811 - [C++][FlightRPC] ODBC: Add missing
arrow::to fix build (#48810) - GH-48827 - [CI][Python] Add required xz dependency to emscripten dockerfile (#48828)
- GH-48838 - [Release] Use gh cli to download sources for Linux packages and publish draft release before verification (#48839)
- GH-48841 - [Release][Package] Add GH_TOKEN to rake build step on Linux Packaging jobs (#48842)
New Features and Improvements
- GH-23970 - [GLib] Add support for duration (#48564)
- GH-24157 - [C++] Add tests for DayTimeIntervalBuilder (#48709)
- GH-31869 - [Python][Parquet] Implement external key material features in Python (#48009)
- GH-40735 - [Packaging][CentOS] Drop support for CentOS 7 (#48550)
- GH-41364 - [GLib][Ruby] Allow passing thread pool to ExecutePlan (#48462)
- GH-44810 - [C++][Parquet] Add arrow::Result version of parquet::arrow::FileReader::Make() (#48285)
- GH-45449 - [R][CI] Remove OpenSSL 1.x builds (#48297)
- GH-45484 - [C++] Drop support for the gold linker (#47780)
- GH-45885 - [C++] Require C++20 (#48414)
- GH-46004 - [C++][FlightRPC] Enable ODBC Build In C++ Workflows (#47689)
- GH-46096 - [C++][FlightRPC] Environment and Connection Handle Allocation (#47759)
- GH-46098 - [C++][FlightRPC] ODBC Environment Attribute Implementation (#47760)
- GH-46147 - [C++] Implement GCS support in Meson (#47568)
- GH-46411 - [C++] Implemented dataset option in Meson (#47669)
- GH-46465 - [C++][FlightRPC] Refactor ODBC namespaces and file structure (#47703)
- GH-46574 - [C++][FlightRPC] ODBC Driver Connectivity support (#47971)
- GH-46575 - [C++][FlightRPC] Add Diagnostic tests (#47764)
- GH-46575 - [C++][FlightRPC] ODBC Diagnostics Report (#47763)
- GH-46592 - [CI][Dev][R] Add Air to pre-commit (#47423)
- GH-46825 - [R] Use smallest_decimal() from C++ instead of working out which decimal type to instantiate in R (#47906)
- GH-46903 - [CI] Automatically flag stale issues (#46904)
- GH-47030 - [C++][Parquet] Add setting to limit the number of rows written per page (#47090)
- GH-47103 - [Statistics][C++] Implement Statistics specification attribute ARROW:null_count:approximate (#47969)
- GH-47105 - [Statistics][C++] Implement Statistics specification attribute ARROW:row_count:approximate (#48266)
- GH-47196 - [CI][C++] Add Windows ARM64 build (#47811)
- GH-47437 - [CI][Python] Update win wheels and free-threaded build for Python 3.14
- GH-47441 - [Python][Parquet] Allow passing write_time_adjusted_to_utc to Python's ParquetWriter (#47745)
- GH-47572 - [C++][Parquet] Uniform unpack interface (#47573)
- GH-47635 - [CI][Integration] Add new gold files (#47729)
- GH-47640 - [CI] Remove needless ci/docker/ubuntu-22.04-csharp.dockerfile (#48298)
- GH-47643 - [Python][Packaging] Enable CMAKE_INTERPROCEDURAL_OPTIMIZATION for wheels (#47733)
- GH-47677 - [C++][GPU] Allow building with CUDA 13 (#48259)
- GH-47697 - [C++][FlightRPC] Add ODBC API placeholders (#47725)
- GH-47706 - [C++][FlightRPC] ODBC SQLFreeStmt implementation (#48033)
- GH-47707 - [C++][FlightRPC] Add tests for descriptor handle allocation (#48053)
- GH-47708 - [C++][FlightRPC] Connection Attribute Support for ODBC (#47772)
- GH-47710 - [C++][FlightRPC] Statement attribute Support in ODBC (#47773)
- GH-47711 - [C++][FlightRPC] Enable ODBC query execution (#48032)
- GH-47713 - [C++][FlightRPC] ODBC SQLMoreResults implementation (#48035)
- GH-47713 - [C++][FlightRPC] ODBC return number of result columns (#48036)
- GH-47713 - [C++][FlightRPC] ODBC return number of affected rows (#48037)
- GH-47713 - [C++][FlightRPC] ODBC Basic Data Retrieval (#48034)
- GH-47714 - [C++][FlightRPC] ODBC extended fetch (#48040)
- GH-47715 - [C++][FlightRPC] ODBC scroll fetch implementation (#48041)
- GH-47716 - [C++][FlightRPC] ODBC bind column implementation (#48042)
- GH-47717 - [C++][FlightRPC] ODBC close cursor (#48043)
- GH-47719 - [C++][FlightRPC] Extract SQLTables Implementation (#48021)
- GH-47720 - [C++][FlightRPC] ODBC Columns Metadata (#48049)
- GH-47721 - [C++][FlightRPC] Followup to remove unncessary std::move to resolve compliation flakiness (#48687)
- GH-47721 - [C++][FlightRPC] Return ODBC Column Attribute from result set (#48050)
- GH-47722 - [C++][FlightRPC] ODBC Data Type Information (#48051)
- GH-47723 - [C++][FlightRPC] ODBC SQLNativeSQL implementation (#48020)
- GH-47724 - [C++][FlightRPC] ODBC: implement SQLDescribeCol (#48052)
- GH-47726 - [C++][FlightRPC] ODBC Unicode Support (#47771)
- GH-47728 - [Python] Check the source argument in parquet.read_table (#48008)
- GH-47747 - [C++] Bump Apache ORC to 2.2.1 (#47744)
- GH-47753 - [C++][Parquet] Build Thrift with OpenSSL disabled (#47754)
- GH-47756 - [C++][CI] Fuzz CSV reader (#47757)
- GH-47767 - [CI] Add date to extra CI report email subject (#47777)
- GH-47784 - [C++] Patch vendored pcg library to enable msvc arm64 intrinsics (#47779)
- GH-47786 - [C++][FlightRPC] Establish ODBC tests (#47788)
-
GH-47787 - [C++][FlightRPC] ODBC
msiWindows installer (#48054) - GH-47789 - [C++][FlightRPC] SQLGetFunctions Tests (#48031)
- GH-47797 - [CI][Python] Update Python installs for free-threaded wheel tasks (#47993)
- GH-47800 - [C++][CI] Fuzz more CSV reader types (#48398)
- GH-47806 - [CI] Rename deprecated docker-compose.yml to preferred compose.yaml file (#47954)
- GH-47833 - [C++] Add utf8proc option to Meson configuration (#47834)
- GH-47881 - [C++] Update fast_float version to 8.1.0 (#47882)
- GH-47887 - [C++][Integration] Enable extension types with C Data Interface tests (#47888)
- GH-47891 - [C++][Parquet] Generate a separate fuzz seed file for each column (#47892)
- GH-47895 - [C++][Parquet] Add prolog and epilog in unpack (#47896)
- GH-47905 - [C++][Parquet] MakeColumnStats should use user-provided memory pool (#47894)
- GH-47926 - [C++] Adopt alternative safe arithmetic library (#47958)
- GH-47936 - [R] docgen.R requires installed package instead of current working code (#47940)
- GH-47939 - [R] Update CRAN packaging checklist to update checksums and have make build call make clean (#47944)
- GH-47974 - [Docs] Remove stray documentation from Java and JS (#48006)
- GH-47975 - [Docs][Python] Remove experimental warning on PyCapsule documentation (#47976)
- GH-47978 - [C++][Parquet][CI] Add more compression codecs to fuzzing seed corpus (#47979)
- GH-48000 - [CI][Release] Publish RC GitHub Release as draft to allow immutable releases (#48059)
- GH-48013 - [R] Add CI job for musl (Alpine Linux) to replicate CRAN checks (#48014)
- GH-48025 - [C++][GLib] Replace instances where build path is being added to built artifacts (#48026)
- GH-48055 - [C++][FlightRPC] Allow spaces while parsing Table Type in ODBC (#48056)
- GH-48074 - [C++] Use FetchContent for bundled Abseil (#48075)
- GH-48084 - [C++][FlightRPC] Replace boost::optional with std::optional (#48323)
- GH-48089 - [C++][Parquet] Read statistics and other metadata when fuzzing (#48090)
- GH-48091 - [C++] Use FetchContent for bundled c-ares (#48092)
- GH-48096 - [Python][Parquet] Expose new WriterProperties::max_rows_per_page to Python bindings (#48101)
- GH-48102 - [Python] Remove deprecated Array.format method (#48324)
- GH-48105 - [C++][Parquet][IPC] Cap allocated memory when fuzzing (#48108)
- GH-48112 - [C++][Parquet] Use more accurate data length estimate when decoding PLAIN BYTE_ARRAY data (#48113)
- GH-48123 - [C++][Float16] Reimplement arrow::WithinUlp and Enable it for float16 (#48224)
- GH-48139 - [C++] Allow compilation for QNX versions up to 8 (#48140)
- GH-48152 - [CI][MATLAB] Bump MATLAB release to R2025b in the MATLAB GitHub Actions Workflow (#48153)
-
GH-48154 - [MATAB][Packaging] Update MATLAB crossbow workflow to build against MATLAB
R2025b(#48155) - GH-48163 - [CI][Docs] Update preview docs task S3 secret to use (#48164)
- GH-48167 - [Python][C++][Compute] Add python bindings for scatter, inverse_permutation (#48267)
- GH-48174 - [CI][Dev] Fix shellcheck errors in ci/scripts/util_download_apache.sh (#48175)
- GH-48176 - [C++][Parquet] Fix arrow-ipc-message-internal-test failure (#48166)
- GH-48178 - [C++] Use FetchContent for bundled RE2 (#48179)
- GH-48181 - [C++] Use FetchContent for bundled Protobuf (#48183)
- GH-48186 - [CI][Dev] Remove ci/scripts/util_wait_for_it.sh (#48189)
- GH-48218 - [C++][Parquet] Fix Util & Level Conversion logic on big-endian (#48219)
- GH-48245 - [C++][Parquet] Simplify GetVlqInt (#48237)
- GH-48248 - [C++] Use FetchContent for bundled gRPC (#48250)
- GH-48251 - [C++][CI] Add CSV fuzzing seed corpus generator (#48252)
-
GH-48256 - [Packaging][Linux] Use
closer.lua?action=downloadURL (#48257) - GH-48260 - [C++][Python][R] Move S3 bucket references to new bucket as Voltron Data ones will be removed soon (#48261)
- GH-48275 - [C++][Dev] Allow choosing verbosity when fuzzing (#48276)
- GH-48287 - [Ruby] Add minimum pure Ruby Apache Arrow reader implementation (#48288)
-
GH-48292 - [Ruby] Add
Arrow::Column#to_arrow{,_array,_chunked_array}(#48293) - GH-48295 - [Ruby] Add support for reading Int8 array (#48296)
-
GH-48303 - [CI] Remove needless
setup-dotnetfrom.github/workflows/dev.yml(#48304) - GH-48306 - [Ruby] Add support for reading binary array (#48307)
- GH-48312 - [C++][FlightRPC] Standalone ODBC MSVC CI (#48313)
- GH-48315 - [C++] Use FetchContent for bundled CRC32C (#48318)
- GH-48316 - [C++] Use FetchContent for bundled nlohmann-json (#48320)
- GH-48317 - [C++] Use FetchContent for bundled google-cloud-cpp (#48333)
-
GH-48326 - [CI] Stop specifying hash for
actions/*GitHub Actions (#48327) - GH-48328 - [Ruby] Add support for reading UTF-8 array (#48329)
- GH-48330 - [Ruby] Add support for reading null array (#48331)
- GH-48335 - [C++][Parquet] Fuzz encrypted files (#48336)
- GH-48337 - [C++][Parquet] Improve column encryption API (#48338)
- GH-48339 - [C++] Enhance functions in util/ubsan.h to support types without a default constructor (#48429)
- GH-48342 - [R] Turn off gcs by default, also if it is on, bundle. (#48343)
- GH-48346 - [Ruby] Add support for reading boolean array (#48348)
- GH-48347 - [Ruby] Add support for reading list array (#48351)
- GH-48355 - [Python] Remove obsolete snprintf workaround for Python 3.9 (#48354)
- GH-48358 - [Ruby] Add support for reading float32 array (#48359)
- GH-48360 - [Ruby] Add support for reading large binary array (#48361)
- GH-48362 - [GLib][Ruby] Add FixedSizeListArray (#48369)
- GH-48363 - [GLib][Ruby] Add AssumeTimezoneOptions (#48370)
- GH-48364 - [GLib][Ruby] Add CumulativeOptions (#48371)
- GH-48365 - [GLib][Ruby] Add DayOfWeekOptions (#48372)
- GH-48366 - [GLib][Ruby] Add DictionaryEncodeOptions (#48373)
- GH-48367 - [GLib][Ruby] Add ElementWiseAggregateOptions (#48374)
- GH-48368 - [GLib][Ruby] Add ExtractRegexOptions (#48375)
- GH-48380 - [Ruby] Add support for reading float64 array (#48381)
- GH-48382 - [Ruby] Add support for reading struct array (#48383)
- GH-48384 - [C++][Docs][Parquet] Fix broken link for parquet-format spec (#48385)
- GH-48386 - [Ruby][Dev] Enable Layout/TrailingEmptyLines: final_newline cop (#48392)
- GH-48388 - [Ruby] Add support for reading map array (#48389)
- GH-48395 - [C++][Dev] Update fuzzing CMake preset (#48396)
-
GH-48400 - [Python] Convert an old todo to a proper ticket in
test_copy_files_directory(#48401) - GH-48402 - [Python] Enable the relative path in test_write_dataset (#48403)
- GH-48404 - [Python] Add tests to to_table(filter=...) to reject a boolean expr (#48405)
- GH-48406 - [Python] Negative test for struct_field no-argument (ARROW-14853) (#48407)
- GH-48410 - [Ruby] Add support for reading large list array (#48411)
- GH-48412 - [Ruby] Add support for reading date32 array (#48413)
- GH-48419 - [Python] Fix test_parquet_file_too_small to catch only ArrowInvalid (#48420)
- GH-48421 - [Python] Enable test_orc_scan_options with batch_size (#48422)
- GH-48423 - [Ruby] Add support for reading date64 array (#48424)
- GH-48425 - [Ruby] Add support for reading dense union array (#48426)
- GH-48435 - [Ruby] Add support for reading sparse union array (#48439)
- GH-48437 - [Ruby] Add tests for large list array (#48438)
- GH-48440 - [Ruby] Add support for reading time32 array (#48441)
-
GH-48442 - [Python] Remove workaround that excluded struct types from
chunked_arrays(#48443) - GH-48444 - [Python] Remove todo of implementing requested_schema in test_roundtrip_reader_capsule (#48445)
-
GH-48446 - [Python] Remove todo of schema=name mismatch in
record_batches(#48447) - GH-48452 - [Python] Add tests for Date32 and Date64 array creation with masks (#48453)
- GH-48461 - [R][CI] Migrate Azure pipelines to GitHub actions (#48585)
- GH-48463 - [Python] Improve error message in CheckTypeExact arrow_to_pandas.cc (#48464)
- GH-48471 - [Ruby] Add support for reading Int16 and UInt16 arrays (#48472)
- GH-48475 - [Ruby] Add support for reading Int32 and UInt32 arrays (#48476)
- GH-48479 - [Ruby] Add support for reading Int64 and UInt64 arrays (#48480)
- GH-48482 - [GLib][Ruby] Add GArrowExtractRegexSpanOptions (#48483)
- GH-48484 - [GLib][Ruby] Add GArrowJoinOptions (#48485)
- GH-48486 - [GLib][Ruby] Add GArrowListFlattenOptions (#48487)
- GH-48488 - [GLib][Ruby] Add GArrowListSliceOptions (#48489)
- GH-48490 - [GLib][Ruby] Add GArrowMakeStructOptions (#48491)
- GH-48492 - [GLib][Ruby] Add MapLookupOptions (#48513)
- GH-48493 - [GLib][Ruby] Add ModeOptions (#48514)
- GH-48494 - [GLib][Ruby] Add NullOptions (#48515)
- GH-48495 - [GLib][Ruby] Add PadOptions (#48516)
- GH-48496 - [GLib][Ruby] Add PairwiseOptions (#48517)
- GH-48497 - [GLib][Ruby] Add PartitionNthOptions (#48518)
- GH-48498 - [GLib][Ruby] Add PivotWiderOptions (#48519)
- GH-48499 - [GLib][Ruby] Add RankQuantileOptions (#48520)
- GH-48500 - [GLib][Ruby] Add ReplaceSliceOptions (#48521)
- GH-48501 - [GLib][Ruby] Add ReplaceSubstringOptions (#48522)
- GH-48502 - [GLib][Ruby] Add RoundBinaryOptions (#48523)
- GH-48503 - [GLib][Ruby] Add RoundTemporalOptions (#48524)
- GH-48504 - [GLib][Ruby] Add SelectKOptions (#48525)
- GH-48505 - [GLib][Ruby] Add SkewOptions (#48526)
- GH-48506 - [GLib][Ruby] Add SliceOptions (#48527)
- GH-48507 - [GLib][Ruby] Add SplitOptions (#48528)
- GH-48508 - [GLib][Ruby] Add TDigestOptions (#48529)
- GH-48509 - [GLib][Ruby] Add TrimOptions (#48530)
- GH-48510 - [GLib][Ruby] Add WeekOptions (#48531)
- GH-48511 - [GLib][Ruby] Add WinsorizeOptions (#48532)
- GH-48512 - [GLib][Ruby] Add ZeroFillOptions (#48533)
- GH-48535 - [Ruby] Add support for reading time64 array (#48536)
- GH-48537 - [Ruby] Add support for reading fixed size binary array (#48538)
- GH-48545 - [C++][Parquet][CI] Add more encodings to fuzzing seed corpus (#48546)
- GH-48551 - [Ruby] Add support for reading large UTF-8 array (#48552)
- GH-48553 - [Ruby] Add support for reading timestamp array (#48554)
- GH-48555 - [C++] Use FetchContent for bundled opentelemetry (#48556)
- GH-48557 - [C++][Parquet][CI] Also encrypt nested columns in fuzz seed corpus (#48558)
- GH-48572 - [CI] Remove centos-7-cpp dockerfile and reference from compose (#48573)
- GH-48579 - [Ruby] Add support for reading duration array (#48580)
- GH-48582 - [CI][GPU][C++][Python] Add new CUDA jobs using the new self-hosted runners (#48583)
- GH-48592 - [C++] Use starts_with/ends_with methods (#48614)
- GH-48602 - [Ruby] Add support for reading interval arrays (#48603)
- GH-48606 - [CI][GLib] Increase NuGet timeout for vcpkg cache (#48638)
- GH-48612 - [Ruby] Add support for reading streaming format (#48613)
-
GH-48616 - [GLib] Use
Arrow-${MAJOR}.${MINOR}.typelibnotArrow-1.0.typelib(#48617) - GH-48631 - [R] Non-API calls: 'ATTRIB', 'SET_ATTRIB' (#48634)
- GH-48632 - [R] Add NEWS.md entry for 22.0.0.1 (#48633)
- GH-48642 - [Ruby] Add support for reading decimal128 array (#48643)
- GH-48654 - [Python] Test timestamp from int without pandas dependency (#48655)
-
GH-48667 - [Python] Remove unused imports from
python/pyarrow/__init__.py(#48640) -
GH-48668 - [Python][Docs] Add python examples for compute functions
min/max/min_max(#48648) - GH-48675 - [C++][FlightRPC] Document StatementAttributeId enum values in ODBC SPI (#48676)
- GH-48680 - [GLib][Ruby] Add CSVWriter (#48681)
- GH-48684 - [C++] Update MakeListArray to use ListArray::FromArrays instead of constructor (#48685)
- GH-48690 - [R] Make "Can read Parquet files from a URL" less flaky (#48693)
- GH-48703 - [Ruby] Add support for reading decimal256 array (#48704)
- GH-48705 - [Ruby] Add support for reading dictionary array (#48706)
- GH-48707 - [C++][FlightRPC] Use IRD precision/scale defaults with ARD override in SQLGetData (#48708)
- GH-48752 - [Ruby] Skip ChunkedArray test on Windows due to flakiness (#48779)
- GH-48755 - [MATLAB] Rename getArrayProxyIDs to getProxyIDs (#48756)
- GH-48757 - [CI] Update arrow/.github /CODEOWNERS (#48758)
- GH-48770 - [CI] Add missing permissions declaration to workflows (#48771)