Release Management Guide#
This page provides detailed information on the steps followed to perform a release. It can be used both as a guide to learn the Apache Arrow release process and as a comprehensive checklist for the Release Manager when performing a release. The person acting as Release Manager must at least have committer status in order to perform the tasks below. If the Release Manager is a committer but not a member of the PMC, some tasks will need to be delegated to a PMC member and these are marked below accordingly.
Principles#
The Apache Arrow Release follows the guidelines defined at the Apache Software Foundation Release Policy.
Preparing for the release#
In advance to the Release date the Release Manager communicates with the community usually via Zulip, Mailing List or the bi-weekly community call about the upcoming release and proposes a feature freeze date.
The feature freeze date is the date the maintenance branch is created and from that point no new features are allowed to be added to the release, unless there is community consensus to allow it, and only bug fixes will be accepted.
Once the feature freeze is in place issues labelled as blocker must be resolved before
the first release candidate can be created.
Before creating a release candidate, the Release Manager must ensure that any resolved GitHub issues have the appropriate milestone set so that the changelog is generated properly.
Note that pull requests without a corresponding GitHub issue won’t be detected by the cherry-pick script and must be cherry-picked manually by the release manager onto the maintenance branch. Examples include MINOR and Dependabot pull requests. For this reason, it’s encouraged to avoid the need for manual cherry-picking by creating issues for any pull requests that are merged to the default branch after the release maintenance branch has been created.
Before creating a Release Candidate#
Ensure local tags are removed, gpg-agent is set and GitHub issues are correctly assigned.
# Delete the local tag for RC1 or later
git tag -d apache-arrow-<version>
# Setup gpg agent for signing binary artifacts
source dev/release/setup-gpg-agent.sh
# Curate the release
# The end of the generated report shows any GitHub issues with the wrong
# version number assigned.
archery release curate <version>
Ensure a major version milestone for a follow up release is created on GitHub. This will automatically be used by our merge script as the new version for issues closed when the maintenance branch is created.
Patch Releases#
We usually create patch releases once a major breaking issue has been identified. Issues that are identified as major breaking issues can be security fixes, broken packages for specific builds and others.
Any developer can ask for a patch release to be generated sending an email to the Arrow development mailing-list with the reason of why a new release is necessary. If there is consensus and there is a Release Manager willing to make the effort to create the release a patch release can be created.
Committers can tag issues that should be included on the next patch release using the
backport-candidate label. Is the responsibility of the author or the committer to add the
label to the issue to help the Release Manager identify the issues that should be backported.
If a specific issue is identified as the reason to create a patch release the Release Manager should validate that, at least, this issue is correctly tagged and included in the patch release.
Be sure to go through on the following checklist:
- Create milestone 
- Create maintenance branch 
- Include issue that was requested as requiring new patch release 
- Add new milestone to issues with - backport-candidatelabel
- cherry-pick issues into maintenance branch 
Creating a Release Candidate#
These are the different steps that are required to create a Release Candidate.
For the initial Release Candidate on a major release, we will create a maintenance branch from main. This is done on the feature freeze date.
Follow up Release Candidates will update the maintenance branch by cherry-picking specific commits.
For the initial Release Candidate for a minor or a patch release we will create a maintenance branch from the previous corresponding release. For example, for a 15.0.1 patch we will create a maint-15.0.1 branch from maint-15.0.0 and for a maint-15.0.2 we will create it from maint-15.0.1. Once the maintenance branch is created we will update the created maintenance branch by cherry-picking specific commits.
Create or update the corresponding maintenance branch#
# Execute the following from an up to date main branch.
# This will create a branch locally called maint-X.Y.Z.
# X.Y.Z corresponds with the Major, Minor and Patch version number
# of the release respectively. As an example 9.0.0
archery release cherry-pick X.Y.Z --execute
# Push the maintenance branch to the remote repository
git push -u upstream maint-X.Y.Z
# First run in dry-mode to see which commits will be cherry-picked.
# If there are commits that we don't want to get applied, ensure the
# milestone on GitHub is set to the following release.
archery release cherry-pick X.Y.Z --continue
# Update the maintenance branch with the previous commits
archery release cherry-pick X.Y.Z --continue --execute
# Push the updated maintenance branch to the remote repository
git push -u upstream maint-X.Y.Z
Optional: Test Before Creating a Release Candidate#
Some release managers prefer to perform testing before creating the first release candidate to avoid the need to create multiple release candidates within a given release.
To test before creating a release candidate:
- Create a pull request from the up-to-date maint-X.Y.Z branch onto main 
- Title the pull request “WIP: Dummy PR to check maint-X.Y.Z status” 
- Comment on the pull request to trigger the relevant Crossbow jobs: - @github-actions crossbow submit --group verify-rc-source
- @github-actions crossbow submit --group packaging
 
Create the Release Candidate branch from the updated maintenance branch#
# Start from the updated maintenance branch.
git checkout maint-X.Y.Z
# The following script will create a branch for the Release Candidate,
# place the necessary commits updating the version number and then create a git tag
# on OSX use gnu-sed with homebrew: brew install gnu-sed (and export to $PATH)
#
# <rc-number> starts at 0 and increments every time the Release Candidate is created
# so for the first RC this would be: dev/release/01-prepare.sh 4.0.0 5.0.0 0
dev/release/01-prepare.sh <version> <next-version> <rc-number>
# Push the release candidate tag
git push -u upstream apache-arrow-<version>-rc<rc-number>
# Push the release candidate branch in order to trigger verification jobs later
git push -u upstream release-<version>-rc<rc-number>
Once the tag is created the GitHub Actions workflow on verify-rc.yml will be triggered to verify
the release candidate.
The release_candidate.yml workflow will also be triggered which will sign the source code
for the release and will create a GitHub pre-release with the corresponding source and signatures.
Build source and binaries and submit them#
# Waits for previous workflows to finish and uploads source and signatures to SVN.
#
# NOTE: This must be run by a PMC member
# NOTE: You need to have GitHub CLI installed to run this script.
dev/release/02-source.sh <version> <rc-number>
# Submit binary tasks using crossbow, the command will output the crossbow build id
dev/release/03-binary-submit.sh <version> <rc-number>
# Wait for the crossbow jobs to finish
archery crossbow status <crossbow-build-id>
# Download the produced binaries
# This will download packages to a directory called packages/release-<version>-rc<rc-number>
dev/release/04-binary-download.sh <version> <rc-number>
# Sign and upload the binaries
#
# NOTE: This must be run by a PMC member
#
# On macOS the only way I could get this to work was running "echo "UPDATESTARTUPTTY" | gpg-connect-agent" before running this comment
# otherwise I got errors referencing "ioctl" errors.
dev/release/05-binary-upload.sh <version> <rc-number>
# Sign and upload MATLAB artifacts to the GitHub Releases area.
#
# NOTE: This must be run by a PMC member
# NOTE: You need to have GitHub CLI installed to run this script.
dev/release/06-matlab-upload.sh <version> <rc-number>
# Start verifications for binaries and wheels
dev/release/07-binary-verify.sh <version> <rc-number>
Verify the Release#
# Once the automatic verification has passed start the vote thread
# on dev@arrow.apache.org. To regenerate the email template use
SOURCE_DEFAULT=0 SOURCE_VOTE=1 dev/release/02-source.sh <version> <rc-number>
See Release Verification Process for details.
Voting and approval#
Start the vote thread on dev@arrow.apache.org and supply instructions for verifying the integrity of the release. Approval requires a net of 3 +1 votes from PMC members. A release cannot be vetoed.
Post-release tasks#
After the release vote, we must undertake many tasks to update source artifacts, binary builds, and the Arrow website.
Be sure to go through on the following checklist:
- Update the released milestone Date and set to “Closed” on GitHub 
- Merge changes on release branch to maintenance branch for patch releases 
- Add the new release to the Apache Reporter System 
- Push release tag 
- Upload source 
- Upload binaries 
- Update website 
- Update GitHub Release Notes 
- Update Homebrew packages 
- Update MSYS2 package 
- Upload RubyGems 
- Update conda recipes 
- Upload wheels/sdist to pypi 
- Update R packages 
- Update vcpkg port 
- Update Conan recipe 
- Bump versions 
- Update docs 
- Update version in Apache Arrow Cookbook 
- Announce the new release 
- Publish release blog posts 
- Announce the release on BlueSky 
- Remove old artifacts 
 
    