xz crate by youknowone · Pull Request #8 · youknowone/xz-rs

youknowone · 2026-04-22T04:18:30Z

Summary by CodeRabbit

New Features
- Rebranded package from liblzma to xz with updated project metadata and documentation.
- Added configurable backend selection: xz-core (default), xz-sys, with fallback support for liblzma-sys.
Documentation
- Updated README with new branding, CI badges, and backend selection guidance.
- Refreshed API documentation and examples to reflect xz crate naming.
Chores
- Updated CI/CD workflows and scripts to reference new package name.
- Updated dependencies and build configurations.

coderabbitai · 2026-04-22T04:18:42Z

Walkthrough

This PR renames the primary crate from liblzma to xz, refactors allocator management into modular backend policies (Rust-only, C-backed, custom), and updates all imports, documentation, CI workflows, and build scripts accordingly. Key structural changes include reorganizing xz-core allocation logic into feature-gated separate modules and conditionally gating the lzma_stream.allocator field behind a custom_allocator feature flag.

Changes

Cohort / File(s)	Summary
Crate Renaming (Metadata) `Cargo.toml`, `Cargo.lock.msrv`, `README.md`	Updated package name from `liblzma` to `xz`, repository/homepage URLs to `youknowone/xz-rs`, and description text. Enabled `parallel` feature for `xz-sys` and `liblzma-sys` dev-dependencies in non-WASM targets.
Documentation Updates `src/lib.rs`, `src/stream.rs`, `docs/performance-workflow.md`	Changed crate-level docstrings, rustdoc examples, and HTML root URL from `liblzma` to `xz`. Updated performance workflow documentation to reference `xz-core` as default backend and add `systest` runs.
Examples and Tests `examples/bufread_trailing_probe.rs`, `examples/qc_probe.rs`, `examples/standard_files_probe.rs`, `tests/drop-incomplete.rs`, `tests/xz.rs`	Updated imports from `liblzma::` to `xz::` across examples and tests. Removed entire `tests/xz.rs` test file that previously iterated over test fixtures and validated encode/decode round-trips.
CI Workflows `.github/workflows/main.yml`, `.github/workflows/publish.yml`	Updated `cargo clean` targets from `liblzma` to `xz` in test cleanup steps. Changed publish step to target `xz` crate when tag starts with `xz` (instead of `liblzma`).
Build Scripts `scripts/compare_all_trimmed.sh`, `scripts/compare_backends.sh`, `scripts/inspect_codegen.sh`, `scripts/run_root_test_bins.sh`	Updated package names and prefixes from `liblzma`/old patterns to `xz`/`standard_files` in test binary selection and prebuilding logic.
Allocator Infrastructure `xz-core/Cargo.toml`	Added new optional `custom_allocator` Cargo feature to gate custom allocator functionality.
Allocator Refactoring (Core Module) `xz-core/src/alloc.rs`	Replaced monolithic allocator implementation with compile-time policy selector that conditionally compiles `c`, `custom`, or `rust_only` implementations and re-exports the selected allocator entrypoints.
Allocator Policies (New Modules) `xz-core/src/alloc/c.rs`, `xz-core/src/alloc/custom.rs`, `xz-core/src/alloc/rust.rs`, `xz-core/src/alloc/rust_only.rs`	Added four modular allocator implementations: C-backed (platform-specific malloc/calloc), custom (user-provided lzma_allocator), Rust-based with header tracking, and Rust-only (ignoring allocator parameter). Each provides `lzma_alloc`, `lzma_alloc_zero`, `lzma_free` entrypoints.
Stream and Type Configuration `xz-core/src/types.rs`, `xz-core/src/common/common.rs`	Gated `lzma_stream.allocator` field behind `#[cfg(feature = "custom_allocator")]`. Added `lzma_stream_allocator()` helper that conditionally extracts allocator from stream or returns null. Re-exported allocation functions from `crate::alloc` module.
Decoder Allocator Updates `xz-core/src/common/alone_decoder.rs`, `xz-core/src/common/auto_decoder.rs`, `xz-core/src/common/block_decoder.rs`, `xz-core/src/common/filter_decoder.rs`, `xz-core/src/common/file_info.rs`, `xz-core/src/common/index_decoder.rs`, `xz-core/src/common/lzip_decoder.rs`, `xz-core/src/common/microlzma_decoder.rs`, `xz-core/src/common/stream_decoder.rs`	Updated all decoder initializations to obtain allocator via `lzma_stream_allocator(strm)` instead of directly accessing `(*strm).allocator` field.
Encoder Allocator Updates `xz-core/src/common/alone_encoder.rs`, `xz-core/src/common/block_encoder.rs`, `xz-core/src/common/filter_encoder.rs`, `xz-core/src/common/index_encoder.rs`, `xz-core/src/common/microlzma_encoder.rs`, `xz-core/src/common/stream_encoder.rs`	Updated all encoder initializations to obtain allocator via `lzma_stream_allocator(strm)` instead of directly accessing `(*strm).allocator` field.
Multi-threaded Encoder Allocator `xz-core/src/common/stream_decoder_mt.rs`, `xz-core/src/common/stream_encoder_mt.rs`	Added feature-gated `allocator` field to worker thread structs with helper functions (`worker_allocator`, `set_worker_allocator`) to abstract allocator access. Updated worker cleanup and initialization to use these helpers instead of direct field access.
Backend Normalization `xz-sys/Cargo.toml`, `xz-sys/src/lib.rs`	Updated `xz-sys` to enable `custom_allocator` feature on `xz-core` dependency. Replaced module-level `c_allocator` import with local `normalize_c_allocator` and `normalize_c_stream_allocator` helpers that route through `xz_core::alloc` functions.

Estimated code review effort

🎯 4 (Complex) | ⏱️ ~60 minutes

Possibly related PRs

Rename xz crate to xz-core #7: Related crate naming and feature wiring changes between xz and xz-core at the workspace/build level.
Add backend performance workflow and tuning #5: Related backend selection adjustments and performance workflow artifact updates with overlapping script and documentation changes.
Further optimization #6: Related CI workflow and build script modifications for package name and target adjustments.

Poem

🐰 From liblzma hops the name, to xz we now proclaim,
Allocators split in three—Rust, C, and custom's frame,
Feature gates keep things lean when backends aren't in play,
Workers wield their allocators in threads' coordinated way! ✨

🚥 Pre-merge checks | ✅ 3 | ❌ 2

❌ Failed checks (1 warning, 1 inconclusive)

Check name	Status	Explanation	Resolution
Docstring Coverage	⚠️ Warning	Docstring coverage is 25.98% which is insufficient. The required threshold is 80.00%.	Write docstrings for the functions missing them to satisfy the coverage threshold.
Title check	❓ Inconclusive	The title 'xz crate' is vague and generic, failing to convey the specific nature of changes in this large refactoring PR that renames the package from 'liblzma' to 'xz' across the entire codebase.	Consider a more descriptive title such as 'Rename liblzma crate to xz' or 'Rebrand package from liblzma to xz' to clearly indicate the primary change.

✅ Passed checks (3 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Linked Issues check	✅ Passed	Check skipped because no linked issues were found for this pull request.
Out of Scope Changes check	✅ Passed	Check skipped because no linked issues were found for this pull request.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing Touches

📝 Generate docstrings

Create stacked PR
Commit on current branch

🧪 Generate unit tests (beta)

Create PR with unit tests
Commit unit tests in branch xz-crate

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

coderabbitai

Actionable comments posted: 2

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Inline comments:
In @.github/workflows/publish.yml:
- Around line 23-24: The tag condition for the "Publish xz crate" step is too
broad (startsWith(github.ref, 'refs/tags/xz')) and will match tags like
refs/tags/xz-core-*, causing wrong publishes; update the if condition around the
step that references github.ref and startsWith to either explicitly exclude
xz-core (add && !startsWith(github.ref, 'refs/tags/xz-core')) or tighten the
match to the root crate tag pattern (e.g., use a stricter startsWith like
'refs/tags/xz-' combined with a version marker or use a delimiter such as
'refs/tags/xz@' or 'refs/tags/xz-v' to only match the intended root xz tags).

In `@xz-sys/src/lib.rs`:
- Around line 70-74: The helper unsafe fn normalize_c_stream_allocator mutates
(*strm).allocator, but three query functions (lzma_memlimit_get, lzma_memusage,
lzma_get_check) currently cast their *const lzma_stream to *mut and call
normalize_c_stream_allocator, violating FFI const-correctness; fix by removing
those casts and the calls to normalize_c_stream_allocator from each of those
three query functions so they only read from the stream, or alternatively change
the call sites to use a non-mutating/read-only normalization variant if mutation
is truly required elsewhere; locate calls to normalize_c_stream_allocator in the
implementations of lzma_memlimit_get, lzma_memusage, and lzma_get_check and
eliminate the cast-to-*mut and the normalize call.

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: Organization UI

Review profile: ASSERTIVE

Plan: Pro

Run ID: 3eca78c4-1d87-46ab-aad5-0e66955b2ffc

📥 Commits

Reviewing files that changed from the base of the PR and between 90cff2d and d61b258.

📒 Files selected for processing (24)

.github/workflows/main.yml
.github/workflows/publish.yml
Cargo.lock.msrv
Cargo.toml
README.md
benches/backend_comparison.rs
docs/performance-workflow.md
examples/bufread_trailing_probe.rs
examples/qc_probe.rs
examples/standard_files_probe.rs
perf-probe/src/main.rs
scripts/compare_all_trimmed.sh
scripts/compare_backends.sh
scripts/inspect_codegen.sh
scripts/run_root_test_bins.sh
src/lib.rs
src/stream.rs
tests/drop-incomplete.rs
tests/xz.rs
xz-core/src/alloc.rs
xz-core/src/common/common.rs
xz-sys/Cargo.toml
xz-sys/src/c_allocator.rs
xz-sys/src/lib.rs

💤 Files with no reviewable changes (2)

tests/xz.rs
xz-sys/src/c_allocator.rs

coderabbitai · 2026-04-22T04:25:35Z

+      - if: "!startsWith(github.ref, 'refs/tags/liblzma-sys') && !startsWith(github.ref, 'refs/tags/xz-sys') && startsWith(github.ref, 'refs/tags/xz')"
+        name: Publish xz crate


⚠️ Potential issue | 🟠 Major

Tag condition leaks to xz-core (and any future xz* crate).

startsWith(github.ref, 'refs/tags/xz') matches refs/tags/xz-core-* (and e.g. xz-foo-*). Only xz-sys is explicitly excluded, so pushing an xz-core tag would erroneously trigger publication of the root xz crate from this step. Either exclude xz-core as well, or tighten the prefix to match only the root crate's tag scheme (e.g. xz-v / xz@).

🔧 Proposed fix (exclude xz-core explicitly)

- - if: "!startsWith(github.ref, 'refs/tags/liblzma-sys') && !startsWith(github.ref, 'refs/tags/xz-sys') && startsWith(github.ref, 'refs/tags/xz')" + - if: "!startsWith(github.ref, 'refs/tags/liblzma-sys') && !startsWith(github.ref, 'refs/tags/xz-sys') && !startsWith(github.ref, 'refs/tags/xz-core') && startsWith(github.ref, 'refs/tags/xz')" name: Publish xz crate

Alternatively, lock the prefix to an unambiguous root-crate tag pattern:

- - if: "!startsWith(github.ref, 'refs/tags/liblzma-sys') && !startsWith(github.ref, 'refs/tags/xz-sys') && startsWith(github.ref, 'refs/tags/xz')" + - if: "startsWith(github.ref, 'refs/tags/xz-v')"

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

- if: "!startsWith(github.ref, 'refs/tags/liblzma-sys') && !startsWith(github.ref, 'refs/tags/xz-sys') && startsWith(github.ref, 'refs/tags/xz')"

name: Publish xz crate

- if: "!startsWith(github.ref, 'refs/tags/liblzma-sys') && !startsWith(github.ref, 'refs/tags/xz-sys') && !startsWith(github.ref, 'refs/tags/xz-core') && startsWith(github.ref, 'refs/tags/xz')"

name: Publish xz crate

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In @.github/workflows/publish.yml around lines 23 - 24, The tag condition for the "Publish xz crate" step is too broad (startsWith(github.ref, 'refs/tags/xz')) and will match tags like refs/tags/xz-core-*, causing wrong publishes; update the if condition around the step that references github.ref and startsWith to either explicitly exclude xz-core (add && !startsWith(github.ref, 'refs/tags/xz-core')) or tighten the match to the root crate tag pattern (e.g., use a stricter startsWith like 'refs/tags/xz-' combined with a version marker or use a delimiter such as 'refs/tags/xz@' or 'refs/tags/xz-v' to only match the intended root xz tags).

coderabbitai · 2026-04-22T04:25:35Z

+unsafe fn normalize_c_stream_allocator(strm: *mut lzma_stream) {
+    if !strm.is_null() && (*strm).allocator.is_null() {
+        (*strm).allocator = xz_core::alloc::c_allocator_ptr().cast();
+    }
+}


⚠️ Potential issue | 🟠 Major

🧩 Analysis chain

🏁 Script executed:

#!/bin/bash rg -n -C3 'normalize_c_stream_allocator\(strm as \*mut lzma_stream\)|pub unsafe extern "C" fn lzma_(memlimit_get|memusage|get_check)' xz-sys/src/lib.rs

Repository: youknowone/xz-rs

Length of output: 815

🏁 Script executed:

rg -n 'normalize_c_stream_allocator' xz-sys/src/lib.rs

Repository: youknowone/xz-rs

Length of output: 1338

🏁 Script executed:

sed -n '224,230p; 268,274p; 275,280p' xz-sys/src/lib.rs

Repository: youknowone/xz-rs

Length of output: 720

Don't normalize through *const lzma_stream query wrappers.

Lines 227, 271, 277 cast *const lzma_stream to *mut before calling normalize_c_stream_allocator, which writes to (*strm).allocator. This mutates read-only caller memory and violates FFI const-correctness. The three query functions (lzma_memlimit_get, lzma_memusage, lzma_get_check) only read from the stream and don't need allocator normalization.

Suggested fix

#[unsafe(no_mangle)] pub unsafe extern "C" fn lzma_memlimit_get(strm: *const lzma_stream) -> u64 { - normalize_c_stream_allocator(strm as *mut lzma_stream); xz_core::common::common::lzma_memlimit_get(strm.cast()) } #[unsafe(no_mangle)] pub unsafe extern "C" fn lzma_memusage(strm: *const lzma_stream) -> u64 { - normalize_c_stream_allocator(strm as *mut lzma_stream); xz_core::common::common::lzma_memusage(strm.cast()) } #[unsafe(no_mangle)] pub unsafe extern "C" fn lzma_get_check(strm: *const lzma_stream) -> lzma_check { - normalize_c_stream_allocator(strm as *mut lzma_stream); xz_core::common::common::lzma_get_check(strm.cast()) }

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@xz-sys/src/lib.rs` around lines 70 - 74, The helper unsafe fn normalize_c_stream_allocator mutates (*strm).allocator, but three query functions (lzma_memlimit_get, lzma_memusage, lzma_get_check) currently cast their *const lzma_stream to *mut and call normalize_c_stream_allocator, violating FFI const-correctness; fix by removing those casts and the calls to normalize_c_stream_allocator from each of those three query functions so they only read from the stream, or alternatively change the call sites to use a non-mutating/read-only normalization variant if mutation is truly required elsewhere; locate calls to normalize_c_stream_allocator in the implementations of lzma_memlimit_get, lzma_memusage, and lzma_get_check and eliminate the cast-to-*mut and the normalize call.

coderabbitai

Actionable comments posted: 1

Caution

Some comments are outside the diff and can’t be posted inline due to platform limitations.

⚠️ Outside diff range comments (1)

README.md (1)

13-23: ⚠️ Potential issue | 🟡 Minor

Align the migration snippet with the renamed package version.

This README now documents the xz crate whose manifest version is 0.4.6, but the migration example tells users to depend on xz = "0.1.7". Either update the snippet to the current series or explicitly mark it as legacy 0.1-only guidance.

📝 Proposed wording if the current crate line should be advertised

-**This crate is forked from [xz2](https://crates.io/crates/xz2) and `xz = "0.1.x"` is fully compatible with `xz2 = "0.1.7"`,**
-so you can migrate simply.
+**This crate is forked from [xz2](https://crates.io/crates/xz2) and provides an xz2-compatible high-level API,**
+so most users can migrate by changing the dependency and import path.
@@
 -xz2 = "0.1.7"
-+xz = "0.1.7"
++xz = "0.4"

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@README.md` around lines 13 - 23, The migration example in the README shows
adding xz = "0.1.7" which is inconsistent with this crate's current version
series; update the snippet (the Cargo.toml diff lines referencing xz2 and xz) to
advertise the current crate series (e.g., replace xz = "0.1.7" with xz =
"0.4.6") or, if you intend to show legacy guidance, add an explicit note that
the snippet is for the 0.1.x legacy compatibility only; modify the two lines in
the example that change the dependency from "xz2 = \"0.1.7\"" to the desired
current version string or add the legacy clarification text.

♻️ Duplicate comments (1)

xz-sys/src/lib.rs (1)

226-278: ⚠️ Potential issue | 🟠 Major

Don’t normalize through *const lzma_stream query wrappers.

These read-only wrappers still cast *const lzma_stream to *mut before calling normalize_c_stream_allocator, which can write strm.allocator and violates FFI const-correctness.

Suggested fix

 #[unsafe(no_mangle)]
 pub unsafe extern "C" fn lzma_memlimit_get(strm: *const lzma_stream) -> u64 {
-    normalize_c_stream_allocator(strm as *mut lzma_stream);
     xz_core::common::common::lzma_memlimit_get(strm.cast())
 }
@@
 #[unsafe(no_mangle)]
 pub unsafe extern "C" fn lzma_memusage(strm: *const lzma_stream) -> u64 {
-    normalize_c_stream_allocator(strm as *mut lzma_stream);
     xz_core::common::common::lzma_memusage(strm.cast())
 }
@@
 #[unsafe(no_mangle)]
 pub unsafe extern "C" fn lzma_get_check(strm: *const lzma_stream) -> lzma_check {
-    normalize_c_stream_allocator(strm as *mut lzma_stream);
     xz_core::common::common::lzma_get_check(strm.cast())
 }

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@xz-sys/src/lib.rs` around lines 226 - 278, The read-only C wrappers
lzma_memlimit_get, lzma_memusage, and lzma_get_check (which accept *const
lzma_stream) currently cast the pointer to *mut and call
normalize_c_stream_allocator, which can mutate strm. Replace those calls with a
non-mutating normalization helper (e.g., normalize_c_stream_allocator_const that
accepts *const lzma_stream and only reads/validates without writing) or remove
the normalization if unnecessary; update lzma_memlimit_get, lzma_memusage, and
lzma_get_check to call that const-safe helper (or no-op) instead of
normalize_c_stream_allocator to preserve FFI const-correctness.

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Inline comments:
In `@xz-core/src/alloc.rs`:
- Around line 371-476: The allocator callbacks must be treated as an
all-or-nothing pair: only call a custom alloc or free if both alloc and free
callbacks are present (and allocator is non-null); otherwise always use the
fallback (c_alloc_*/c_alloc_zeroed_bytes or rust_alloc_impl/rust_free_impl).
Update lzma_alloc_bytes, lzma_alloc_zeroed_bytes, lzma_free_ptr,
internal_alloc_bytes, internal_alloc_object, internal_alloc_array,
internal_alloc_zeroed_array, and internal_free so their checks require allocator
!= null && let Some(alloc) = unsafe { (*allocator).alloc } && let Some(free) =
unsafe { (*allocator).free } (or equivalent) before using the custom callbacks;
if that condition fails, fall back to the existing c_/rust_ implementations.
Ensure the same paired-check logic is used everywhere so allocations and frees
never cross allocator boundaries.

---

Outside diff comments:
In `@README.md`:
- Around line 13-23: The migration example in the README shows adding xz =
"0.1.7" which is inconsistent with this crate's current version series; update
the snippet (the Cargo.toml diff lines referencing xz2 and xz) to advertise the
current crate series (e.g., replace xz = "0.1.7" with xz = "0.4.6") or, if you
intend to show legacy guidance, add an explicit note that the snippet is for the
0.1.x legacy compatibility only; modify the two lines in the example that change
the dependency from "xz2 = \"0.1.7\"" to the desired current version string or
add the legacy clarification text.

---

Duplicate comments:
In `@xz-sys/src/lib.rs`:
- Around line 226-278: The read-only C wrappers lzma_memlimit_get,
lzma_memusage, and lzma_get_check (which accept *const lzma_stream) currently
cast the pointer to *mut and call normalize_c_stream_allocator, which can mutate
strm. Replace those calls with a non-mutating normalization helper (e.g.,
normalize_c_stream_allocator_const that accepts *const lzma_stream and only
reads/validates without writing) or remove the normalization if unnecessary;
update lzma_memlimit_get, lzma_memusage, and lzma_get_check to call that
const-safe helper (or no-op) instead of normalize_c_stream_allocator to preserve
FFI const-correctness.

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: Organization UI

Review profile: ASSERTIVE

Plan: Pro

Run ID: d017f921-f6ef-4b36-8ca7-234d439aa108

📥 Commits

Reviewing files that changed from the base of the PR and between d61b258 and e8efc2d.

📒 Files selected for processing (12)

Cargo.lock.msrv
Cargo.toml
README.md
docs/performance-workflow.md
scripts/inspect_codegen.sh
src/lib.rs
xz-core/Cargo.toml
xz-core/src/alloc.rs
xz-core/src/common/common.rs
xz-sys/Cargo.toml
xz-sys/src/c_allocator.rs
xz-sys/src/lib.rs

💤 Files with no reviewable changes (1)

xz-sys/src/c_allocator.rs

coderabbitai · 2026-04-23T06:32:06Z

+    pub(crate) unsafe fn lzma_alloc_bytes(
+        size: size_t,
+        allocator: *const lzma_allocator,
+    ) -> *mut c_void {
+        let size = c_size(size);
+        if !allocator.is_null()
+            && let Some(alloc) = unsafe { (*allocator).alloc }
+        {
+            return unsafe { alloc((*allocator).opaque, 1, size) };
        }
-        return ptr;
+        unsafe { c_alloc_bytes(size) }
    }
-    rust_alloc_impl(size, core::mem::align_of::<T>(), true) as *mut T
-}

-pub(crate) unsafe fn internal_free(ptr: *mut c_void, allocator: *const lzma_allocator) {
-    if !allocator.is_null()
-        && let Some(free) = unsafe { (*allocator).free }
-    {
-        unsafe { free((*allocator).opaque, ptr) };
-        return;
+    pub(crate) unsafe fn lzma_alloc_zeroed_bytes(
+        size: size_t,
+        allocator: *const lzma_allocator,
+    ) -> *mut c_void {
+        let size = c_size(size);
+        if !allocator.is_null()
+            && let Some(alloc) = unsafe { (*allocator).alloc }
+        {
+            let ptr = unsafe { alloc((*allocator).opaque, 1, size) };
+            if !ptr.is_null() {
+                unsafe { core::ptr::write_bytes(ptr as *mut u8, 0, size) };
+            }
+            return ptr;
+        }
+        unsafe { c_alloc_zeroed_bytes(size) }
+    }
+
+    pub(crate) unsafe fn lzma_free_ptr(ptr: *mut c_void, allocator: *const lzma_allocator) {
+        if !allocator.is_null()
+            && let Some(free) = unsafe { (*allocator).free }
+        {
+            unsafe { free((*allocator).opaque, ptr) };
+            return;
+        }
+        unsafe { c_free_ptr(ptr) };
+    }
+
+    pub(crate) unsafe fn internal_alloc_bytes(
+        size: size_t,
+        allocator: *const lzma_allocator,
+    ) -> *mut c_void {
+        let size = c_size(size);
+        let allocator = allocator_or_rust(allocator);
+        if let Some(alloc) = unsafe { (*allocator).alloc } {
+            return unsafe { alloc((*allocator).opaque, 1, size) };
+        }
+        rust_alloc_impl(size as usize, RUST_ALLOC_ALIGN, false)
+    }
+
+    pub(crate) unsafe fn internal_alloc_object<T>(allocator: *const lzma_allocator) -> *mut T {
+        if !allocator.is_null()
+            && let Some(alloc) = unsafe { (*allocator).alloc }
+        {
+            return unsafe {
+                alloc((*allocator).opaque, 1, core::mem::size_of::<T>() as size_t) as *mut T
+            };
+        }
+        rust_alloc_impl(core::mem::size_of::<T>(), core::mem::align_of::<T>(), false) as *mut T
+    }
+
+    pub(crate) unsafe fn internal_alloc_array<T>(
+        count: size_t,
+        allocator: *const lzma_allocator,
+    ) -> *mut T {
+        let Some(size) = (count as usize).checked_mul(core::mem::size_of::<T>()) else {
+            return core::ptr::null_mut();
+        };
+        if !allocator.is_null()
+            && let Some(alloc) = unsafe { (*allocator).alloc }
+        {
+            return unsafe { alloc((*allocator).opaque, 1, size as size_t) as *mut T };
+        }
+        rust_alloc_impl(size, core::mem::align_of::<T>(), false) as *mut T
+    }
+
+    pub(crate) unsafe fn internal_alloc_zeroed_array<T>(
+        count: size_t,
+        allocator: *const lzma_allocator,
+    ) -> *mut T {
+        let Some(size) = (count as usize).checked_mul(core::mem::size_of::<T>()) else {
+            return core::ptr::null_mut();
+        };
+        if !allocator.is_null()
+            && let Some(alloc) = unsafe { (*allocator).alloc }
+        {
+            let ptr = unsafe { alloc((*allocator).opaque, 1, size as size_t) as *mut T };
+            if !ptr.is_null() {
+                unsafe { core::ptr::write_bytes(ptr as *mut u8, 0, size) };
+            }
+            return ptr;
+        }
+        rust_alloc_impl(size, core::mem::align_of::<T>(), true) as *mut T
+    }
+
+    pub(crate) unsafe fn internal_free(ptr: *mut c_void, allocator: *const lzma_allocator) {
+        if !allocator.is_null()
+            && let Some(free) = unsafe { (*allocator).free }
+        {
+            unsafe { free((*allocator).opaque, ptr) };
+            return;
+        }
+        rust_free_impl(ptr);
+    }


⚠️ Potential issue | 🔴 Critical

Treat custom allocator callbacks as an all-or-nothing pair.

The active custom path chooses alloc and free independently. With a partial allocator, memory can be allocated by a custom callback and freed by c_free/rust_free_impl, or allocated by fallback code and freed by a custom callback. That crosses allocator boundaries and can corrupt memory.

Suggested direction

- if !allocator.is_null() - && let Some(alloc) = unsafe { (*allocator).alloc } + if !allocator.is_null() + && let (Some(alloc), Some(_free)) = + unsafe { ((*allocator).alloc, (*allocator).free) } { return unsafe { alloc((*allocator).opaque, 1, size) }; } @@ - if !allocator.is_null() - && let Some(free) = unsafe { (*allocator).free } + if !allocator.is_null() + && let (Some(_alloc), Some(free)) = + unsafe { ((*allocator).alloc, (*allocator).free) } { unsafe { free((*allocator).opaque, ptr) }; return; }

Apply the same complete-pair check consistently across lzma_alloc_bytes, lzma_alloc_zeroed_bytes, lzma_free_ptr, internal_alloc_*, and internal_free.

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

pub(crate) unsafe fn lzma_alloc_bytes(

size: size_t,

allocator: *const lzma_allocator,

) -> *mut c_void {

let size = c_size(size);

if !allocator.is_null()

&& let Some(alloc) = unsafe { (*allocator).alloc }

{

return unsafe { alloc((*allocator).opaque, 1, size) };

}

return ptr;

unsafe { c_alloc_bytes(size) }

}

rust_alloc_impl(size, core::mem::align_of::<T>(), true) as *mut T

}

pub(crate) unsafe fn internal_free(ptr: *mut c_void, allocator: *const lzma_allocator) {

if !allocator.is_null()

&& let Some(free) = unsafe { (*allocator).free }

{

unsafe { free((*allocator).opaque, ptr) };

return;

pub(crate) unsafe fn lzma_alloc_zeroed_bytes(

size: size_t,

allocator: *const lzma_allocator,

) -> *mut c_void {

let size = c_size(size);

if !allocator.is_null()

&& let Some(alloc) = unsafe { (*allocator).alloc }

{

let ptr = unsafe { alloc((*allocator).opaque, 1, size) };

if !ptr.is_null() {

unsafe { core::ptr::write_bytes(ptr as *mut u8, 0, size) };

}

return ptr;

}

unsafe { c_alloc_zeroed_bytes(size) }

}

pub(crate) unsafe fn lzma_free_ptr(ptr: *mut c_void, allocator: *const lzma_allocator) {

if !allocator.is_null()

&& let Some(free) = unsafe { (*allocator).free }

{

unsafe { free((*allocator).opaque, ptr) };

return;

}

unsafe { c_free_ptr(ptr) };

}

pub(crate) unsafe fn internal_alloc_bytes(

size: size_t,

allocator: *const lzma_allocator,

) -> *mut c_void {

let size = c_size(size);

let allocator = allocator_or_rust(allocator);

if let Some(alloc) = unsafe { (*allocator).alloc } {

return unsafe { alloc((*allocator).opaque, 1, size) };

}

rust_alloc_impl(size as usize, RUST_ALLOC_ALIGN, false)

}

pub(crate) unsafe fn internal_alloc_object<T>(allocator: *const lzma_allocator) -> *mut T {

if !allocator.is_null()

&& let Some(alloc) = unsafe { (*allocator).alloc }

{

return unsafe {

alloc((*allocator).opaque, 1, core::mem::size_of::<T>() as size_t) as *mut T

};

}

rust_alloc_impl(core::mem::size_of::<T>(), core::mem::align_of::<T>(), false) as *mut T

}

pub(crate) unsafe fn internal_alloc_array<T>(

count: size_t,

allocator: *const lzma_allocator,

) -> *mut T {

let Some(size) = (count as usize).checked_mul(core::mem::size_of::<T>()) else {

return core::ptr::null_mut();

};

if !allocator.is_null()

&& let Some(alloc) = unsafe { (*allocator).alloc }

{

return unsafe { alloc((*allocator).opaque, 1, size as size_t) as *mut T };

}

rust_alloc_impl(size, core::mem::align_of::<T>(), false) as *mut T

}

pub(crate) unsafe fn internal_alloc_zeroed_array<T>(

count: size_t,

allocator: *const lzma_allocator,

) -> *mut T {

let Some(size) = (count as usize).checked_mul(core::mem::size_of::<T>()) else {

return core::ptr::null_mut();

};

if !allocator.is_null()

&& let Some(alloc) = unsafe { (*allocator).alloc }

{

let ptr = unsafe { alloc((*allocator).opaque, 1, size as size_t) as *mut T };

if !ptr.is_null() {

unsafe { core::ptr::write_bytes(ptr as *mut u8, 0, size) };

}

return ptr;

}

rust_alloc_impl(size, core::mem::align_of::<T>(), true) as *mut T

}

pub(crate) unsafe fn internal_free(ptr: *mut c_void, allocator: *const lzma_allocator) {

if !allocator.is_null()

&& let Some(free) = unsafe { (*allocator).free }

{

unsafe { free((*allocator).opaque, ptr) };

return;

}

rust_free_impl(ptr);

}

pub(crate) unsafe fn lzma_alloc_bytes(

size: size_t,

allocator: *const lzma_allocator,

) -> *mut c_void {

let size = c_size(size);

if !allocator.is_null()

&& let (Some(alloc), Some(_free)) = unsafe { ((*allocator).alloc, (*allocator).free) }

{

return unsafe { alloc((*allocator).opaque, 1, size) };

}

unsafe { c_alloc_bytes(size) }

}

pub(crate) unsafe fn lzma_alloc_zeroed_bytes(

size: size_t,

allocator: *const lzma_allocator,

) -> *mut c_void {

let size = c_size(size);

if !allocator.is_null()

&& let (Some(alloc), Some(_free)) = unsafe { ((*allocator).alloc, (*allocator).free) }

{

let ptr = unsafe { alloc((*allocator).opaque, 1, size) };

if !ptr.is_null() {

unsafe { core::ptr::write_bytes(ptr as *mut u8, 0, size) };

}

return ptr;

}

unsafe { c_alloc_zeroed_bytes(size) }

}

pub(crate) unsafe fn lzma_free_ptr(ptr: *mut c_void, allocator: *const lzma_allocator) {

if !allocator.is_null()

&& let (Some(_alloc), Some(free)) = unsafe { ((*allocator).alloc, (*allocator).free) }

{

unsafe { free((*allocator).opaque, ptr) };

return;

}

unsafe { c_free_ptr(ptr) };

}

pub(crate) unsafe fn internal_alloc_bytes(

size: size_t,

allocator: *const lzma_allocator,

) -> *mut c_void {

let size = c_size(size);

let allocator = allocator_or_rust(allocator);

if let (Some(alloc), Some(_free)) = unsafe { ((*allocator).alloc, (*allocator).free) } {

return unsafe { alloc((*allocator).opaque, 1, size) };

}

rust_alloc_impl(size as usize, RUST_ALLOC_ALIGN, false)

}

pub(crate) unsafe fn internal_alloc_object<T>(allocator: *const lzma_allocator) -> *mut T {

if !allocator.is_null()

&& let (Some(alloc), Some(_free)) = unsafe { ((*allocator).alloc, (*allocator).free) }

{

return unsafe {

alloc((*allocator).opaque, 1, core::mem::size_of::<T>() as size_t) as *mut T

};

}

rust_alloc_impl(core::mem::size_of::<T>(), core::mem::align_of::<T>(), false) as *mut T

}

pub(crate) unsafe fn internal_alloc_array<T>(

count: size_t,

allocator: *const lzma_allocator,

) -> *mut T {

let Some(size) = (count as usize).checked_mul(core::mem::size_of::<T>()) else {

return core::ptr::null_mut();

};

if !allocator.is_null()

&& let (Some(alloc), Some(_free)) = unsafe { ((*allocator).alloc, (*allocator).free) }

{

return unsafe { alloc((*allocator).opaque, 1, size as size_t) as *mut T };

}

rust_alloc_impl(size, core::mem::align_of::<T>(), false) as *mut T

}

pub(crate) unsafe fn internal_alloc_zeroed_array<T>(

count: size_t,

allocator: *const lzma_allocator,

) -> *mut T {

let Some(size) = (count as usize).checked_mul(core::mem::size_of::<T>()) else {

return core::ptr::null_mut();

};

if !allocator.is_null()

&& let (Some(alloc), Some(_free)) = unsafe { ((*allocator).alloc, (*allocator).free) }

{

let ptr = unsafe { alloc((*allocator).opaque, 1, size as size_t) as *mut T };

if !ptr.is_null() {

unsafe { core::ptr::write_bytes(ptr as *mut u8, 0, size) };

}

return ptr;

}

rust_alloc_impl(size, core::mem::align_of::<T>(), true) as *mut T

}

pub(crate) unsafe fn internal_free(ptr: *mut c_void, allocator: *const lzma_allocator) {

if !allocator.is_null()

&& let (Some(_alloc), Some(free)) = unsafe { ((*allocator).alloc, (*allocator).free) }

{

unsafe { free((*allocator).opaque, ptr) };

return;

}

rust_free_impl(ptr);

}

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@xz-core/src/alloc.rs` around lines 371 - 476, The allocator callbacks must be treated as an all-or-nothing pair: only call a custom alloc or free if both alloc and free callbacks are present (and allocator is non-null); otherwise always use the fallback (c_alloc_*/c_alloc_zeroed_bytes or rust_alloc_impl/rust_free_impl). Update lzma_alloc_bytes, lzma_alloc_zeroed_bytes, lzma_free_ptr, internal_alloc_bytes, internal_alloc_object, internal_alloc_array, internal_alloc_zeroed_array, and internal_free so their checks require allocator != null && let Some(alloc) = unsafe { (*allocator).alloc } && let Some(free) = unsafe { (*allocator).free } (or equivalent) before using the custom callbacks; if that condition fails, fall back to the existing c_/rust_ implementations. Ensure the same paired-check logic is used everywhere so allocations and frees never cross allocator boundaries.

coderabbitai

Actionable comments posted: 2

Caution

Some comments are outside the diff and can’t be posted inline due to platform limitations.

⚠️ Outside diff range comments (1)

docs/performance-workflow.md (1)
97-101: ⚠️ Potential issue | 🟠 Major

Remove the "mirrors tests/standard_files.rs" claim from line 97.

The file tests/standard_files.rs does not exist in the repository. The "standard_files" workload is defined and driven directly by examples/standard_files_probe.rs (as confirmed by scripts/compare_api_workloads.sh which maps the "standard-files" workload argument to this example). Update the documentation to accurately reflect that the example defines the workload, rather than mirroring a nonexistent test file.
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@docs/performance-workflow.md` around lines 97 - 101, Remove the incorrect
phrase that the example "mirrors the `tests/standard_files.rs` `standard_files`
path" and update the sentence to state that `examples/standard_files_probe.rs`
defines and drives the "standard_files" workload (as mapped by
`scripts/compare_api_workloads.sh` / the "standard-files" argument) and that it
writes reports to `target/perf-results/api-standard-files.json` and
`target/perf-results/api-standard-files.md`; keep reference to
`examples/standard_files_probe.rs` and the output paths intact but eliminate any
mention of a nonexistent `tests/standard_files.rs`.

♻️ Duplicate comments (1)

xz-sys/src/lib.rs (1)

226-229: ⚠️ Potential issue | 🟠 Major

Remove mutating allocator normalization from read-only stream query APIs.

Line 227, Line 271, and Line 277 cast *const lzma_stream to mutable and may write through it via normalize_c_stream_allocator. These wrappers are read-only by contract and this breaks FFI const-correctness.

Proposed fix

 #[unsafe(no_mangle)]
 pub unsafe extern "C" fn lzma_memlimit_get(strm: *const lzma_stream) -> u64 {
-    normalize_c_stream_allocator(strm as *mut lzma_stream);
     xz_core::common::common::lzma_memlimit_get(strm.cast())
 }
@@
 #[unsafe(no_mangle)]
 pub unsafe extern "C" fn lzma_memusage(strm: *const lzma_stream) -> u64 {
-    normalize_c_stream_allocator(strm as *mut lzma_stream);
     xz_core::common::common::lzma_memusage(strm.cast())
 }
@@
 #[unsafe(no_mangle)]
 pub unsafe extern "C" fn lzma_get_check(strm: *const lzma_stream) -> lzma_check {
-    normalize_c_stream_allocator(strm as *mut lzma_stream);
     xz_core::common::common::lzma_get_check(strm.cast())
 }

Also applies to: 270-273, 276-279

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@xz-sys/src/lib.rs` around lines 226 - 229, The wrapper lzma_memlimit_get
currently casts the incoming *const lzma_stream to mutable and calls
normalize_c_stream_allocator, which mutates the stream and breaks FFI
const-correctness; remove that mutation by either calling the underlying
xz_core::common::common::lzma_memlimit_get(strm.cast()) directly (no normalize
call and no cast to *mut) or implement/use a non-mutating normalization helper
(e.g., normalize_c_stream_allocator_const) that accepts *const lzma_stream;
apply the same change pattern to the other read-only wrappers that call
normalize_c_stream_allocator (the functions referenced in the diff/comments).

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Inline comments:
In `@xz-core/src/alloc.rs`:
- Around line 1-13: The build fails because alloc.rs declares submodules `mod
rust;`, `mod c;`, `mod custom;`, and `mod rust_only;` but the corresponding
files under xz-core/src/alloc/ are missing; either add the missing files
`rust.rs`, `c.rs`, `custom.rs`, and `rust_only.rs` in xz-core/src/alloc/ with
the expected public API used by this crate, or collapse the declarations into
inline modules inside alloc.rs (e.g., define `mod rust { ... }` etc.) and ensure
the conditional `use` alias `use custom as policy;` / `use rust_only as policy;`
still resolves to the same symbols; make sure the implementations exposed by
those modules match the functions/types referenced elsewhere so feature-gated
builds (with and without `custom_allocator`) compile.

In `@xz-core/src/common/stream_decoder_mt.rs`:
- Around line 71-95: The two functions worker_allocator and set_worker_allocator
duplicate the same feature-gated logic in stream_decoder_mt.rs and
stream_encoder_mt.rs for different worker_thread types; refactor to remove
copy/paste by extracting the shared behavior into a single reusable abstraction
(either a small trait like HasAllocator with methods allocator()/set_allocator()
implemented for each worker_thread type, or a macro that emits the two functions
for a given worker_thread type), then replace the per-file helper definitions
with calls to the common trait/macro implementation so worker_allocator,
set_worker_allocator, worker_thread and lzma_allocator usages remain identical
and the #[cfg(feature = "custom_allocator")] branches stay centralized.

---

Outside diff comments:
In `@docs/performance-workflow.md`:
- Around line 97-101: Remove the incorrect phrase that the example "mirrors the
`tests/standard_files.rs` `standard_files` path" and update the sentence to
state that `examples/standard_files_probe.rs` defines and drives the
"standard_files" workload (as mapped by `scripts/compare_api_workloads.sh` / the
"standard-files" argument) and that it writes reports to
`target/perf-results/api-standard-files.json` and
`target/perf-results/api-standard-files.md`; keep reference to
`examples/standard_files_probe.rs` and the output paths intact but eliminate any
mention of a nonexistent `tests/standard_files.rs`.

---

Duplicate comments:
In `@xz-sys/src/lib.rs`:
- Around line 226-229: The wrapper lzma_memlimit_get currently casts the
incoming *const lzma_stream to mutable and calls normalize_c_stream_allocator,
which mutates the stream and breaks FFI const-correctness; remove that mutation
by either calling the underlying
xz_core::common::common::lzma_memlimit_get(strm.cast()) directly (no normalize
call and no cast to *mut) or implement/use a non-mutating normalization helper
(e.g., normalize_c_stream_allocator_const) that accepts *const lzma_stream;
apply the same change pattern to the other read-only wrappers that call
normalize_c_stream_allocator (the functions referenced in the diff/comments).

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: Organization UI

Review profile: ASSERTIVE

Plan: Pro

Run ID: aac9f5eb-d00f-4af6-9168-b3b4cd72d659

📥 Commits

Reviewing files that changed from the base of the PR and between e8efc2d and b430753.

📒 Files selected for processing (30)

Cargo.lock.msrv
Cargo.toml
README.md
docs/performance-workflow.md
scripts/inspect_codegen.sh
src/lib.rs
xz-core/Cargo.toml
xz-core/src/alloc.rs
xz-core/src/common/alone_decoder.rs
xz-core/src/common/alone_encoder.rs
xz-core/src/common/auto_decoder.rs
xz-core/src/common/block_decoder.rs
xz-core/src/common/block_encoder.rs
xz-core/src/common/common.rs
xz-core/src/common/file_info.rs
xz-core/src/common/filter_decoder.rs
xz-core/src/common/filter_encoder.rs
xz-core/src/common/index_decoder.rs
xz-core/src/common/index_encoder.rs
xz-core/src/common/lzip_decoder.rs
xz-core/src/common/microlzma_decoder.rs
xz-core/src/common/microlzma_encoder.rs
xz-core/src/common/stream_decoder.rs
xz-core/src/common/stream_decoder_mt.rs
xz-core/src/common/stream_encoder.rs
xz-core/src/common/stream_encoder_mt.rs
xz-core/src/types.rs
xz-sys/Cargo.toml
xz-sys/src/c_allocator.rs
xz-sys/src/lib.rs

💤 Files with no reviewable changes (1)

xz-sys/src/c_allocator.rs

coderabbitai · 2026-04-24T16:12:04Z

+#[inline]
+unsafe fn worker_allocator(thr: *const worker_thread) -> *const lzma_allocator {
+    #[cfg(feature = "custom_allocator")]
+    {
+        unsafe { (*thr).allocator }
+    }
+    #[cfg(not(feature = "custom_allocator"))]
+    {
+        let _ = thr;
+        core::ptr::null()
+    }
+}
+#[inline]
+unsafe fn set_worker_allocator(thr: *mut worker_thread, allocator: *const lzma_allocator) {
+    #[cfg(feature = "custom_allocator")]
+    {
+        unsafe {
+            (*thr).allocator = allocator;
+        }
+    }
+    #[cfg(not(feature = "custom_allocator"))]
+    {
+        let _ = (thr, allocator);
+    }
+}


🧹 Nitpick | 🔵 Trivial

Clean feature-gated indirection; consider deduping across encoder/decoder.

The worker_allocator/set_worker_allocator helpers correctly abstract the #[cfg(feature = "custom_allocator")] branches and the let _ = … absorptions avoid unused-arg warnings on the feature-off path. Usage is consistent: each worker stores its own allocator at init (Line 373) and all per-thread resources (in_0, block_decoder) are subsequently freed through worker_allocator(thr), which is strictly more robust than re-reading (*strm).allocator later.

Optional: these two helpers are duplicated verbatim in xz-core/src/common/stream_encoder_mt.rs (Lines 26–50). Only the worker_thread type differs. Factoring them through a small trait (trait HasAllocator { fn allocator(&self) -> *const lzma_allocator; … }) or a macro would remove the copy/paste without adding meaningful indirection.

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@xz-core/src/common/stream_decoder_mt.rs` around lines 71 - 95, The two functions worker_allocator and set_worker_allocator duplicate the same feature-gated logic in stream_decoder_mt.rs and stream_encoder_mt.rs for different worker_thread types; refactor to remove copy/paste by extracting the shared behavior into a single reusable abstraction (either a small trait like HasAllocator with methods allocator()/set_allocator() implemented for each worker_thread type, or a macro that emits the two functions for a given worker_thread type), then replace the per-file helper definitions with calls to the common trait/macro implementation so worker_allocator, set_worker_allocator, worker_thread and lzma_allocator usages remain identical and the #[cfg(feature = "custom_allocator")] branches stay centralized.

coderabbitai

♻️ Duplicate comments (1)

xz-core/src/alloc/custom.rs (1)
26-125: ⚠️ Potential issue | 🔴 Critical

Custom alloc/free callbacks are still treated independently — cross-allocator free hazard.

Each of lzma_alloc, lzma_alloc_zero, lzma_free, internal_alloc_bytes, internal_alloc_object, internal_alloc_array, internal_alloc_zeroed_array, and internal_free checks only one of (*allocator).alloc / (*allocator).free. If a caller supplies a partial lzma_allocator (e.g. alloc=Some, free=None), allocations route through the user callback while the matching free falls back to c_free_ptr / rust_free_ptr (or vice versa), which crosses allocator boundaries and can corrupt memory.

Gate every custom path on both callbacks being present, falling back to the policy backend otherwise. For example:
🔒 Suggested pair-check pattern
 pub unsafe fn lzma_alloc(size: size_t, allocator: *const lzma_allocator) -> *mut c_void {
     let size = c_size(size);
-    if !allocator.is_null()
-        && let Some(alloc) = unsafe { (*allocator).alloc }
-    {
-        return unsafe { alloc((*allocator).opaque, 1, size) };
-    }
+    if !allocator.is_null()
+        && let (Some(alloc), Some(_free)) = unsafe { ((*allocator).alloc, (*allocator).free) }
+    {
+        return unsafe { alloc((*allocator).opaque, 1, size) };
+    }
     unsafe { c_alloc_bytes(size) }
 }
@@
 pub unsafe fn lzma_free(ptr: *mut c_void, allocator: *const lzma_allocator) {
-    if !allocator.is_null()
-        && let Some(free) = unsafe { (*allocator).free }
-    {
-        unsafe { free((*allocator).opaque, ptr) };
-        return;
-    }
+    if !allocator.is_null()
+        && let (Some(_alloc), Some(free)) = unsafe { ((*allocator).alloc, (*allocator).free) }
+    {
+        unsafe { free((*allocator).opaque, ptr) };
+        return;
+    }
     unsafe { c_free_ptr(ptr) };
 }
Apply the same paired check to lzma_alloc_zero, internal_alloc_bytes, internal_alloc_object, internal_alloc_array, internal_alloc_zeroed_array, and internal_free so allocations and frees never cross backends.
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@xz-core/src/alloc/custom.rs` around lines 26 - 125, The code currently uses a
custom allocator if only one of (*allocator).alloc or (*allocator).free is
present, causing cross-allocator frees; update each function (lzma_alloc,
lzma_alloc_zero, lzma_free, internal_alloc_bytes, internal_alloc_object,
internal_alloc_array, internal_alloc_zeroed_array, internal_free) to require the
allocator to be non-null and both callbacks present before taking the custom
path (i.e., check that unsafe { (*allocator).alloc }.is_some() AND unsafe {
(*allocator).free }.is_some() for allocation and the same paired presence for
frees), otherwise fall back to the existing policy backend calls (c_alloc_bytes
/ c_alloc_zeroed_bytes / c_free_ptr or rust_alloc_impl / rust_free_ptr) so
allocations and frees always use the same backend.

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Duplicate comments:
In `@xz-core/src/alloc/custom.rs`:
- Around line 26-125: The code currently uses a custom allocator if only one of
(*allocator).alloc or (*allocator).free is present, causing cross-allocator
frees; update each function (lzma_alloc, lzma_alloc_zero, lzma_free,
internal_alloc_bytes, internal_alloc_object, internal_alloc_array,
internal_alloc_zeroed_array, internal_free) to require the allocator to be
non-null and both callbacks present before taking the custom path (i.e., check
that unsafe { (*allocator).alloc }.is_some() AND unsafe { (*allocator).free
}.is_some() for allocation and the same paired presence for frees), otherwise
fall back to the existing policy backend calls (c_alloc_bytes /
c_alloc_zeroed_bytes / c_free_ptr or rust_alloc_impl / rust_free_ptr) so
allocations and frees always use the same backend.

ℹ️ Review info

⚙️ Run configuration

Configuration used: Organization UI

Review profile: ASSERTIVE

Plan: Pro

Run ID: 6e4bf12e-359f-47a6-9d76-69e342c769d4

📥 Commits

Reviewing files that changed from the base of the PR and between b430753 and e9b8adf.

📒 Files selected for processing (6)

xz-core/src/alloc.rs
xz-core/src/alloc/c.rs
xz-core/src/alloc/custom.rs
xz-core/src/alloc/rust.rs
xz-core/src/alloc/rust_only.rs
xz-core/src/common/common.rs

liblzma->xz

240a033

coderabbitai Bot reviewed Apr 22, 2026

View reviewed changes

youknowone force-pushed the xz-crate branch from d61b258 to e8efc2d Compare April 23, 2026 06:23

coderabbitai Bot reviewed Apr 23, 2026

View reviewed changes

Rename core crate and centralize C allocator

b430753

youknowone force-pushed the xz-crate branch from e8efc2d to b430753 Compare April 24, 2026 16:05

coderabbitai Bot reviewed Apr 24, 2026

View reviewed changes

Split allocator implementation modules

e9b8adf

coderabbitai Bot reviewed Apr 25, 2026

View reviewed changes

youknowone merged commit 36be6fe into main Apr 25, 2026
47 checks passed

This was referenced Apr 28, 2026

trim alloc #9

Merged

trim #10

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

xz crate#8

xz crate#8
youknowone merged 3 commits intomainfrom
xz-crate

youknowone commented Apr 22, 2026 •

edited by coderabbitai Bot

Loading

Uh oh!

coderabbitai Bot commented Apr 22, 2026 •

edited

Loading

❌ Failed checks (1 warning, 1 inconclusive)

Uh oh!

coderabbitai Bot left a comment

Uh oh!

coderabbitai Bot Apr 22, 2026

Uh oh!

coderabbitai Bot Apr 22, 2026

Uh oh!

coderabbitai Bot left a comment

Uh oh!

coderabbitai Bot Apr 23, 2026

Uh oh!

coderabbitai Bot left a comment

Uh oh!

Uh oh!

coderabbitai Bot Apr 24, 2026

Uh oh!

coderabbitai Bot left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

		- if: "!startsWith(github.ref, 'refs/tags/liblzma-sys') && !startsWith(github.ref, 'refs/tags/xz-sys') && startsWith(github.ref, 'refs/tags/xz')"
		name: Publish xz crate

Conversation

youknowone commented Apr 22, 2026 • edited by coderabbitai Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary by CodeRabbit

Uh oh!

coderabbitai Bot commented Apr 22, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Estimated code review effort

Possibly related PRs

Poem

❌ Failed checks (1 warning, 1 inconclusive)

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot Apr 22, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot Apr 22, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot Apr 23, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

coderabbitai Bot Apr 24, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

youknowone commented Apr 22, 2026 •

edited by coderabbitai Bot

Loading

coderabbitai Bot commented Apr 22, 2026 •

edited

Loading