Add batch size and gradient accumulation parameters to quantization s… by xin3he · Pull Request #2456 · intel/neural-compressor

xin3he · 2026-04-28T02:03:40Z

Type of Change

example update

Description

Reduce peak memory to suit low memory GPU

Expected Behavior & Potential Risk

the expected behavior that triggered by this PR

How has this PR been tested?

how to reproduce the test (including hardware information)

Dependency Change?

any library dependency introduced or removed

…cripts Signed-off-by: Xin He <xin3.he@intel.com>

xin3he · 2026-04-28T03:12:18Z

bash run_quant.sh --topology=Llama-3.3-70B --dtype=mxfp4_mixed --input_model=meta-llama/Llama-3.3-70B-Instruct --output_model=./Llama-3.3-70B_mxfp4_mixed --gradient_accumulate_steps=2

Add batch size and gradient accumulation parameters to quantization s…

0da9e54

…cripts Signed-off-by: Xin He <xin3.he@intel.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add batch size and gradient accumulation parameters to quantization s…#2456

Add batch size and gradient accumulation parameters to quantization s…#2456
xin3he wants to merge 1 commit intomasterfrom
xinhe/4-28

xin3he commented Apr 28, 2026

Uh oh!

xin3he commented Apr 28, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

xin3he commented Apr 28, 2026

Type of Change

Description

Expected Behavior & Potential Risk

How has this PR been tested?

Dependency Change?

Uh oh!

xin3he commented Apr 28, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant