Skip to content
Draft
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
61 commits
Select commit Hold shift + click to select a range
5cddcb5
feat: Add model analysis and conversion framework with Transformers i…
antmikinka Mar 14, 2026
0aa1505
fix: Use Transformers integration for HF Hub models in gap analysis
antmikinka Mar 14, 2026
61fb52a
Fix CLI scan command to print summary directly from info object
antmikinka Mar 14, 2026
d890840
Remove silent AST scanner fallback from gap analysis
antmikinka Mar 14, 2026
6236d65
Fix gap analysis to properly detect sliding window as unsupported
antmikinka Mar 14, 2026
1bf709d
Add operator specification generator (#76)
antmikinka Mar 14, 2026
f3c30fe
Fix Transformers 5.x compatibility for multi-modal models (#77)
antmikinka Mar 14, 2026
b06fce7
Add operator creation guide and update README (#78)
antmikinka Mar 14, 2026
bc4cda2
Archive duplicate files from model_convert (#79)
antmikinka Mar 14, 2026
8a0fa4b
Consolidate model_analysis imports and improve documentation (#80)
antmikinka Mar 14, 2026
ef842ca
Add comprehensive data sources guide for operator creation (#81)
antmikinka Mar 15, 2026
ce9002e
Add master document generator for operator implementation (#82)
antmikinka Mar 15, 2026
c5818bd
Export generate_master_document in __init__.py (#82)
antmikinka Mar 15, 2026
ace8c76
Add Reduction operator for AIE2 and AIE2P (#83)
antmikinka Mar 15, 2026
154acc2
Add Conv2D operator for AIE2 and AIE2P (#84)
antmikinka Mar 15, 2026
aa1cbcd
Add MaxPool operator for AIE2 and AIE2P (#85)
antmikinka Mar 15, 2026
dc2039f
Add AveragePool operator for AIE2 and AIE2P (#86)
antmikinka Mar 15, 2026
11da5b6
Add Conv3D operator for AIE2 and AIE2P (#87)
antmikinka Mar 15, 2026
9023b4b
Fix syntax error in conv3d_bf16_large_kernel weight_idx calculation
antmikinka Mar 15, 2026
6c4f30d
Update CONV3D_STRATEGY.md to reflect completed implementation
antmikinka Mar 15, 2026
afcb559
Add conv3d_bf16_large_kernel for AIE2 architecture
antmikinka Mar 15, 2026
6364a54
Update CONV3D_STRATEGY.md for complete AIE2 large_kernel support
antmikinka Mar 15, 2026
ee61d48
Add conv3d_bf16_scalar for AIE2P architecture
antmikinka Mar 15, 2026
f3378e2
Update CONV3D_STRATEGY.md to reflect complete kernel parity
antmikinka Mar 15, 2026
46baf11
Add ONNX Runtime GenAI Windows backend for NPU runtime (Task #52)
antmikinka Mar 15, 2026
a69a610
Complete ONNX Runtime GenAI API implementation (Task #53)
antmikinka Mar 15, 2026
26a7bc9
Add Task #52 & #53 completion report
antmikinka Mar 15, 2026
556655b
Add IronServer C++ backend implementation and integration guide
antmikinka Mar 15, 2026
3027cf0
Add session summary for continuation session
antmikinka Mar 15, 2026
127304a
docs: Add comprehensive IronServer integration documentation
antmikinka Mar 15, 2026
9d24489
docs: Add Llama3.2 operator analysis and support plan
antmikinka Mar 16, 2026
4d642b9
feat: Phase 2 Baseline Complete - Benchmark Framework + Operator Impl…
antmikinka Mar 16, 2026
40a029c
feat: Phase 3 Week 1 complete - Foundation components for Llama3.2 in…
antmikinka Mar 16, 2026
6745eab
feat: Phase 3 Week 2 complete - Llama3.2 model config and weight loader
antmikinka Mar 16, 2026
904c8e6
docs: Update PROJECT_STATUS_TRACKER for Week 2 completion
antmikinka Mar 16, 2026
991dca7
feat: Phase 3 Week 3 generation infrastructure - STRUCTURE COMPLETE
antmikinka Mar 16, 2026
4cfc824
feat: Phase 3 Week 3 REMEDIATION COMPLETE - _forward_layer() implemented
antmikinka Mar 18, 2026
fe9a5d8
feat: Add block_size config for paged KV cache integration
antmikinka Mar 18, 2026
06f3bee
feat: Implement P0 benchmark regression fixes across 10 operator files
antmikinka Mar 18, 2026
eaeaab4
feat: P3 benchmark infrastructure complete - tile/column scaling stud…
antmikinka Mar 19, 2026
969594f
docs: Update .gitignore to exclude documentation and AI folders
antmikinka Mar 19, 2026
0b35142
fix: Gracefully skip NPU hardware tests when AIE toolchain unavailable
antmikinka Mar 19, 2026
36b9929
docs: Add cross-analysis verification report for comprehensive benchm…
antmikinka Mar 19, 2026
7fc8191
fix(p0-critical): Resolve severe performance regressions in 6 operators
antmikinka Mar 19, 2026
84b2333
fix(p1-high): Address bandwidth and stability regressions in 5 operators
antmikinka Mar 19, 2026
380714e
fix(p2-medium): Resolve stddev regressions in GEMM and GEMV operators
antmikinka Mar 19, 2026
6bdf735
fix(p1-high): Resolve AXPY 4-column 2-channel bandwidth regression
antmikinka Mar 19, 2026
5a0bd8d
docs: Update benchmark analysis tracking documentation
antmikinka Mar 19, 2026
c6d330f
docs: Add SWIGLU_DECODE fix plan documentation
antmikinka Mar 21, 2026
589a793
docs: Add SWIGLU_DECODE-FIX-PLAN.md to task tracking table
antmikinka Mar 21, 2026
82f3f14
fix(p2-medium): Add FIFO depth=3 for TANH 2-column stability
antmikinka Mar 21, 2026
b814d9e
docs: Update task tracking with TANH 2-column fix (Task #119)
antmikinka Mar 21, 2026
ef079f6
docs: Add TRANSPOSE fix status and update task tracking (Task #120)
antmikinka Mar 21, 2026
24fa898
fix(p1-high): Enhanced FIFO depth for WEIGHTED_RMS_NORM stability
antmikinka Mar 21, 2026
8cb875d
docs: Update task tracking with WEIGHTED_RMS_NORM fix (Task #121)
antmikinka Mar 21, 2026
64e745f
fix: Batch commit for 17 operator benchmark fixes
antmikinka Mar 21, 2026
ffd699d
chore: Apply Black formatting to Python files
antmikinka Mar 21, 2026
dae6f6c
fix: Critical import regression and numpy.softmax errors in generatio…
antmikinka Mar 21, 2026
fd7783c
fix(p0-critical): AXPY operator FIFO depth with tile_size_factor
antmikinka Mar 21, 2026
5ee11e3
fix(p1-high): DEQUANT operator FIFO depth with tile_size_factor
antmikinka Mar 21, 2026
63f0d6f
fix(p1-high): DEQUANT operator add large tile (>=2048) factor
antmikinka Mar 21, 2026
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
1 change: 1 addition & 0 deletions .clang-format
Original file line number Diff line number Diff line change
Expand Up @@ -40,3 +40,4 @@ AllowAllParametersOfDeclarationOnNextLine: false
BinPackParameters: false
BinPackArguments: false
ConstructorInitializerAllOnOneLineOrOnePerLine: true
UseCRLF: true
42 changes: 42 additions & 0 deletions .claude/settings.local.json
Original file line number Diff line number Diff line change
@@ -0,0 +1,42 @@
{
"permissions": {
"allow": [
"mcp__clear-thought-server__sequentialthinking",
"mcp__sequential-thinking__sequentialthinking",
"Bash(git add:*)",
"Bash(git commit:*)",
"Bash(git push:*)",
"Bash(test:*)",
"Bash(python3:*)",
"Bash(python -m py_compile:*)",
"Bash(python:*)",
"Bash(ls:*)",
"Bash(cmd /c:*)",
"Bash(cmake:*)",
"Bash(wc:*)",
"Bash(git pull:*)",
"Bash(git stash:*)",
"Bash(git rebase:*)",
"Bash(dir:*)",
"Bash(git -C /c/Users/antmi/IRON log --oneline -10)",
"Bash(git -C /c/Users/antmi/IRON log --oneline -20)",
"Bash(find:*)",
"Bash(black:*)",
"Bash(clang-format:*)",
"Bash(unix2dos:*)",
"Bash(findstr:*)",
"Bash(gh pr view:*)",
"Bash(gh api:*)",
"WebFetch(domain:github.com)",
"Bash(sort:*)",
"Bash(git show:*)",
"Bash(git diff-tree:*)",
"Bash(git -C \"/c/Users/antmi/IRON\" show 6bdf735 --stat)",
"Bash(git -C \"/c/Users/antmi/IRON\" diff iron/operators/gelu/design.py)",
"Bash(flake8:*)",
"Bash(pip install:*)",
"Bash(cat:*)",
"Bash(reuse lint:*)"
]
}
}
5 changes: 5 additions & 0 deletions .gitignore
Original file line number Diff line number Diff line change
Expand Up @@ -20,3 +20,8 @@ id_ed25519.pub
*.model
.cline_storage
*.egg-info

# Documentation and AI folders
docs/
chroma-data/
.claude/
Loading
Loading