Draft: merge any_model: mip_and_realize_models by danielkorzekwa · Pull Request #995 · NVIDIA/Model-Optimizer

danielkorzekwa · 2026-03-06T14:13:41Z

What does this PR do?

Merging dkorzekwa/mip_and_realize_models into dkorzekwa/any_model_calc_one_block_scores - this MR is only for reviewing. Ultimately dkorzekwa/mip_and_realize_models should be merged into feature/puzzletron once dkorzekwa/any_model_calc_one_block_scores is merged there.

Summary by CodeRabbit

Release Notes

New Features
- Enabled model realization step during compression workflow after scoring phase completes.
Bug Fixes
- Fixed key-value head calculation in attention configuration sourcing.
Tests
- Strengthened validation checks for compression artifacts and output directories; added rank-aware assertions for model compression expectations.
Chores
- Minor documentation formatting updates.

- Add converter, model_descriptor, puzzformer, and llama model support - Selective merge of anymodel functionality Signed-off-by: Daniel Korzekwa <dkorzekwa@nvidia.com>

Signed-off-by: Daniel Korzekwa <dkorzekwa@nvidia.com>

…s merged) Signed-off-by: Daniel Korzekwa <dkorzekwa@nvidia.com>

…tion_scoring

Signed-off-by: Daniel Korzekwa <dkorzekwa@nvidia.com>

…tion_scoring

Signed-off-by: Daniel Korzekwa <dkorzekwa@nvidia.com>

…tion_scoring

Signed-off-by: Daniel Korzekwa <dkorzekwa@nvidia.com>

…tion_scoring

…nymodel_pruning

Signed-off-by: Daniel Korzekwa <dkorzekwa@nvidia.com>

coderabbitai · 2026-03-06T14:18:57Z

📝 Walkthrough

Walkthrough

This PR modifies the Puzzletron model optimization pipeline by changing how key-value head counts are computed from block configuration, enabling the MIP realization step in the main pipeline, and replacing disabled test assertions with active validation checks for post-processing artifacts.

Changes

Cohort / File(s)	Summary
Pipeline Logic `modelopt/torch/puzzletron/mip/run_puzzle.py`, `modelopt/torch/puzzletron/puzzletron.py`	Updated key-value head computation to source directly from `block_config.attention.num_key_value_heads` instead of deriving from subblock args. Uncommented and enabled Step 6 MIP realization execution in the main pipeline workflow.
Validation and Testing `modelopt/torch/puzzletron/tools/validate_model.py`, `tests/gpu/torch/puzzletron/test_puzzletron.py`	Minor docstring formatting in validate_model.py. Replaced disabled test assertions with active rank-aware validation checks that verify pruning files, checkpoint directories, MIP solution artifacts, loss metrics, build artifacts, and scoring results.

Estimated code review effort

🎯 2 (Simple) | ⏱️ ~12 minutes

🚥 Pre-merge checks | ✅ 3 | ❌ 1

❌ Failed checks (1 warning)

Check name	Status	Explanation	Resolution
Title check	⚠️ Warning	The title 'Draft: merge any_model: mip_and_realize_models' is misleading. It describes a merge operation into an intermediate branch, but the actual changes enable MIP model realization and fix pruning validation logic—not a merge operation.	Use a title that reflects the substantive changes: something like 'Enable MIP model realization and enhance pruning validation' or 'Implement MIP and realize models with improved test coverage'.

✅ Passed checks (3 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Docstring Coverage	✅ Passed	No functions found in the changed files to evaluate docstring coverage. Skipping docstring coverage check.
Security Anti-Patterns	✅ Passed	The modifications in run_puzzle.py, puzzletron.py, and validate_model.py only adjust internal logic and uncomment a function call, introducing no new torch.load calls, numpy.load usage, hardcoded trust_remote_code flags, eval/exec statements, or "# nosec" comments.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing Touches

📝 Generate docstrings (stacked PR)
📝 Generate docstrings (commit on current branch)

🧪 Generate unit tests (beta)

Create PR with unit tests
Post copyable unit tests in a comment
Commit unit tests in branch dkorzekwa/mip_and_realize_models

📝 Coding Plan

Generate coding plan for human review comments

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

Signed-off-by: Daniel Korzekwa <dkorzekwa@nvidia.com>

…tion_scoring

…nymodel_pruning

Signed-off-by: Daniel Korzekwa <dkorzekwa@nvidia.com>

…ld_library_and_stats

Signed-off-by: Daniel Korzekwa <dkorzekwa@nvidia.com>

…kwa/any_model_calc_one_block_scores

…wa/mip_and_realize_models

Signed-off-by: Daniel Korzekwa <dkorzekwa@nvidia.com>

…nymodel_pruning

…ld_library_and_stats

…kwa/any_model_calc_one_block_scores

…wa/mip_and_realize_models

Signed-off-by: Daniel Korzekwa <dkorzekwa@nvidia.com>

coderabbitai

🧹 Nitpick comments (2)

tests/gpu/torch/puzzletron/test_puzzletron.py (1)

144-152: Consider defensive handling for missing directory.

If mip_solutions_dir doesn't exist (e.g., due to an earlier step failure), iterdir() will raise FileNotFoundError. A more explicit check could improve debuggability.

Optional: Add existence check before iteration

             # assertions for the mip_and_realize_models step 6
             # Find the MIP solution directory dynamically (e.g., stats_num_local_experts_*)
             mip_solutions_dir = puzzle_dir / "mip/puzzle_solutions"
+            assert mip_solutions_dir.exists(), (
+                f"MIP solutions directory not found: {mip_solutions_dir}"
+            )
             solution_dirs = [
                 d
                 for d in mip_solutions_dir.iterdir()
                 if d.is_dir() and d.name.startswith("stats_num_local_experts_")
             ]

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@tests/gpu/torch/puzzletron/test_puzzletron.py` around lines 144 - 152, The
code assumes mip_solutions_dir exists before calling
mip_solutions_dir.iterdir(); add a defensive existence check (e.g., if not
mip_solutions_dir.exists() or not mip_solutions_dir.is_dir()) before iterating
and raise/assert with a clear error message that includes puzzle_dir and
mip_solutions_dir when missing, or set solution_dirs to an empty list and fail
the subsequent assertion with that informative message; update the block around
mip_solutions_dir, solution_dirs and the following assert to use this check so
FileNotFoundError is avoided and debugging is easier.

modelopt/torch/puzzletron/puzzletron.py (1)

65-74: Minor: Step numbering skips Step 3.

The comments indicate Steps 0, 1, 2, 4, 5, 6 — Step 3 is missing. This might be intentional (perhaps a removed step), but it could cause confusion when referencing steps in logs or documentation.

Consider renumbering for clarity

-    # Step 4: build_library_and_stats (single process)
+    # Step 3: build_library_and_stats (single process)
     if dist.is_master():
         build_library_and_stats.launch_build_library_and_stats(hydra_cfg)
     dist.barrier()

-    # Step 5: calc_one_block_scores (distributed processing)
+    # Step 4: calc_one_block_scores (distributed processing)
     scoring.launch_scoring(hydra_cfg)

-    # Step 6: mip_and_realize_models (distributed processing)
+    # Step 5: mip_and_realize_models (distributed processing)
     mip_and_realize_models.launch_mip_and_realize_model(hydra_cfg)

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@modelopt/torch/puzzletron/puzzletron.py` around lines 65 - 74, The step
comments in puzzletron.py skip Step 3 which is confusing; update the comments
around the sequence that includes dist.is_master(),
build_library_and_stats.launch_build_library_and_stats, dist.barrier(),
scoring.launch_scoring, and mip_and_realize_models.launch_mip_and_realize_model
to either renumber them sequentially (e.g., change "Step 4"→"Step 3", "Step
5"→"Step 4", "Step 6"→"Step 5") or add a brief comment like "Step 3 removed
intentionally" to make the gap explicit and avoid confusion when referencing
steps in logs/docs.

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Nitpick comments:
In `@modelopt/torch/puzzletron/puzzletron.py`:
- Around line 65-74: The step comments in puzzletron.py skip Step 3 which is
confusing; update the comments around the sequence that includes
dist.is_master(), build_library_and_stats.launch_build_library_and_stats,
dist.barrier(), scoring.launch_scoring, and
mip_and_realize_models.launch_mip_and_realize_model to either renumber them
sequentially (e.g., change "Step 4"→"Step 3", "Step 5"→"Step 4", "Step 6"→"Step
5") or add a brief comment like "Step 3 removed intentionally" to make the gap
explicit and avoid confusion when referencing steps in logs/docs.

In `@tests/gpu/torch/puzzletron/test_puzzletron.py`:
- Around line 144-152: The code assumes mip_solutions_dir exists before calling
mip_solutions_dir.iterdir(); add a defensive existence check (e.g., if not
mip_solutions_dir.exists() or not mip_solutions_dir.is_dir()) before iterating
and raise/assert with a clear error message that includes puzzle_dir and
mip_solutions_dir when missing, or set solution_dirs to an empty list and fail
the subsequent assertion with that informative message; update the block around
mip_solutions_dir, solution_dirs and the following assert to use this check so
FileNotFoundError is avoided and debugging is easier.

ℹ️ Review info

⚙️ Run configuration

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Pro

Run ID: 7ed33a66-d1fb-4928-ad4d-611f6fb09e49

📥 Commits

Reviewing files that changed from the base of the PR and between eb4b210 and 47b4479.

📒 Files selected for processing (4)

modelopt/torch/puzzletron/mip/run_puzzle.py
modelopt/torch/puzzletron/puzzletron.py
modelopt/torch/puzzletron/tools/validate_model.py
tests/gpu/torch/puzzletron/test_puzzletron.py

💤 Files with no reviewable changes (1)

modelopt/torch/puzzletron/tools/validate_model.py

codecov · 2026-03-13T07:47:25Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 72.12%. Comparing base (eb4b210) to head (1c1e983).
⚠️ Report is 1 commits behind head on feature/puzzletron.

Additional details and impacted files

@@                  Coverage Diff                   @@
##           feature/puzzletron     #995      +/-   ##
======================================================
- Coverage               72.13%   72.12%   -0.02%     
======================================================
  Files                     209      209              
  Lines                   23628    23628              
======================================================
- Hits                    17045    17042       -3     
- Misses                   6583     6586       +3

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

Signed-off-by: Daniel Korzekwa <dkorzekwa@nvidia.com>

danielkorzekwa added 26 commits March 4, 2026 11:33

Add anymodel directories to feature/puzzletron

e82164f

- Add converter, model_descriptor, puzzformer, and llama model support - Selective merge of anymodel functionality Signed-off-by: Daniel Korzekwa <dkorzekwa@nvidia.com>

Make any_model conversion working.

2099df3

Signed-off-by: Daniel Korzekwa <dkorzekwa@nvidia.com>

Update child_init.py with anymodel version

eb5cf8a

Signed-off-by: Daniel Korzekwa <dkorzekwa@nvidia.com>

fix attention pruning

c9de41c

Signed-off-by: Daniel Korzekwa <dkorzekwa@nvidia.com>

Add trust_remote_code to load_model_config (default to false)

3c1bc1f

Signed-off-by: Daniel Korzekwa <dkorzekwa@nvidia.com>

Make activation scoring working

8357136

Signed-off-by: Daniel Korzekwa <dkorzekwa@nvidia.com>

Comment all tested models aside of llama_3_1_8b_instruct

6cc2194

Signed-off-by: Daniel Korzekwa <dkorzekwa@nvidia.com>

Delete not needed decilm test

ee4e1e3

Signed-off-by: Daniel Korzekwa <dkorzekwa@nvidia.com>

Fix broken tests

449b523

Signed-off-by: Daniel Korzekwa <dkorzekwa@nvidia.com>

Update puzzletron_nas_pluging to any_model version

fb27bba

Signed-off-by: Daniel Korzekwa <dkorzekwa@nvidia.com>

Correct test resources used by tests.

b350f82

Signed-off-by: Daniel Korzekwa <dkorzekwa@nvidia.com>

Disable puzzletron tests (will be enabled after all any_model logic i…

fafe5a3

…s merged) Signed-off-by: Daniel Korzekwa <dkorzekwa@nvidia.com>

Merge branch 'dkorzekwa/anymodel_core' into dkorzekwa/anymodel_activa…

e988248

…tion_scoring

Comment out not implemented models.

c717852

Signed-off-by: Daniel Korzekwa <dkorzekwa@nvidia.com>

format python docs

030f126

Signed-off-by: Daniel Korzekwa <dkorzekwa@nvidia.com>

Merge branch 'dkorzekwa/anymodel_core' into dkorzekwa/anymodel_activa…

8dcdfbf

…tion_scoring

Use trust_remote_code in force_cache_dynamic_modules()

70df0df

Signed-off-by: Daniel Korzekwa <dkorzekwa@nvidia.com>

Merge branch 'dkorzekwa/anymodel_core' into dkorzekwa/anymodel_activa…

bb56662

…tion_scoring

Fix anymodel pruning

ecd953e

Signed-off-by: Daniel Korzekwa <dkorzekwa@nvidia.com>

Fix buid docs issue.

ee8f538

Signed-off-by: Daniel Korzekwa <dkorzekwa@nvidia.com>

Merge branch 'dkorzekwa/anymodel_core' into dkorzekwa/anymodel_activa…

c9b76a1

…tion_scoring

Merge branch 'dkorzekwa/anymodel_activation_scoring' into dkorzekwa/a…

6e3af61

…nymodel_pruning

Merging build_library_and_stats

0ad6d92

Signed-off-by: Daniel Korzekwa <dkorzekwa@nvidia.com>

Merging anymodel: calc_one_block_scores

995eb1a

Signed-off-by: Daniel Korzekwa <dkorzekwa@nvidia.com>

Mering any_model: calc_one_block_scores

34081c9

Signed-off-by: Daniel Korzekwa <dkorzekwa@nvidia.com>

merge any_model: mip_and_realize_models

ed5c00f

Signed-off-by: Daniel Korzekwa <dkorzekwa@nvidia.com>

danielkorzekwa requested a review from a team as a code owner March 6, 2026 14:13

kevalmorabia97 approved these changes Mar 6, 2026

View reviewed changes

Clarify readme and avoid reusing the same reference in llama_converter.

47414d5

Signed-off-by: Daniel Korzekwa <dkorzekwa@nvidia.com>

danielkorzekwa added 17 commits March 9, 2026 09:36

Fix tied-embedding handling before writing the safetensors index.

a8305d8

Signed-off-by: Daniel Korzekwa <dkorzekwa@nvidia.com>

Fix NaN ranking currently selects NaNs as “best” experts by default.

68421a5

Signed-off-by: Daniel Korzekwa <dkorzekwa@nvidia.com>

Code clean up.

d6b8028

Signed-off-by: Daniel Korzekwa <dkorzekwa@nvidia.com>

Code clean up.

ecd2341

Signed-off-by: Daniel Korzekwa <dkorzekwa@nvidia.com>

code clean up

f9d845d

Signed-off-by: Daniel Korzekwa <dkorzekwa@nvidia.com>

Merge branch 'dkorzekwa/anymodel_core' into dkorzekwa/anymodel_activa…

d171b01

…tion_scoring

Merge branch 'dkorzekwa/anymodel_activation_scoring' into dkorzekwa/a…

722da90

…nymodel_pruning

code clean up

934ab2f

Signed-off-by: Daniel Korzekwa <dkorzekwa@nvidia.com>

Merge branch 'dkorzekwa/anymodel_pruning' into dkorzekwa/anymodel_bui…

0f14ec3

…ld_library_and_stats

remove not needed comment

dcb9e02

Signed-off-by: Daniel Korzekwa <dkorzekwa@nvidia.com>

Merge branch 'dkorzekwa/anymodel_build_library_and_stats' into dkorze…

0c9ea5d

…kwa/any_model_calc_one_block_scores

Merge branch 'dkorzekwa/any_model_calc_one_block_scores' into dkorzek…

5b310e2

…wa/mip_and_realize_models

Fix a broken test_puzzletron test on 2 gpus.

176a435

Signed-off-by: Daniel Korzekwa <dkorzekwa@nvidia.com>

Merge branch 'dkorzekwa/anymodel_activation_scoring' into dkorzekwa/a…

02e2c9b

…nymodel_pruning

Merge branch 'dkorzekwa/anymodel_pruning' into dkorzekwa/anymodel_bui…

92c4419

…ld_library_and_stats

Merge branch 'dkorzekwa/anymodel_build_library_and_stats' into dkorze…

aa1eb3e

…kwa/any_model_calc_one_block_scores

Merge branch 'dkorzekwa/any_model_calc_one_block_scores' into dkorzek…

2b84a96

…wa/mip_and_realize_models

Base automatically changed from dkorzekwa/any_model_calc_one_block_scores to feature/puzzletron March 12, 2026 23:37

danielkorzekwa requested a review from a team as a code owner March 12, 2026 23:37

danielkorzekwa requested a review from realAsma March 12, 2026 23:37

danielkorzekwa added 2 commits March 13, 2026 00:32

Merge branch 'feature/puzzletron' into dkorzekwa/mip_and_realize_models

313260d

Signed-off-by: Daniel Korzekwa <dkorzekwa@nvidia.com>

Fix comments

47b4479

Signed-off-by: Daniel Korzekwa <dkorzekwa@nvidia.com>

coderabbitai bot reviewed Mar 13, 2026

View reviewed changes

fix tox -e build-docs issue

1c1e983

Signed-off-by: Daniel Korzekwa <dkorzekwa@nvidia.com>

danielkorzekwa merged commit 8fe318d into feature/puzzletron Mar 13, 2026
28 checks passed

danielkorzekwa deleted the dkorzekwa/mip_and_realize_models branch March 13, 2026 09:22

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Draft: merge any_model: mip_and_realize_models#995

Draft: merge any_model: mip_and_realize_models#995
danielkorzekwa merged 47 commits intofeature/puzzletronfrom
dkorzekwa/mip_and_realize_models

danielkorzekwa commented Mar 6, 2026 •

edited by coderabbitai bot

Loading

Uh oh!

coderabbitai bot commented Mar 6, 2026 •

edited

Loading

Walkthrough

Changes

Estimated code review effort

❌ Failed checks (1 warning)

Uh oh!

coderabbitai bot left a comment

Uh oh!

codecov bot commented Mar 13, 2026 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

danielkorzekwa commented Mar 6, 2026 • edited by coderabbitai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Summary by CodeRabbit

Release Notes

Uh oh!

coderabbitai bot commented Mar 6, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Estimated code review effort

❌ Failed checks (1 warning)

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

codecov bot commented Mar 13, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

danielkorzekwa commented Mar 6, 2026 •

edited by coderabbitai bot

Loading

coderabbitai bot commented Mar 6, 2026 •

edited

Loading

codecov bot commented Mar 13, 2026 •

edited

Loading