Dkorzekwa/any model other models by danielkorzekwa · Pull Request #1007 · NVIDIA/Model-Optimizer

danielkorzekwa · 2026-03-09T16:03:54Z

What does this PR do?

Merging dkorzekwa/any_model_other_models into dkorzekwa/mip_and_realize_models - this MR is only for reviewing. Ultimately dkorzekwa/any_model_other_models should be merged into feature/puzzletron once dkorzekwa/mip_and_realize_models is merged there.

Signed-off-by: Daniel Korzekwa <dkorzekwa@nvidia.com>

coderabbitai · 2026-03-09T16:05:07Z

Important

Review skipped

Auto reviews are disabled on base/target branches other than the default branch.

🗂️ Base branches to auto review (3)

main
release/.*
feature/.*

Please check the settings in the CodeRabbit UI or the .coderabbit.yaml file in this repository. To trigger a single review, invoke the @coderabbitai review command.

⚙️ Run configuration

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Pro

Run ID: d8e9eb0c-bd29-4252-992d-863b204b6427

You can disable this status message by setting the reviews.review_status to false in the CodeRabbit configuration file.

Use the checkbox below for a quick retry:

🔍 Trigger review

✨ Finishing Touches

🧪 Generate unit tests (beta)

Create PR with unit tests
Post copyable unit tests in a comment
Commit unit tests in branch dkorzekwa/any_model_other_models

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

kevalmorabia97 · 2026-03-09T16:35:39Z

tests/gpu/torch/puzzletron/resources/hf_configs/llama_3_2_3b_instruct/config.json

Why do we store all these configs instead of directly reading from HF using config = AutoConfig.from_pretrained("Qwen/Qwen3-8B")?

To make sure we always use the same version for the test. I will add your suggestion to TODO so we will discuss it.

kevalmorabia97 · 2026-03-09T16:37:11Z

...torch/puzzletron/resources/configs/mistral-small-24b-instruct-2501/pruning/attn_pruning.yaml

Can we define one or more common yamls for all model tests and only define overwrites in one separate per-model yaml to avoid so many duplications and keep things simple?

added to TODOs

kevalmorabia97 · 2026-03-09T16:41:30Z

tests/gpu/torch/puzzletron/resources/hf_configs/nemotron-nano-12b-v2/modeling_nemotron_h.py

We dont want to maintain copies of all these HF files when we can just load them with Auto{Model,Config}.from_pretrained()

added to TODOs

…del_other_models

danielkorzekwa added 3 commits March 6, 2026 07:13

Add all anymodel models but gptoss

993b5ec

Signed-off-by: Daniel Korzekwa <dkorzekwa@nvidia.com>

Make nemotron-nano-12b-v2 to work (set trust_remote_code=true)

6e9f03b

Signed-off-by: Daniel Korzekwa <dkorzekwa@nvidia.com>

merge anymodel for nemotron-3-nano-30b-a3b-base-bf16

e8b7a7d

Signed-off-by: Daniel Korzekwa <dkorzekwa@nvidia.com>

danielkorzekwa requested review from a team as code owners March 9, 2026 16:03

danielkorzekwa requested review from kevalmorabia97 and removed request for a team March 9, 2026 16:03

kevalmorabia97 reviewed Mar 9, 2026

View reviewed changes

danielkorzekwa added 2 commits March 10, 2026 05:00

Merge branch 'dkorzekwa/mip_and_realize_models' into dkorzekwa/any_mo…

4f82b1c

…del_other_models

Merge branch 'dkorzekwa/mip_and_realize_models' into dkorzekwa/any_mo…

fb838c0

…del_other_models

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Dkorzekwa/any model other models#1007

Dkorzekwa/any model other models#1007
danielkorzekwa wants to merge 5 commits intodkorzekwa/mip_and_realize_modelsfrom
dkorzekwa/any_model_other_models

danielkorzekwa commented Mar 9, 2026

Uh oh!

coderabbitai bot commented Mar 9, 2026 •

edited

Loading

Review skipped

Uh oh!

kevalmorabia97 Mar 9, 2026

Uh oh!

danielkorzekwa Mar 10, 2026

Uh oh!

kevalmorabia97 Mar 9, 2026

Uh oh!

danielkorzekwa Mar 10, 2026

Uh oh!

kevalmorabia97 Mar 9, 2026

Uh oh!

danielkorzekwa Mar 10, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

danielkorzekwa commented Mar 9, 2026

What does this PR do?

Uh oh!

coderabbitai bot commented Mar 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Review skipped

Uh oh!

kevalmorabia97 Mar 9, 2026

Choose a reason for hiding this comment

Uh oh!

danielkorzekwa Mar 10, 2026

Choose a reason for hiding this comment

Uh oh!

kevalmorabia97 Mar 9, 2026

Choose a reason for hiding this comment

Uh oh!

danielkorzekwa Mar 10, 2026

Choose a reason for hiding this comment

Uh oh!

kevalmorabia97 Mar 9, 2026

Choose a reason for hiding this comment

Uh oh!

danielkorzekwa Mar 10, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

coderabbitai bot commented Mar 9, 2026 •

edited

Loading