add nn.WeightNorm layer by mm65x · Pull Request #3296 · ml-explore/mlx

mm65x · 2026-03-22T12:44:59Z

Proposed changes

adds nn.WeightNorm, a module wrapper that applies weight normalization to a
parameter of a given module. reparameterizes a weight w into a magnitude
weight_g and direction weight_v such that w = g * v / ||v||, recomputed
on each forward pass.

implemented as a pure nn.Module layer with no C++ or free functions, as
suggested in #1921. the wrapped module's original weight is frozen so only
weight_g and weight_v are trainable, using the same freeze/unfreeze pattern
as BatchNorm's running stats.

works with any module that has a weight parameter (Linear, Conv1d, Conv2d, etc.).

Checklist

I have read the CONTRIBUTING document
I have run pre-commit run --all-files to format my code / installed pre-commit prior to committing changes
I have added tests that prove my fix is effective or that my feature works
I have updated the necessary documentation (if needed)

add nn.WeightNorm layer

c145489

mm65x marked this pull request as draft March 22, 2026 13:00

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add nn.WeightNorm layer#3296

add nn.WeightNorm layer#3296
mm65x wants to merge 1 commit intoml-explore:mainfrom
mm65x:add-weight-norm

mm65x commented Mar 22, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

mm65x commented Mar 22, 2026

Proposed changes

Checklist

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant