IM2Deep 2.0 API by rodvrees · Pull Request #18 · CompOmics/IM2Deep

rodvrees · 2026-01-08T14:47:25Z

No description provided.

Remove Numpy version pin; general linting and typing updates

Copilot

Pull request overview

This PR introduces IM2Deep 2.0 API, a major refactoring from version 1.2.0 to 2.0.0-beta. The changes modernize the codebase with comprehensive test coverage, refactored architecture using PyTorch Lightning, and an improved API design with better separation of concerns.

Changes:

Complete API refactoring with new modular structure (core, calibration, model_ops, constants)
Added comprehensive test suite with 9 test modules covering ~90% of functionality
Replaced DeepLC-based models with native PyTorch Lightning implementations
Updated dependencies: removed version constraint on deeplc, added torch and lightning as core dependencies
Enhanced CLI with better argument handling and profiling support

Reviewed changes

Copilot reviewed 27 out of 36 changed files in this pull request and generated 29 comments.

Show a summary per file

File	Description
tests/*	New comprehensive test suite with fixtures, unit tests, and integration tests
pytest.ini	Pytest configuration with markers for integration and slow tests
pyproject.toml	Updated dependencies: removed deeplc constraint and `er` extras, added torch/lightning
im2deep/utils.py	Major expansion with input parsing, validation, CCS conversion, and CLI utilities
im2deep/_model_ops.py	New file for PyTorch model loading and prediction operations
im2deep/core.py	New high-level API with predict() and predict_and_calibrate() functions
im2deep/constants.py	New file centralizing model paths, configurations, and physical constants
im2deep/calibration.py	Refactored calibration with abstract base class and LinearCCSCalibration implementation
im2deep/_architecture.py	New file with PyTorch Lightning model architectures (IM2Deep, IM2DeepMulti, IM2DeepMultiTransfer)
im2deep/_exceptions.py	Minor formatting improvements to exception classes
im2deep/main.py	Major CLI refactor with DefaultCommandGroup, improved argument handling, and profiling
im2deep/init.py	Updated to version 2.0.0-beta with new API exports
README.md	Comprehensive documentation update with CLI examples and API usage
.github/workflows/test.yml	New CI/CD workflow for automated testing

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

tests/test_constants.py

tests/test_utils.py

im2deep/utils.py

tests/test_exceptions.py

RalfG

Some general comments:

I think we could reconsider making some more modules private or split them into a public and private part (for instance utils.py)
_architecture.py is quite a long file. Perhaps it could be split up into submodules for loss functions and each of the models?
Exceptions raised in public functions should be also be public, so they can be caught if needed in downstream applications. (This was also wrong in DeepLC 4.0)
Are the functions in _model_ops.py still necessary, or could they be replaced with a Lightning Trainer class?

.github/workflows/test.yml

im2deep/__init__.py

im2deep/_exceptions.py

im2deep/_model_ops.py

im2deep/__init__.py

im2deep/__main__.py

im2deep/calibration.py

- Use ruff for faster linting, including formatting checks - Use uv for faster installs and builds, including caching - Fix issue where test workflow would be run on 'master' branch, while default branch is 'main'

…ew-api

…optional dependencies to dependency groups

…aining typing issues; add some to-do's.

Copilot

Pull request overview

Copilot reviewed 37 out of 47 changed files in this pull request and generated 11 comments.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

You can also share your feedback on Copilot code review. Take the survey.

im2deep/core.py

+from im2deep import _model_ops
+from im2deep.calibration import Calibration, LinearCCSCalibration
+from im2deep.constants import DEFAULT_MODEL, DEFAULT_MULTI_MODEL
+from im2deep.utils import validate_psm_list


im2deep/core.py

+    LOGGER.info("Predicting CCS values using IM2Deep.")
+    psm_list = validate_psm_list(psm_list)
+    return _model_ops.predict(
+        model=model or DEFAULT_MODEL if not multi else DEFAULT_MULTI_MODEL,


im2deep/core.py

+    )
+
+    # Assign the predicted CCS to the PSM metadata
+    for idx, psm in enumerate(psm_list):


im2deep/_io_helpers.py

+    psm_list_filtered = psm_list[charges != None]  # noqa: E711
+    psm_list_filtered = psm_list_filtered[charges <= 6]


im2deep/_io_helpers.py

+            if (
+                psm.ion_mobility is not None
+                and psm.metadata is not None
+                and psm.metadata.get("CCS") is None
+            ):
+                psm.metadata["CCS"] = str(
+                    im2ccs(
+                        psm.ion_mobility,
+                        psm.peptidoform.theoretical_mz,
+                        psm.peptidoform.precursor_charge,
+                    )
+                )
+            # Ensure CCS is always stored as float
+            elif psm.metadata.get("CCS") is not None:
+                ccs_value = psm.metadata["CCS"]
+                if not isinstance(ccs_value, float):
+                    psm.metadata["CCS"] = float(ccs_value)
+


tests/test_utils.py

+    def test_ccs2im_basic(self):
+        """Test basic CCS to ion mobility conversion."""
+        ccs = 450.0
+        charge = 2
+        mz = 500.0
+        im = ccs2im(ccs, charge, mz)
+
+        assert isinstance(im, float)
+        assert im > 0
+
+    def test_ccs2im_array(self):
+        """Test CCS to ion mobility conversion with arrays."""
+        ccs = np.array([450.0, 520.0, 480.0])
+        charge = np.array([2, 3, 2])
+        mz = np.array([500.0, 600.0, 550.0])
+
+        im = ccs2im(ccs, charge, mz)
+
+        assert isinstance(im, np.ndarray)
+        assert len(im) == len(ccs)
+        assert np.all(im > 0)
+
+    def test_im2ccs_basic(self):
+        """Test basic ion mobility to CCS conversion."""
+        im = 1.0
+        charge = 2
+        mz = 500.0
+        ccs = im2ccs(im, charge, mz)
+


im2deep/__main__.py

+        predictions = core.predict_and_calibrate(psm_list, psm_list_cal, *args, **kwargs)
+    else:
+        LOGGER.info(
+            "No calibration file provided (calibration is HIGHLY recommended), performing prediction only..."
        )
+        predictions = core.predict(*args, **kwargs)


im2deep/_model_ops.py

+    # TODO: Implement load_model function here (also config) and path to default model?
+    model = _get_architecture(
+        multi=multi,
+    ).load_from_checkpoint(
+        checkpoint_path=model,  # type: ignore # TODO: Match with function signature
+        config=_get_model_config(multi=multi),
+        criterion=_get_loss_function(multi=multi),
+    )
+    model.eval()


im2deep/_model_ops.py

+    )
+    model.eval()
+    LOGGER.debug(f"Model loaded on device: {device}")
+
+    data_loader = DataLoader(
+        data,
+        batch_size=batch_size,
+        shuffle=False,
+        num_workers=num_workers,
+    )
+    LOGGER.debug("DataLoader created for prediction.")
+    LOGGER.debug("Starting prediction loop.")
+    predictions = _predict_loop(model, data_loader, device)
+    return predictions.cpu().detach()
+


im2deep/_architectures/callbacks.py

@@ -0,0 +1,26 @@
+"""Architecture callbacks."""
+
+import pytorch_lightning as L


RalfG and others added 5 commits January 8, 2026 11:23

General linting and typing notation updates

1cf1ee0

Remove numpy version pin

2c5a047

Merge pull request #17 from CompOmics/fix/linting-and-numpy-pin

07e227e

Remove Numpy version pin; general linting and typing updates

Bump version

2c80cc4

Add core module, outline core functionality

1f206d6

rodvrees added this to the v2.0 milestone Jan 8, 2026

rodvrees added 24 commits January 8, 2026 17:14

model_ops and calibration

9735019

Calibration refactoring done

52a6b0e

CLI functionality start

7c12fc5

Implement prediction logic

e2f69d0

New model

07428d7

get_default_reference in Calibration class

3ff0946

Do not use PSMList during calibration

436fb18

Fix calibration logic

8943fb1

Fix calibration for multiconformer prediction

1e2a027

delete outdated modules and model files

8206671

add tests + adapt code where needed

dca3582

Add test workflow

b9e1fb4

update dependencies

e99a871

pytorch-lightning -> lightning

49bc58f

update pyproject and README.md

35ccac0

Remove non-optional optional dependencies

1a4c5ce

Add profiling + use dataframes for calibration

52e1999

Fix tests

8902973

Fix error if CCS is directly in df

35f4451

Fix tests

10b7c32

Fix if CCS is directly in df

e3c0a18

Further fixes

074cb9c

Fix general shift allocation

e3c39eb

throw out im2deeptrainer dependency

40ca412

rodvrees added 2 commits January 14, 2026 10:03

update pyproject.toml

ddd62ef

wandb optional

be33db8

rodvrees marked this pull request as ready for review January 19, 2026 13:36

rodvrees requested review from RalfG and Copilot January 19, 2026 13:36

update version

681998e

Copilot started reviewing on behalf of rodvrees January 19, 2026 13:36 View session

Copilot AI reviewed Jan 19, 2026

View reviewed changes

rodvrees added 2 commits February 17, 2026 08:39

update __init__.,py

112c5c3

Ruff Ruff

74ff2cb

RalfG reviewed Mar 16, 2026

View reviewed changes

RalfG requested changes Mar 16, 2026

View reviewed changes

RalfG added 9 commits March 17, 2026 10:32

Update CI workflows:

ed966f7

- Use ruff for faster linting, including formatting checks - Use uv for faster installs and builds, including caching - Fix issue where test workflow would be run on 'master' branch, while default branch is 'main'

Merge branch 'new-api' of https://github.com/CompOmics/IM2Deep into n…

84e63c9

…ew-api

Split up architecture module into subpackage

0e4ab37

Consolidate pytest config to pyproject.toml; fix test workflow; move …

874fc9f

…optional dependencies to dependency groups

Make exceptions public

e0efbc9

Fix linting and formatting issues

f6e349b

Split utils module into public utils and private _io_helpers; fix rem…

d7a016a

…aining typing issues; add some to-do's.

Change version to alpha.1

c9fdc43

Add type checking to CI

888db74

RalfG changed the base branch from main to release/2.0 March 17, 2026 17:16

RalfG requested a review from Copilot March 17, 2026 17:16

Copilot started reviewing on behalf of RalfG March 17, 2026 17:17 View session

Move type checking to separate job with installed package

da16941

Copilot AI reviewed Mar 17, 2026

View reviewed changes

		psm_list_filtered = psm_list[charges != None] # noqa: E711
		psm_list_filtered = psm_list_filtered[charges <= 6]

		@@ -0,0 +1,26 @@
		"""Architecture callbacks."""

		import pytorch_lightning as L

Conversation

rodvrees commented Jan 8, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

RalfG left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants