Simple adaptive integration by pkienzle · Pull Request #658 · SasView/sasmodels

pkienzle · 2025-07-30T16:18:57Z

Alternative to the #608 using a simple heuristic based on qr.

Implements adaptive integration for all shapes except superball. The paracrystal models (bcc, fcc, sc) need a different approach.

Accuracy is usually comparable to a 10000 point gaussian integration for every qr. The target is 0.1% difference, though it isn't always achieved. For example:

$ python -m sasmodels.compare background=0 core_shell_cylinder -ngauss=0,10000 -engine=single,single! -random=83174 -nq=1000 -pars -neval=10
Randomize using -random=83174
scale: 0.291548
background: 0
sld_core: 8.92736
sld_shell: 11.2834
sld_solvent: 10.7719
radius: 291.224
thickness: 33528.1
length: 1.53983
GPU[32] t=12.22 ms, intensity=36122148864
DLL[32] t=293.91 ms, intensity=36122148864
|GPU[32]-DLL[32]|            max:3.648e+03  median:2.930e-03  98%:6.400e+02  rms:2.012e+02  zero-offset:+3.928e+01
|(GPU[32]-DLL[32])/DLL[32]|  max:3.443e-02  median:2.903e-06  98%:2.554e-03  rms:1.932e-03  zero-offset:+2.716e-04

Because we include a 20 point gaussian integration scheme, speed is frequently faster than the fixed 76 point gaussian integration in master, at least for small shapes. For large shapes it can be several times slower than the fixed scheme, though the increase in accuracy easily justifies the cost.

Shapes with nested integrals (e.g., triaxial ellipsoid) can be very slow. For example:

$ python -m sasmodels.compare background=0 triaxial_ellipsoid -random=709580 -nq=100 -pars
Randomize using -random=709580
scale: 0.0515113
background: 0
sld: 4.27038
sld_solvent: 8.05591
radius_equat_minor: 30.0115
radius_equat_major: 28083
radius_polar: 42754.1
GPU[32] t=11001.53 ms, intensity=69025144

Because the cost for a 10000 point gaussian with nested integration is so high these models have only be checked for accuracy at a few Q points.

Refs #248

pkienzle · 2025-07-30T19:08:27Z

Example of bad triaxial ellipsoid (20% error):

$ python -m sasmodels.compare background=0 triaxial_ellipsoid -ngauss=0,10000 -engine=single,single! -nq=30 -random=716856 -pars
Randomize using -random=716856
scale: 0.00343363
background: 0
sld: 11.2141
sld_solvent: 10.9297
radius_equat_minor: 41.5349
radius_equat_major: 9142.92
radius_polar: 74.0436
GPU[32] t=58.31 ms, intensity=33
DLL[32] t=12646.33 ms, intensity=33
|GPU[32]-DLL[32]|            max:1.941e-03  median:1.907e-06  98%:1.849e-03  rms:6.002e-04  zero-offset:+2.745e-04
|(GPU[32]-DLL[32])/DLL[32]|  max:1.884e-01  median:1.179e-06  98%:1.810e-01  rms:5.891e-02  zero-offset:+2.319e-02

The fixed 76 point integration scheme works better for this example (0.3% error).

Maybe it is worth exploring Lebedev and other surface quadrature schemes for these nested integrals. It is messy, though, because not all of them are of the form ∫∫ F(q) sin(θ) dφ dθ.

butlerpd · 2025-10-21T14:52:16Z

This was briefly discussed at today's fortnightly call and tagged as of interest to the upcoming camp. Question is whether it provides a minimal change to provide a reasonable speedup. It is noted that this PR not only adds the new adaptive integation it changes all the model files that currently use the GaussXX methods with this one. Probably would have been cleaner as two separate PRs?

Also at issue is what to do with the integration speedup already proposed a few years earlier and sitting in #608
Michael Wagner agreed to look at this.

~~NOTE: there are conflicts that will need to be resolved before this can be merged~~

pkienzle · 2026-01-14T00:46:57Z

This works well for rotationally symmetric shapes that only use 1D integrals.

Performance is unsatisfactory on shapes such as triaxial ellipsoid that need 2D integrals.

I could revert changes for those models until we've had a chance to explore other schemes such as Lebedev or Fibonacci.

…sasmodels into ticket-535-adaptive-integration

…uracy tests

pkienzle · 2026-04-13T21:11:23Z

List of shapes with 2D integrals:

triaxial_ellipsoid
elliptical_cylinder, core_shell_bicelle_elliptical[_belt_rough]
parallelepiped, core_shell_parallelepiped, rectangular_prism, hollow_rectangular_prism[_thin_walls]
[bcc|fcc|sc]_paracrystal
barbell, capped_cylinder
superball, octehedron_truncated, nanoprisms
pringle

For these shapes the computational cost is quadratic in the number of integration points, so it is not feasible to fit large shapes accurately.

Consider returning NaN for q values that require more than a million evaluations to get better than 3e-3 accuracy. If these q are dropped from the residuals calculation the fit can still proceed for the low q points but the high q points will be ignored. This may end up biasing the fit toward large shapes since the estimated log likelihood will be reduced.

Triaxial ellipsoid, the five rectangular prisms and the three elliptical cylinders should be reasonably accurate for dimensions below 1 μm, though they can take several seconds per evaluation. [I only tested triaxial ellipsoid, parallelepiped and elliptical cylinder; the others follow the same code patterns so they are probably good but should still be tested.]

butlerpd · 2026-04-13T22:13:12Z

would unrolling the integral to distribute on GPU's help the speed?

pkienzle · 2026-04-13T22:29:52Z

would unrolling the integral to distribute on GPU's help the speed?

Yes, but not much. With 15000 cores and 150 q points evaluated in parallel we could potentially see a 100x improvement over the current speed. For a 1 μm cube this would turn a 5 s evaluation into a 0.05 s evaluation. But cost is growing as (qr)² or worse, so a 10 μm cube would be back at 5 s again. We need better algorithms for USAXS/USANS calculations.

pkienzle · 2026-04-14T00:28:39Z

... except that USAXS/USANS will be at lower q, so in practice it shouldn't be a problem.

The issue is with slit resolution, which pulls from a very high q values. With $q^4$ fall off the large q values don't contribute much to the resolution integral, so it didn't matter that they weren't computed accurately. This PR will make the resolution calculation take too long for accuracy that it doesn't need.

A couple of options:

limit the number of gaussian points when computing q values far above the nominal q
replace I(q) with a power law function when computing q values far above the nominal q
add an additional function for each model ∫I(q)dq from q0 to infinity to use in slit resolution calculations

All of these will require icky code in the interface between resolution function and model calculations.

Given that it'll break USAXS/USANS, I don't think we should merge this PR until we figure out how to handle slit resolution.

Paul Kienzle and others added 5 commits July 25, 2025 19:28

adaptive integration for cylinder using qr heuristic

c0eb28e

add core shell bicelle; fix gpu build errors (mac)

5936967

adaptive integration for most shape models

e6126ac

[pre-commit.ci lite] apply automatic fixes for ruff linting errors

e5703a1

use sasmodels.compare -seed=<n> for reproducibile -sets

bffeaf0

Paul Kienzle and others added 5 commits November 3, 2025 16:18

adaptive integration for cylinder using qr heuristic

1c2b23c

add core shell bicelle; fix gpu build errors (mac)

56630af

adaptive integration for most shape models

9bb7d2c

[pre-commit.ci lite] apply automatic fixes for ruff linting errors

c6d9666

use sasmodels.compare -seed=<n> for reproducibile -sets

615df71

DrPaulSharp force-pushed the ticket-535-adaptive-integration branch from bffeaf0 to 615df71 Compare November 3, 2025 16:18

pkienzle mentioned this pull request Jan 14, 2026

Adding a pure python truncated octahedron model with Fibonacci orientation averaging #694

Open

8 tasks

Paul Kienzle added 9 commits April 8, 2026 12:34

Merge branch 'ticket-535-adaptive-integration' of github.com:SasView/…

2be337b

…sasmodels into ticket-535-adaptive-integration

improve accuracy of large triaxial ellipsoids

40df890

add missing lib/nonadaptive.c which is used for sasmodels.compare acc…

dea77dd

…uracy tests

remove merge conflict

5881ff7

improve accuracy of long rectangular prism models

26a5f8d

improve accuracy of long rectangular prism models

da3b533

improve accuracy of elliptical cylinder model

10c9f31

Tag gaussian integration variables n,z,w with _outer

8a49273

improve accuracy of elliptical bicelle models

6d0d448

pkienzle mentioned this pull request Apr 13, 2026

Nanoprism model #685

Open

improve accuracy of barbell and capped cylinder

0899275

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Simple adaptive integration#658

Simple adaptive integration#658
pkienzle wants to merge 20 commits intomasterfrom
ticket-535-adaptive-integration

pkienzle commented Jul 30, 2025 •

edited

Loading

Uh oh!

pkienzle commented Jul 30, 2025

Uh oh!

butlerpd commented Oct 21, 2025 •

edited by DrPaulSharp

Loading

Uh oh!

pkienzle commented Jan 14, 2026

Uh oh!

pkienzle commented Apr 13, 2026

Uh oh!

butlerpd commented Apr 13, 2026

Uh oh!

pkienzle commented Apr 13, 2026

Uh oh!

pkienzle commented Apr 14, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

pkienzle commented Jul 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pkienzle commented Jul 30, 2025

Uh oh!

butlerpd commented Oct 21, 2025 • edited by DrPaulSharp Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pkienzle commented Jan 14, 2026

Uh oh!

pkienzle commented Apr 13, 2026

Uh oh!

butlerpd commented Apr 13, 2026

Uh oh!

pkienzle commented Apr 13, 2026

Uh oh!

pkienzle commented Apr 14, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

pkienzle commented Jul 30, 2025 •

edited

Loading

butlerpd commented Oct 21, 2025 •

edited by DrPaulSharp

Loading