Makefile: add link-time optimization support by pjonsson · Pull Request #420 · davidfrantz/force

pjonsson · 2026-03-19T10:00:08Z

Add flags for enabling link-time
optmization during compilation
by calling "make LTO=yes" and use
this in the Dockerfile.

I cannot find a definition of
LDFLAGS, so just chuck it into
GDAL_FLAGS which should be used
for the binaries that does heavy
computation.

I don't have any performance
comparisons, but compiling with LTO
makes the binary noticeably smaller,
before:

$ ls -l which force-l2ps
-rwxr-xr-x 1 root root 2668408 Mar 16 22:02 /usr/local/bin/force/force-l2ps

and after:

$ ls -l which force-l2ps
-rwxr-xr-x 1 root root 1986848 Mar 16 22:14 /usr/local/bin/force/force-l2ps

Reducing the binary size is likely
to improve the instruction cache
hit rate even if we pessimistically
assume that LTO didn't manage to make
any other optimization.

Add flags for enabling link-time optmization during compilation by calling "make LTO=yes". I cannot find a definition of LDFLAGS, so just chuck it into GDAL_FLAGS which should be used for the binaries that does heavy computation. I don't have any performance comparisons, but compiling with LTO makes the binary noticeably smaller, before: $ ls -l `which force-l2ps` -rwxr-xr-x 1 root root 2668408 Mar 16 22:02 /usr/local/bin/force/force-l2ps and after: $ ls -l `which force-l2ps` -rwxr-xr-x 1 root root 1986848 Mar 16 22:14 /usr/local/bin/force/force-l2ps Reducing the binary size is likely to improve the instruction cache hit rate even if we pessimistically assume that LTO didn't manage to make any other optimization.

pjonsson · 2026-03-19T14:54:51Z

The sample size is 1, but here are some rough measurements of CPU-minutes required for creating a L2A product from a Sentinel-2 L1C product with force-l2ps on a 16-core AMD Ryzen 9950X3D:

Compilation flags	CPU Usage
`-O3`	42:48
`-O3 -flto`	42:04
`-O2 -flto`	40:28

It's my desktop machine so there are web browsers and other things running in the background at the same time, but the window focused on by my mouse cursor was the FORCE container.

davidfrantz · 2026-03-20T07:19:26Z

This seems interesting. I will run some tests before merging, though

davidfrantz · 2026-04-17T12:22:39Z

I found some time to make a quick test:

I tested using a Landsat image, converted to L2 ARD, using 32 threads on a testing machine.

I ran the 1st row twice to get rid of caching-related differences.

Compilation flags	CPU Usage
-O3	4:58
-O2	5:10
-O3 -flto	4:59
-O2 -flto	4:47

The results are interesting. There might be a small performance gain with the last combo. I'd like to make some more tests as n is still quite low.

pjonsson · 2026-04-17T12:38:43Z

GDAL 3.13 is just around the corner, and that Docker image will be based on Ubuntu 26.04 that contains GCC 15 instead of the current GCC 13 in Ubuntu 24.04. I should probably warn you that there's a risk that your careful measurements on GCC 13 might not be that relevant in a couple of weeks.

Since we're on the subject of performance: when I run force-l2ps on a single Sentinel-2 L1C image on a 16 core machine, my CPU usage starts with a reasonably long time at 200%, and then it increases to 1500+%. The CPU sitting pegged at 200% usage for an extended time seems like something is hardcoded to use exactly 2 threads.

Is the initial cap to only 2 threads working an old decision based on measurements, or is it because the initial part of the processing only gets a speedup from at most 2 threads?

pjonsson added 2 commits March 19, 2026 10:51

Dockerfile: enable link-time optimization

ad972ae

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Makefile: add link-time optimization support#420

Makefile: add link-time optimization support#420
pjonsson wants to merge 2 commits intodavidfrantz:developfrom
pjonsson:docker-enable-lto

pjonsson commented Mar 19, 2026

Uh oh!

pjonsson commented Mar 19, 2026

Uh oh!

davidfrantz commented Mar 20, 2026

Uh oh!

davidfrantz commented Apr 17, 2026 •

edited

Loading

Uh oh!

pjonsson commented Apr 17, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

pjonsson commented Mar 19, 2026

Uh oh!

pjonsson commented Mar 19, 2026

Uh oh!

davidfrantz commented Mar 20, 2026

Uh oh!

davidfrantz commented Apr 17, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pjonsson commented Apr 17, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

davidfrantz commented Apr 17, 2026 •

edited

Loading