optimizations by HoneyryderChuck · Pull Request #190 · igrigorik/http-2

HoneyryderChuck · 2026-05-25T22:25:50Z

this is a series of optimizations around memory usage / doing less in a series of path both hot and cold.

The main change is how flags are represented in frames: instead of arrays of symbols, like [:end_stream], they're now integers representing bitflags, i.e. 1 (0000 0001). It's a semi-breaking change, in the sense that frames get exposed to end users via on(:frame_received | :frame_sent) callbacks, but it's internal representation regardless.

this removes one less extra string compared to the previous strategy

this replaces multiple calls to #read_str (which allocates a string per byte popped) by an accounting loop using String#getbyte and a single #read_str call at the end algo should be roughly the same speed, with less GC pressure as a result

this comes at the cost of having an extra global hash, albeit a small one. to compensate, DEFINE_SETTINGS was turned into an array. there's the space saving, but Array#include?, despite O(n), should be more performant than Hash#key.

frame containers would store flag information as arrays of symbols, which would then be passed up and down the encoding/decoding chain in that format before being translated to/from the actual bits that are encoded into the buffer. this would cause some overhead, given that 1) arrays of symbols generate more GC pressure than raw integers (tagged objects); 2) array operations vs bitwise operations; 3) translation overhead (and given it used bit positions rather than the bit representation, bit shifting). this approach is replaced by carrying the bits around, which solves all problems, at the expense of making the flag less readable when exposed to the end user via the frame_sent/received callbacks. for that reason, one can consider it a "regression", but IMO this is an internal representation that end users shouldn't be relying on for anything.

this seems like a leftover from a WIP version of the HTTP/2 RFC that never made it to the final spec

avoids the whole array resizing logic

…oding now

while ruby allows sharing the underlying buffer string when slicing to the end of the string, it still allocates the 40b extra ruby objects, and the code doesn't get more complicated through it, so why not

…id O(n)'ing on every size check

…ve operations the prior version of this would append the huffman string THEN add the length prefix via String#insert, which needs to resize the string in the middle thereby doing a memmove operation, which is bad. this avoids it by either appending the length prefix before the string when possible, pre-append a null byte then change it after generating the huffman string, and in the rare case where the length may require more than one byte, remove the appended byte and use the old strategy relying on String#insert, because we can't have nice things

by keeping a side hash which keeps the offset of matched entries, lookups don't require table scans anymore, which makes this more scalable across different dynamic table sizes (at the expense of an extra container where to store things)

no intermediate 1 char string

the buffer will be already force-encoded after the method is called

these were identified while running rbs runtime tests against the record union type for frames, something which is still making steep block... but is nevertheless where this should land

…n already encoded settings list

which support appending byte numbers

the resulting hash from merge was bigger than it needed to be

…closed management with a twist though; instead of traversing the whole hash, break the loop as early as possible, thereby having the same algo advantage of Hash#delete_while while not having to tear down the list each time

…ning max size chunks this uses #read_str helper, which will use String#bytesplice when available

some of the functions are raising type errors instead of the error which would close the connection or stream.

HoneyryderChuck added 25 commits May 25, 2026 18:16

memoize padding bytes, speed up huffman encoding ~2x

dc86ffa

use bytesplice (when available) when reading chunks from a string buffer

092918e

this removes one less extra string compared to the previous strategy

correctly set buffer size for SETTINGS frame buffer

35e7407

eliminate DEFINED_SETTINGS O(n) walk by direct lookup

8e99b96

this comes at the cost of having an extra global hash, albeit a small one. to compensate, DEFINE_SETTINGS was turned into an array. there's the space saving, but Array#include?, despite O(n), should be more performant than Hash#key.

improving typing

e0adbbc

removing support for blocked frame type

914870d

this seems like a leftover from a WIP version of the HTTP/2 RFC that never made it to the final spec

create settings array via map instead of new array then append pattern

e746110

avoids the whole array resizing logic

took encode_headers logic away from #encode, which does raw frame enc…

cea03b1

…oding now

avoid slicing string too much when generating continuation frames

afb5a7f

while ruby allows sharing the underlying buffer string when slicing to the end of the string, it still allocates the 40b extra ruby objects, and the code doesn't get more complicated through it, so why not

when processing continuation frames, bookkeep size of payloads to avo…

e68000c

…id O(n)'ing on every size check

pack priority frame data in one single #pack call

53e1245

replace String#ord with String#getbyte for single character byte

94fedb8

no intermediate 1 char string

refactor Connection#send as a plain case/when block

a7fa55f

remove needless #force_encoding call

e8c4e85

the buffer will be already force-encoded after the method is called

removing redundant cast to to_a

1377f3d

adjusted frame usage to make type checking more accurate

69e48ab

these were identified while running rbs runtime tests against the record union type for frames, something which is still making steep block... but is nevertheless where this should land

simplify h2c code by allowing frame to generate settings frame from a…

44556e1

…n already encoded settings list

replace #append_str usage for chr with String#<<

1d8745b

which support appending byte numbers

create continuation frame hash instead of merging with headers

04d110e

the resulting hash from merge was bigger than it needed to be

frame buffer: forego allocation of intermediate string while parittio…

ab55a71

…ning max size chunks this uses #read_str helper, which will use String#bytesplice when available

HoneyryderChuck force-pushed the opts branch from dd7f822 to 9b4bbe2 Compare May 25, 2026 22:46

HoneyryderChuck added 2 commits May 26, 2026 09:22

make sure a string is returned.

2bc6cbe

turn off typecheck when running h2spec

518582a

some of the functions are raising type errors instead of the error which would close the connection or stream.

HoneyryderChuck force-pushed the opts branch from 9b4bbe2 to 518582a Compare May 26, 2026 08:25

only post simplecov results if working from the main repo

f6d7d09

HoneyryderChuck requested a review from mullermp May 26, 2026 10:29

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

optimizations#190

optimizations#190
HoneyryderChuck wants to merge 28 commits into
igrigorik:mainfrom
HoneyryderChuck:opts

HoneyryderChuck commented May 25, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

HoneyryderChuck commented May 25, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant