fix(datasets-raw): replace random provider selection with attempt-based rotation by Theodus · Pull Request #1995 · edgeandnode/amp

Theodus · 2026-03-19T14:07:13Z

Replace random provider shuffling with deterministic provider selection based on job attempt count. On each retry, the job selects the next provider in lexicographic order, cycling through all available providers.

Add find_providers method returning all matching providers in deterministic order
Select provider in job_impl using attempt_count % num_providers
Remove rand dependency

shiyasmohd

LGTM. Added some suggestions as comments.

My only concern is, this approach forces us to have a fallback provider.
Eg:- If only one provider is configured, and if it fails, the job process fails instantly without retry (correct me if i'm wrong). This job will the retry only when the scheduler retries the next time. My suggestion would be to add 3 or 5 retries with fallback. Leaving this decision to you. This could be added in a follow up PR or this PR itself.

crates/core/providers-registry/src/lib.rs

Replace random provider selection with deterministic ordering and add BlockStreamerWithFallback to try each provider once on recoverable errors. - Add `BlockStreamerWithFallback` that iterates providers in order, falling back to the next on recoverable errors and aborting on fatal - Add `find_providers` and `create_block_stream_clients` to `ProvidersRegistry` returning all matching providers in name order - Add `AllClientCreationsFailed` error variant to distinguish from `ProviderNotFound` when providers exist but all fail to connect - Replace random shuffle with deterministic lexicographic selection - Remove `rand` dependency from `providers-registry`

…ed rotation Replace random provider shuffling with deterministic provider selection based on job attempt count. On each retry, the job selects the next provider in lexicographic order, cycling through all available providers. - Add find_providers method returning all matching providers in deterministic order - Select provider in job_impl using attempt_count % num_providers - Remove rand dependency

claude

Code Review Summary

Most important finding: Likely off-by-one bug in provider rotation. get_attempt_count returns 1 on the first job execution (it counts SCHEDULED events), so 1 % num_providers skips provider index 0 on the initial attempt. The first provider in lexicographic order is only used when attempt is a multiple of num_providers.

Other findings:

Silent error swallowing when get_attempt_count DB query fails (no logging)
Silent skip of unparseable provider configs in find_providers (no warning logged, unlike env substitution failures)
Doc/implementation mismatch in provider-registry.md — describes single-provider selection but find_providers returns a list for callers to select from
The ordering guarantee of find_providers depends on BTreeMap iteration order — worth documenting this dependency
eth_call always picks the first provider while block streaming rotates — potential provider affinity concern noted via TODO

crates/core/worker-datasets-raw/src/job_impl.rs

crates/core/providers-registry/src/lib.rs

crates/core/common/src/udfs/eth_call/cache.rs

docs/feat/provider-registry.md

Theodus · 2026-03-23T21:47:26Z

@shiyasmohd, can I get another review on this since there has been significant changes?

Theodus requested review from LNSD, shiyasmohd and sistemd March 19, 2026 14:07

leoyvens added the claude-review label Mar 19, 2026

This comment was marked as outdated.

Sign in to view

shiyasmohd approved these changes Mar 22, 2026

View reviewed changes

crates/core/providers-registry/src/lib.rs Outdated Show resolved Hide resolved

crates/core/providers-registry/src/lib.rs Outdated Show resolved Hide resolved

Theodus marked this pull request as draft March 23, 2026 14:34

Theodus force-pushed the theodus/rpc-fallback branch 2 times, most recently from 85e4bfc to 3cd3996 Compare March 23, 2026 15:40

Theodus changed the title ~~feat(datasets-raw): add provider fallback for block streaming~~ fix(datasets-raw): replace random provider selection with attempt-based rotation Mar 23, 2026

Theodus force-pushed the theodus/rpc-fallback branch from 3cd3996 to 1a4dcd1 Compare March 23, 2026 15:52

Theodus added claude-review and removed claude-review labels Mar 23, 2026

claude bot reviewed Mar 23, 2026

View reviewed changes

Theodus added 2 commits March 23, 2026 10:07

add test

7523856

CR feedback

3037507

Theodus force-pushed the theodus/rpc-fallback branch from 1938b9f to 3037507 Compare March 23, 2026 16:16

Theodus requested a review from shiyasmohd March 23, 2026 16:21

Theodus marked this pull request as ready for review March 23, 2026 16:21

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(datasets-raw): replace random provider selection with attempt-based rotation#1995

fix(datasets-raw): replace random provider selection with attempt-based rotation#1995
Theodus wants to merge 4 commits intomainfrom
theodus/rpc-fallback

Theodus commented Mar 19, 2026 •

edited

Loading

Uh oh!

This comment was marked as outdated.

Uh oh!

shiyasmohd left a comment

Uh oh!

Uh oh!

Uh oh!

claude bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Theodus commented Mar 23, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

Theodus commented Mar 19, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

This comment was marked as outdated.

Uh oh!

shiyasmohd left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

claude bot left a comment

Choose a reason for hiding this comment

Code Review Summary

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Theodus commented Mar 23, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Theodus commented Mar 19, 2026 •

edited

Loading