Skip to content

HIVE-29432: Autogather column statistics missing for tables containing a column with an unsupported type#6374

Open
thomasrebele wants to merge 1 commit intoapache:masterfrom
thomasrebele:tr/HIVE-29432
Open

HIVE-29432: Autogather column statistics missing for tables containing a column with an unsupported type#6374
thomasrebele wants to merge 1 commit intoapache:masterfrom
thomasrebele:tr/HIVE-29432

Conversation

@thomasrebele
Copy link
Contributor

@thomasrebele thomasrebele commented Mar 17, 2026

HIVE-29432

What changes were proposed in this pull request?

When the column statistics are calculated, skip the columns with unsupported types, but calculate the statistics for all supported types.

Why are the changes needed?

If a table contains a column of certain types (union, struct, list, but also TIMESTAMP WITH LOCAL TIME ZONE), then no column statistics are gathered.

Does this PR introduce any user-facing change?

No.

How was this patch tested?

I've added a q file test.

@sonarqubecloud
Copy link

@thomasrebele
Copy link
Contributor Author

I think the test failure of TestReplicationMetricCollector.testSuccessOptimizedBootstrapDumpMetrics is not related to this PR. The test passes when executed locally. It didn't fail during the previous versions of the PR, so it might be some flakiness of TestReplicationMetricCollector.

@thomasrebele thomasrebele marked this pull request as ready for review March 20, 2026 22:34
}
}

public static boolean canRunAutogatherStats(Table destinationTable, Operator curr) {
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I'll add the parametrized type for the Opterator parameters during the next update of the PR.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants