GH-3411 Expose row group index via Parquet reader by uros7251brick · Pull Request #3412 · apache/parquet-java

uros7251brick · 2026-03-04T23:58:47Z

Rationale for this change

Engines like Apache Spark need to know which row group a record belongs to — for example, to expose row group metadata as a hidden column, or to correlate records with row group-level statistics. Without this API, callers have no way to determine the current row group index during sequential reads.

What changes are included in this PR?

Similar to how getCurrentRowIndex() was introduced to expose the current row's file-level index, this adds getCurrentRowGroupIndex() to expose the index of the row group currently being read.

New API:

ParquetFileReader.getCurrentRowGroupIndex() — returns the 0-based index of the last row group read via readNextRowGroup() / readNextFilteredRowGroup(). Returns -1 before any row group has been read.
ParquetReader.getCurrentRowGroupIndex() — same semantics, for the high-level record reader.
ParquetRecordReader.getCurrentRowGroupIndex() — same, for the Hadoop MapReduce record reader.

The returned index is the actual file-level row group index, meaning it correctly reflects gaps when empty row groups are skipped (e.g. if row group 1 is empty, the indices reported will be 0, 2, ... not 0, 1, ...).

Are these changes tested?

Yes.

Are there any user-facing changes?

No.

Closes #3411

add getCurrentRowGroupIndex method to Parquet readers

08e7c48

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

GH-3411 Expose row group index via Parquet reader#3412

GH-3411 Expose row group index via Parquet reader#3412
uros7251brick wants to merge 1 commit intoapache:masterfrom
uros7251brick:expose-row-group-idx

uros7251brick commented Mar 4, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

uros7251brick commented Mar 4, 2026

Rationale for this change

What changes are included in this PR?

Are these changes tested?

Are there any user-facing changes?

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant