feat: citation requirement#725

Open

akihikokuroda wants to merge 9 commits intogenerative-computing:mainfrom

akihikokuroda:citation

Member

akihikokuroda commented Mar 23, 2026 •

edited

Loading

Misc PR

Type of PR

Bug Fix
New Feature
Documentation
Other

Description

Link to Issue: Fixes Hallucination Detection Requirements #503

Testing

Tests added to the respective file if code was changed
New code has 100% coverage if code as added
Ensure existing tests and github automation passes (a maintainer will kick off the github automation when the rest of the PR is populated)

akihikokuroda requested a review from a team as a code owner

March 23, 2026 20:06

Contributor

github-actions bot commented Mar 23, 2026

The PR description has been updated. Please fill out the template for your PR to be reviewed.

mergify bot commented Mar 23, 2026

Merge Protections

Your pull request matches the following merge protections and will not be merged until they are valid.

🟢 Enforce conventional commit

Wonderful, this rule succeeded.

Make sure that we follow https://www.conventionalcommits.org/en/v1.0.0/

title ~= ^(fix|feat|docs|style|refactor|perf|test|build|ci|chore|revert|release)(?:\(.+\))?:

akihikokuroda marked this pull request as draft

March 23, 2026 20:34

akihikokuroda marked this pull request as draft

March 23, 2026 20:34

akihikokuroda marked this pull request as ready for review

March 23, 2026 21:50

frreiss requested changes

View reviewed changes

Collaborator

frreiss left a comment

This needs some work. The fundamental calculation does not appear to be valid, and there are a number of puzzling API design decisions.

mellea/stdlib/requirements/rag.py Show resolved Hide resolved

mellea/stdlib/requirements/rag.py

+                      # Use constructor documents if provided, otherwise get from message
+                      if self.documents is not None:
+                          documents = self.documents

Collaborator

frreiss Mar 24, 2026

Checking against different set of documents than those used to generate the last message is outside the target domain of the Citations intrinsic. Have you done anything to validate the assumption that such a trick would work?

Member Author

akihikokuroda Mar 24, 2026

This implementation is based on the current example and find_citations implementation below
https://github.com/generative-computing/mellea/blob/main/docs/examples/intrinsics/citations.py

mellea/mellea/stdlib/components/intrinsic/rag.py

Line 97 in 1a3a01f

def find_citations(

mellea/stdlib/requirements/rag.py Outdated Show resolved Hide resolved

mellea/stdlib/requirements/rag.py Outdated

+                      all_messages = ctx.as_list()
+                      if len(all_messages) > 1:
+                          # Rebuild context without last message
+                          from ..context import ChatContext

Collaborator

frreiss Mar 24, 2026

Why is this import not at the top of the file?

Member Author

akihikokuroda Mar 24, 2026

It's to avoid circular dependencies

Collaborator

frreiss Mar 24, 2026

The fact that you're seeing a circular dependency here is concerning. The only imports at the top of this file are:

from ...backends.adapters import AdapterMixin
from ...core import Backend, Context, Requirement, ValidationResult
from ..components import Document, Message

An import of mellea.stdlib.context.ChatContext ought to be compatible with these imports. What is the root cause of the dependency cycle when that dependency is added to this file?

Member Author

akihikokuroda Mar 25, 2026

Yes, it is right. After looking at them again, there are no cycle dependency for ChatContext. I'll move it at top. Thanks!

mellea/stdlib/requirements/rag.py Outdated Show resolved Hide resolved

mellea/stdlib/requirements/rag.py Outdated

+                          citation["response_end"] - citation["response_begin"]
+                          for citation in citations
+                      )
+                      coverage_ratio = cited_chars / total_chars

Collaborator

frreiss Mar 24, 2026

Checking a ratio of characters is not in line with the original requirement in #503: "requires [sic.] citations for factual claims". You should be checking the fraction of factual claims that are backed by citations.

Member Author

akihikokuroda Mar 24, 2026

Implemented "fraction of factual claims" and also add an option to choose between character or factual base check.

mellea/stdlib/requirements/rag.py

+                          reason += f"\n\nCitations found ({num_citations}):"
+                          for i, citation in enumerate(citations[:5]):  # Show first 5
+                              response_text = citation["response_text"].strip()
+                              doc_id = citation.get("citation_doc_id", "unknown")

Collaborator

frreiss Mar 24, 2026

Why are you passing default values to get() here and on the next line?

mellea/stdlib/requirements/rag.py Outdated Show resolved Hide resolved

test/stdlib/requirements/test_rag_requirements.py

+              @pytest.mark.requires_heavy_ram
+              async def test_citation_requirement_basic():
+                  """Test basic citation requirement functionality."""
+                  backend = LocalHFBackend(model_id="ibm-granite/granite-4.0-micro")

Collaborator

frreiss Mar 24, 2026

Every test in this file loads the base model. Use a fixture instead.

Member Author

akihikokuroda Mar 24, 2026

There are both tests, accessing model and using mock. There are test markers that make selectable execution such as @pytest.mark.huggingface.

test/stdlib/requirements/test_rag_requirements.py Outdated

+              @pytest.mark.llm
+              @pytest.mark.requires_heavy_ram
+              async def test_citation_requirement_threshold_boundary():
+                  """Test citation requirement at exact threshold boundary."""

Collaborator

frreiss Mar 24, 2026

Test case does not do what comment says it does. You would need to construct an input that produces the exact target threshold boundary, probably by mocking the citation intrinsic, to do what this comment says.

Member Author

akihikokuroda Mar 24, 2026

Thanks! I'll fix it.

Collaborator

frreiss commented Mar 24, 2026

Member Author

akihikokuroda commented Mar 24, 2026

@frreiss Thanks for review. I addressed all of your comments so far.

akihikokuroda requested a review from frreiss

March 24, 2026 16:37

Member Author

akihikokuroda commented Mar 24, 2026 •

edited

Loading

@frreiss I missed one. I'll work on it.
It's done now.

akihikokuroda force-pushed the citation branch 2 times, most recently from 113ae32 to 0f47f16 Compare

March 24, 2026 20:20

psschwei closed this

psschwei reopened this

akihikokuroda added 8 commits

March 24, 2026 17:02


          feat: citation requirement

8bf9523

Signed-off-by: Akihiko Kuroda <akihikokuroda2020@gmail.com>


          fix test error

52ae5f4

Signed-off-by: Akihiko Kuroda <akihikokuroda2020@gmail.com>


          fix example

1b5cc82

Signed-off-by: Akihiko Kuroda <akihikokuroda2020@gmail.com>


          fix dockstring issue

8f3779c

Signed-off-by: Akihiko Kuroda <akihikokuroda2020@gmail.com>


          fix test name conflict

6677b5e

Signed-off-by: Akihiko Kuroda <akihikokuroda2020@gmail.com>


          review comments

70d70cb

Signed-off-by: Akihiko Kuroda <akihikokuroda2020@gmail.com>


          feat: review comments

e08b8be

Signed-off-by: Akihiko Kuroda <akihikokuroda2020@gmail.com>


          feat: review comments

c18c9b2

Signed-off-by: Akihiko Kuroda <akihikokuroda2020@gmail.com>

akihikokuroda force-pushed the citation branch from 0f47f16 to c18c9b2 Compare

March 24, 2026 21:02

github-actions bot added the enhancement label


          review comment

84472ff

Signed-off-by: Akihiko Kuroda <akihikokuroda2020@gmail.com>

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels