lossless-permissive frame: tactical no-regret fix for #117 silent-drop bug class

## Summary

A **tactical, no-regret** fix for the `NodeFilter` silent-drop bug class from issue #117, designed to be shippable in one small PR without committing to the broader strict/permissive/hybrid frame decision.

**The principle**: accept multiple equivalent input forms (lossless) — reject inputs we'd silently discard (lossy).

This frames the *bright line* between "convenience for the agent" and "bug class": lossless transformations of input are fine; lossy silent-drops are not. The frame question of #117 (strict vs. permissive vs. hybrid) is deferred — this issue only commits to "stop silently dropping things."

## Why this exists

Issue #117 surfaces a real frame question with real tradeoffs, and the user explicitly asked for thinking-time before locking. But the *production-blocking bug class* (silent-drop of inapplicable filter fields) doesn't have to wait for the frame decision. It can be fixed cheaply, today, without committing to a frame.

This issue is the "ship the safety net first" path. If the eventual frame decision in #117 is **strict**: this work is the first migration step. If the eventual decision is **permissive**: this work is a no-cost compatibility safety net. If the eventual decision is **hybrid**: this work is half of step 1.

**Lossless-permissive doesn't pick a side. It just stops the bleeding.**

## The principle, concretely

| Behavior | Today | Under lossless-permissive |
|---|---|---|
| `filter` passed as JSON-encoded string | Decoded losslessly via `_coerce_filter` | **Kept** (lossless: same input, different form) |
| `filter` passed as dict or `NodeFilter` | Accepted | **Kept** (lossless: same input, different form) |
| Unknown key in `filter` (e.g. `"kind": "client"` inside filter) | Silently dropped by Pydantic `extra="ignore"` | **Rejected** with loud error (lossy → reject) |
| Symbol-only field (e.g. `fqn_prefix`) sent with `kind="client"` | Silently dropped — returns every Client | **Rejected** with structured error listing applicable fields for the kind (lossy → reject) |
| Field that does apply to the requested kind | Honored | **Honored** (no change) |

That's the whole change. No new fields, no aliases, no smart behaviors. Just: **the contract about what gets honored vs. discarded becomes explicit instead of implicit.**

## Why this is genuinely option-preserving

After this lands, the system is in a state where:

- **Strict frame (#117 direction)** is still reachable — keep tightening per-kind validation, lock the frame, no rework required
- **Permissive frame** is still reachable — relax `extra="forbid"` later, add field aliases per kind, add smart behaviors per request
- **Hybrid frame** is still reachable — combine the above per-tool

**None of those future doors close.** The only door that *opens* is "no more silent-drop bugs of the #117 shape." That's the safest possible move because it eliminates the empirically-bad behavior without committing to a long-term frame.

## Proposed implementation

Roughly one PR, ~50 lines of code, ~5 tests:

1. **`NodeFilter` gets `model_config = ConfigDict(extra="forbid")`.** Unknown keys → `ValidationError`. Catches the `"kind"`-inside-filter pathology.

2. **Per-kind applicability check in `find_v2` / `search_v2` / `describe_v2` / `neighbors_v2`.** Before the Cypher push-down, validate that the populated fields on `nf` are a subset of the fields applicable to `kind`. Non-applicable fields → `FindOutput(success=False, message=…)` with a structured error.

3. **Error messages designed as teaching surfaces**, e.g.:

   ```
   filter field 'fqn_prefix' is not applicable to kind 'client'.
   applicable fields for kind 'client': ['microservice', 'module', 'client_kind',
     'target_service', 'target_path_prefix', 'client_method', 'source_layer'].
   ```

   The error message becomes a roadmap. Combined with PR #120's hint field, the agent learns the contract from its own mistakes.

4. **`_coerce_filter` stays untouched.** JSON-decoding is lossless multi-form input — same input, different form. Not a permissive-frame thing; just a serialization convenience.

5. **No changes to `search.query` permissiveness.** The fuzzy query → ranked score behavior is unaffected. This issue is about filter contract honesty only.

## Test coverage

- Unknown key in filter → `ValidationError`
- Symbol-only field with `kind="client"` → `success=False` with structured message
- Symbol-only field with `kind="symbol"` → honored (no change)
- Client-only field with `kind="symbol"` → `success=False`
- Empty filter on any kind → success, no-op (no change)
- JSON-string filter input → still works (lossless multi-form)

## What this does NOT do

- **Does not pick a frame** (strict / permissive / hybrid). That's #117's job.
- **Does not add `resolve` or any new tool.** Would be premature; depends on frame.
- **Does not rename or align fields across kinds.** Vocabulary alignment audit is part of the #117 frame work, not this safety-net fix.
- **Does not change `search.query` semantics.** Query stays fuzzy text.
- **Does not introduce field aliases** (no `member_fqn_prefix` for clients, no smart `target_service` resolution, etc.). Smart behaviors stay deferred.
- **Does not change `EdgeType` literal, kind literal, or any other shipped strict invariant.**

## The caveat (deliberately surfaced)

**Lossless-permissive is a tactical fix, not a strategic frame.** It answers "stop silently dropping things" but it does not answer "what's the contract per kind?" The deferred question still has to be answered eventually, and the longer we wait, the more agents have been trained on the loud-fail-but-still-permissive surface, which biases future frame choices.

Concrete risk: if we ship lossless-permissive and then 3 months later try to add per-kind field aliases (toward permissive frame) or per-kind closed enums (toward strict frame), we'll have **agent prompts in the wild that assume the current surface.** They'll need updating. The longer we wait, the more agents need updating.

**So the safest path isn't "ship lossless-permissive and forget about it"** — it's "ship lossless-permissive AND keep #117 open as the strategic frame question." The tactical fix earns time; it doesn't replace the strategic decision.

## Relationship to other issues

- **#117 — strategic frame question.** Stays open. This issue is the tactical no-regret subset that ships without locking #117.
- **#118 — rollup decomposition.** Independent. This fix doesn't change rollup behavior.
- **#119 — Kuzu `label(e) IN $list` bug.** Independent; should land first.
- **#120 — hints field propose.** Composes naturally: lossless-permissive's structured error messages become candidate `hints` outputs under #120's road-sign discipline.

## Decision point for the maintainer

If you ship lossless-permissive:
- **Pro**: production bug class is fixed today; #117 frame decision can take its time
- **Pro**: each piece (`extra="forbid"`, per-kind validation) is independently testable and shippable
- **Pro**: error messages become teaching surfaces, accelerating agent learning
- **Con**: still owe the frame decision; this is a stopgap, not a finish line
- **Con**: agents trained on lossless-permissive surface will need re-training when strict frame fully lands (if it does)

If you wait for #117 to lock first:
- **Pro**: one coordinated migration, no transitional state
- **Pro**: no risk of agents adapting to a transitional surface that later changes
- **Con**: production bug class persists until #117 frame is decided
- **Con**: frame decision is harder to make without evidence from real agent runs under the loud-fail behavior

## Suggested sequencing (if you go with this)

1. Fix #119 (Kuzu bug) — independent, one PR
2. Ship lossless-permissive (this issue) — one PR, ~50 lines
3. Ship #120 hints field — one PR, already drafted
4. Let 1-3 months of agent runs accumulate. Watch what error messages agents hit repeatedly.
5. Revisit #117 frame with that evidence. Decide strict / permissive / hybrid then.

That sequence is fully reversible at every step until step 5. Each step is small. None of them block another.

## References

- `mcp_v2.py:49–66` — `NodeFilter` (needs `extra="forbid"` config)
- `mcp_v2.py:79–98` — `_coerce_filter` (lossless multi-form input — stays untouched)
- `mcp_v2.py:321–368` — `_node_matches_filter` (per-kind post-filter; gets per-kind applicability check)
- `mcp_v2.py:401–466` — `find_v2` / `search_v2` (the entry points where per-kind validation lives)
- Issue #117 — the strategic frame question this issue defers
- PR #120 — hints field, which composes with lossless-permissive's teaching error messages


Behavior	Today	Under lossless-permissive
`filter` passed as JSON-encoded string	Decoded losslessly via `_coerce_filter`	Kept (lossless: same input, different form)
`filter` passed as dict or `NodeFilter`	Accepted	Kept (lossless: same input, different form)
Unknown key in `filter` (e.g. `"kind": "client"` inside filter)	Silently dropped by Pydantic `extra="ignore"`	Rejected with loud error (lossy → reject)
Symbol-only field (e.g. `fqn_prefix`) sent with `kind="client"`	Silently dropped — returns every Client	Rejected with structured error listing applicable fields for the kind (lossy → reject)
Field that does apply to the requested kind	Honored	Honored (no change)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

lossless-permissive frame: tactical no-regret fix for #117 silent-drop bug class #122

Summary

Why this exists

The principle, concretely

Why this is genuinely option-preserving

Proposed implementation

Test coverage

What this does NOT do

The caveat (deliberately surfaced)

Relationship to other issues

Decision point for the maintainer

Suggested sequencing (if you go with this)

References

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

lossless-permissive frame: tactical no-regret fix for #117 silent-drop bug class #122

Description

Summary

Why this exists

The principle, concretely

Why this is genuinely option-preserving

Proposed implementation

Test coverage

What this does NOT do

The caveat (deliberately surfaced)

Relationship to other issues

Decision point for the maintainer

Suggested sequencing (if you go with this)

References

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions