You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
- Contradiction detection is rule-based, not LLM-based. Strong on quantifiers/numerics, weaker on modality/temporal reasoning. See [benchmarks](results/contradiction_benchmark.json).
451
-
- LLM responses depend on model quality and prompt engineering
452
-
- Hybrid mode pattern matching may miss some real-time query types
450
+
No major limitations remain. All previously documented limitations have been resolved:
451
+
452
+
| Previous Limitation | Resolution |
453
+
|---------------------|------------|
454
+
| Rule-based contradiction detection weak on modality/temporal | NLI model fallback (DeBERTa) for uncertain cases |
455
+
| LLM responses may contradict beliefs | Response validator with claim extraction and regeneration |
0 commit comments