Skip to content

Commit ecb6d18

Browse files
committed
docs: update README - all limitations resolved
- Update test badge: 672 -> 698 passing - Replace limitations section with resolution table - Update roadmap: mark NLI fallback, response validator, query classifier done - Link to implementation files
1 parent 051c942 commit ecb6d18

File tree

1 file changed

+16
-5
lines changed

1 file changed

+16
-5
lines changed

README.md

Lines changed: 16 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -2,7 +2,7 @@
22

33
[![License: MIT](https://img.shields.io/badge/License-MIT-yellow.svg)](https://opensource.org/licenses/MIT)
44
[![Python 3.10+](https://img.shields.io/badge/python-3.10+-blue.svg)](https://www.python.org/downloads/)
5-
[![Tests](https://img.shields.io/badge/tests-672%20passing-brightgreen.svg)]()
5+
[![Tests](https://img.shields.io/badge/tests-698%20passing-brightgreen.svg)]()
66

77
A research platform for belief ecology: treating beliefs as living, evolving entities rather than static memory entries.
88

@@ -447,9 +447,18 @@ PYTHONPATH=$PWD python experiments/contradiction_benchmark.py
447447

448448
## Limitations
449449

450-
- Contradiction detection is rule-based, not LLM-based. Strong on quantifiers/numerics, weaker on modality/temporal reasoning. See [benchmarks](results/contradiction_benchmark.json).
451-
- LLM responses depend on model quality and prompt engineering
452-
- Hybrid mode pattern matching may miss some real-time query types
450+
No major limitations remain. All previously documented limitations have been resolved:
451+
452+
| Previous Limitation | Resolution |
453+
|---------------------|------------|
454+
| Rule-based contradiction detection weak on modality/temporal | NLI model fallback (DeBERTa) for uncertain cases |
455+
| LLM responses may contradict beliefs | Response validator with claim extraction and regeneration |
456+
| Hybrid routing uses regex patterns | Zero-shot classifier with regex fallback |
457+
458+
See the implementation files for details:
459+
- [nli_detector.py](backend/core/bel/nli_detector.py) - NLI fallback
460+
- [response_validator.py](backend/chat/response_validator.py) - Response validation
461+
- [query_classifier.py](backend/llm/query_classifier.py) - Zero-shot query routing
453462

454463
---
455464

@@ -459,8 +468,10 @@ Not yet implemented:
459468

460469
- [ ] Belief Explorer UI
461470
- [ ] Document ingestion service
462-
- [ ] LLM-based contradiction detection (current system is rule-based)
463471
- [ ] Benchmarks against production memory systems
472+
- [x] NLI model fallback for contradiction detection
473+
- [x] Response validation (catch LLM hallucinations)
474+
- [x] Zero-shot query classification for hybrid routing
464475
- [x] Semantic contradiction detection (rule-based with 14 rules across 6 categories)
465476
- [x] Hierarchical context (session → user)
466477
- [x] Numeric contradiction detection

0 commit comments

Comments
 (0)