GitHub - phosphorco/layered-nlp: Rust natural language processing model with a focus on mapping back to source and "layerable" recognizers

Incrementally build up recognizers over an abstract token that combine to create multiple possible interpretations.

Key features:

Abstract over token type to support "rich" tokens like we have at Story.ai.
May generate multiple interpretations of the same token span.
Produces a set of ranges over the input token list with different attributes.

Quick Start (Humans)

# Build workspace
cargo build

# Run all tests
cargo test

# Run fixture harness (most common during development)
cargo test -p layered-nlp-specs runner::tests::test_full_pipeline_integration -- --nocapture

Repository Map

src/ — core line-level engine (LLLine, matchers, selection)
layered-contracts/ — contract NLP resolvers (terms, obligations, diff)
layered-clauses/ — clause/linking resolvers (list items, cross-refs)
layered-nlp-specs/ — fixture harness + assertions
docs/ — architecture notes and resolver patterns
examples/ — runnable examples
web/ + layered-nlp-demo-wasm/ — demo UI + WASM build

Development Workflow (Humans)

Pick a lane:
- Expansion: add fixtures (workflows/expansion.md)
- Investigation: root-cause failures (workflows/investigation.md)
- Implementation: fix resolvers (workflows/implementation.md)
Run the fixture harness and inspect failures.
Update layered-nlp-specs/fixtures/expected_failures.toml for new, known gaps.
Keep changes narrow and layered; avoid overfitting a single fixture.

Demos

cd layered-nlp-demo-wasm && wasm-pack build --target web --out-dir ../web/pkg
cd ../web && python3 -m http.server 8080
# Open http://localhost:8080/contract-viewer.html

Troubleshooting

Fixtures fail but resolver looks correct: check layered-nlp-specs/src/runner.rs span extraction rules.
Snapshot updates: run cargo insta review.
Expected failures: ensure failures are recorded in layered-nlp-specs/fixtures/expected_failures.toml.

Layering

The key idea here is to enable starting from a bunch of vague tags and slowly building meaning up through incrementally adding information that builds on itself.

Simplification: Money = '$' + Number

    $   123   .     00
                    ╰Natural
              ╰Punct
        ╰Natural
        ╰Amt(Decimal)╯
    ╰Money($/£, Num)─╯

Simplification:

Location(NYC) = 'New' + 'York' + 'City'
Location(AMS) = 'Amsterdam'
Address(Person, Location) = Person + Verb('live') + Predicate('in') + Location

    I     live      in      New York City
                                     ╰Noun
                                ╰Noun
                            ╰Adj
                    ╰Predicate
          ╰Verb
    ╰Noun
    ╰Person(Self)
                            ╰──Location─╯
    ╰────Address(Person, Location)─────╯

Contributor Workflows

If you are working on fixture coverage or implementation, start here:

Expansion (fixtures + coverage gaps): workflows/expansion.md
Investigation (failure forensics): workflows/investigation.md
Implementation (make fixtures pass): workflows/implementation.md

Agent loop (short version): Pick a lane, complete the full checklist in that workflow, run tests, and hand off to the next lane.

Agent loop (super simple): Pick lane → do the full lane checklist → run tests → hand off.

Architecture Notes

Versioned diff design: docs/versioned-diff-architecture.md
Resolver design playbook (recipe + pseudocode): docs/resolver-design-exercise.md

Name		Name	Last commit message	Last commit date
Latest commit History 147 Commits
.context		.context
assets		assets
docs		docs
examples		examples
layered-amount		layered-amount
layered-clauses		layered-clauses
layered-contracts		layered-contracts
layered-deixis		layered-deixis
layered-nlp-demo-wasm		layered-nlp-demo-wasm
layered-nlp-document		layered-nlp-document
layered-nlp-specs		layered-nlp-specs
layered-part-of-speech		layered-part-of-speech
scripts		scripts
src		src
web		web
workflows		workflows
.gitignore		.gitignore
.gitmodules		.gitmodules
AGENTS.md		AGENTS.md
CLAUDE.md		CLAUDE.md
COORDINATION.md		COORDINATION.md
COORDINATION_LOG.md		COORDINATION_LOG.md
Cargo.toml		Cargo.toml
LICENSE-APACHE		LICENSE-APACHE
LICENSE-MIT		LICENSE-MIT
README.md		README.md
mise.toml		mise.toml
ntm.sh		ntm.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Quick Start (Humans)

Repository Map

Development Workflow (Humans)

Demos

Troubleshooting

Layering

Contributor Workflows

Architecture Notes

About

Uh oh!

Releases

Packages

Languages

License

phosphorco/layered-nlp

Folders and files

Latest commit

History

Repository files navigation

Quick Start (Humans)

Repository Map

Development Workflow (Humans)

Demos

Troubleshooting

Layering

Contributor Workflows

Architecture Notes

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages