A source of inefficiency in LR(1) parsers.

Hello @osa1!

There's one observation that I made about a source of inefficiencies in traditional LR(1) parsers [here](https://github.com/mdaines/grammophone/issues/26#issuecomment-1597730431).

I'd love to hear your thoughts on it. I hope to find an algorithm that allows me to split states that would not introduce this redundancy, that is, one that doesn't need a post processing step and only splits states that are necessary.

(below is a copy of the linked comment:)

---

[This](https://mdaines.github.io/grammophone/?s=UyAtPiBhIFggYSB8IGIgWCBiIHwgYSBZIGIgfCBiIFkgYS4KWCAtPiBjICJYJyIuClkgLT4gYyAiWSciLgoiWCciIC0+IGMuCiJZJyIgLT4gYy4=) grammar demonstrates a source of inefficiency in the automaton of the standard LR(1) construction.

Looking at the LR(1) automaton, I think that state 13 & 18 and state 17 & 12 could be merged and the resulting automaton would still resolve the ambiguity that LR(1) was meant to resolve. only 6 & 9 and 14 & 19 are required to be distinct new states.

This grammar is given as an example in some [slides](https://jzimmerman.io/langcc_stanford_oct_20_2022.pdf) that introduce [langcc](https://github.com/mdaines/grammophone/issues/29). However, I'm not sure if langcc eliminates this source of inefficiency.

_I don't know what this observation is called in the literature, but I would be interested in finding that out._


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

A source of inefficiency in LR(1) parsers. #11

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

A source of inefficiency in LR(1) parsers. #11

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions