Results and code

This repository contains the results and code from the PhD thesis Extraction of Quantitative Grammatical Rules from Syntactic Treebanks.

Results

All results are in the results directory. The JSON files contain the scope, the conclusion and the extracted rules for each experience of the thesis.

Code

The main code for rule extraction is in src. Within this directory:

grex2: the rule extraction scripts, including those for the decision tree, sparse logistic regression, and the RuleFit implementation.

The work is built upon a modified version of Grex2. The original Grex2 project is maintained at https://github.com/FilippoC/grex2. The code was primarily developed by Caio Corro.

The official documentation for Grex2 (currently under development) can be found at https://grex.grew.fr.

If you use this software, please cite our paper:

@inproceedings{herrera2024grex,
    title = "Sparse Logistic Regression with High-order Features for Automatic Grammar Rule Extraction from Treebanks",
    author = "Herrera, Santiago and Corro, Caio and Kahane, Sylvain",
    booktitle = "Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation",
    month = may,
    year = "2024",
    address = "Torino, Italia",
    url = "https://arxiv.org/abs/2403.17534",
}

univariate: script for compute univariate measures over features.
evaluation: evaluations and global measure scripts.

Experiments

This directory includes bash scripts for executing the tasks of each experiment. It also contains other scripts or notebooks specific to each experiment.

Setup

Clone the repository and then install the project with pip install -e .

I recommend installing the project in a virtual environment, e.g., python -m venv .venv

Some paths need to be adjusted in order to run the experiments.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
data		data
experiments		experiments
results		results
src		src
.gitignore		.gitignore
README.md		README.md
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Results and code

Results

Code

Experiments

Setup

About

Uh oh!

Releases

Packages

Languages

s-herrera/phd-thesis

Folders and files

Latest commit

History

Repository files navigation

Results and code

Results

Code

Experiments

Setup

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages