posh-bench

This is the repository for the paper: A Unified Assessment of the Poverty of the Stimulus Argument for Neural Language Models by Xiulin Yang, Arianna Bisazza, Nathan Schneider, and Ethan Gotlieb Wilcox

Setup

To set up the environment, run:

conda create -n posh-bench python=3.11
conda activate posh-bench
pip install -r requirements.txt
pip install -e . --no-dependencies

Experiments

To run the experiments, use the following command:

# train models
bash train_model.sh $dataset_size $vocab_size $model_type $baby_or_wiki # you can find the options available in ```generate_config.py```
# evaluate models
python benchmark_eval.py model_name --eval_dataset posh --best_checkpoint

Dataset

Training data: it is stored in OSF
Evaluation data: different benchmarks are listed in different folders in this repository, e.g., posh: posh-bench

Citation

@misc{yang2026unifiedassessmentpovertystimulus,
      title={A Unified Assessment of the Poverty of the Stimulus Argument for Neural Language Models}, 
      author={Xiulin Yang and Arianna Bisazza and Nathan Schneider and Ethan Gotlieb Wilcox},
      year={2026},
      eprint={2602.09992},
      archivePrefix={arXiv},
      primaryClass={cs.CL},
      url={https://arxiv.org/abs/2602.09992}, 
}

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
blimp		blimp
posh		posh
scamp_implausible		scamp_implausible
scamp_plausible		scamp_plausible
src		src
zorro		zorro
README.md		README.md
benchmark_eval.py		benchmark_eval.py
generate_config.py		generate_config.py
requirements.txt		requirements.txt
save_config.py		save_config.py
train_clm.py		train_clm.py
train_model.sh		train_model.sh
train_tokenizer.py		train_tokenizer.py
upload_models.sh		upload_models.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

posh-bench

Setup

Experiments

Dataset

Citation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

xiulinyang/posh-bench

Folders and files

Latest commit

History

Repository files navigation

posh-bench

Setup

Experiments

Dataset

Citation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages