Corpus Tools for creating an OCFL repository for Eastern Central Arrernte dictionary recordings.
- Node.js (v22 or newer needed for oni-ocfl)
- npm
- Visual Studio Code
Clone the repository and install dependencies:
git clone https://github.com/Language-Research-Technology/corpus-tools-ec-arrernte.git
cd corpus-tools-ec-arrernte
npm installYou can run the main corpus tool using the provided Makefile:
makeYou can override the incldued makefile to include your data location such as:
#!/usr/bin/env bash
make BASE_DATA_DIR=/opt/storage/ECArrernte \
TEMPLATE_DIR=/opt/storage/ECArrernte \
REPO_OUT_DIR=/opt/storage/oni/ocfl \
REPO_SCRATCH_DIR=/opt/storage/oni/scratch-ocfl \
BASE_TMP_DIR=./storage/temp \
NAMESPACE=ec-arrernte-dictionary-recordings \
CORPUS_NAME=ec-arrernte-dictionary-recordings \
DATA_DIR="/opt/storage/ECArrernte/DICTIONARY RECORDINGS TRANSCRIBED ETC/" \
REPO_NAME=LDaCA \
DEBUG=trueOr directly with Node.js:
node index.js -s ec-arrernte-dictionary-recordings \
-t "./template" \
-c ec-arrernte-dictionary-recordings -n LDaCA \
-r "./ocfl-repo" -x "/data/override" \
-d "/data/override" \
-D true \
-p "temp" -z "scratch"make cleanRun all tests using Mocha:
npm testOr run a specific test file:
npx mocha test/elan.spec.js- Open the project folder in VS Code.
- Go to the "Run and Debug" panel (Ctrl+Shift+D).
- Select a debug configuration, e.g. Debug ELAN Tests.
- Set breakpoints in your test or source files.
- Click the green "Start Debugging" button.
The .vscode/launch.json includes a configuration for debugging tests:
index.js— Main entry pointlib/— Library modulestest/— Mocha test filesMakeFile— Makefile for common tasks
For issues or questions, please open an issue
{ "type": "node", "request": "launch", "name": "Debug ELAN Tests", "program": "${workspaceFolder}/node_modules/mocha/bin/_mocha", "args": [ "--timeout", "999999", "--colors", "${workspaceFolder}/test/elan.spec.js" ], "console": "integratedTerminal", "internalConsoleOptions": "neverOpen" }