Skip to content

eriknovak/ErikNovak

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

54 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Erik Novak

Data Scientist at Event Registry

Research Focus

Artificial Intelligence • Natural Language Processing • Cross-lingual Language Models • Semi-Automatic Text Processing • Data Visualization

Connect: LinkedIn | Homepage

Open Source Contributions

Python Packages

Package Description GitHub Stars PyPI
anonipy Data anonymization library supporting multiple anonymization strategies and techniques Stars PyPi
datachart Flexible data visualization library with simple API and extensive customization options Stars PyPi

Research Datasets

Dataset Description GitHub Stars Repository
OG2021 Comprehensive dataset from the 2021 Tokyo Olympics Stars Clarin.si
SloATOMIC 2020 Slovene translation of the ATOMIC 2020 commonsense reasoning dataset Stars Clarin.si

Project Templates

Machine Learning with DVC

eriknovak/cookiecutter-ml-dvc — Template for machine learning experiments using DVC for version control and reproducibility (in development).

# Install pipx for running cookiecutter
pip install pipx
# Create a new project using the template
pipx run cookiecutter gh:eriknovak/cookiecutter-ml-dvc

Machine Learning on HPC Systems

eriknovak/cookiecutter-ml-hpc — Template for machine learning experiments on HPC clusters with SLURM workload manager (in development).

# Install pipx for running cookiecutter
pip install pipx
# Create a new project using the template
pipx run cookiecutter gh:eriknovak/cookiecutter-ml-hpc

About

Home project

Topics

Resources

Code of conduct

Contributing

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors