Skip to content

Commit 5054e0f

Browse files
authored
Merge pull request #11 from VectorInstitute/add_yaml_files_script
Add yaml files for repos, add script to parse data
2 parents cb9f45d + 0572c47 commit 5054e0f

File tree

12 files changed

+938
-389
lines changed

12 files changed

+938
-389
lines changed
Lines changed: 12 additions & 10 deletions
Original file line numberDiff line numberDiff line change
@@ -1,4 +1,4 @@
1-
name: sync-readme
1+
name: sync-repositories-to-docs
22
permissions:
33
contents: write
44
pull-requests: write
@@ -8,20 +8,22 @@ on:
88
branches:
99
- main
1010
paths:
11-
- 'README.md'
12-
- 'scripts/sync_readme_to_docs.py'
13-
- '.github/workflows/sync_readme.yml'
11+
- 'scripts/sync_repositories_to_docs.py'
12+
- '.github/workflows/sync_repos.yml'
13+
- 'repositories/**'
14+
- 'docs/bibtex/**'
1415
pull_request:
1516
branches:
1617
- main
1718
paths:
18-
- 'README.md'
19-
- 'scripts/sync_readme_to_docs.py'
20-
- '.github/workflows/sync_readme.yml'
19+
- 'scripts/sync_repositories_to_docs.py'
20+
- '.github/workflows/sync_repos.yml'
21+
- 'repositories/**'
22+
- 'docs/bibtex/**'
2123
workflow_dispatch: # Allow manual triggering
2224

2325
jobs:
24-
sync-readme:
26+
sync-repositories:
2527
runs-on: ubuntu-latest
2628
steps:
2729
- name: Checkout code
@@ -42,7 +44,7 @@ jobs:
4244
run: uv sync --dev --group docs
4345

4446
- name: Run sync script
45-
run: python scripts/sync_readme_to_docs.py
47+
run: source .venv/bin/activate && python scripts/sync_repositories_to_docs.py
4648

4749
- name: Check for changes
4850
id: git-check
@@ -59,7 +61,7 @@ jobs:
5961
if: steps.git-check.outputs.changes == 'true' && github.event_name == 'push'
6062
run: |
6163
git add docs/index.md
62-
git commit -m "docs: sync README.md reference implementations to docs/index.md
64+
git commit -m "docs: sync repositories to docs/index.md
6365
6466
Automated update triggered by changes to README.md
6567

README.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -19,7 +19,7 @@ This catalog is a collection of repositories for various Machine Learning techni
1919
| [bias-mitigation-unlearning][bmu-repo] | This repository contains code for the paper [Can Machine Unlearning Reduce Social Bias in Language Models?][bmu-paper] which was published at EMNLP'24 in the Industry track. <br>Authors are Omkar Dige, Diljot Arneja, Tsz Fung Yau, Qixuan Zhang, Mohammad Bolandraftar, Xiaodan Zhu, Faiza Khan Khattak. | PCGU, Task vectors and DPO for Machine Unlearning | 20 | [BBQ][bbq-bmu], [Stereoset][stereoset-bmu], [Link1][link1-bmu], [Link2][link2-bmu] | applied-research | 2024 |
2020
| [cyclops-workshop][cyclops-repo] | This repository contains demos for using [CyclOps] package for clinical ML evaluation and monitoring. | XGBoost | 1 | [Diabetes 130-US hospitals dataset for years 1999-2008][diabetes-cyclops] | bootcamp | 2024 |
2121
| [odyssey][odyssey-repo] | This is a library created with research done for the paper [EHRMamba: Towards Generalizable and Scalable Foundation Models for Electronic Health Records][odyssey-paper] published at ArXiv'24. <br>Authors are Adibvafa Fallahpour, Mahshid Alinoori, Wenqian Ye, Xu Cao, Arash Afkanpour, Amrit Krishnan. | EHRMamba, XGBoost, Bi-LSTM | 1 | [MIMIC-IV] | tool | 2024 |
22-
| [diffusion-model-bootcamp][diffusion-repo] | This repository contains demos for various diffusion models for tabular and time series data. | TabDDPM, TabSyn, ClavaDDPM, CSDI, TSDiff | 12 | [Physionet Challenge 2012], [wiki2000] | bootcamp | 2024 |
22+
| [diffusion-models][diffusion-repo] | This repository contains demos for various diffusion models for tabular and time series data. | TabDDPM, TabSyn, ClavaDDPM, CSDI, TSDiff | 12 | [Physionet Challenge 2012], [wiki2000] | bootcamp | 2024 |
2323
| [News Media Bias][nmb-repo] | This repository contains code for libraries and experiments to recognise and evaluate bias and fakeness within news media articles via LLMs. | Bias evaluation via LLMs, finetuning and data annotation via LLM for fake news detection, Supervised finetuning for debiasing sentence, NER for biased phrases via LLMS, Evaluate using DeepEval library | 4 | [News Media Bias Full data][nmb-data], [Toxigen], [Nela GT], [Debiaser data] | bootcamp | 2024 |
2424
| [News Media Bias Plus][nmb-plus-repo] | Continuation of News Media Bias project, this repository contains code for libraries and experiments to collect and annotate data, recognise and evaluate bias and fakeness within news media articles via LLMs and LVMs. | Bias evaluation via LLMs and VLMs, finetuning and data annotation via LLM for fake news detection, supervised finetuning for debiasing sentence, NER for biased entities via LLMS | 2 | [News Media Bias Plus Full Data][nmb-plus-full-data], [NMB Plus Named Entities][nmb-plus-entities] | bootcamp | 2024 |
2525
| [Anomaly Detection Project][anomaly-repo] | This repository contains demos for various supervised and unsupervised anomaly detection techniques in domains such as Fraud Detection, Network Intrusion Detection, System Monitoring and image, Video Analysis. | AMNet, GCN, SAGE, OCGNN, DON, AdONE, MLP, FTTransformer, DeepSAD, XGBoost, CBLOF, CFA for Target-Oriented Anomaly Localization, Draem for surface anomaly detection, Logistic Regression, CATBoost, Random Forest, Diversity Measurable Anomaly Detection, Two-stream I3D Convolutional Network, DeepCNN, LightGBM, Isolation Forest, TabNet, AutoEncoder, Internal Contrastive Learning | 5 | [On Vector Cluster][cluster-anomaly] | bootcamp | 2023 |

docs/bibtex/papers.bib

Lines changed: 7 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,7 @@
1+
@inproceedings{dige2024can,
2+
title={Can Machine Unlearning Reduce Social Bias in Language Models?},
3+
author={Dige, Omkar and Arneja, Diljot and Yau, Tsz Fung and Zhang, Qixuan and Bolandraftar, Mohammad and Zhu, Xiaodan and Khattak, Faiza},
4+
booktitle={Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing: Industry Track},
5+
pages={954--969},
6+
year={2024}
7+
}

0 commit comments

Comments
 (0)