Skip to content

Commit 671e3f7

Browse files
committed
Add new column
1 parent 765494b commit 671e3f7

File tree

3 files changed

+107
-34
lines changed

3 files changed

+107
-34
lines changed

README.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -16,9 +16,9 @@ This catalog is a collection of repositories for various Machine Learning techni
1616
| [rag-bootcamp][rag-repo] | This repository contains demos for various Retrieval Augmented Generation techniques using different libraries. | Cloud search via LlamaHub, Document search via LangChain, LlamaIndex for OpenAI and Cohere models, Hybrid Search via Weaviate Vector Store, Evaluation via RAGAS library, Websearch via LangChain | 3 | [Vectors 2021 Annual Report], [PubMed Doc], [Banking Deposits] | bootcamp | 2024 |
1717
| [finetuning-and-alignment][fa-repo] | This repository contains demos for finetuning techniques for LLMs focussed on reducing computational cost. | DDP, FSDP, Instruction Tuning, LoRA, DoRA, QLora, Supervised finetuning | 3 | [samsam], [imdb], [Bias-DeBiased] | bootcamp | 2024 |
1818
| [Prompt Engineering Laboratory][pe-lab-repo] | This repository contains demos for various Prompt Engineering techniques, along with examples for Bias quantification, text classification. | Stereotypical Bias Analysis, Sentiment inference, Finetuning using HF Library, Activation Generation, Train and Test Model for Activations without Prompts, RAG, ABSA, Few shot prompting, Zero shot prompting (Stochastic, Greedy, Likelihood Estimation), Role play prompting, LLM Prompt Summarization, Zero shot and few shot prompt translation, Few shot CoT, Zero shot CoT, Self-Consistent CoT prompting (Zero shot, 5-shot), Balanced Choice of Plausible Alternatives, Bootstrap Ensembling(Generation & MC formulation), Vote Ensembling | 11 | [Crows-pairs][crow-pairs-pe-lab], [sst5][sst5-pe-lab], [czarnowska templates][czar-templ-pe-lab], [cnn_dailymail], [ag_news], [Weather and sports data], [Other] | bootcamp | 2024 |
19-
| [bias-mitigation-unlearning][bmu-repo] | This repository contains code for the paper [Can Machine Unlearning Reduce Social Bias in Language Models?][bmu-paper] which was published at EMNLP'24 in the Industry track. <br>Authors are Omkar Dige, Diljot Arneja, Tsz Fung Yau, Qixuan Zhang, Mohammad Bolandraftar, Xiaodan Zhu, Faiza Khan Khattak. | PCGU, Task vectors and DPO for Machine Unlearning | 20 | [BBQ][bbq-bmu], [Stereoset][stereoset-bmu], [Link1][link1-bmu], [Link2][link2-bmu] | bootcamp | 2024 |
19+
| [bias-mitigation-unlearning][bmu-repo] | This repository contains code for the paper [Can Machine Unlearning Reduce Social Bias in Language Models?][bmu-paper] which was published at EMNLP'24 in the Industry track. <br>Authors are Omkar Dige, Diljot Arneja, Tsz Fung Yau, Qixuan Zhang, Mohammad Bolandraftar, Xiaodan Zhu, Faiza Khan Khattak. | PCGU, Task vectors and DPO for Machine Unlearning | 20 | [BBQ][bbq-bmu], [Stereoset][stereoset-bmu], [Link1][link1-bmu], [Link2][link2-bmu] | applied-research | 2024 |
2020
| [cyclops-workshop][cyclops-repo] | This repository contains demos for using [CyclOps] package for clinical ML evaluation and monitoring. | XGBoost | 1 | [Diabetes 130-US hospitals dataset for years 1999-2008][diabetes-cyclops] | bootcamp | 2024 |
21-
| [odyssey][odyssey-repo] | This is a library created with research done for the paper [EHRMamba: Towards Generalizable and Scalable Foundation Models for Electronic Health Records][odyssey-paper] published at ArXiv'24. <br>Authors are Adibvafa Fallahpour, Mahshid Alinoori, Wenqian Ye, Xu Cao, Arash Afkanpour, Amrit Krishnan. | EHRMamba, XGBoost, Bi-LSTM | 1 | [MIMIC-IV] | bootcamp | 2024 |
21+
| [odyssey][odyssey-repo] | This is a library created with research done for the paper [EHRMamba: Towards Generalizable and Scalable Foundation Models for Electronic Health Records][odyssey-paper] published at ArXiv'24. <br>Authors are Adibvafa Fallahpour, Mahshid Alinoori, Wenqian Ye, Xu Cao, Arash Afkanpour, Amrit Krishnan. | EHRMamba, XGBoost, Bi-LSTM | 1 | [MIMIC-IV] | tool | 2024 |
2222
| [diffusion-model-bootcamp][diffusion-repo] | This repository contains demos for various diffusion models for tabular and time series data. | TabDDPM, TabSyn, ClavaDDPM, CSDI, TSDiff | 12 | [Physionet Challenge 2012], [wiki2000] | bootcamp | 2024 |
2323
| [News Media Bias][nmb-repo] | This repository contains code for libraries and experiments to recognise and evaluate bias and fakeness within news media articles via LLMs. | Bias evaluation via LLMs, finetuning and data annotation via LLM for fake news detection, Supervised finetuning for debiasing sentence, NER for biased phrases via LLMS, Evaluate using DeepEval library | 4 | [News Media Bias Full data][nmb-data], [Toxigen], [Nela GT], [Debiaser data] | bootcamp | 2024 |
2424
| [News Media Bias Plus][nmb-plus-repo] | Continuation of News Media Bias project, this repository contains code for libraries and experiments to collect and annotate data, recognise and evaluate bias and fakeness within news media articles via LLMs and LVMs. | Bias evaluation via LLMs and VLMs, finetuning and data annotation via LLM for fake news detection, supervised finetuning for debiasing sentence, NER for biased entities via LLMS | 2 | [News Media Bias Plus Full Data][nmb-plus-full-data], [NMB Plus Named Entities][nmb-plus-entities] | bootcamp | 2024 |

docs/index.md

Lines changed: 51 additions & 31 deletions
Original file line numberDiff line numberDiff line change
@@ -15,6 +15,8 @@ hide:
1515
</div>
1616

1717
<!-- Custom styling for the hero section -->
18+
19+
1820
<style>
1921
.hero-section {
2022
position: relative;
@@ -99,6 +101,8 @@ hide:
99101

100102

101103

104+
105+
102106
<div class="catalog-stats">
103107
<div class="stat">
104108
<div class="stat-number">100+</div>
@@ -141,6 +145,10 @@ hide:
141145

142146

143147

148+
149+
150+
151+
144152

145153

146154

@@ -204,21 +212,6 @@ hide:
204212
</div>
205213
</div>
206214
<div class="card" markdown>
207-
<div class="header">
208-
<h3><a href="https://github.com/VectorInstitute/bmu" title="Go to Repository">bias-mitigation-unlearning</a></h3>
209-
<span class="tag year-tag">2024</span>
210-
<span class="tag type-tag">bootcamp</span>
211-
</div>
212-
<p>This repository contains code for the paper [Can Machine Unlearning Reduce Social Bias in Language Models?][bmu-paper] which was published at EMNLP'24 in the Industry track. <br>Authors are Omkar Dige, Diljot Arneja, Tsz Fung Yau, Qixuan Zhang, Mohammad Bolandraftar, Xiaodan Zhu, Faiza Khan Khattak.</p>
213-
<div class="tag-container">
214-
<span class="tag" data-tippy="PCGU">PCGU</span>
215-
<span class="tag" data-tippy="Task vectors and DPO for Machine Unlearning">Task vectors and DPO for Machine Unlearning</span>
216-
</div>
217-
<div class="datasets">
218-
<strong>Datasets:</strong> <span class="dataset-tag">BBQ</span> <span class="dataset-tag">bbq-bmu</span> <span class="dataset-tag">Stereoset</span> <span class="dataset-tag">stereoset-bmu</span> <span class="dataset-tag">Link1</span> <span class="dataset-tag">link1-bmu</span> <span class="dataset-tag">Link2</span> <span class="dataset-tag">link2-bmu</span>
219-
</div>
220-
</div>
221-
<div class="card" markdown>
222215
<div class="header">
223216
<h3><a href="https://github.com/VectorInstitute/cyclops" title="Go to Repository">cyclops-workshop</a></h3>
224217
<span class="tag year-tag">2024</span>
@@ -233,22 +226,6 @@ hide:
233226
</div>
234227
</div>
235228
<div class="card" markdown>
236-
<div class="header">
237-
<h3><a href="https://github.com/VectorInstitute/odyssey" title="Go to Repository">odyssey</a></h3>
238-
<span class="tag year-tag">2024</span>
239-
<span class="tag type-tag">bootcamp</span>
240-
</div>
241-
<p>This is a library created with research done for the paper [EHRMamba: Towards Generalizable and Scalable Foundation Models for Electronic Health Records][odyssey-paper] published at ArXiv'24. <br>Authors are Adibvafa Fallahpour, Mahshid Alinoori, Wenqian Ye, Xu Cao, Arash Afkanpour, Amrit Krishnan.</p>
242-
<div class="tag-container">
243-
<span class="tag" data-tippy="EHRMamba">EHRMamba</span>
244-
<span class="tag" data-tippy="XGBoost">XGBoost</span>
245-
<span class="tag" data-tippy="Bi-LSTM">Bi-LSTM</span>
246-
</div>
247-
<div class="datasets">
248-
<strong>Datasets:</strong> <span class="dataset-tag">MIMIC-IV</span>
249-
</div>
250-
</div>
251-
<div class="card" markdown>
252229
<div class="header">
253230
<h3><a href="https://github.com/VectorInstitute/diffusion" title="Go to Repository">diffusion-model-bootcamp</a></h3>
254231
<span class="tag year-tag">2024</span>
@@ -544,3 +521,46 @@ hide:
544521

545522
</div>
546523

524+
=== "tool"
525+
526+
<div class="grid cards" markdown>
527+
<div class="card" markdown>
528+
<div class="header">
529+
<h3><a href="https://github.com/VectorInstitute/odyssey" title="Go to Repository">odyssey</a></h3>
530+
<span class="tag year-tag">2024</span>
531+
<span class="tag type-tag">tool</span>
532+
</div>
533+
<p>This is a library created with research done for the paper [EHRMamba: Towards Generalizable and Scalable Foundation Models for Electronic Health Records][odyssey-paper] published at ArXiv'24. <br>Authors are Adibvafa Fallahpour, Mahshid Alinoori, Wenqian Ye, Xu Cao, Arash Afkanpour, Amrit Krishnan.</p>
534+
<div class="tag-container">
535+
<span class="tag" data-tippy="EHRMamba">EHRMamba</span>
536+
<span class="tag" data-tippy="XGBoost">XGBoost</span>
537+
<span class="tag" data-tippy="Bi-LSTM">Bi-LSTM</span>
538+
</div>
539+
<div class="datasets">
540+
<strong>Datasets:</strong> <span class="dataset-tag">MIMIC-IV</span>
541+
</div>
542+
</div>
543+
544+
</div>
545+
546+
=== "applied-research"
547+
548+
<div class="grid cards" markdown>
549+
<div class="card" markdown>
550+
<div class="header">
551+
<h3><a href="https://github.com/VectorInstitute/bmu" title="Go to Repository">bias-mitigation-unlearning</a></h3>
552+
<span class="tag year-tag">2024</span>
553+
<span class="tag type-tag">applied-research</span>
554+
</div>
555+
<p>This repository contains code for the paper [Can Machine Unlearning Reduce Social Bias in Language Models?][bmu-paper] which was published at EMNLP'24 in the Industry track. <br>Authors are Omkar Dige, Diljot Arneja, Tsz Fung Yau, Qixuan Zhang, Mohammad Bolandraftar, Xiaodan Zhu, Faiza Khan Khattak.</p>
556+
<div class="tag-container">
557+
<span class="tag" data-tippy="PCGU">PCGU</span>
558+
<span class="tag" data-tippy="Task vectors and DPO for Machine Unlearning">Task vectors and DPO for Machine Unlearning</span>
559+
</div>
560+
<div class="datasets">
561+
<strong>Datasets:</strong> <span class="dataset-tag">BBQ</span> <span class="dataset-tag">bbq-bmu</span> <span class="dataset-tag">Stereoset</span> <span class="dataset-tag">stereoset-bmu</span> <span class="dataset-tag">Link1</span> <span class="dataset-tag">link1-bmu</span> <span class="dataset-tag">Link2</span> <span class="dataset-tag">link2-bmu</span>
562+
</div>
563+
</div>
564+
565+
</div>
566+

scripts/sync_readme_to_docs.py

Lines changed: 54 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -320,9 +320,56 @@ def update_docs_index(implementations_by_type: Dict[str, List[Dict]]) -> None:
320320

321321
original_content = docs_index_path.read_text(encoding="utf-8")
322322

323-
# Ensure we have CSS for dataset tags and type tags
323+
# Ensure we have CSS for dataset tags, type tags, year tags, and hero section
324324
css_for_tags = """
325325
<style>
326+
.hero-section {
327+
position: relative;
328+
padding: 5rem 4rem;
329+
text-align: center;
330+
color: white;
331+
background-color: var(--md-primary-fg-color);
332+
background-image: linear-gradient(rgba(0, 0, 0, 0.35), rgba(0, 0, 0, 0.35)), url('assets/splash.png');
333+
background-size: cover;
334+
background-position: center;
335+
display: flex;
336+
justify-content: center;
337+
align-items: center;
338+
margin: 0;
339+
padding: 0;
340+
width: 100%;
341+
position: relative;
342+
min-height: 70vh;
343+
}
344+
345+
.hero-content {
346+
max-width: 800px;
347+
z-index: 10;
348+
}
349+
350+
.hero-content h1 {
351+
font-size: 3rem;
352+
margin-bottom: 1rem;
353+
text-shadow: 0 2px 8px rgba(0,0,0,0.7);
354+
font-weight: 600;
355+
letter-spacing: 0.5px;
356+
color: #ffffff;
357+
font-family: 'Roboto', sans-serif;
358+
}
359+
360+
.hero-content p {
361+
font-size: 1.5rem;
362+
margin-bottom: 2rem;
363+
text-shadow: 0 2px 6px rgba(0,0,0,0.7);
364+
max-width: 700px;
365+
margin-left: auto;
366+
margin-right: auto;
367+
line-height: 1.4;
368+
color: #f8f8f8;
369+
font-family: 'Roboto', sans-serif;
370+
font-weight: 300;
371+
}
372+
326373
.dataset-tag {
327374
display: inline-block;
328375
background-color: #6a5acd;
@@ -348,6 +395,12 @@ def update_docs_index(implementations_by_type: Dict[str, List[Dict]]) -> None:
348395
font-weight: 500;
349396
white-space: nowrap;
350397
}
398+
399+
.year-tag {
400+
background-color: #eb088a; /* Pink color instead of black */
401+
color: white;
402+
float: right;
403+
}
351404
</style>
352405
"""
353406

0 commit comments

Comments
 (0)