Add new column

amrit110 · amrit110 · commit 671e3f76b8e1 · 2025-05-12T22:55:19.000-04:00
diff --git a/README.md b/README.md
@@ -16,9 +16,9 @@ This catalog is a collection of repositories for various Machine Learning techni
 | [rag-bootcamp][rag-repo] | This repository contains demos for various Retrieval Augmented Generation techniques using different libraries. | Cloud search via LlamaHub, Document search via LangChain, LlamaIndex for OpenAI and Cohere models, Hybrid Search via Weaviate Vector Store, Evaluation via RAGAS library, Websearch via LangChain | 3 | [Vectors 2021 Annual Report], [PubMed Doc], [Banking Deposits] | bootcamp | 2024 |
 | [finetuning-and-alignment][fa-repo] | This repository contains demos for finetuning techniques for LLMs focussed on reducing computational cost. | DDP, FSDP, Instruction Tuning, LoRA, DoRA, QLora, Supervised finetuning | 3 | [samsam], [imdb], [Bias-DeBiased] | bootcamp | 2024 |
 | [Prompt Engineering Laboratory][pe-lab-repo] | This repository contains demos for various Prompt Engineering techniques, along with examples for Bias quantification, text classification. | Stereotypical Bias Analysis, Sentiment inference, Finetuning using HF Library, Activation Generation, Train and Test Model for Activations without Prompts, RAG, ABSA, Few shot prompting, Zero shot prompting (Stochastic, Greedy, Likelihood Estimation), Role play prompting, LLM Prompt Summarization, Zero shot and few shot prompt translation, Few shot CoT, Zero shot CoT, Self-Consistent CoT prompting (Zero shot, 5-shot), Balanced Choice of Plausible Alternatives, Bootstrap Ensembling(Generation & MC formulation), Vote Ensembling | 11 | [Crows-pairs][crow-pairs-pe-lab], [sst5][sst5-pe-lab], [czarnowska templates][czar-templ-pe-lab], [cnn_dailymail], [ag_news], [Weather and sports data], [Other] | bootcamp | 2024 |
-| [bias-mitigation-unlearning][bmu-repo] | This repository contains code for the paper [Can Machine Unlearning Reduce Social Bias in Language Models?][bmu-paper] which was published at EMNLP'24 in the Industry track. <br>Authors are Omkar Dige, Diljot Arneja, Tsz Fung Yau, Qixuan Zhang, Mohammad Bolandraftar, Xiaodan Zhu, Faiza Khan Khattak. | PCGU, Task vectors and DPO for Machine Unlearning | 20 | [BBQ][bbq-bmu], [Stereoset][stereoset-bmu], [Link1][link1-bmu], [Link2][link2-bmu] | bootcamp | 2024 |
+| [bias-mitigation-unlearning][bmu-repo] | This repository contains code for the paper [Can Machine Unlearning Reduce Social Bias in Language Models?][bmu-paper] which was published at EMNLP'24 in the Industry track. <br>Authors are Omkar Dige, Diljot Arneja, Tsz Fung Yau, Qixuan Zhang, Mohammad Bolandraftar, Xiaodan Zhu, Faiza Khan Khattak. | PCGU, Task vectors and DPO for Machine Unlearning | 20 | [BBQ][bbq-bmu], [Stereoset][stereoset-bmu], [Link1][link1-bmu], [Link2][link2-bmu] | applied-research | 2024 |
 | [cyclops-workshop][cyclops-repo] | This repository contains demos for using [CyclOps] package for clinical ML evaluation and monitoring. | XGBoost | 1 | [Diabetes 130-US hospitals dataset for years 1999-2008][diabetes-cyclops] | bootcamp | 2024 |
-| [odyssey][odyssey-repo] | This is a library created with research done for the paper [EHRMamba: Towards Generalizable and Scalable Foundation Models for Electronic Health Records][odyssey-paper] published at ArXiv'24. <br>Authors are Adibvafa Fallahpour, Mahshid Alinoori, Wenqian Ye, Xu Cao, Arash Afkanpour, Amrit Krishnan. | EHRMamba, XGBoost, Bi-LSTM | 1 | [MIMIC-IV] | bootcamp | 2024 |
+| [odyssey][odyssey-repo] | This is a library created with research done for the paper [EHRMamba: Towards Generalizable and Scalable Foundation Models for Electronic Health Records][odyssey-paper] published at ArXiv'24. <br>Authors are Adibvafa Fallahpour, Mahshid Alinoori, Wenqian Ye, Xu Cao, Arash Afkanpour, Amrit Krishnan. | EHRMamba, XGBoost, Bi-LSTM | 1 | [MIMIC-IV] | tool | 2024 |
 | [diffusion-model-bootcamp][diffusion-repo] | This repository contains demos for various diffusion models for tabular and time series data. | TabDDPM, TabSyn, ClavaDDPM, CSDI, TSDiff | 12 | [Physionet Challenge 2012], [wiki2000] | bootcamp | 2024 |
 | [News Media Bias][nmb-repo] | This repository contains code for libraries and experiments to recognise and evaluate bias and fakeness within news media articles via LLMs. | Bias evaluation via LLMs, finetuning and data annotation via LLM for fake news detection, Supervised finetuning for debiasing sentence, NER for biased phrases via LLMS, Evaluate using DeepEval library | 4 | [News Media Bias Full data][nmb-data], [Toxigen], [Nela GT], [Debiaser data] | bootcamp | 2024 |
 | [News Media Bias Plus][nmb-plus-repo] | Continuation of News Media Bias project, this repository contains code for libraries and experiments to collect and annotate data, recognise and evaluate bias and fakeness within news media articles via LLMs and LVMs. | Bias evaluation via LLMs and VLMs, finetuning and data annotation via LLM for fake news detection, supervised finetuning for debiasing sentence, NER for biased entities via LLMS | 2 | [News Media Bias Plus Full Data][nmb-plus-full-data], [NMB Plus Named Entities][nmb-plus-entities] | bootcamp | 2024 |
diff --git a/docs/index.md b/docs/index.md
@@ -15,6 +15,8 @@ hide:
 </div>
 
 <!-- Custom styling for the hero section -->
+
+
 <style>
 .hero-section {
   position: relative;
@@ -99,6 +101,8 @@ hide:
 
 
 
+
+
 <div class="catalog-stats">
   <div class="stat">
     <div class="stat-number">100+</div>
@@ -141,6 +145,10 @@ hide:
 
 
 
+
+
+
+
 
 
 
@@ -204,21 +212,6 @@ hide:
     </div>
     </div>
     <div class="card" markdown>
-    <div class="header">
-        <h3><a href="https://github.com/VectorInstitute/bmu" title="Go to Repository">bias-mitigation-unlearning</a></h3>
-        <span class="tag year-tag">2024</span>
-        <span class="tag type-tag">bootcamp</span>
-    </div>
-    <p>This repository contains code for the paper [Can Machine Unlearning Reduce Social Bias in Language Models?][bmu-paper] which was published at EMNLP'24 in the Industry track. <br>Authors are Omkar Dige, Diljot Arneja, Tsz Fung Yau, Qixuan Zhang, Mohammad Bolandraftar, Xiaodan Zhu, Faiza Khan Khattak.</p>
-    <div class="tag-container">
-        <span class="tag" data-tippy="PCGU">PCGU</span>
-        <span class="tag" data-tippy="Task vectors and DPO for Machine Unlearning">Task vectors and DPO for Machine Unlearning</span>
-    </div>
-    <div class="datasets">
-        <strong>Datasets:</strong> <span class="dataset-tag">BBQ</span> <span class="dataset-tag">bbq-bmu</span>  <span class="dataset-tag">Stereoset</span> <span class="dataset-tag">stereoset-bmu</span>  <span class="dataset-tag">Link1</span> <span class="dataset-tag">link1-bmu</span>  <span class="dataset-tag">Link2</span> <span class="dataset-tag">link2-bmu</span>
-    </div>
-    </div>
-    <div class="card" markdown>
     <div class="header">
         <h3><a href="https://github.com/VectorInstitute/cyclops" title="Go to Repository">cyclops-workshop</a></h3>
         <span class="tag year-tag">2024</span>
@@ -233,22 +226,6 @@ hide:
     </div>
     </div>
     <div class="card" markdown>
-    <div class="header">
-        <h3><a href="https://github.com/VectorInstitute/odyssey" title="Go to Repository">odyssey</a></h3>
-        <span class="tag year-tag">2024</span>
-        <span class="tag type-tag">bootcamp</span>
-    </div>
-    <p>This is a library created with research done for the paper [EHRMamba: Towards Generalizable and Scalable Foundation Models for Electronic Health Records][odyssey-paper] published at ArXiv'24. <br>Authors are Adibvafa Fallahpour, Mahshid Alinoori, Wenqian Ye, Xu Cao, Arash Afkanpour, Amrit Krishnan.</p>
-    <div class="tag-container">
-        <span class="tag" data-tippy="EHRMamba">EHRMamba</span>
-        <span class="tag" data-tippy="XGBoost">XGBoost</span>
-        <span class="tag" data-tippy="Bi-LSTM">Bi-LSTM</span>
-    </div>
-    <div class="datasets">
-        <strong>Datasets:</strong> <span class="dataset-tag">MIMIC-IV</span>
-    </div>
-    </div>
-    <div class="card" markdown>
     <div class="header">
         <h3><a href="https://github.com/VectorInstitute/diffusion" title="Go to Repository">diffusion-model-bootcamp</a></h3>
         <span class="tag year-tag">2024</span>
@@ -544,3 +521,46 @@ hide:
 
     </div>
 
+=== "tool"
+
+    <div class="grid cards" markdown>
+    <div class="card" markdown>
+    <div class="header">
+        <h3><a href="https://github.com/VectorInstitute/odyssey" title="Go to Repository">odyssey</a></h3>
+        <span class="tag year-tag">2024</span>
+        <span class="tag type-tag">tool</span>
+    </div>
+    <p>This is a library created with research done for the paper [EHRMamba: Towards Generalizable and Scalable Foundation Models for Electronic Health Records][odyssey-paper] published at ArXiv'24. <br>Authors are Adibvafa Fallahpour, Mahshid Alinoori, Wenqian Ye, Xu Cao, Arash Afkanpour, Amrit Krishnan.</p>
+    <div class="tag-container">
+        <span class="tag" data-tippy="EHRMamba">EHRMamba</span>
+        <span class="tag" data-tippy="XGBoost">XGBoost</span>
+        <span class="tag" data-tippy="Bi-LSTM">Bi-LSTM</span>
+    </div>
+    <div class="datasets">
+        <strong>Datasets:</strong> <span class="dataset-tag">MIMIC-IV</span>
+    </div>
+    </div>
+
+    </div>
+
+=== "applied-research"
+
+    <div class="grid cards" markdown>
+    <div class="card" markdown>
+    <div class="header">
+        <h3><a href="https://github.com/VectorInstitute/bmu" title="Go to Repository">bias-mitigation-unlearning</a></h3>
+        <span class="tag year-tag">2024</span>
+        <span class="tag type-tag">applied-research</span>
+    </div>
+    <p>This repository contains code for the paper [Can Machine Unlearning Reduce Social Bias in Language Models?][bmu-paper] which was published at EMNLP'24 in the Industry track. <br>Authors are Omkar Dige, Diljot Arneja, Tsz Fung Yau, Qixuan Zhang, Mohammad Bolandraftar, Xiaodan Zhu, Faiza Khan Khattak.</p>
+    <div class="tag-container">
+        <span class="tag" data-tippy="PCGU">PCGU</span>
+        <span class="tag" data-tippy="Task vectors and DPO for Machine Unlearning">Task vectors and DPO for Machine Unlearning</span>
+    </div>
+    <div class="datasets">
+        <strong>Datasets:</strong> <span class="dataset-tag">BBQ</span> <span class="dataset-tag">bbq-bmu</span>  <span class="dataset-tag">Stereoset</span> <span class="dataset-tag">stereoset-bmu</span>  <span class="dataset-tag">Link1</span> <span class="dataset-tag">link1-bmu</span>  <span class="dataset-tag">Link2</span> <span class="dataset-tag">link2-bmu</span>
+    </div>
+    </div>
+
+    </div>
+
diff --git a/scripts/sync_readme_to_docs.py b/scripts/sync_readme_to_docs.py
@@ -320,9 +320,56 @@ def update_docs_index(implementations_by_type: Dict[str, List[Dict]]) -> None:
 
     original_content = docs_index_path.read_text(encoding="utf-8")
 
-    # Ensure we have CSS for dataset tags and type tags
+    # Ensure we have CSS for dataset tags, type tags, year tags, and hero section
     css_for_tags = """
 <style>
+.hero-section {
+  position: relative;
+  padding: 5rem 4rem;
+  text-align: center;
+  color: white;
+  background-color: var(--md-primary-fg-color);
+  background-image: linear-gradient(rgba(0, 0, 0, 0.35), rgba(0, 0, 0, 0.35)), url('assets/splash.png');
+  background-size: cover;
+  background-position: center;
+  display: flex;
+  justify-content: center;
+  align-items: center;
+  margin: 0;
+  padding: 0;
+  width: 100%;
+  position: relative;
+  min-height: 70vh;
+}
+
+.hero-content {
+  max-width: 800px;
+  z-index: 10;
+}
+
+.hero-content h1 {
+  font-size: 3rem;
+  margin-bottom: 1rem;
+  text-shadow: 0 2px 8px rgba(0,0,0,0.7);
+  font-weight: 600;
+  letter-spacing: 0.5px;
+  color: #ffffff;
+  font-family: 'Roboto', sans-serif;
+}
+
+.hero-content p {
+  font-size: 1.5rem;
+  margin-bottom: 2rem;
+  text-shadow: 0 2px 6px rgba(0,0,0,0.7);
+  max-width: 700px;
+  margin-left: auto;
+  margin-right: auto;
+  line-height: 1.4;
+  color: #f8f8f8;
+  font-family: 'Roboto', sans-serif;
+  font-weight: 300;
+}
+
 .dataset-tag {
   display: inline-block;
   background-color: #6a5acd;
@@ -348,6 +395,12 @@ def update_docs_index(implementations_by_type: Dict[str, List[Dict]]) -> None:
   font-weight: 500;
   white-space: nowrap;
 }
+
+.year-tag {
+  background-color: #eb088a; /* Pink color instead of black */
+  color: white;
+  float: right;
+}
 </style>
 """