You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: docs/implementation_details.md
+15Lines changed: 15 additions & 0 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -6,13 +6,28 @@ Each repository in this catalog contains implementations of specific machine lea
6
6
-**Description**: Brief introduction to the repository's purpose and links to relevant research papers
7
7
-**Algorithms**: List of ML algorithms demonstrated in the repository
8
8
-**Datasets**: Information on datasets used, with links to publicly available data
9
+
-**Type**: The category of implementation:
10
+
***bootcamp**: Educational implementations developed for workshops and learning purposes
11
+
***tool**: Utility libraries and tools for broader use
12
+
***applied-research**: Research code tied to specific projects or papers
13
+
-**Year**: The year the implementation was published
9
14
10
15
## Usage Notes
11
16
12
17
!!! note
13
18
- Many repositories contain code for reference purposes only. To run them, updates may be required to the code and environment files.
14
19
- Links for only publicly available datasets are provided. Many datasets used in the repositories are only available on the Vector cluster.
15
20
21
+
## Repository Organization
22
+
23
+
The catalog is organized by implementation type to help you quickly find the resources you need:
24
+
25
+
***Bootcamp implementations**: Educational resources designed for workshops and learning purposes
26
+
***Tool implementations**: Utility libraries and general-purpose tools
27
+
***Applied Research implementations**: Code tied to specific research projects or papers
28
+
29
+
Each implementation includes algorithm tags, dataset information, and other metadata to aid in discovery.
30
+
16
31
## Contributing
17
32
18
33
If you are a Vector researcher or engineer and would like to add your implementation to this catalog, you can contribute by following our [contribution guidelines](https://github.com/VectorInstitute/reference-implementation-catalog/blob/main/CONTRIBUTING.md).
<h3><a href="https://github.com/VectorInstitute/rag" title="Go to Repository">RAG</a></h3>
154
+
<h3><a href="https://github.com/VectorInstitute/rag" title="Go to Repository">rag-bootcamp</a></h3>
126
155
<span class="tag year-tag">2024</span>
156
+
<span class="tag type-tag">bootcamp</span>
127
157
</div>
128
158
<p>This repository contains demos for various Retrieval Augmented Generation techniques using different libraries.</p>
129
159
<div class="tag-container">
@@ -139,8 +169,9 @@ hide:
139
169
</div>
140
170
<div class="card" markdown>
141
171
<div class="header">
142
-
<h3><a href="https://github.com/VectorInstitute/fa" title="Go to Repository">Finetuning and Alignment</a></h3>
172
+
<h3><a href="https://github.com/VectorInstitute/fa" title="Go to Repository">finetuning-and-alignment</a></h3>
143
173
<span class="tag year-tag">2024</span>
174
+
<span class="tag type-tag">bootcamp</span>
144
175
</div>
145
176
<p>This repository contains demos for finetuning techniques for LLMs focussed on reducing computational cost.</p>
146
177
<div class="tag-container">
@@ -158,6 +189,7 @@ hide:
158
189
<div class="header">
159
190
<h3><a href="https://github.com/VectorInstitute/pe-lab" title="Go to Repository">Prompt Engineering Laboratory</a></h3>
160
191
<span class="tag year-tag">2024</span>
192
+
<span class="tag type-tag">bootcamp</span>
161
193
</div>
162
194
<p>This repository contains demos for various Prompt Engineering techniques, along with examples for Bias quantification, text classification.</p>
163
195
<div class="tag-container">
@@ -175,6 +207,7 @@ hide:
175
207
<div class="header">
176
208
<h3><a href="https://github.com/VectorInstitute/bmu" title="Go to Repository">bias-mitigation-unlearning</a></h3>
177
209
<span class="tag year-tag">2024</span>
210
+
<span class="tag type-tag">bootcamp</span>
178
211
</div>
179
212
<p>This repository contains code for the paper [Can Machine Unlearning Reduce Social Bias in Language Models?][bmu-paper] which was published at EMNLP'24 in the Industry track. <br>Authors are Omkar Dige, Diljot Arneja, Tsz Fung Yau, Qixuan Zhang, Mohammad Bolandraftar, Xiaodan Zhu, Faiza Khan Khattak.</p>
180
213
<div class="tag-container">
@@ -189,6 +222,7 @@ hide:
189
222
<div class="header">
190
223
<h3><a href="https://github.com/VectorInstitute/cyclops" title="Go to Repository">cyclops-workshop</a></h3>
191
224
<span class="tag year-tag">2024</span>
225
+
<span class="tag type-tag">bootcamp</span>
192
226
</div>
193
227
<p>This repository contains demos for using [CyclOps] package for clinical ML evaluation and monitoring.</p>
194
228
<div class="tag-container">
@@ -202,6 +236,7 @@ hide:
202
236
<div class="header">
203
237
<h3><a href="https://github.com/VectorInstitute/odyssey" title="Go to Repository">odyssey</a></h3>
204
238
<span class="tag year-tag">2024</span>
239
+
<span class="tag type-tag">bootcamp</span>
205
240
</div>
206
241
<p>This is a library created with research done for the paper [EHRMamba: Towards Generalizable and Scalable Foundation Models for Electronic Health Records][odyssey-paper] published at ArXiv'24. <br>Authors are Adibvafa Fallahpour, Mahshid Alinoori, Wenqian Ye, Xu Cao, Arash Afkanpour, Amrit Krishnan.</p>
207
242
<div class="tag-container">
@@ -215,8 +250,9 @@ hide:
215
250
</div>
216
251
<div class="card" markdown>
217
252
<div class="header">
218
-
<h3><a href="https://github.com/VectorInstitute/diffusion" title="Go to Repository">Diffusion modelbootcamp</a></h3>
253
+
<h3><a href="https://github.com/VectorInstitute/diffusion" title="Go to Repository">diffusion-model-bootcamp</a></h3>
219
254
<span class="tag year-tag">2024</span>
255
+
<span class="tag type-tag">bootcamp</span>
220
256
</div>
221
257
<p>This repository contains demos for various diffusion models for tabular and time series data.</p>
222
258
<div class="tag-container">
@@ -234,6 +270,7 @@ hide:
234
270
<div class="header">
235
271
<h3><a href="https://github.com/VectorInstitute/nmb" title="Go to Repository">News Media Bias</a></h3>
236
272
<span class="tag year-tag">2024</span>
273
+
<span class="tag type-tag">bootcamp</span>
237
274
</div>
238
275
<p>This repository contains code for libraries and experiments to recognise and evaluate bias and fakeness within news media articles via LLMs.</p>
239
276
<div class="tag-container">
@@ -251,6 +288,7 @@ hide:
251
288
<div class="header">
252
289
<h3><a href="https://github.com/VectorInstitute/nmb-plus" title="Go to Repository">News Media Bias Plus</a></h3>
253
290
<span class="tag year-tag">2024</span>
291
+
<span class="tag type-tag">bootcamp</span>
254
292
</div>
255
293
<p>Continuation of News Media Bias project, this repository contains code for libraries and experiments to collect and annotate data, recognise and evaluate bias and fakeness within news media articles via LLMs and LVMs.</p>
256
294
<div class="tag-container">
@@ -263,16 +301,11 @@ hide:
263
301
<strong>Datasets:</strong> <span class="dataset-tag">News Media Bias Plus Full Data</span> <span class="dataset-tag">nmb-plus-full-data</span> <span class="dataset-tag">NMB Plus Named Entities</span> <span class="dataset-tag">nmb-plus-entities</span>
264
302
</div>
265
303
</div>
266
-
267
-
</div>
268
-
269
-
=== "2023"
270
-
271
-
<div class="grid cards" markdown>
272
304
<div class="card" markdown>
273
305
<div class="header">
274
306
<h3><a href="https://github.com/VectorInstitute/anomaly" title="Go to Repository">Anomaly Detection Project</a></h3>
275
307
<span class="tag year-tag">2023</span>
308
+
<span class="tag type-tag">bootcamp</span>
276
309
</div>
277
310
<p>This repository contains demos for various supervised and unsupervised anomaly detection techniques in domains such as Fraud Detection, Network Intrusion Detection, System Monitoring and image, Video Analysis.</p>
278
311
<div class="tag-container">
@@ -290,6 +323,7 @@ hide:
290
323
<div class="header">
291
324
<h3><a href="https://github.com/VectorInstitute/ssl" title="Go to Repository">SSL Bootcamp</a></h3>
292
325
<span class="tag year-tag">2023</span>
326
+
<span class="tag type-tag">bootcamp</span>
293
327
</div>
294
328
<p>This repository contains demos for self-supervised techniques such as contrastive learning, masked modeling and self distillation.</p>
295
329
<div class="tag-container">
@@ -307,6 +341,7 @@ hide:
307
341
<div class="header">
308
342
<h3><a href="https://github.com/VectorInstitute/ci-lab" title="Go to Repository">Causal Inference Lab</a></h3>
309
343
<span class="tag year-tag">2023</span>
344
+
<span class="tag type-tag">bootcamp</span>
310
345
</div>
311
346
<p>This repository contains code to estimate the causal effects of an intervention on some measurable outcome primarily in the health domain.</p>
312
347
<div class="tag-container">
@@ -324,6 +359,7 @@ hide:
324
359
<div class="header">
325
360
<h3><a href="https://github.com/VectorInstitute/hvaic" title="Go to Repository">HV-Ai-C</a></h3>
326
361
<span class="tag year-tag">2023</span>
362
+
<span class="tag type-tag">bootcamp</span>
327
363
</div>
328
364
<p>This repository implements a Reinforcement Learning agent to optimize energy consumption within Data Centers.</p>
329
365
<div class="tag-container">
@@ -340,6 +376,7 @@ hide:
340
376
<div class="header">
341
377
<h3><a href="https://github.com/VectorInstitute/flex-model" title="Go to Repository">Flex Model</a></h3>
342
378
<span class="tag year-tag">2023</span>
379
+
<span class="tag type-tag">bootcamp</span>
343
380
</div>
344
381
<p>This repository contains code for the paper [FlexModel: A Framework for Interpretability of Distributed Large Language Models][flex-model-paper]. <br> Authors are Matthew Choi, Muhammad Adil Asif, John Willes, David Emerson.</p>
345
382
<div class="tag-container">
@@ -353,6 +390,7 @@ hide:
353
390
<div class="header">
354
391
<h3><a href="https://github.com/VectorInstitute/vbll" title="Go to Repository">VBLL</a></h3>
355
392
<span class="tag year-tag">2023</span>
393
+
<span class="tag type-tag">bootcamp</span>
356
394
</div>
357
395
<p>This repository contains code for the paper [Variational Bayesian Last Layers][vbll-paper]. <br> Authors are James Harrison, John Willes, Jasper Snoek.</p>
<h3><a href="https://github.com/VectorInstitute/recsys" title="Go to Repository">Recommendation Systems</a></h3>
374
406
<span class="tag year-tag">2022</span>
407
+
<span class="tag type-tag">bootcamp</span>
375
408
</div>
376
409
<p>This repository contains demos for various RecSys techniques such as Collaborative Filtering, Knowledge Graph, RL based, Sequence Aware, Session based etc.</p>
377
410
<div class="tag-container">
@@ -389,6 +422,7 @@ hide:
389
422
<div class="header">
390
423
<h3><a href="https://github.com/VectorInstitute/forecasting-dl" title="Go to Repository">Forecasting with Deep Learning</a></h3>
391
424
<span class="tag year-tag">2022</span>
425
+
<span class="tag type-tag">bootcamp</span>
392
426
</div>
393
427
<p>This repository contains demos for a variety of forecasting techniques for Univariate and Multivariate time series, spatiotemporal forecasting etc.</p>
394
428
<div class="tag-container">
@@ -406,6 +440,7 @@ hide:
406
440
<div class="header">
407
441
<h3><a href="https://github.com/VectorInstitute/pe" title="Go to Repository">Prompt Engineering</a></h3>
408
442
<span class="tag year-tag">2022</span>
443
+
<span class="tag type-tag">bootcamp</span>
409
444
</div>
410
445
<p>This repository contains demos for a variety of Prompt Engineering techniques such as fairness measurement via sentiment analysis, finetuning, prompt tuning, prompt ensembling etc.</p>
411
446
<div class="tag-container">
@@ -423,6 +458,7 @@ hide:
423
458
<div class="header">
424
459
<h3><a href="https://github.com/VectorInstitute/naa" title="Go to Repository">NAA</a></h3>
425
460
<span class="tag year-tag">2022</span>
461
+
<span class="tag type-tag">bootcamp</span>
426
462
</div>
427
463
<p>This repository contains code for the paper [Bringing the State-of-the-Art to Customers: A Neural Agent Assistant Framework for Customer Service Support][naa-paper] published at EMNLP'22 in the industry track. <br> Authors are Stephen Obadinma, Faiza Khan Khattak, Shirley Wang, Tania Sidhorn, Elaine Lau, Sean Robertson, Jingcheng Niu, Winnie Au, Alif Munim, Karthik Raja Kalaiselvi Bhaskar.</p>
<h3><a href="https://github.com/VectorInstitute/pet" title="Go to Repository">Privacy Enhancing Technologies</a></h3>
448
478
<span class="tag year-tag">2021</span>
479
+
<span class="tag type-tag">bootcamp</span>
449
480
</div>
450
481
<p>This repository contains demos for Privacy, Homomorphic Encryption, Horizontal and Vertical Federated Learning, MIA, and PATE.</p>
451
482
<div class="tag-container">
@@ -463,6 +494,7 @@ hide:
463
494
<div class="header">
464
495
<h3><a href="https://github.com/VectorInstitute/ssgvap" title="Go to Repository">SSGVQAP</a></h3>
465
496
<span class="tag year-tag">2021</span>
497
+
<span class="tag type-tag">bootcamp</span>
466
498
</div>
467
499
<p>This repository contains code for the paper [A Smart System to Generate and Validate Question Answer Pairs for COVID-19 Literature][ssgvap-paper] which was accepted in ACL'20. <br> Authors are Rohan Bhambhoria, Luna Feng, Dawn Sepehr, John Chen, Conner Cowling, Sedef Kocak, Elham Dolatabadi.</p>
468
500
<div class="tag-container">
@@ -478,6 +510,7 @@ hide:
478
510
<div class="header">
479
511
<h3><a href="https://github.com/VectorInstitute/fpf" title="Go to Repository">foodprice-forecasting</a></h3>
480
512
<span class="tag year-tag">2021</span>
513
+
<span class="tag type-tag">bootcamp</span>
481
514
</div>
482
515
<p>This repository replicates the experiments described on pages 16 and 17 of the [2022 Edition of Canada's Food Price Report][fpf-paper].</p>
<h3><a href="https://github.com/VectorInstitute/cvp" title="Go to Repository">Computer_Vision_Project</a></h3>
502
529
<span class="tag year-tag">2020</span>
530
+
<span class="tag type-tag">bootcamp</span>
503
531
</div>
504
532
<p>This repository tackles different problems such as defect detection, footprint extraction, road obstacle detection, traffic incident detection, and segmentation of medical procedures.</p>
0 commit comments