-
Notifications
You must be signed in to change notification settings - Fork 3
Rohan's Notepad
Rohan Sen edited this page May 23, 2025
·
5 revisions
My initial work would be researching and creating an evaluation model for the FIM and also Researching how to efficiently stop AI hallucinations
- Syntax-Aware Fill-in-the-Middle (SAFIM): https://github.com/gonglinyuan/safim
- Iterating over an answer and finding optimal code (might help in research about AI hallucination): https://www.linkedin.com/posts/stasbel_google-deepmind-just-dropped-a-bombshell-activity-7329140225190834176-M57l
- TruthfulQA (OpenAI): https://arxiv.org/abs/2109.07958
- A good ready-made evaluation tool: https://github.com/confident-ai/deepeval