|
31 | 31 | </a> |
32 | 32 | </p> |
33 | 33 | <p align="center"> |
34 | | - <a href="https://arxiv.org/pdf/2507.06196"> |
35 | | - <img src="https://img.shields.io/badge/JMLR-Published-blue?style=flat" alt="JMLR Publication"> |
| 34 | + <a href="https://www.jmlr.org/papers/v27/25-1557.html"> |
| 35 | + <img src="https://img.shields.io/badge/JMLR-Published-112467?style=flat&style=for-the-badge&logo=semantic-scholar&logoColor=white" alt="JMLR Publication"> |
36 | 36 | </a> |
37 | 37 | <a href="https://openreview.net/pdf?id=WOFspd4lq5"> |
38 | | - <img src="https://img.shields.io/badge/TMLR-Published-blue?style=flat" alt="TMLR Publication"> |
| 38 | + <img src="https://img.shields.io/badge/TMLR-Published-4FA1CA?style=flat&logo=semantic-scholar&logoColor=white" alt="TMLR Publication"> |
39 | 39 | </a> |
40 | 40 | <a href="https://arxiv.org/abs/2602.17431"> |
41 | | - <img src="https://img.shields.io/badge/arXiv-2602.17431-B31B1B.svg" alt="arXiv"> |
| 41 | + <img src="https://img.shields.io/badge/arXiv-LongTextUQ-B31B1B?logo=arXiv&logoColor=white" alt="arXiv"> |
42 | 42 | </a> |
43 | 43 | </p> |
44 | 44 |
|
| 45 | + |
45 | 46 | UQLM is a Python library for Large Language Model (LLM) hallucination detection using state-of-the-art uncertainty quantification techniques. |
46 | 47 |
|
47 | 48 | ## Installation |
@@ -312,30 +313,45 @@ The examples directory contains tutorials for: |
312 | 313 | Each notebook includes detailed explanations and code samples that you can adapt to your specific use case. |
313 | 314 |
|
314 | 315 | ## Citation |
315 | | -A technical description of the `uqlm` scorers and extensive experimental results are presented in **[this paper](https://arxiv.org/abs/2504.19254)**. If you use our framework or toolkit, please cite: |
| 316 | +A technical description of the `uqlm` scorers and extensive experimental results are presented in **[this paper](https://openreview.net/pdf?id=WOFspd4lq5)**, published in **Transactions on Machine Learning Research (TMLR)**. If you use our framework or toolkit, please cite: |
316 | 317 |
|
317 | 318 | ```bibtex |
318 | | -@misc{bouchard2025uncertaintyquantificationlanguagemodels, |
319 | | - title={Uncertainty Quantification for Language Models: A Suite of Black-Box, White-Box, LLM Judge, and Ensemble Scorers}, |
320 | | - author={Dylan Bouchard and Mohit Singh Chauhan}, |
321 | | - year={2025}, |
322 | | - eprint={2504.19254}, |
323 | | - archivePrefix={arXiv}, |
324 | | - primaryClass={cs.CL}, |
325 | | - url={https://arxiv.org/abs/2504.19254}, |
| 319 | +@article{ |
| 320 | +bouchard2025uncertainty, |
| 321 | +title={Uncertainty Quantification for Language Models: A Suite of Black-Box, White-Box, {LLM} Judge, and Ensemble Scorers}, |
| 322 | +author={Dylan Bouchard and Mohit Singh Chauhan}, |
| 323 | +journal={Transactions on Machine Learning Research}, |
| 324 | +issn={2835-8856}, |
| 325 | +year={2025}, |
| 326 | +url={https://openreview.net/forum?id=WOFspd4lq5}, |
| 327 | +note={} |
326 | 328 | } |
327 | 329 | ``` |
328 | 330 |
|
329 | | -The `uqlm` software package is described in this **[this paper](https://arxiv.org/abs/2507.06196)**. If you use the software, please cite: |
| 331 | +The `uqlm` software package is described in this **[this paper](https://arxiv.org/abs/2507.06196)**, published in the **Journal of Machine Learning Research (JMLR)**. If you use the software, please cite: |
| 332 | + |
| 333 | +```bibtex |
| 334 | +@article{JMLR:v27:25-1557, |
| 335 | + author = {Dylan Bouchard and Mohit Singh Chauhan and David Skarbrevik and Ho-Kyeong Ra and Viren Bajaj and Zeya Ahmad}, |
| 336 | + title = {UQLM: A Python Package for Uncertainty Quantification in Large Language Models}, |
| 337 | + journal = {Journal of Machine Learning Research}, |
| 338 | + year = {2026}, |
| 339 | + volume = {27}, |
| 340 | + number = {13}, |
| 341 | + pages = {1--10}, |
| 342 | + url = {http://jmlr.org/papers/v27/25-1557.html} |
| 343 | +} |
| 344 | +``` |
330 | 345 |
|
| 346 | +The long-text methods and experiment results are described in **this paper**, available as a preprint on arXiv. To cite: |
331 | 347 | ```bibtex |
332 | | -@misc{bouchard2025uqlmpythonpackageuncertainty, |
333 | | - title={UQLM: A Python Package for Uncertainty Quantification in Large Language Models}, |
334 | | - author={Dylan Bouchard and Mohit Singh Chauhan and David Skarbrevik and Ho-Kyeong Ra and Viren Bajaj and Zeya Ahmad}, |
335 | | - year={2025}, |
336 | | - eprint={2507.06196}, |
| 348 | +@misc{bouchard2026finegraineduncertaintyquantificationlongform, |
| 349 | + title={Fine-Grained Uncertainty Quantification for Long-Form Language Model Outputs: A Comparative Study}, |
| 350 | + author={Dylan Bouchard and Mohit Singh Chauhan and Viren Bajaj and David Skarbrevik}, |
| 351 | + year={2026}, |
| 352 | + eprint={2602.17431}, |
337 | 353 | archivePrefix={arXiv}, |
338 | 354 | primaryClass={cs.CL}, |
339 | | - url={https://arxiv.org/abs/2507.06196}, |
| 355 | + url={https://arxiv.org/abs/2602.17431}, |
340 | 356 | } |
341 | 357 | ``` |
0 commit comments