@@ -32,42 +32,47 @@ Data Valuation is the task of estimating the intrinsic value of a data point
3232wrt. the training set, the model and a scoring function. We currently implement
3333methods from the following papers:
3434
35- - Ghorbani, Amirata, and James Zou.
36- [ Data Shapley: Equitable Valuation of Data for Machine Learning] ( http://proceedings.mlr.press/v97/ghorbani19c.html ) .
37- In International Conference on Machine Learning, 2242–51. PMLR, 2019.
35+ - Castro, Javier, Daniel Gómez, and Juan Tejada. [ Polynomial Calculation of the
36+ Shapley Value Based on Sampling] ( https://doi.org/10.1016/j.cor.2008.04.004 ) .
37+ Computers & Operations Research, Selected papers presented at the Tenth
38+ International Symposium on Locational Decisions (ISOLDE X), 36, no. 5 (May 1,
39+ 2009): 1726–30.
40+ - Ghorbani, Amirata, and James Zou. [ Data Shapley: Equitable Valuation of Data
41+ for Machine Learning] ( http://proceedings.mlr.press/v97/ghorbani19c.html ) . In
42+ International Conference on Machine Learning, 2242–51. PMLR, 2019.
3843- Wang, Tianhao, Yu Yang, and Ruoxi Jia.
39- [ Improving Cooperative Game Theory-Based Data Valuation via Data Utility Learning ] ( https://doi.org/10.48550/arXiv.2107.06336 ) .
40- arXiv, 2022.
41- - Jia, Ruoxi, David Dao, Boxin Wang, Frances Ann Hubis, Nezihe Merve Gurel, Bo Li,
42- Ce Zhang, Costas Spanos, and Dawn Song.
43- [ Efficient Task-Specific Data Valuation for Nearest Neighbor Algorithms] ( https://doi.org/10.14778/3342263.3342637 ) .
44+ [ Improving Cooperative Game Theory-Based Data Valuation via Data Utility
45+ Learning ] ( https://doi.org/10.48550/arXiv.2107.06336 ) . arXiv, 2022.
46+ - Jia, Ruoxi, David Dao, Boxin Wang, Frances Ann Hubis, Nezihe Merve Gurel, Bo
47+ Li, Ce Zhang, Costas Spanos, and Dawn Song. [ Efficient Task-Specific Data
48+ Valuation for Nearest Neighbor Algorithms] ( https://doi.org/10.14778/3342263.3342637 ) .
4449 Proceedings of the VLDB Endowment 12, no. 11 (1 July 2019): 1610–23.
45- - Okhrati, Ramin, and Aldo Lipani.
46- [ A Multilinear Sampling Algorithm to Estimate Shapley Values] ( https://doi.org/10.1109/ICPR48806.2021.9412511 ) .
47- In 25th International Conference on Pattern Recognition (ICPR 2020), 7992–99.
48- IEEE, 2021.
49- - Yan, T., & Procaccia, A. D.
50- [ If You Like Shapley Then You’ll Love the Core] ( ) .
51- Proceedings of the AAAI Conference on Artificial Intelligence, 35(6) (2021): 5751-5759.
50+ - Okhrati, Ramin, and Aldo Lipani. [ A Multilinear Sampling Algorithm to Estimate
51+ Shapley Values] ( https://doi.org/10.1109/ICPR48806.2021.9412511 ) . In 25th
52+ International Conference on Pattern Recognition (ICPR 2020), 7992–99. IEEE,
53+ 2021 .
54+ - Yan, T., & Procaccia, A. D. [ If You Like Shapley Then You’ll Love the
55+ Core] ( https://ojs.aaai.org/index.php/AAAI/article/view/16721 ) . Proceedings of
56+ the AAAI Conference on Artificial Intelligence, 35(6) (2021): 5751-5759.
5257- Jia, Ruoxi, David Dao, Boxin Wang, Frances Ann Hubis, Nick Hynes, Nezihe Merve
53- Gürel, Bo Li, Ce Zhang, Dawn Song, and Costas J. Spanos.
54- [ Towards Efficient Data Valuation Based on the Shapley Value] ( http://proceedings.mlr.press/v89/jia19a.html ) .
58+ Gürel, Bo Li, Ce Zhang, Dawn Song, and Costas J. Spanos. [ Towards Efficient
59+ Data Valuation Based on the Shapley Value] ( http://proceedings.mlr.press/v89/jia19a.html ) .
5560 In 22nd International Conference on Artificial Intelligence and Statistics,
5661 1167–76. PMLR, 2019.
57- - Wang, Jiachen T., and Ruoxi Jia.
58- [ Data Banzhaf: A Robust Data Valuation Framework for Machine Learning] ( https://doi.org/10.48550/arXiv.2205.15466 ) .
62+ - Wang, Jiachen T., and Ruoxi Jia. [ Data Banzhaf: A Robust Data Valuation
63+ Framework for Machine Learning] ( https://doi.org/10.48550/arXiv.2205.15466 ) .
5964 arXiv, October 22, 2022.
60- - Kwon, Yongchan, and James Zou.
61- [ Beta Shapley: A Unified and Noise-Reduced Data Valuation Framework for Machine Learning] ( http://arxiv.org/abs/2110.14049 ) .
65+ - Kwon, Yongchan, and James Zou. [ Beta Shapley: A Unified and Noise-Reduced Data
66+ Valuation Framework for Machine Learning] ( http://arxiv.org/abs/2110.14049 ) .
6267 In Proceedings of the 25th International Conference on Artificial Intelligence
6368 and Statistics (AISTATS) 2022, Vol. 151. Valencia, Spain: PMLR, 2022.
6469
6570Influence Functions compute the effect that single points have on an estimator /
6671model. We implement methods from the following papers:
6772
68- - Koh, Pang Wei, and Percy Liang.
69- [ Understanding Black-Box Predictions via Influence Functions] ( http://proceedings.mlr.press/v70/koh17a.html ) .
70- In Proceedings of the 34th International Conference on Machine Learning,
73+ - Koh, Pang Wei, and Percy Liang. [ Understanding Black-Box Predictions via
74+ Influence Functions] ( http://proceedings.mlr.press/v70/koh17a.html ) . In
75+ Proceedings of the 34th International Conference on Machine Learning,
7176 70:1885–94. Sydney, Australia: PMLR, 2017.
7277
7378# Installation
0 commit comments