On sampled metrics for item recommendation

23/08/2020

On sampled metrics for item recommendation

Walid Krichene, Steffen Rendle

Keywords: item recommendation, sampled metric, evaluation, metrics

Abstract Paper Similar Papers

Abstract: The task of item recommendation requires ranking a large catalogue of items given a context. Item recommendation algorithms are evaluated using ranking metrics that depend on the positions of relevant items. To speed up the computation of metrics, recent work often uses sampled metrics where only a smaller set of random items and the relevant items are ranked. This paper investigates sampled metrics in more detail and shows that they are inconsistent with their exact version, in the sense that they do not persist relative statements, e.g., recommender A is better than B, not even in expectation. Moreover, the smaller the sampling size, the less difference there is between metrics, and for very small sampling size, all metrics collapse to the AUC metric. We show that it is possible to improve the quality of the sampled metrics by applying a correction, obtained by minimizing different criteria such as bias or mean squared error. We conclude with an empirical evaluation of the naive sampled metrics and their corrected variants. To summarize, our work suggests that sampling should be avoided for metric calculation, however if an experimental study needs to sample, the proposed corrections can improve the quality of the estimate.

The video of this talk cannot be embedded. You can watch it here:

https://dl.acm.org/doi/10.1145/3394486.3403226#sec-supp

(Link will open in new window)

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at KDD 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

19/08/2021

On Sampled Metrics for Item Recommendation (Extended Abstract)

Walid Krichene, Steffen Rendle

Keywords Paper

Machine Learning, Recommender Systems

0

0

0

0

15:39

06/12/2020

Sampling-Decomposable Generative Adversarial Recommender

Binbin Jin, Defu Lian, Zheng Liu and
Qi Liu, Jianhui Ma, Xing Xie, Enhong Chen

Keywords Paper

0

0

0

0

3:17

06/12/2021

The Out-of-Distribution Problem in Explainability and Search Methods for Feature Importance Explanations

Peter Hase, Harry Xie, Mohit Bansal

Keywords Paper

machine learning, interpretability

0

0

0

0

15:05

02/02/2021

Hierarchical Negative Binomial Factorization for Recommender Systems on Implicit Feedback

Li-Yen Kuo, Ming-Syan Chen

Keywords Paper

0

0

0

0

18:19

06/12/2021

Realistic evaluation of transductive few-shot learning

Olivier Veilleux, Malik Boudiaf, Pablo Piantanida, Ismail Ben Ayed

Keywords Paper

optimization, machine learning, few shot learning

0

0

0

0

10:21

26/08/2020

Balanced Off-Policy Evaluation in General Action Spaces

Arjun Sondhi, David Arbour, Drew Dimmery

Keywords Paper

0

0

0

0

12:36

18/07/2021

Fair Selective Classification Via Sufficiency

Joshua Lee, Yuheng Bu, Deepta Rajan and
Prasanna Sattigeri, Rameswar Panda, Subhro Das, Gregory Wornell

Keywords Paper

Social Aspects of Machine Learning, Fairness, Accountability, and Transparency

0

0

0

0

18:20

26/04/2020

SUMO: Unbiased Estimation of Log Marginal Probability for Latent Variable Models

Yucen Luo, Alex Beatson, Mohammad Norouzi and
Jun Zhu, David Duvenaud, Ryan P. Adams, Ricky T. Q. Chen

Keywords Paper

0

0

0

0

5:14

03/05/2021

No Cost Likelihood Manipulation at Test Time for Making Better Mistakes in Deep Networks

Shyamgopal Karthik, Ameya Prabhu, Puneet Dokania, Vineet Gandhi

Keywords Paper

Conditional Risk Minimization, Hierarchy-Aware Classification, Post-Hoc Correction

0

0

0

0

4:53

06/12/2021

On Component Interactions in Two-Stage Recommender Systems

Jiri Hron, Karl Krauth, Michael Jordan, Niki Kilbertus

Keywords Paper

bandits

0

0

0

0

13:25

26/08/2020

A Framework for Sample Efficient Interval Estimation with Control Variates

Shengjia Zhao, Christopher Yeh, Stefano Ermon

Keywords Paper

0

0

0

0

12:01

13/04/2021

Comparing the value of labeled and unlabeled data in method-of-moments latent variable estimation

Mayee Chen, Benjamin Cohen-Wang, Stephen Mussmann and
Frederic Sala, Christopher Re

Keywords Paper

0

0

0

0

3:04

06/12/2020

Exemplar Guided Active Learning

Jason Hartford, Kevin Leyton-Brown, Hadas Raviv and
Dan Padnos, Shahar Lev, Barak Lenz

Keywords Paper

Algorithms -> Multitask and Transfer Learning; Algorithms -> Representation Learning; Algorithms -> Semi-Supervised Learning; A, Algorithms -> Unsupervised Learning

0

0

0

0

3:23

06/12/2021

Bridging Offline Reinforcement Learning and Imitation Learning: A Tale of Pessimism

Paria Rashidinejad, Banghua Zhu, Cong Ma and
Jiantao Jiao, Stuart Russell

Keywords Paper

theory, reinforcement learning and planning, bandits

0

0

0

0

12:21

02/02/2021

Agreement-Discrepancy-Selection: Active Learning with Progressive Distribution Alignment

Mengying Fu, Tianning Yuan, Fang Wan and
Songcen Xu, Qixiang Ye

Keywords Paper

0

0

0

0

15:06

06/12/2021

Self-Diagnosing GAN: Diagnosing Underrepresented Samples in Generative Adversarial Networks

Jinhee Lee, Haeri Kim, Youngkyu Hong, Hye Won Chung

Keywords Paper

generative model

0

0

0

0

10:35

19/04/2021

We need to talk about random splits

Anders Søgaard, Sebastian Ebert, Jasmijn Bastings, Katja Filippova

Keywords Paper

0

0

0

0

7:49

19/08/2021

Likelihood-free Out-of-Distribution Detection with Invertible Generative Models

Amirhossein Ahmadian, Fredrik Lindsten

Keywords Paper

Machine Learning, Deep Learning, Uncertainty Representations, Anomaly/Outlier Detection

0

0

0

0

15:18

26/08/2020

Semi-Modular Inference: enhanced learning in multi-modular models by tempering the influence of components

Christian Carmona, Geoff Nicholls

Keywords Paper

0

0

0

0

14:59

04/07/2020

Investigating Word-Class Distributions in Word Vector Spaces

Ryohei Sasano, Anna Korhonen

Keywords Paper

modeling distribution, centroid-based model, discriminative models, Word-Class Distributions

0

0

0

0

11:53

19/08/2021

Improved Guarantees and a Multiple-descent Curve for Column Subset Selection and the Nystrom Method (Extended Abstract)

Michał Dereziński, Rajiv Khanna, Michael W. Mahoney

Keywords Paper

Machine Learning, Dimensionality Reduction, Explainable/Interpretable Machine Learning, Kernel Methods, Unsupervised Learning

0

0

0

0

13:48

18/07/2021

Online A-Optimal Design and Active Linear Regression

Xavier Fontaine, Pierre Perrault, Michal Valko, Vianney Perchet

Keywords Paper

Algorithms, Online Learning Algorithms

0

0

0

0

5:21

06/12/2021

Differentiable Annealed Importance Sampling and the Perils of Gradient Noise

Guodong Zhang, Kyle Hsu, Jianing Li and
Chelsea Finn, Roger Grosse

Keywords Paper

optimization, generative model

0

0

0

0

15:30

06/12/2020

Generalized Hindsight for Reinforcement Learning

Alex Li, Lerrel Pinto, Pieter Abbeel

Keywords Paper

0

0

0

0

3:20

14/06/2020

Revisiting Saliency Metrics: Farthest-Neighbor Area Under Curve

Sen Jia, Neil D. B. Bruce

Keywords Paper

visual saliency, saliency metric, center bias, area under curve

0

0

0

0

4:50

03/08/2020

On Counterfactual Explanations under Predictive Multiplicity

Martin Pawelczyk, Klaus Broelemann, Gjergji. Kasneci

Keywords Paper

0

0

0

0

8:03

18/07/2021

Sparse Feature Selection Makes Batch Reinforcement Learning More Sample Efficient

Botao Hao, Yaqi Duan, Tor Lattimore and
Csaba Szepesvari, Mengdi Wang

Keywords Paper

Theory, Statistical Learning Theory

0

0

0

0

5:20

02/02/2021

Preference Elicitation as Average-Case Sorting

Dominik Peters, Ariel D. Procaccia

Keywords Paper

0

0

0

0

17:10

13/04/2021

Automatic differentiation variational inference with mixtures

Warren Morningstar, Sharad Vikram, Cusuh Ham and
Andrew Gallagher, Joshua Dillon

Keywords Paper

0

0

0

0

3:05

04/07/2020

Is Your Classifier Actually Biased? Measuring Fairness under Uncertainty with Bernstein Bounds

Kawin Ethayarajh

Keywords Paper

Classifier, Bernstein Bounds, classifiers, co-reference system

0

0

0

0

6:57

13/04/2021

Improving adversarial robustness via unlabeled out-of-domain data

Zhun Deng, Linjun Zhang, Amirata Ghorbani, James Zou

Keywords Paper

0

0

0

0

3:01

14/06/2020

Effectively Unbiased FID and Inception Score and Where to Find Them

Min Jin Chong, David Forsyth

Keywords Paper

fid, inception score, evaluation, generative models, gans, sobol sequence

0

0

0

0

1:01

12/07/2020

Doubly robust off-policy evaluation with shrinkage

Yi Su, Maria Dimakopoulou, Akshay Krishnamurthy, Miroslav Dudik

Keywords Paper

Online Learning, Active Learning, and Bandits

0

0

0

0

15:08

02/02/2021

Tripartite Collaborative Filtering with Observability and Selection for Debiasing Rating Estimation on Missing-Not-at-Random Data

Qi Zhang, Longbing Cao, Chongyang Shi, Liang Hu

Keywords Paper

0

0

0

0

17:01

09/07/2020

Robust causal inference under covariate shift via worst-case subpopulation treatment effects

Sookyo Jeong, Hongseok Namkoong

Keywords Paper

Privacy, fairness, causal inference

0

0

0

0

15:40

06/12/2021

Evaluating model performance under worst-case subpopulations

Mike Li, Hongseok Namkoong, Shangzhou Xia

Keywords Paper

robustness, fairness

0

0

0

0

5:45

03/05/2021

Contemplating Real-World Object Classification

Ali Borji

Keywords Paper

Robustness, object recognition, deep learning, ObjectNet

0

0

0

0

5:12

26/08/2020

Regularized Autoencoders via Relaxed Injective Probability Flow

Abhishek Kumar, Ben Poole, Kevin Murphy

Keywords Paper

0

0

0

0

14:03

06/12/2020

On ranking via sorting by estimated expected utility

Clement Calauzenes, Nicolas Usunier

Keywords Paper

Optimization -> Convex Optimization, Optimization -> Stochastic Optimization

0

0

0

0

3:23

02/02/2021

A Few Queries Go a Long Way: Information-Distortion Tradeoffs in Matching

Georgios Amanatidis, Georgios Birmpas, Aris Filos-Ratsikas, Alexandros A. Voudouris

Keywords Paper

0

0

0

0

18:01