Skip to content

Commit a7afbaf

Browse files
authored
Merge pull request #60 from BMSguerra/master
Added UMAP25 papers
2 parents 0bf5f38 + cc18a78 commit a7afbaf

File tree

7 files changed

+118
-0
lines changed

7 files changed

+118
-0
lines changed

_posts/2024-06-16-umap-moscati.md

Lines changed: 25 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,25 @@
1+
---
2+
layout: post
3+
title: "Familiarizing with Music: Discovery Patterns for Different Music Discovery Needs"
4+
date: 2025-06-16 10:00:00 +0200
5+
category: Publication
6+
author: bmassonisguerra
7+
readtime: 4
8+
people:
9+
- dafchar
10+
- bmassonisguerra
11+
publication_type: conference
12+
publication_title: "Familiarizing with Music: Discovery Patterns for Different Music Discovery Needs"
13+
publication_year: 2025
14+
publication_authors: M. Moscati, D. Afchar, M. schedl, B. Sguerra
15+
publication_conference: "UMAP"
16+
publication_code: "https://github.com/hcai-mms/familiarizing_with_music"
17+
publication_preprint: "https://arxiv.org/abs/2505.03568"
18+
domains:
19+
domains:
20+
- RECSYS
21+
---
22+
23+
Humans have the tendency to discover and explore. This natural tendency is reflected in data from streaming platforms as the amount of previously unknown content accessed by users. Additionally, in domains such as that of music streaming there is evidence that recommending novel content improves users' experience with the platform. Therefore, understanding users' discovery patterns, such as the amount to which and the way users access previously unknown content, is a topic of relevance for both the scientific community and the streaming industry, particularly the music one. Previous works studied how music consumption differs for users of different traits and looked at diversity, novelty, and consistency over time of users' music preferences. However, very little is known about how users discover and explore previously unknown music, and how this behavior differs for users of varying discovery needs. In this paper we bridge this gap by analyzing data from a survey answered by users of the major music streaming platform Deezer in combination with their streaming data. We first address questions regarding whether users who declare a higher interest in unfamiliar music listen to more diverse music, have more stable music preferences over time, and explore more music within a same time window, compared to those who declare a lower interest. We then investigate which type of music tracks users choose to listen to when they explore unfamiliar music, identifying clear patterns of popularity and genre representativeness that vary for users of different discovery needs.
24+
25+
Our findings open up possibilities to infer users' interest in unfamiliar music from streaming data as well as possibilities to develop recommender systems that guide users in exploring music in a more natural way.

_posts/2024-06-16-umap-sguerra.md

Lines changed: 63 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,63 @@
1+
---
2+
layout: post
3+
title: "Uncertainty in Repeated Implicit Feedback as a Measure of Reliability"
4+
date: 2025-06-16 10:00:00 +0200
5+
category: Publication
6+
author: bmassonisguerra
7+
readtime: 6
8+
people:
9+
- bmassonisguerra
10+
- vatran
11+
- rhennequin
12+
- mmoussallam
13+
publication_type: conference
14+
publication_title: "Uncertainty in Repeated Implicit Feedback as a Measure of Reliability"
15+
publication_year: 2025
16+
publication_authors: B. Sguerra, V. Tran, R. Hennequin, M. Moussallam
17+
publication_conference: "UMAP"
18+
publication_code: "https://github.com/deezer/uncertainty_feedback"
19+
publication_preprint: "https://arxiv.org/pdf/2505.02492"
20+
domains:
21+
domains:
22+
- RECSYS
23+
---
24+
25+
In recommender systems, feedback usually comes in two forms. Explicit feedback is when users directly state their preferences, for example through ratings or reviews. Implicit feedback is inferred from their actions. At Deezer, most of the feedback users leave behind is implicit: listening to a song counts as positive, while skipping it counts as negative. A skip usually means dropping the track before the 30-second mark. The challenge is that implicit feedback is noisy, which makes it harder to build accurate models of user preferences.
26+
27+
One particularity about music consumption is repetition: people repeat songs a lot. Compared to other media like movies or books, repetition in music is off the charts. And every time we hit replay, different things can happen in our brains. Sometimes a few listens are enough to make a track "click". Psychologists call this the mere exposure effect, the idea that familiarity makes us like something more. That is why a song you hear in a TikTok or at the supermarket can suddenly become your new favorite. But there is also a downside: play a song too much and you eventually get bored and start dropping it.
28+
29+
This brings us to the main idea of our work: can these repetitive listening patterns help us measure uncertainty in user feedback? In other words, can these patterns reveal which actions reflect real interest and which are just noise?
30+
31+
When we look at how people repeat songs, both in terms of the number of times they listen (play count) and the time between listens, we start to see some trends. For example, the likelihood of replaying a song (that someone ends up enjoying it) grows up to around ten plays, then slowly drops as overexposure sets in. Timing matters too: after about a day, people are very likely to replay the same track, often because they are in the same context as the day before, such as working, commuting, or doing sports. These patterns create areas in the play count x time interval space where listening behavior is more predictable than in others.
32+
33+
To capture this, we use a simple Bayesian model that quantifies uncertainty in user feedback. Each interaction is represented by a pair (play count x time interval), and the listening tendency is modeled with a Beta distribution. The Bayesian approach has two big advantages: it allows us to incorporate prior beliefs and it naturally provides a measure of uncertainty after updating with data.
34+
35+
As we collect more interactions for a given pair, the prior distribution gets updated into a posterior distribution. This posterior reflects both prior knowledge and the observed data. From it, we can estimate how likely users are to listen, and just as importantly, how confident we are in that estimate. We use the highest density interval (HDI) to capture the range where most of the probability mass lies, giving us a direct measure of certainty.
36+
37+
<div class="publication-illustration">
38+
<img
39+
style="width: 80%;"
40+
src="{{ '/static/images/publis/sguerra25umap/uncertainty1.jpg' | prepend: site.url }}"
41+
alt="Overview of the methodology."/>
42+
</div>
43+
44+
Practically, we split the space into discrete play count x time interval bins, each starting with a weak prior. As user data comes in, we update these priors to get posterior distributions across the whole space.
45+
46+
<div class="publication-illustration">
47+
<img
48+
style="width: 80%;"
49+
src="{{ '/static/images/publis/sguerra25umap/uncertainty2.jpg' | prepend: site.url }}"
50+
alt="Overview of the methodology."/>
51+
</div>
52+
53+
The results are intuitive. For example a user replaying a track for the ninth time with a one-day gap shows a strong, repeatable pattern, while listening to a track forty times with very long gaps is much less consistent. The posteriors capture this difference, allowing us to pinpoint reliable signals from noisy outliers.
54+
55+
<div class="publication-illustration">
56+
<img
57+
style="width: 80%;"
58+
src="{{ '/static/images/publis/sguerra25umap/uncertainty3.jpg' | prepend: site.url }}"
59+
alt="Overview of the methodology."/>
60+
</div>
61+
62+
By weighting interactions according to their uncertainty, we can downplay the noisy ones and let the recommender system focus on the interactions that really matter. In practice, this means paying more attention to reliable signals and less to noisy ones. And when we tested this approach on two different datasets, the outcome was clear: explicitly modeling uncertainty in feedback led to better recommendation performance.
63+

_posts/2025-06-16-umap-marey.md

Lines changed: 30 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,30 @@
1+
---
2+
layout: post
3+
title: "Modeling Musical Genre Trajectories through Pathlet Learning"
4+
date: 2025-06-16 10:00:00 +0200
5+
category: Publication
6+
author: lmarey
7+
readtime: 4
8+
people:
9+
- lmarey
10+
- bmassonisguerra
11+
- mmoussallam
12+
publication_type: conference
13+
publication_title: "Modeling Musical Genre Trajectories through Pathlet Learning"
14+
publication_year: 2025
15+
publication_authors: L. Marey, C. Laclau, B. Sguerra, T. Viard, M. Moussallam
16+
publication_conference: "EXUM: Adjunct to UMAP"
17+
publication_code: "https://github.com/lilianmarey/music_pathlets"
18+
publication_preprint: "https://telecom-paris.hal.science/hal-05120696/"
19+
domains:
20+
- RECSYS
21+
---
22+
23+
The increasing availability of user data on music streaming platforms opens up new possibilities for analyzing music consumption. However, understanding the evolution of user preferences remains a complex challenge, particularly as their musical tastes change over time. This paper uses the dictionary learning paradigm to model user trajectories across different musical genres. We define a new framework that captures recurring patterns in genre trajectories, called pathlets, enabling the creation of comprehensible trajectory embeddings. We show that pathlet learning reveals relevant listening patterns that can be analyzed both qualitatively and quantitatively. This work improves our understanding of users’ interactions with music and opens up avenues of research into user behavior and fostering diversity in recommender systems. A dataset of 2000 user histories tagged by genre over 17 months, supplied by Deezer (a leading music streaming company), is also released with the code.
24+
25+
<div class="publication-illustration">
26+
<img
27+
style="width: 95%;"
28+
src="{{ '/static/images/publis/marey25umap/pathlets.jpg' | prepend: site.url }}"
29+
alt="Overview of the methodology."/>
30+
</div>
1.79 MB
Loading
1.83 MB
Loading
2.19 MB
Loading
3.07 MB
Loading

0 commit comments

Comments
 (0)