You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: README.md
+12Lines changed: 12 additions & 0 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -7,10 +7,22 @@ We will use Kaggle's "Top Spotify Songs in 73 Countries (Daily Updated)" dataset
7
7
- Do countries prefer explicit music or non explicit music?
8
8
9
9
**Setup/usage instructions**
10
+
<<<<<<< HEAD
10
11
Follow these steps:
11
12
1. clone the repository,
12
13
2. activate the virtual environment (for Windows .venv2\Scripts\activate),
13
14
3. install the requirements (pip install -r reqiurements.txt),
14
15
4. copy your BigQuery key into the secrets file and save it (remember not to commit the secrets file),
15
16
5. run the main dashboard page (streamlit run Spotify_Dashboard.py) with Streamlit.
16
17
18
+
=======
19
+
Please use Python 3.12.2 or older
20
+
21
+
Follow these steps:
22
+
1. clone the repository
23
+
2. create and activate the virtual environment (depending on your computer and Windows version you can use a variation of: (1) python -m venv .venv for creation and (2) .venv\Scripts\activate for activation)
24
+
3. Install the requirements (pip install -r requirements.txt). Please exercise patience. It takes a really long time
25
+
5=4. run the main dashboard page (streamlit run Spotify_Dashboard.py) with Streamlit.
st.write("We will use Kaggle's \"Top Spotify Songs in 73 Countries (Daily Updated)\" dataset. This dataset is updated daily to include the top songs and artists streamed across the world.")
9
+
st.markdown("[Link to dataset](https://www.kaggle.com/datasets/asaniczka/top-spotify-songs-in-73-countries-daily-updated?resource=download)")
10
+
11
+
# Research Questions
12
+
st.header("What are your research question(s)?")
13
+
st.write("We are interested in answering the following questions:")
14
+
st.markdown("""
15
+
- Which genres of music and artists are most popular Italy and the US?
16
+
- What is the relationship between song features (speechiness, explicitness, etc) and their popularity?
17
+
- Do countries tend to listen to domestic music (i.e. music produced in the user's country or sung in the home language) more than foreign music?
18
+
- Which songs/artists are most popular and when do they start to lose popularity?
19
+
20
+
We integrated Roberto's feedback and eliminated "which country listens to the most music". We also slightly modified the wording when it comes to song features. Lastly, we decided to focus on just the US (Alexa's home) and Italy (Giulio's home)
21
+
""")
22
+
23
+
# Notebook Link
24
+
st.header("What's the link to your notebook?")
25
+
st.markdown("[Link to our notebook](https://colab.research.google.com/drive/1H0l4hN8gyangmzVwuEojwT8GAi1kykyc?usp=sharing)")
26
+
27
+
# Target Visualization
28
+
st.header("What's your target visualization?")
29
+
st.write("Depending on which research question we choose, we have several ideas for how to graph this data (such as a choropleth map of artist popularity across the world or a word cluster chart that shows the name of the most popular artists globally).")
30
+
st.write("If we choose to do a line graph of song/artist popularity over time (where each line represents one song or artist), then our target visualization could look like this:")
31
+
32
+
# Known Unknowns
33
+
st.header("What are your known unknowns?")
34
+
st.markdown("""
35
+
- The dataset specifies that it pulls data from 73 countries -- are these countries biased towards certain continents?
36
+
- Are there inconsistencies with how the song/artist titles are written (meaning all upper case/lower case, special characters, etc.) that may make it difficult to properly aggregate the frequency for individual songs?
37
+
""")
38
+
39
+
# Anticipated Challenges
40
+
st.header("What challenges do you anticipate?")
41
+
st.markdown("""
42
+
- If we choose to do a geographic representation of our data, then linking country codes could present a challenge
43
+
- Making sure that the data is written uniformly may be time-consuming
44
+
- Normalizing the rankings of each song/artist between differently sized countries to better understand a song's popularity
st.write("Thanks for stopping by our dashboard! This app uses Kaggle's \"Top Spotify Songs in 73 Countries (Daily Updated)\" dataset to analyze music trends in Italy (Giulio's patria) and the US (Alexa's home). Hope you enjoy!")
144
+
st.markdown("[Link to dataset](https://www.kaggle.com/datasets/asaniczka/top-spotify-songs-in-73-countries-daily-updated?resource=download)")
145
+
146
+
#creating ability to choose which country to look at
0 commit comments