Targaryen Group

Team Members:

Alan Lupatini António Galvão Carmelina MBesso Rui Parreira

Project Overview

We are a newly employed data analyst in the Customer Experience (CX) team at Vanguard, the US-based investment management company. You've been thrown straight into the deep end with your first task. Before your arrival, the team launched an exciting digital experiment, and now, they're eagerly waiting to uncover the results and need your help!

An A/B test was set into motion from 3/15/2017 to 6/20/2017 by the team.

Control Group: Clients interacted with Vanguard's traditional online process. Test Group: Clients experienced the new, spruced-up digital interface. Both groups navigated through an identical process sequence: an initial page, three subsequent steps, and finally, a confirmation page signaling process completion. The goal is to see if the new design leads to a better user experience and higher process completion rates

Objective: The digital world is evolving, and so are Vanguard’s clients. Vanguard believed that a more intuitive and modern User Interface (UI), coupled with timely in-context prompts (cues, messages, hints, or instructions provided to users directly within the context of their current task or action), could make the online process smoother for clients. The critical question was: Would these changes encourage more clients to complete the process?.
Dataset:

Client Profiles (df_final_demo): Demographics like age, gender, and account details of our clients. Digital Footprints (df_final_web_data): A detailed trace of client interactions online, divided into two parts: pt_1 and pt_2. Experiment Roster (df_final_experiment_clients): A list revealing which clients were part of the grand experiment.

Dataset Features: First dataset (df_final_experiment_clients) is composed: client_id and variation - We can see if clientes are on Test or Control. Second dataset (df_final_web_data) is composed: client_id, visitor_id, process_step and date/time. Third dataset (f_final_de) is composed by the demographics: cliend_id, clnt_tenure_yr, clnt_tenure_mhth, clnt_age, genrd, num:accts, bal, calls_6_mnth, logons_6_mnth

Final project presentation

The results from the analysis can be found in the presentation slides of the project:

Google Slides https://docs.google.com/presentation/d/1UzFUm5n8xV6y3JB86S_7T5vD-jrlLIJaflm5zIP1oEU/edit?slide=id.g3b56f06efb1_2_0#slide=id.g3b56f06efb1_2_0

Tableu Public https://public.tableau.com/app/profile/carmelina.mbesso/viz/Kpisdashboard_17678791075060/STORY?publish=yes

Tools & Libraries

Python 3
Pandas (data manipulation)
NumPy (numerical operations)
Matplotlib, Seaborn (visualization)
Trello (project management)
GitHub (collaboration)
MySQL (data manipulation)
DrawnBD (Visualization)
Tableu (visualization)
Google Slides (Presentation)

Day 1/2 - Project Initiation & Data Cleaning

Os Day 1 and 2, the team created the enviroment, repository, Trelo board. Team merged and cleanead databasets for exploration. We analised columns, and decided to keep ages in float, unknown in genre, etc.

1. Created a Kanban Board for Project Management Purposes on Trello.

We organized the tasks for the whole project. https://trello.com/b/q2e3vDqd/mon-tableau-trello

2. Created a Github Repository for the Project:

https://github.com/alanlupatini/targaryan/tree/main We created the repository and defined the collaboration status for all group members. We use branch and merge techniques, so all files are always updated. We have practiced working collaboratively on the repository.

Objectives:

Trying to ask the questions: Who are the primary clients using this online process? Are the primary clients younger or older, new or long-standing? Carried out a client behaviour analysis to answer any additional relevant questions you think are important.

Explore datasets (EDA) We draw ERD diagram for the datacharts: https://www.drawdb.app/editor?shareId=ab97aa8aec0f25c0ad4d3e7055352ad2

Open datasets in Python Check shape & columns Check missing values Check duplicates Save observations in notebook

Day 3 -

Objectives:

Use at least completion rate, time spent on each step and error rates. Add any KPIs you might find relevant. Completion Rate: The proportion of users who reach the final 'confirm' step. Time Spent on Each Step: The average duration users spend on each step. Error Rates: If there's a step where users go back to a previous step, it may indicate confusion or an error. You should consider moving from a later step to an earlier one as an error. Anwser this: Based on the chosen KPIs, how does the new design's performance compare to the old one?

Day 4/5

Objectives:

Conduct hypothesis testing to make data-driven conclusions about the effectiveness of the redesign

Day 6/7

Defined metrics of the A/B experiment you will visually present in Tableau Imported the cleaned and processed data into Tableau Created a dashboard showcasing the A/B test results, including completion rates, time spent on each step, error rates for both the Test and Control groups and/or any KPIs you've defined for this business case Used Tableau's filtering and drilling capabilities to allow viewers to explore data based on demographics, such as age groups or gender Incorporated visualizations from the EDA section to provide context

Day 8/9

We created the full presentation on Google Slides We made a story in tableu, with some key-insights

Day 10 - Presentation

Calculated metrics

Conversion Rate (CR) > Amount of Starters that reached Confirmation page. Error Rate > The rate in which users loaded the same page more than once or navigated back into the funnel. Time Spent on Page > The amount of time lapsed between loading a page into the funnel and the next page load (unavailable in last page seen in funnel). Confidence Level > Metric that calculates if the traffic and CR are statistically significant.

2. Analysis & Conclusions

Middle age group (30-50) are the primary clients using the online process, with >4 logons in the last 6 months, if we look at the >16 tenure group. We’ve noted that all demographic groups, and all tenure group made at least 3 logons in the last 6 months.

Looking at average account balance by age, we can clearly see that seniors (>50 years) are the primary clients peaking at 300.000. Male clients also almost double female clients, in this analysis, with a peak of 170.000.

In the analysis of the data, regarding the new website, the new design (Test) resulted in FEWER errors!

Users in the Test group experienced 31% fewer backward movements compared to Control The new, intuitive UI and in-context prompts successfully reduced confusion Users navigated the process more smoothly with fewer mistakes The modern interface helps users understand what to do at each step

3. Challenges in the Project

Data cleaning, combining data. Data combination had to be done again in Python so we could import one single table to Tableau. Translating some calculations from Python to Tableau wasn’t easy. Calculating test confidence levels on SQL was really difficult, so we turned to Python.

Name		Name	Last commit message	Last commit date
Latest commit History 42 Commits
.ipynb_checkpoints		.ipynb_checkpoints
data		data
figures		figures
notebooks		notebooks
sql_scripts		sql_scripts
src/project_template		src/project_template
.gitignore		.gitignore
README.md		README.md
config.yaml		config.yaml
gitignore.txt		gitignore.txt
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Targaryen Group

Team Members:

Project Overview

Final project presentation

Tools & Libraries

Day 1/2 - Project Initiation & Data Cleaning

1. Created a Kanban Board for Project Management Purposes on Trello.

2. Created a Github Repository for the Project:

Objectives:

Day 3 -

Objectives:

Day 4/5

Objectives:

Day 6/7

Day 8/9

Day 10 - Presentation

Calculated metrics

2. Analysis & Conclusions

3. Challenges in the Project

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Targaryen Group

Team Members:

Project Overview

Final project presentation

Tools & Libraries

Day 1/2 - Project Initiation & Data Cleaning

1. Created a Kanban Board for Project Management Purposes on Trello.

2. Created a Github Repository for the Project:

Objectives:

Day 3 -

Objectives:

Day 4/5

Objectives:

Day 6/7

Day 8/9

Day 10 - Presentation

Calculated metrics

2. Analysis & Conclusions

3. Challenges in the Project

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages