You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: sklift/datasets/descr/hillstrom.rst
+12-8Lines changed: 12 additions & 8 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -4,17 +4,20 @@ Kevin Hillstrom Dataset: MineThatData
4
4
Data description
5
5
################
6
6
7
-
This is a copy of `MineThatData E-Mail Analytics And Data Mining Challenge dataset <https://blog.minethatdata.com/2008/03/minethatdata-e-mail-analytics-and-data.html/>`_.
7
+
This is a copy of `MineThatData E-Mail Analytics And Data Mining Challenge dataset <https://blog.minethatdata.com/2008/03/minethatdata-e-mail-analytics-and-data.html>`_.
8
8
9
-
date: March 20, 2008
10
-
11
-
This dataset contains 64,000 customers who last purchased within twelve months. The customers were involved in an e-mail test.
9
+
This dataset contains 64,000 customers who last purchased within twelve months.
10
+
The customers were involved in an e-mail test.
12
11
13
12
* 1/3 were randomly chosen to receive an e-mail campaign featuring Mens merchandise.
14
13
* 1/3 were randomly chosen to receive an e-mail campaign featuring Womens merchandise.
15
14
* 1/3 were randomly chosen to not receive an e-mail campaign.
16
15
17
-
During a period of two weeks following the e-mail campaign, results were tracked. Your job is to tell the world if the Mens or Womens e-mail campaign was successful.
16
+
During a period of two weeks following the e-mail campaign, results were tracked.
17
+
Your job is to tell the world if the Mens or Womens e-mail campaign was successful.
18
+
19
+
Fields
20
+
################
18
21
19
22
Historical customer attributes at your disposal include:
20
23
@@ -30,9 +33,10 @@ Historical customer attributes at your disposal include:
30
33
Another variable describes the e-mail campaign the customer received:
31
34
32
35
* Segment
33
-
* Mens E-Mail
34
-
* Womens E-Mail
35
-
* No E-Mail
36
+
37
+
* Mens E-Mail
38
+
* Womens E-Mail
39
+
* No E-Mail
36
40
37
41
Finally, we have a series of variables describing activity in the two weeks following delivery of the e-mail campaign:
* train.csv: a subset of clients for training. The column *treatment_flg* indicates if there was a communication. The column *target* shows if there was a purchase afterward;
16
17
* clients.csv: general info about clients;
17
-
* products.csv: general info about stock items;
18
18
* purchases.csv: clients’ purchase history prior to communication.
0 commit comments