Training Data
Test Data
195
ROWS
49
1
DUPLICATES
0
53.8 kb
RAM
13.5 kb
7
FEATURES
7
5
CATEGORICAL
5
2
NUMERICAL
2
0
TEXT
0
2.1.4
Get updates, docs & report issues here

Created & maintained by Francois Bertrand
Graphic design by Jean-Francois Hains
1
total_bill
VALUES:
195
(100%)
49
(100%)
MISSING:
---
---
DISTINCT:
185
(95%)
48
(98%)
ZEROES:
---
---
MAX
50.8
48.2
95%
38.0
37.8
Q3
24.1
24.5
AVG
19.8
19.7
MEDIAN
17.5
18.1
Q1
13.2
14.5
5%
10.1
7.4
MIN
7.2
3.1
RANGE
43.6
45.1
IQR
10.9
10.0
STD
8.83
9.28
VAR
77.9
86.2
KURT.
1.22
1.43
SKEW
1.20
0.940
SUM
3,865
963
2
tip
VALUES:
195
(100%)
49
(100%)
MISSING:
---
---
DISTINCT:
107
(55%)
32
(65%)
ZEROES:
---
---
MAX
10.00
7.58
95%
5.32
5.10
Q3
3.50
4.00
AVG
2.96
3.13
MEDIAN
2.74
3.00
Q1
2.00
2.00
5%
1.50
1.07
MIN
1.00
1.00
RANGE
9.00
6.58
IQR
1.50
2.00
STD
1.37
1.45
VAR
1.87
2.11
KURT.
4.75
0.663
SKEW
1.70
0.684
SUM
578
153
3
sex
VALUES:
195
(100%)
49
(100%)
MISSING:
---
---
DISTINCT:
2
(1%)
2
(4%)
4
smoker
VALUES:
195
(100%)
49
(100%)
MISSING:
---
---
DISTINCT:
2
(1%)
2
(4%)
5
day
VALUES:
195
(100%)
49
(100%)
MISSING:
---
---
DISTINCT:
4
(2%)
4
(8%)
6
time
VALUES:
195
(100%)
49
(100%)
MISSING:
---
---
DISTINCT:
2
(1%)
2
(4%)
7
size
VALUES:
195
(100%)
49
(100%)
MISSING:
---
---
DISTINCT:
6
(3%)
5
(10%)
Associations
[Only including dataset "Training Data"]
Squares are categorical associations (uncertainty coefficient & correlation ratio) from 0 to 1. The uncertainty coefficient is assymmetrical, (i.e. ROW LABEL values indicate how much they PROVIDE INFORMATION to each LABEL at the TOP).

Circles are the symmetrical numerical correlations (Pearson's) from -1 to 1. The trivial diagonal is intentionally left blank for clarity.
Associations
[Only including dataset "Test Data"]
Squares are categorical associations (uncertainty coefficient & correlation ratio) from 0 to 1. The uncertainty coefficient is assymmetrical, (i.e. ROW LABEL values indicate how much they PROVIDE INFORMATION to each LABEL at the TOP).

Circles are the symmetrical numerical correlations (Pearson's) from -1 to 1. The trivial diagonal is intentionally left blank for clarity.
total_bill
MISSING:
---
---
>
NUMERICAL ASSOCIATIONS
(PEARSON, -1 to 1)

tip
0.66

CATEGORICAL ASSOCIATIONS
(CORRELATION RATIO, 0 to 1)

size
0.59
time
0.22
day
0.19
sex
0.15
smoker
0.12
MOST FREQUENT VALUES

13.42
2
1.0%
1
2.0%
15.98
2
1.0%
0
0.0%
15.69
2
1.0%
0
0.0%
10.34
2
1.0%
0
0.0%
20.69
2
1.0%
0
0.0%
10.07
2
1.0%
0
0.0%
10.33
2
1.0%
0
0.0%
13.81
2
1.0%
0
0.0%
13.0
2
1.0%
0
0.0%
20.29
2
1.0%
0
0.0%
14.26
1
0.5%
0
0.0%
24.27
1
0.5%
0
0.0%
30.06
1
0.5%
0
0.0%
13.27
1
0.5%
0
0.0%
9.6
1
0.5%
0
0.0%
SMALLEST VALUES

7.25
1
0.5%
1
2.0%
7.56
1
0.5%
0
0.0%
8.35
1
0.5%
0
0.0%
8.58
1
0.5%
0
0.0%
8.77
1
0.5%
0
0.0%
9.55
1
0.5%
0
0.0%
9.6
1
0.5%
0
0.0%
9.78
1
0.5%
0
0.0%
9.94
1
0.5%
0
0.0%
10.07
2
1.0%
0
0.0%
10.09
1
0.5%
0
0.0%
10.27
1
0.5%
0
0.0%
10.29
1
0.5%
0
0.0%
10.33
2
1.0%
0
0.0%
10.34
2
1.0%
0
0.0%
LARGEST VALUES

50.81
1
0.5%
0
0.0%
48.33
1
0.5%
0
0.0%
48.27
1
0.5%
0
0.0%
45.35
1
0.5%
0
0.0%
44.3
1
0.5%
0
0.0%
41.19
1
0.5%
0
0.0%
40.55
1
0.5%
0
0.0%
40.17
1
0.5%
0
0.0%
38.73
1
0.5%
0
0.0%
38.07
1
0.5%
0
0.0%
38.01
1
0.5%
0
0.0%
35.83
1
0.5%
0
0.0%
34.83
1
0.5%
0
0.0%
34.81
1
0.5%
0
0.0%
34.65
1
0.5%
0
0.0%
tip
MISSING:
---
---
>
NUMERICAL ASSOCIATIONS
(PEARSON, -1 to 1)

total_bill
0.66

CATEGORICAL ASSOCIATIONS
(CORRELATION RATIO, 0 to 1)

size
0.46
time
0.14
day
0.13
sex
0.07
smoker
0.01
MOST FREQUENT VALUES

2.0
27
13.8%
6
12.2%
3.0
18
9.2%
5
10.2%
2.5
9
4.6%
1
2.0%
1.5
9
4.6%
0
0.0%
4.0
9
4.6%
3
6.1%
3.5
7
3.6%
2
4.1%
5.0
6
3.1%
4
8.2%
3.23
2
1.0%
0
0.0%
2.01
2
1.0%
0
0.0%
3.18
2
1.0%
0
0.0%
3.25
2
1.0%
0
0.0%
2.75
2
1.0%
0
0.0%
2.24
2
1.0%
0
0.0%
1.25
2
1.0%
1
2.0%
2.03
2
1.0%
0
0.0%
SMALLEST VALUES

1.0
1
0.5%
3
6.1%
1.01
1
0.5%
0
0.0%
1.1
1
0.5%
0
0.0%
1.25
2
1.0%
1
2.0%
1.36
1
0.5%
0
0.0%
1.44
1
0.5%
1
2.0%
1.45
1
0.5%
0
0.0%
1.47
1
0.5%
0
0.0%
1.5
9
4.6%
0
0.0%
1.56
1
0.5%
0
0.0%
1.57
1
0.5%
0
0.0%
1.58
1
0.5%
0
0.0%
1.61
1
0.5%
0
0.0%
1.63
1
0.5%
0
0.0%
1.64
1
0.5%
0
0.0%
LARGEST VALUES

10.0
1
0.5%
0
0.0%
9.0
1
0.5%
0
0.0%
6.73
1
0.5%
0
0.0%
6.7
1
0.5%
0
0.0%
6.5
1
0.5%
1
2.0%
6.0
1
0.5%
0
0.0%
5.92
1
0.5%
0
0.0%
5.85
1
0.5%
0
0.0%
5.65
1
0.5%
0
0.0%
5.6
1
0.5%
0
0.0%
5.2
1
0.5%
0
0.0%
5.17
1
0.5%
0
0.0%
5.15
1
0.5%
0
0.0%
5.14
1
0.5%
0
0.0%
5.07
1
0.5%
0
0.0%
sex
MISSING:
---
---
TOP CATEGORIES

Male
125
64%
32
65%
Female
70
36%
17
35%
ALL
195
100%
49
100%
CATEGORICAL ASSOCIATIONS
(UNCERTAINTY COEFFICIENT, 0 to 1)
sex
PROVIDES INFORMATION ON...

time
0.03
day
0.02
size
0.01
smoker
0.00

THESE FEATURES
GIVE INFORMATION
ON sex:

day
0.04
time
0.03
size
0.01
smoker
0.00

NUMERICAL ASSOCIATIONS
(CORRELATION RATIO, 0 to 1)
sex
CORRELATION RATIO WITH...

total_bill
0.15
tip
0.07
smoker
MISSING:
---
---
TOP CATEGORIES

No
121
62%
30
61%
Yes
74
38%
19
39%
ALL
195
100%
49
100%
CATEGORICAL ASSOCIATIONS
(UNCERTAINTY COEFFICIENT, 0 to 1)
smoker
PROVIDES INFORMATION ON...

day
0.04
size
0.01
sex
0.00
time
0.00

THESE FEATURES
GIVE INFORMATION
ON smoker:

day
0.07
size
0.02
sex
0.00
time
0.00

NUMERICAL ASSOCIATIONS
(CORRELATION RATIO, 0 to 1)
smoker
CORRELATION RATIO WITH...

total_bill
0.12
tip
0.01
day
MISSING:
---
---
TOP CATEGORIES

Sat
69
35%
18
37%
Sun
59
30%
17
35%
Thur
50
26%
12
24%
Fri
17
9%
2
4%
ALL
195
100%
49
100%
CATEGORICAL ASSOCIATIONS
(UNCERTAINTY COEFFICIENT, 0 to 1)
day
PROVIDES INFORMATION ON...

time
0.86
smoker
0.07
size
0.06
sex
0.04

THESE FEATURES
GIVE INFORMATION
ON day:

time
0.40
size
0.05
smoker
0.04
sex
0.02

NUMERICAL ASSOCIATIONS
(CORRELATION RATIO, 0 to 1)
day
CORRELATION RATIO WITH...

total_bill
0.19
tip
0.13
time
MISSING:
---
---
TOP CATEGORIES

Dinner
139
71%
37
76%
Lunch
56
29%
12
24%
ALL
195
100%
49
100%
CATEGORICAL ASSOCIATIONS
(UNCERTAINTY COEFFICIENT, 0 to 1)
time
PROVIDES INFORMATION ON...

day
0.40
size
0.04
sex
0.03
smoker
0.00

THESE FEATURES
GIVE INFORMATION
ON time:

day
0.86
size
0.06
sex
0.03
smoker
0.00

NUMERICAL ASSOCIATIONS
(CORRELATION RATIO, 0 to 1)
time
CORRELATION RATIO WITH...

total_bill
0.22
tip
0.14
size
MISSING:
---
---
TOP CATEGORIES

2
129
66%
27
55%
3
31
16%
7
14%
4
27
14%
10
20%
5
5
3%
0
0%
1
2
1%
2
4%
6
1
<1%
3
6%
ALL
195
100%
49
100%
CATEGORICAL ASSOCIATIONS
(UNCERTAINTY COEFFICIENT, 0 to 1)
size
PROVIDES INFORMATION ON...

time
0.06
day
0.05
smoker
0.02
sex
0.01

THESE FEATURES
GIVE INFORMATION
ON size:

day
0.06
time
0.04
smoker
0.01
sex
0.01

NUMERICAL ASSOCIATIONS
(CORRELATION RATIO, 0 to 1)
size
CORRELATION RATIO WITH...

total_bill
0.59
tip
0.46