Dataset and Analysis

Statistical representation of samples for demographic categories across Initial, Naive, and SMOTE datasets.

Gender

Male Female
Initial 240 500
Naïve 1019 1009
SMOTE 4443 4443

Fitzpatrick scale

The abbreviations for `Fitzpatrick scale’ are: T.1 - type i, T.1 - type ii, T.1 - type iii, T.1 - type iv, T.1 - type v, T.1 - type vi.

T.1 T.2 T.3 T.4 T.5 T.6
Initial 11 192 289 159 72 17
Naïve 25 681 743 345 164 70
SMOTE 1925 1893 2630 1132 322 984

Age Groups

18-24 25-30 31-36 37-42 43-50 51-60 61+
Initial 83 201 164 137 103 44 8
Naïve 293 282 297 293 283 274 285
SMOTE 1014 2918 1703 1236 978 458 579

Geo Location

The abbreviations for `Geo-location’ are: RN - Rio Grande do Norte, SP - Sao Paulo, RS - Rio Grande do Sul, GO -Goias, MT - Mato Grosso, PR - Parana, RJ - Rio de Janeiro, MG - Minas Gerais, PI - Piaui, PE - Pernambuco, MA - Maranhao.

MA MT RN GO PI RS RJ SP PE PR MG
Initial 9 27 25 11 7 28 130 379 38 55 31
Naïve 18 47 39 24 34 70 409 1110 59 119 99
SMOTE 892 854 845 841 830 805 796 787 769 744 723