Overview

Brought to you by YData

Dataset statistics

Number of variables7
Number of observations3260
Missing cells2573
Missing cells (%)11.3%
Total size in memory332.8 KiB
Average record size in memory104.5 B

Variable types

Categorical6
Numeric1

Alerts

gender has 232 (7.1%) missing values Missing
age_at_diagnosis has 383 (11.7%) missing values Missing
tumor_grade has 1958 (60.1%) missing values Missing

Reproduction

Analysis started2025-06-19 17:55:02.108691
Analysis finished2025-06-19 17:55:02.158349
Duration0.05 seconds
Software versionydata-profiling vv4.16.1
Download configurationconfig.json

Variables

category
Categorical

Distinct54
Distinct (%)1.7%
Missing0
Missing (%)0.0%
Memory size50.9 KiB
Blood or Bone marrow Acute myeloid leukemia
 
213
Uterus Endometrioid adenocarcinoma
 
161
Lung Adenocarcinoma
 
161
Thyroid gland Papillary carcinoma
 
136
Head and Neck Squamous cell carcinoma
 
134
Other values (49)
2455 

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowSkin Malignant melanoma
2nd rowHead and Neck Squamous cell carcinoma
3rd rowHead and Neck Squamous cell carcinoma
4th rowHead and Neck Squamous cell carcinoma
5th rowHead and Neck Squamous cell carcinoma

Common Values

ValueCountFrequency (%)
Blood or Bone marrow Acute myeloid leukemia 213
6.5%
Uterus Endometrioid adenocarcinoma 161
4.9%
Lung Adenocarcinoma 161
4.9%
Thyroid gland Papillary carcinoma 136
4.2%
Head and Neck Squamous cell carcinoma 134
4.1%
Breast Infiltrating duct carcinoma 134
4.1%
Lung Squamous cell carcinoma 129
4.0%
Lung Healthy 124
3.8%
Kidney Renal cell carcinoma 121
3.7%
Skin Malignant melanoma 113
3.5%
Kidney Healthy 111
3.4%
Prostate gland Adenocarcinoma 109
3.3%
Cervix uteri Squamous cell carcinoma 104
3.2%
Bladder Transitional cell carcinoma 101
3.1%
Colon Adenocarcinoma 100
3.1%
Brain Glioblastoma 96
2.9%
Liver Hepatocellular carcinoma 90
2.8%
Stomach Carcinoma 82
 
2.5%
Kidney Clear cell adenocarcinoma 72
 
2.2%
Kidney Papillary adenocarcinoma 67
 
2.1%
Pancreas Infiltrating duct carcinoma 60
 
1.8%
Blood or Bone marrow Healthy 58
 
1.8%
Kidney Wilms tumor 55
 
1.7%
Prostate gland Acinar cell carcinoma 51
 
1.6%
Brain Astrocytoma 49
 
1.5%
Brain Oligodendroglioma 45
 
1.4%
Breast Lobular carcinoma 40
 
1.2%
Adrenal gland Pheochromocytoma 39
 
1.2%
Ovarian Serous cancer 33
 
1.0%
Head and Neck Healthy 31
 
1.0%
Blood or Bone marrow Chronic lymphocytic leukemia 31
 
1.0%
Uterus Serous cystadenocarcinoma 28
 
0.9%
Adrenal gland Neuroblastoma 26
 
0.8%
Breast Healthy 26
 
0.8%
Bones Osteosarcoma 24
 
0.7%
Thymus Thymoma 24
 
0.7%
Esophagus Adenocarcinoma 22
 
0.7%
Testis Seminoma 21
 
0.6%
Blood or Bone marrow Acute lymphocytic leukemia 20
 
0.6%
Pancreas Healthy 20
 
0.6%
Esophagus Squamous cell carcinoma 20
 
0.6%
Kidney Malignant rhabdoid tumor 20
 
0.6%
Prostate gland Healthy 19
 
0.6%
Uterus Healthy 14
 
0.4%
Liver Healthy 14
 
0.4%
Blood or Bone marrow Acute myelomonocytic leukemia 14
 
0.4%
Colon Healthy 13
 
0.4%
Retroperitoneum Leiomyosarcoma 13
 
0.4%
Anterior mediastinum Thymoma 13
 
0.4%
Cervix uteri Adenocarcinoma 12
 
0.4%
Thyroid gland Healthy 12
 
0.4%
Adrenal gland Adrenal cortical carcinoma 12
 
0.4%
Retroperitoneum Dedifferentiated liposarcoma 12
 
0.4%
Pleura Epithelioid mesothelioma 11
 
0.3%

gender
Categorical

Missing 

Distinct2
Distinct (%)0.1%
Missing232
Missing (%)7.1%
Memory size50.9 KiB
male
1581 
female
1447 

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowfemale
2nd rowmale
3rd rowfemale
4th rowfemale
5th rowmale

Common Values

ValueCountFrequency (%)
male 1581
48.5%
female 1447
44.4%
(Missing) 232
 
7.1%

Common Values (Plot)

2025-06-19T17:55:02.204990image/svg+xmlMatplotlib v3.6.2, https://matplotlib.org/

age_at_diagnosis
Real number (ℝ)

Missing 

Distinct2265
Distinct (%)78.7%
Missing383
Missing (%)11.7%
Infinite0
Infinite (%)0.0%
Mean20467.99722
Minimum3
Maximum32872
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size180.0 KiB
2025-06-19T17:55:02.267006image/svg+xmlMatplotlib v3.6.2, https://matplotlib.org/

Quantile statistics

Minimum3
5-th percentile3584.4
Q117466
median21829
Q325123
95-th percentile29330.2
Maximum32872
Range32869
Interquartile range (IQR)7657

Descriptive statistics

Standard deviation7003.502502
Coefficient of variation (CV)0.3421684314
Kurtosis1.089340156
Mean20467.99722
Median Absolute Deviation (MAD)3746
Skewness-1.107898079
Sum58886428
Variance49049047.3
MonotonicityNot monotonic
2025-06-19T17:55:02.331413image/svg+xmlMatplotlib v3.6.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
32872 13
 
0.4%
18748 6
 
0.2%
24594 6
 
0.2%
17349 5
 
0.2%
5594 5
 
0.2%
23226 5
 
0.2%
31016 5
 
0.2%
25123 5
 
0.2%
27636 4
 
0.1%
20974 4
 
0.1%
23927 4
 
0.1%
28108 4
 
0.1%
20747 4
 
0.1%
17760 4
 
0.1%
24122 4
 
0.1%
21519 4
 
0.1%
24631 4
 
0.1%
24745 4
 
0.1%
22241 4
 
0.1%
20349 4
 
0.1%
22471 4
 
0.1%
11650 3
 
0.1%
17807 3
 
0.1%
19737 3
 
0.1%
5940 3
 
0.1%
19939 3
 
0.1%
4623 3
 
0.1%
18856 3
 
0.1%
20851 3
 
0.1%
3285 3
 
0.1%
24138 3
 
0.1%
851 3
 
0.1%
17930 3
 
0.1%
24986 3
 
0.1%
20750 3
 
0.1%
21871 3
 
0.1%
19964 3
 
0.1%
30028 3
 
0.1%
26002 3
 
0.1%
25516 3
 
0.1%
24837 3
 
0.1%
17831 3
 
0.1%
21328 3
 
0.1%
31052 3
 
0.1%
1454 3
 
0.1%
3972 3
 
0.1%
5325 3
 
0.1%
6946 3
 
0.1%
3774 3
 
0.1%
1504 3
 
0.1%
969 3
 
0.1%
27971 3
 
0.1%
24043 3
 
0.1%
27162 3
 
0.1%
22156 3
 
0.1%
25627 3
 
0.1%
2855 3
 
0.1%
5544 3
 
0.1%
22378 3
 
0.1%
1248 3
 
0.1%
15559 3
 
0.1%
19892 3
 
0.1%
4607 3
 
0.1%
19568 3
 
0.1%
27968 3
 
0.1%
27958 3
 
0.1%
18163 3
 
0.1%
20787 3
 
0.1%
24775 3
 
0.1%
26819 3
 
0.1%
19413 3
 
0.1%
23908 3
 
0.1%
27547 3
 
0.1%
19426 3
 
0.1%
21725 3
 
0.1%
23318 3
 
0.1%
17465 3
 
0.1%
21864 2
 
0.1%
14470 2
 
0.1%
17195 2
 
0.1%
24757 2
 
0.1%
23398 2
 
0.1%
21655 2
 
0.1%
26424 2
 
0.1%
20023 2
 
0.1%
11735 2
 
0.1%
22005 2
 
0.1%
22126 2
 
0.1%
18171 2
 
0.1%
15616 2
 
0.1%
21557 2
 
0.1%
23784 2
 
0.1%
24735 2
 
0.1%
12068 2
 
0.1%
26603 2
 
0.1%
22279 2
 
0.1%
24603 2
 
0.1%
8499 2
 
0.1%
24376 2
 
0.1%
23653 2
 
0.1%
29657 2
 
0.1%
17296 2
 
0.1%
18600 2
 
0.1%
20780 2
 
0.1%
22902 2
 
0.1%
36 2
 
0.1%
20054 2
 
0.1%
26438 2
 
0.1%
11470 2
 
0.1%
26190 2
 
0.1%
27278 2
 
0.1%
20396 2
 
0.1%
20175 2
 
0.1%
3044 2
 
0.1%
27364 2
 
0.1%
14843 2
 
0.1%
11024 2
 
0.1%
30227 2
 
0.1%
815 2
 
0.1%
1720 2
 
0.1%
11932 2
 
0.1%
21323 2
 
0.1%
24103 2
 
0.1%
21336 2
 
0.1%
563 2
 
0.1%
19312 2
 
0.1%
24692 2
 
0.1%
25141 2
 
0.1%
22161 2
 
0.1%
23923 2
 
0.1%
12665 2
 
0.1%
18219 2
 
0.1%
24927 2
 
0.1%
17466 2
 
0.1%
21302 2
 
0.1%
22236 2
 
0.1%
21734 2
 
0.1%
25759 2
 
0.1%
23982 2
 
0.1%
16923 2
 
0.1%
28015 2
 
0.1%
21654 2
 
0.1%
20550 2
 
0.1%
24528 2
 
0.1%
28625 2
 
0.1%
28668 2
 
0.1%
17228 2
 
0.1%
21878 2
 
0.1%
24167 2
 
0.1%
24477 2
 
0.1%
27731 2
 
0.1%
25119 2
 
0.1%
20604 2
 
0.1%
15366 2
 
0.1%
24301 2
 
0.1%
11379 2
 
0.1%
16231 2
 
0.1%
22433 2
 
0.1%
27236 2
 
0.1%
22720 2
 
0.1%
22773 2
 
0.1%
27756 2
 
0.1%
23121 2
 
0.1%
8918 2
 
0.1%
24636 2
 
0.1%
18595 2
 
0.1%
19172 2
 
0.1%
28030 2
 
0.1%
20344 2
 
0.1%
20557 2
 
0.1%
24209 2
 
0.1%
24734 2
 
0.1%
21166 2
 
0.1%
18640 2
 
0.1%
26778 2
 
0.1%
23119 2
 
0.1%
28444 2
 
0.1%
11204 2
 
0.1%
29536 2
 
0.1%
22641 2
 
0.1%
25114 2
 
0.1%
24378 2
 
0.1%
21205 2
 
0.1%
16688 2
 
0.1%
26038 2
 
0.1%
19991 2
 
0.1%
26533 2
 
0.1%
22428 2
 
0.1%
18263 2
 
0.1%
25366 2
 
0.1%
23531 2
 
0.1%
23135 2
 
0.1%
23972 2
 
0.1%
19663 2
 
0.1%
24399 2
 
0.1%
22768 2
 
0.1%
26205 2
 
0.1%
23148 2
 
0.1%
20558 2
 
0.1%
18567 2
 
0.1%
25499 2
 
0.1%
26008 2
 
0.1%
25447 2
 
0.1%
26353 2
 
0.1%
21632 2
 
0.1%
23481 2
 
0.1%
28101 2
 
0.1%
26600 2
 
0.1%
22602 2
 
0.1%
26583 2
 
0.1%
24408 2
 
0.1%
20774 2
 
0.1%
23707 2
 
0.1%
20077 2
 
0.1%
21418 2
 
0.1%
25231 2
 
0.1%
22361 2
 
0.1%
21265 2
 
0.1%
24886 2
 
0.1%
21160 2
 
0.1%
22497 2
 
0.1%
19705 2
 
0.1%
25991 2
 
0.1%
25783 2
 
0.1%
18161 2
 
0.1%
19779 2
 
0.1%
23710 2
 
0.1%
23896 2
 
0.1%
24110 2
 
0.1%
25227 2
 
0.1%
22027 2
 
0.1%
29164 2
 
0.1%
25290 2
 
0.1%
25195 2
 
0.1%
22705 2
 
0.1%
17824 2
 
0.1%
22042 2
 
0.1%
18792 2
 
0.1%
27124 2
 
0.1%
25862 2
 
0.1%
20683 2
 
0.1%
18993 2
 
0.1%
23682 2
 
0.1%
32871 2
 
0.1%
13840 2
 
0.1%
24754 2
 
0.1%
20720 2
 
0.1%
23868 2
 
0.1%
17422 2
 
0.1%
23322 2
 
0.1%
Other values (2015) 2261
69.4%
(Missing) 383
 
11.7%
ValueCountFrequency (%)
3 1
< 0.1%
7 1
< 0.1%
18 1
< 0.1%
31 2
0.1%
36 2
0.1%
ValueCountFrequency (%)
32872 13
0.4%
32871 2
 
0.1%
32754 1
 
< 0.1%
32750 1
 
< 0.1%
32682 1
 
< 0.1%
Distinct25
Distinct (%)0.8%
Missing0
Missing (%)0.0%
Memory size180.0 KiB
Kidney
446 
Lung
414 
Blood or Bone marrow
336 
Uterus
203 
Breast
200 
Other values (20)
1661 

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowSkin
2nd rowHead and Neck
3rd rowHead and Neck
4th rowHead and Neck
5th rowHead and Neck

Common Values

ValueCountFrequency (%)
Kidney 446
13.7%
Lung 414
12.7%
Blood or Bone marrow 336
10.3%
Uterus 203
6.2%
Breast 200
6.1%
Brain 190
5.8%
Prostate gland 179
5.5%
Head and Neck 165
 
5.1%
Thyroid gland 148
 
4.5%
Cervix uteri 116
 
3.6%
Skin 113
 
3.5%
Colon 113
 
3.5%
Liver 104
 
3.2%
Bladder 101
 
3.1%
Stomach 82
 
2.5%
Pancreas 80
 
2.5%
Adrenal gland 77
 
2.4%
Esophagus 42
 
1.3%
Ovarian 33
 
1.0%
Retroperitoneum 25
 
0.8%
Thymus 24
 
0.7%
Bones 24
 
0.7%
Testis 21
 
0.6%
Anterior mediastinum 13
 
0.4%
Pleura 11
 
0.3%
Distinct38
Distinct (%)1.2%
Missing0
Missing (%)0.0%
Memory size180.0 KiB
Healthy
442 
Adenocarcinoma
432 
Squamous cell carcinoma
387 
Acute myeloid leukemia
213 
Infiltrating duct carcinoma
194 
Other values (33)
1592 

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowMalignant melanoma
2nd rowSquamous cell carcinoma
3rd rowSquamous cell carcinoma
4th rowSquamous cell carcinoma
5th rowSquamous cell carcinoma

Common Values

ValueCountFrequency (%)
Healthy 442
13.6%
Adenocarcinoma 432
13.3%
Squamous cell carcinoma 387
11.9%
Acute myeloid leukemia 213
6.5%
Infiltrating duct carcinoma 194
6.0%
Papillary adenocarcinoma 170
 
5.2%
Endometrioid adenocarcinoma 161
 
4.9%
Renal cell carcinoma 121
 
3.7%
Malignant melanoma 113
 
3.5%
Glioblastoma 96
 
2.9%
Hepatocellular carcinoma 90
 
2.8%
Transitional cell carcinoma 74
 
2.3%
Clear cell adenocarcinoma 72
 
2.2%
Wilms tumor 55
 
1.7%
Acinar cell carcinoma 51
 
1.6%
Astrocytoma 49
 
1.5%
Oligodendroglioma 45
 
1.4%
Lobular carcinoma 40
 
1.2%
Pheochromocytoma 39
 
1.2%
Thymoma 37
 
1.1%
Serous cancer 33
 
1.0%
Papillary carcinoma 33
 
1.0%
Chronic lymphocytic leukemia 31
 
1.0%
Serous cystadenocarcinoma 28
 
0.9%
Papillary transitional cell carcinoma 27
 
0.8%
Neuroblastoma 26
 
0.8%
Osteosarcoma 24
 
0.7%
Seminoma 21
 
0.6%
Malignant rhabdoid tumor 20
 
0.6%
Tubular adenocarcinoma 20
 
0.6%
Acute lymphocytic leukemia 20
 
0.6%
Mucinous adenocarcinoma 18
 
0.6%
Carcinoma 16
 
0.5%
Acute myelomonocytic leukemia 14
 
0.4%
Leiomyosarcoma 13
 
0.4%
Dedifferentiated liposarcoma 12
 
0.4%
Adrenal cortical carcinoma 12
 
0.4%
Epithelioid mesothelioma 11
 
0.3%

tumor_grade
Categorical

Missing 

Distinct4
Distinct (%)0.3%
Missing1958
Missing (%)60.1%
Memory size180.0 KiB
G2
645 
G3
492 
G1
129 
G4
 
36

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowG3
2nd rowG2
3rd rowG2
4th rowG2
5th rowG3

Common Values

ValueCountFrequency (%)
G2 645
 
19.8%
G3 492
 
15.1%
G1 129
 
4.0%
G4 36
 
1.1%
(Missing) 1958
60.1%

Common Values (Plot)

2025-06-19T17:55:02.385839image/svg+xmlMatplotlib v3.6.2, https://matplotlib.org/

platform
Categorical

Distinct3
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size180.0 KiB
450K
2309 
EPIC
939 
EPICv2
 
12

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowEPIC
2nd row450K
3rd row450K
4th row450K
5th row450K

Common Values

ValueCountFrequency (%)
450K 2309
70.8%
EPIC 939
28.8%
EPICv2 12
 
0.4%

Common Values (Plot)

2025-06-19T17:55:02.428463image/svg+xmlMatplotlib v3.6.2, https://matplotlib.org/