Overview
Brought to you by YData
Dataset statistics
| Number of variables | 7 |
|---|---|
| Number of observations | 3260 |
| Missing cells | 2573 |
| Missing cells (%) | 11.3% |
| Total size in memory | 332.8 KiB |
| Average record size in memory | 104.5 B |
Variable types
| Categorical | 6 |
|---|---|
| Numeric | 1 |
gender has 232 (7.1%) missing values | Missing |
age_at_diagnosis has 383 (11.7%) missing values | Missing |
tumor_grade has 1958 (60.1%) missing values | Missing |
Reproduction
| Analysis started | 2025-06-19 17:55:02.108691 |
|---|---|
| Analysis finished | 2025-06-19 17:55:02.158349 |
| Duration | 0.05 seconds |
| Software version | ydata-profiling vv4.16.1 |
| Download configuration | config.json |
Variables
category
Categorical
| Distinct | 54 |
|---|---|
| Distinct (%) | 1.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 50.9 KiB |
| Blood or Bone marrow Acute myeloid leukemia | 213 |
|---|---|
| Uterus Endometrioid adenocarcinoma | 161 |
| Lung Adenocarcinoma | 161 |
| Thyroid gland Papillary carcinoma | 136 |
| Head and Neck Squamous cell carcinoma | 134 |
| Other values (49) |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Skin Malignant melanoma |
|---|---|
| 2nd row | Head and Neck Squamous cell carcinoma |
| 3rd row | Head and Neck Squamous cell carcinoma |
| 4th row | Head and Neck Squamous cell carcinoma |
| 5th row | Head and Neck Squamous cell carcinoma |
Common Values
| Value | Count | Frequency (%) |
| Blood or Bone marrow Acute myeloid leukemia | 213 | |
| Uterus Endometrioid adenocarcinoma | 161 | |
| Lung Adenocarcinoma | 161 | |
| Thyroid gland Papillary carcinoma | 136 | |
| Head and Neck Squamous cell carcinoma | 134 | |
| Breast Infiltrating duct carcinoma | 134 | |
| Lung Squamous cell carcinoma | 129 | |
| Lung Healthy | 124 | |
| Kidney Renal cell carcinoma | 121 | |
| Skin Malignant melanoma | 113 | |
| Kidney Healthy | 111 | |
| Prostate gland Adenocarcinoma | 109 | |
| Cervix uteri Squamous cell carcinoma | 104 | |
| Bladder Transitional cell carcinoma | 101 | |
| Colon Adenocarcinoma | 100 | |
| Brain Glioblastoma | 96 | |
| Liver Hepatocellular carcinoma | 90 | |
| Stomach Carcinoma | 82 | 2.5% |
| Kidney Clear cell adenocarcinoma | 72 | 2.2% |
| Kidney Papillary adenocarcinoma | 67 | 2.1% |
| Pancreas Infiltrating duct carcinoma | 60 | 1.8% |
| Blood or Bone marrow Healthy | 58 | 1.8% |
| Kidney Wilms tumor | 55 | 1.7% |
| Prostate gland Acinar cell carcinoma | 51 | 1.6% |
| Brain Astrocytoma | 49 | 1.5% |
| Brain Oligodendroglioma | 45 | 1.4% |
| Breast Lobular carcinoma | 40 | 1.2% |
| Adrenal gland Pheochromocytoma | 39 | 1.2% |
| Ovarian Serous cancer | 33 | 1.0% |
| Head and Neck Healthy | 31 | 1.0% |
| Blood or Bone marrow Chronic lymphocytic leukemia | 31 | 1.0% |
| Uterus Serous cystadenocarcinoma | 28 | 0.9% |
| Adrenal gland Neuroblastoma | 26 | 0.8% |
| Breast Healthy | 26 | 0.8% |
| Bones Osteosarcoma | 24 | 0.7% |
| Thymus Thymoma | 24 | 0.7% |
| Esophagus Adenocarcinoma | 22 | 0.7% |
| Testis Seminoma | 21 | 0.6% |
| Blood or Bone marrow Acute lymphocytic leukemia | 20 | 0.6% |
| Pancreas Healthy | 20 | 0.6% |
| Esophagus Squamous cell carcinoma | 20 | 0.6% |
| Kidney Malignant rhabdoid tumor | 20 | 0.6% |
| Prostate gland Healthy | 19 | 0.6% |
| Uterus Healthy | 14 | 0.4% |
| Liver Healthy | 14 | 0.4% |
| Blood or Bone marrow Acute myelomonocytic leukemia | 14 | 0.4% |
| Colon Healthy | 13 | 0.4% |
| Retroperitoneum Leiomyosarcoma | 13 | 0.4% |
| Anterior mediastinum Thymoma | 13 | 0.4% |
| Cervix uteri Adenocarcinoma | 12 | 0.4% |
| Thyroid gland Healthy | 12 | 0.4% |
| Adrenal gland Adrenal cortical carcinoma | 12 | 0.4% |
| Retroperitoneum Dedifferentiated liposarcoma | 12 | 0.4% |
| Pleura Epithelioid mesothelioma | 11 | 0.3% |
gender
Categorical
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 232 |
| Missing (%) | 7.1% |
| Memory size | 50.9 KiB |
| male | |
|---|---|
| female |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | female |
|---|---|
| 2nd row | male |
| 3rd row | female |
| 4th row | female |
| 5th row | male |
Common Values
| Value | Count | Frequency (%) |
| male | 1581 | |
| female | 1447 | |
| (Missing) | 232 | 7.1% |
Common Values (Plot)
age_at_diagnosis
Real number (ℝ)
Missing 
| Distinct | 2265 |
|---|---|
| Distinct (%) | 78.7% |
| Missing | 383 |
| Missing (%) | 11.7% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 20467.99722 |
| Minimum | 3 |
|---|---|
| Maximum | 32872 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 180.0 KiB |
Quantile statistics
| Minimum | 3 |
|---|---|
| 5-th percentile | 3584.4 |
| Q1 | 17466 |
| median | 21829 |
| Q3 | 25123 |
| 95-th percentile | 29330.2 |
| Maximum | 32872 |
| Range | 32869 |
| Interquartile range (IQR) | 7657 |
Descriptive statistics
| Standard deviation | 7003.502502 |
|---|---|
| Coefficient of variation (CV) | 0.3421684314 |
| Kurtosis | 1.089340156 |
| Mean | 20467.99722 |
| Median Absolute Deviation (MAD) | 3746 |
| Skewness | -1.107898079 |
| Sum | 58886428 |
| Variance | 49049047.3 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 32872 | 13 | 0.4% |
| 18748 | 6 | 0.2% |
| 24594 | 6 | 0.2% |
| 17349 | 5 | 0.2% |
| 5594 | 5 | 0.2% |
| 23226 | 5 | 0.2% |
| 31016 | 5 | 0.2% |
| 25123 | 5 | 0.2% |
| 27636 | 4 | 0.1% |
| 20974 | 4 | 0.1% |
| 23927 | 4 | 0.1% |
| 28108 | 4 | 0.1% |
| 20747 | 4 | 0.1% |
| 17760 | 4 | 0.1% |
| 24122 | 4 | 0.1% |
| 21519 | 4 | 0.1% |
| 24631 | 4 | 0.1% |
| 24745 | 4 | 0.1% |
| 22241 | 4 | 0.1% |
| 20349 | 4 | 0.1% |
| 22471 | 4 | 0.1% |
| 11650 | 3 | 0.1% |
| 17807 | 3 | 0.1% |
| 19737 | 3 | 0.1% |
| 5940 | 3 | 0.1% |
| 19939 | 3 | 0.1% |
| 4623 | 3 | 0.1% |
| 18856 | 3 | 0.1% |
| 20851 | 3 | 0.1% |
| 3285 | 3 | 0.1% |
| 24138 | 3 | 0.1% |
| 851 | 3 | 0.1% |
| 17930 | 3 | 0.1% |
| 24986 | 3 | 0.1% |
| 20750 | 3 | 0.1% |
| 21871 | 3 | 0.1% |
| 19964 | 3 | 0.1% |
| 30028 | 3 | 0.1% |
| 26002 | 3 | 0.1% |
| 25516 | 3 | 0.1% |
| 24837 | 3 | 0.1% |
| 17831 | 3 | 0.1% |
| 21328 | 3 | 0.1% |
| 31052 | 3 | 0.1% |
| 1454 | 3 | 0.1% |
| 3972 | 3 | 0.1% |
| 5325 | 3 | 0.1% |
| 6946 | 3 | 0.1% |
| 3774 | 3 | 0.1% |
| 1504 | 3 | 0.1% |
| 969 | 3 | 0.1% |
| 27971 | 3 | 0.1% |
| 24043 | 3 | 0.1% |
| 27162 | 3 | 0.1% |
| 22156 | 3 | 0.1% |
| 25627 | 3 | 0.1% |
| 2855 | 3 | 0.1% |
| 5544 | 3 | 0.1% |
| 22378 | 3 | 0.1% |
| 1248 | 3 | 0.1% |
| 15559 | 3 | 0.1% |
| 19892 | 3 | 0.1% |
| 4607 | 3 | 0.1% |
| 19568 | 3 | 0.1% |
| 27968 | 3 | 0.1% |
| 27958 | 3 | 0.1% |
| 18163 | 3 | 0.1% |
| 20787 | 3 | 0.1% |
| 24775 | 3 | 0.1% |
| 26819 | 3 | 0.1% |
| 19413 | 3 | 0.1% |
| 23908 | 3 | 0.1% |
| 27547 | 3 | 0.1% |
| 19426 | 3 | 0.1% |
| 21725 | 3 | 0.1% |
| 23318 | 3 | 0.1% |
| 17465 | 3 | 0.1% |
| 21864 | 2 | 0.1% |
| 14470 | 2 | 0.1% |
| 17195 | 2 | 0.1% |
| 24757 | 2 | 0.1% |
| 23398 | 2 | 0.1% |
| 21655 | 2 | 0.1% |
| 26424 | 2 | 0.1% |
| 20023 | 2 | 0.1% |
| 11735 | 2 | 0.1% |
| 22005 | 2 | 0.1% |
| 22126 | 2 | 0.1% |
| 18171 | 2 | 0.1% |
| 15616 | 2 | 0.1% |
| 21557 | 2 | 0.1% |
| 23784 | 2 | 0.1% |
| 24735 | 2 | 0.1% |
| 12068 | 2 | 0.1% |
| 26603 | 2 | 0.1% |
| 22279 | 2 | 0.1% |
| 24603 | 2 | 0.1% |
| 8499 | 2 | 0.1% |
| 24376 | 2 | 0.1% |
| 23653 | 2 | 0.1% |
| 29657 | 2 | 0.1% |
| 17296 | 2 | 0.1% |
| 18600 | 2 | 0.1% |
| 20780 | 2 | 0.1% |
| 22902 | 2 | 0.1% |
| 36 | 2 | 0.1% |
| 20054 | 2 | 0.1% |
| 26438 | 2 | 0.1% |
| 11470 | 2 | 0.1% |
| 26190 | 2 | 0.1% |
| 27278 | 2 | 0.1% |
| 20396 | 2 | 0.1% |
| 20175 | 2 | 0.1% |
| 3044 | 2 | 0.1% |
| 27364 | 2 | 0.1% |
| 14843 | 2 | 0.1% |
| 11024 | 2 | 0.1% |
| 30227 | 2 | 0.1% |
| 815 | 2 | 0.1% |
| 1720 | 2 | 0.1% |
| 11932 | 2 | 0.1% |
| 21323 | 2 | 0.1% |
| 24103 | 2 | 0.1% |
| 21336 | 2 | 0.1% |
| 563 | 2 | 0.1% |
| 19312 | 2 | 0.1% |
| 24692 | 2 | 0.1% |
| 25141 | 2 | 0.1% |
| 22161 | 2 | 0.1% |
| 23923 | 2 | 0.1% |
| 12665 | 2 | 0.1% |
| 18219 | 2 | 0.1% |
| 24927 | 2 | 0.1% |
| 17466 | 2 | 0.1% |
| 21302 | 2 | 0.1% |
| 22236 | 2 | 0.1% |
| 21734 | 2 | 0.1% |
| 25759 | 2 | 0.1% |
| 23982 | 2 | 0.1% |
| 16923 | 2 | 0.1% |
| 28015 | 2 | 0.1% |
| 21654 | 2 | 0.1% |
| 20550 | 2 | 0.1% |
| 24528 | 2 | 0.1% |
| 28625 | 2 | 0.1% |
| 28668 | 2 | 0.1% |
| 17228 | 2 | 0.1% |
| 21878 | 2 | 0.1% |
| 24167 | 2 | 0.1% |
| 24477 | 2 | 0.1% |
| 27731 | 2 | 0.1% |
| 25119 | 2 | 0.1% |
| 20604 | 2 | 0.1% |
| 15366 | 2 | 0.1% |
| 24301 | 2 | 0.1% |
| 11379 | 2 | 0.1% |
| 16231 | 2 | 0.1% |
| 22433 | 2 | 0.1% |
| 27236 | 2 | 0.1% |
| 22720 | 2 | 0.1% |
| 22773 | 2 | 0.1% |
| 27756 | 2 | 0.1% |
| 23121 | 2 | 0.1% |
| 8918 | 2 | 0.1% |
| 24636 | 2 | 0.1% |
| 18595 | 2 | 0.1% |
| 19172 | 2 | 0.1% |
| 28030 | 2 | 0.1% |
| 20344 | 2 | 0.1% |
| 20557 | 2 | 0.1% |
| 24209 | 2 | 0.1% |
| 24734 | 2 | 0.1% |
| 21166 | 2 | 0.1% |
| 18640 | 2 | 0.1% |
| 26778 | 2 | 0.1% |
| 23119 | 2 | 0.1% |
| 28444 | 2 | 0.1% |
| 11204 | 2 | 0.1% |
| 29536 | 2 | 0.1% |
| 22641 | 2 | 0.1% |
| 25114 | 2 | 0.1% |
| 24378 | 2 | 0.1% |
| 21205 | 2 | 0.1% |
| 16688 | 2 | 0.1% |
| 26038 | 2 | 0.1% |
| 19991 | 2 | 0.1% |
| 26533 | 2 | 0.1% |
| 22428 | 2 | 0.1% |
| 18263 | 2 | 0.1% |
| 25366 | 2 | 0.1% |
| 23531 | 2 | 0.1% |
| 23135 | 2 | 0.1% |
| 23972 | 2 | 0.1% |
| 19663 | 2 | 0.1% |
| 24399 | 2 | 0.1% |
| 22768 | 2 | 0.1% |
| 26205 | 2 | 0.1% |
| 23148 | 2 | 0.1% |
| 20558 | 2 | 0.1% |
| 18567 | 2 | 0.1% |
| 25499 | 2 | 0.1% |
| 26008 | 2 | 0.1% |
| 25447 | 2 | 0.1% |
| 26353 | 2 | 0.1% |
| 21632 | 2 | 0.1% |
| 23481 | 2 | 0.1% |
| 28101 | 2 | 0.1% |
| 26600 | 2 | 0.1% |
| 22602 | 2 | 0.1% |
| 26583 | 2 | 0.1% |
| 24408 | 2 | 0.1% |
| 20774 | 2 | 0.1% |
| 23707 | 2 | 0.1% |
| 20077 | 2 | 0.1% |
| 21418 | 2 | 0.1% |
| 25231 | 2 | 0.1% |
| 22361 | 2 | 0.1% |
| 21265 | 2 | 0.1% |
| 24886 | 2 | 0.1% |
| 21160 | 2 | 0.1% |
| 22497 | 2 | 0.1% |
| 19705 | 2 | 0.1% |
| 25991 | 2 | 0.1% |
| 25783 | 2 | 0.1% |
| 18161 | 2 | 0.1% |
| 19779 | 2 | 0.1% |
| 23710 | 2 | 0.1% |
| 23896 | 2 | 0.1% |
| 24110 | 2 | 0.1% |
| 25227 | 2 | 0.1% |
| 22027 | 2 | 0.1% |
| 29164 | 2 | 0.1% |
| 25290 | 2 | 0.1% |
| 25195 | 2 | 0.1% |
| 22705 | 2 | 0.1% |
| 17824 | 2 | 0.1% |
| 22042 | 2 | 0.1% |
| 18792 | 2 | 0.1% |
| 27124 | 2 | 0.1% |
| 25862 | 2 | 0.1% |
| 20683 | 2 | 0.1% |
| 18993 | 2 | 0.1% |
| 23682 | 2 | 0.1% |
| 32871 | 2 | 0.1% |
| 13840 | 2 | 0.1% |
| 24754 | 2 | 0.1% |
| 20720 | 2 | 0.1% |
| 23868 | 2 | 0.1% |
| 17422 | 2 | 0.1% |
| 23322 | 2 | 0.1% |
| Other values (2015) | 2261 | |
| (Missing) | 383 | 11.7% |
| Value | Count | Frequency (%) |
| 3 | 1 | |
| 7 | 1 | |
| 18 | 1 | |
| 31 | 2 | |
| 36 | 2 |
| Value | Count | Frequency (%) |
| 32872 | 13 | |
| 32871 | 2 | 0.1% |
| 32754 | 1 | < 0.1% |
| 32750 | 1 | < 0.1% |
| 32682 | 1 | < 0.1% |
tissue_or_organ_of_origin
Categorical
| Distinct | 25 |
|---|---|
| Distinct (%) | 0.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 180.0 KiB |
| Kidney | |
|---|---|
| Lung | |
| Blood or Bone marrow | |
| Uterus | |
| Breast | |
| Other values (20) |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Skin |
|---|---|
| 2nd row | Head and Neck |
| 3rd row | Head and Neck |
| 4th row | Head and Neck |
| 5th row | Head and Neck |
Common Values
| Value | Count | Frequency (%) |
| Kidney | 446 | |
| Lung | 414 | |
| Blood or Bone marrow | 336 | |
| Uterus | 203 | |
| Breast | 200 | |
| Brain | 190 | |
| Prostate gland | 179 | |
| Head and Neck | 165 | 5.1% |
| Thyroid gland | 148 | 4.5% |
| Cervix uteri | 116 | 3.6% |
| Skin | 113 | 3.5% |
| Colon | 113 | 3.5% |
| Liver | 104 | 3.2% |
| Bladder | 101 | 3.1% |
| Stomach | 82 | 2.5% |
| Pancreas | 80 | 2.5% |
| Adrenal gland | 77 | 2.4% |
| Esophagus | 42 | 1.3% |
| Ovarian | 33 | 1.0% |
| Retroperitoneum | 25 | 0.8% |
| Thymus | 24 | 0.7% |
| Bones | 24 | 0.7% |
| Testis | 21 | 0.6% |
| Anterior mediastinum | 13 | 0.4% |
| Pleura | 11 | 0.3% |
primary_diagnosis
Categorical
| Distinct | 38 |
|---|---|
| Distinct (%) | 1.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 180.0 KiB |
| Healthy | |
|---|---|
| Adenocarcinoma | |
| Squamous cell carcinoma | |
| Acute myeloid leukemia | |
| Infiltrating duct carcinoma | |
| Other values (33) |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Malignant melanoma |
|---|---|
| 2nd row | Squamous cell carcinoma |
| 3rd row | Squamous cell carcinoma |
| 4th row | Squamous cell carcinoma |
| 5th row | Squamous cell carcinoma |
Common Values
| Value | Count | Frequency (%) |
| Healthy | 442 | |
| Adenocarcinoma | 432 | |
| Squamous cell carcinoma | 387 | |
| Acute myeloid leukemia | 213 | |
| Infiltrating duct carcinoma | 194 | |
| Papillary adenocarcinoma | 170 | 5.2% |
| Endometrioid adenocarcinoma | 161 | 4.9% |
| Renal cell carcinoma | 121 | 3.7% |
| Malignant melanoma | 113 | 3.5% |
| Glioblastoma | 96 | 2.9% |
| Hepatocellular carcinoma | 90 | 2.8% |
| Transitional cell carcinoma | 74 | 2.3% |
| Clear cell adenocarcinoma | 72 | 2.2% |
| Wilms tumor | 55 | 1.7% |
| Acinar cell carcinoma | 51 | 1.6% |
| Astrocytoma | 49 | 1.5% |
| Oligodendroglioma | 45 | 1.4% |
| Lobular carcinoma | 40 | 1.2% |
| Pheochromocytoma | 39 | 1.2% |
| Thymoma | 37 | 1.1% |
| Serous cancer | 33 | 1.0% |
| Papillary carcinoma | 33 | 1.0% |
| Chronic lymphocytic leukemia | 31 | 1.0% |
| Serous cystadenocarcinoma | 28 | 0.9% |
| Papillary transitional cell carcinoma | 27 | 0.8% |
| Neuroblastoma | 26 | 0.8% |
| Osteosarcoma | 24 | 0.7% |
| Seminoma | 21 | 0.6% |
| Malignant rhabdoid tumor | 20 | 0.6% |
| Tubular adenocarcinoma | 20 | 0.6% |
| Acute lymphocytic leukemia | 20 | 0.6% |
| Mucinous adenocarcinoma | 18 | 0.6% |
| Carcinoma | 16 | 0.5% |
| Acute myelomonocytic leukemia | 14 | 0.4% |
| Leiomyosarcoma | 13 | 0.4% |
| Dedifferentiated liposarcoma | 12 | 0.4% |
| Adrenal cortical carcinoma | 12 | 0.4% |
| Epithelioid mesothelioma | 11 | 0.3% |
tumor_grade
Categorical
Missing 
| Distinct | 4 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 1958 |
| Missing (%) | 60.1% |
| Memory size | 180.0 KiB |
| G2 | |
|---|---|
| G3 | |
| G1 | |
| G4 | 36 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | G3 |
|---|---|
| 2nd row | G2 |
| 3rd row | G2 |
| 4th row | G2 |
| 5th row | G3 |
Common Values
| Value | Count | Frequency (%) |
| G2 | 645 | 19.8% |
| G3 | 492 | 15.1% |
| G1 | 129 | 4.0% |
| G4 | 36 | 1.1% |
| (Missing) | 1958 |
Common Values (Plot)
platform
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 180.0 KiB |
| 450K | |
|---|---|
| EPIC | |
| EPICv2 | 12 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | EPIC |
|---|---|
| 2nd row | 450K |
| 3rd row | 450K |
| 4th row | 450K |
| 5th row | 450K |
Common Values
| Value | Count | Frequency (%) |
| 450K | 2309 | |
| EPIC | 939 | |
| EPICv2 | 12 | 0.4% |