Overview
Brought to you by YData
Dataset statistics
| Number of variables | 4 |
|---|---|
| Number of observations | 2127 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Total size in memory | 83.1 KiB |
| Average record size in memory | 40.0 B |
Variable types
| Categorical | 4 |
|---|
Reproduction
| Analysis started | 2025-06-19 16:34:49.548317 |
|---|---|
| Analysis finished | 2025-06-19 16:34:49.577639 |
| Duration | 0.03 seconds |
| Software version | ydata-profiling vv4.16.1 |
| Download configuration | config.json |
Variables
category
Categorical
| Distinct | 20 |
|---|---|
| Distinct (%) | 0.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 33.2 KiB |
| Blood or Bone marrow Acute myeloid leukemia | |
|---|---|
| Lung Adenocarcinoma | |
| Liver Hepatocellular carcinoma | |
| Skin Malignant melanoma | |
| Colon Healthy | |
| Other values (15) |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Lung Squamous cell carcinoma |
|---|---|
| 2nd row | Lung Squamous cell carcinoma |
| 3rd row | Lung Squamous cell carcinoma |
| 4th row | Lung Squamous cell carcinoma |
| 5th row | Lung Squamous cell carcinoma |
Common Values
| Value | Count | Frequency (%) |
| Blood or Bone marrow Acute myeloid leukemia | 316 | |
| Lung Adenocarcinoma | 305 | |
| Liver Hepatocellular carcinoma | 276 | |
| Skin Malignant melanoma | 233 | |
| Colon Healthy | 203 | |
| Brain Glioblastoma | 118 | 5.5% |
| Colon Adenoma | 112 | 5.3% |
| Uterus Endometrioid adenocarcinoma | 111 | 5.2% |
| Liver Healthy | 76 | 3.6% |
| Thyroid gland Papillary carcinoma | 68 | 3.2% |
| Colon Adenocarcinoma | 54 | 2.5% |
| Breast Infiltrating duct carcinoma | 53 | 2.5% |
| Thyroid gland Healthy | 48 | 2.3% |
| Stomach Carcinoma | 31 | 1.5% |
| Breast Healthy | 27 | 1.3% |
| Lung Squamous cell carcinoma | 25 | 1.2% |
| Prostate gland Adenocarcinoma | 22 | 1.0% |
| Lung Healthy | 19 | 0.9% |
| Head and Neck Squamous cell carcinoma | 16 | 0.8% |
| Prostate gland Healthy | 14 | 0.7% |
tissue_or_organ_of_origin
Categorical
| Distinct | 12 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 33.2 KiB |
| Colon | |
|---|---|
| Liver | |
| Lung | |
| Blood or Bone marrow | |
| Skin | |
| Other values (7) |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Lung |
|---|---|
| 2nd row | Lung |
| 3rd row | Lung |
| 4th row | Lung |
| 5th row | Lung |
Common Values
| Value | Count | Frequency (%) |
| Colon | 369 | |
| Liver | 352 | |
| Lung | 349 | |
| Blood or Bone marrow | 316 | |
| Skin | 233 | |
| Brain | 118 | 5.5% |
| Thyroid gland | 116 | 5.5% |
| Uterus | 111 | 5.2% |
| Breast | 80 | 3.8% |
| Prostate gland | 36 | 1.7% |
| Stomach | 31 | 1.5% |
| Head and Neck | 16 | 0.8% |
primary_diagnosis
Categorical
| Distinct | 12 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 33.2 KiB |
| Healthy | |
|---|---|
| Adenocarcinoma | |
| Acute myeloid leukemia | |
| Hepatocellular carcinoma | |
| Malignant melanoma | |
| Other values (7) |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Squamous cell carcinoma |
|---|---|
| 2nd row | Squamous cell carcinoma |
| 3rd row | Squamous cell carcinoma |
| 4th row | Squamous cell carcinoma |
| 5th row | Squamous cell carcinoma |
Common Values
| Value | Count | Frequency (%) |
| Healthy | 387 | |
| Adenocarcinoma | 381 | |
| Acute myeloid leukemia | 316 | |
| Hepatocellular carcinoma | 276 | |
| Malignant melanoma | 233 | |
| Glioblastoma | 118 | 5.5% |
| Adenoma | 112 | 5.3% |
| Endometrioid adenocarcinoma | 111 | 5.2% |
| Papillary carcinoma | 68 | 3.2% |
| Infiltrating duct carcinoma | 53 | 2.5% |
| Squamous cell carcinoma | 41 | 1.9% |
| Carcinoma | 31 | 1.5% |
GSE ID
Categorical
| Distinct | 26 |
|---|---|
| Distinct (%) | 1.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 33.2 KiB |
| GSE159907 | |
|---|---|
| GSE157341 | |
| GSE101764 | |
| GSE66836 | |
| GSE202097 | |
| Other values (21) |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | GSE124052 |
|---|---|
| 2nd row | GSE124052 |
| 3rd row | GSE124052 |
| 4th row | GSE124052 |
| 5th row | GSE124052 |
Common Values
| Value | Count | Frequency (%) |
| GSE159907 | 316 | |
| GSE157341 | 270 | |
| GSE101764 | 261 | |
| GSE66836 | 183 | |
| GSE202097 | 144 | |
| GSE256092 | 141 | |
| GSE193535 | 108 | 5.1% |
| GSE120878 | 89 | 4.2% |
| GSE86961 | 82 | 3.9% |
| GSE89852 | 74 | 3.5% |
| GSE136791 | 69 | 3.2% |
| GSE60274 | 68 | 3.2% |
| GSE66313 | 55 | 2.6% |
| GSE196490 | 50 | 2.4% |
| GSE121377 | 34 | 1.6% |
| GSE67116 | 33 | 1.6% |
| GSE116338 | 32 | 1.5% |
| GSE124052 | 30 | 1.4% |
| GSE85464 | 19 | 0.9% |
| GSE100503 | 13 | 0.6% |
| GSE164988 | 12 | 0.6% |
| GSE124367 | 12 | 0.6% |
| GSE95036 | 11 | 0.5% |
| GSE93589 | 9 | 0.4% |
| GSE199747 | 8 | 0.4% |
| GSE38240 | 4 | 0.2% |