Dataset statistics
| Number of variables | 15 |
|---|---|
| Number of observations | 46265 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 1 |
| Duplicate rows (%) | < 0.1% |
| Total size in memory | 3.4 MiB |
| Average record size in memory | 78.0 B |
Variable types
| Numeric | 2 |
|---|---|
| Boolean | 6 |
| DateTime | 1 |
| Categorical | 6 |
Alerts
people_fully_vaccinated has constant value "" | Constant |
people_vaccinated has constant value "" | Constant |
positive_rate has constant value "" | Constant |
total_boosters has constant value "" | Constant |
total_vaccinations has constant value "" | Constant |
| Dataset has 1 (< 0.1%) duplicate rows | Duplicates |
raw_conditions has a high cardinality: 21203 distinct values | High cardinality |
condition__obesity is highly imbalanced (68.2%) | Imbalance |
serial is uniformly distributed | Uniform |
Reproduction
| Analysis started | 2023-05-04 15:10:46.655960 |
|---|---|
| Analysis finished | 2023-05-04 15:10:50.490403 |
| Duration | 3.83 seconds |
| Software version | ydata-profiling vv4.1.2 |
| Download configuration | config.json |
serial
Real number (ℝ)
| Distinct | 46264 |
|---|---|
| Distinct (%) | > 99.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 23134.277 |
| Minimum | 1 |
|---|---|
| Maximum | 46266 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 361.6 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 2315.2 |
| Q1 | 11568 |
| median | 23134 |
| Q3 | 34701 |
| 95-th percentile | 43953.8 |
| Maximum | 46266 |
| Range | 46265 |
| Interquartile range (IQR) | 23133 |
Descriptive statistics
| Standard deviation | 13356.137 |
|---|---|
| Coefficient of variation (CV) | 0.57733106 |
| Kurtosis | -1.1999921 |
| Mean | 23134.277 |
| Median Absolute Deviation (MAD) | 11567 |
| Skewness | 2.0580958 × 10-5 |
| Sum | 1.0703073 × 109 |
| Variance | 1.7838639 × 108 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 46265 | 2 | < 0.1% |
| 46217 | 1 | < 0.1% |
| 30857 | 1 | < 0.1% |
| 30848 | 1 | < 0.1% |
| 30849 | 1 | < 0.1% |
| 30850 | 1 | < 0.1% |
| 30851 | 1 | < 0.1% |
| 30853 | 1 | < 0.1% |
| 30844 | 1 | < 0.1% |
| 30854 | 1 | < 0.1% |
| Other values (46254) | 46254 |
| Value | Count | Frequency (%) |
| 1 | 1 | |
| 2 | 1 | |
| 3 | 1 | |
| 4 | 1 | |
| 5 | 1 | |
| 6 | 1 | |
| 7 | 1 | |
| 8 | 1 | |
| 9 | 1 | |
| 10 | 1 |
| Value | Count | Frequency (%) |
| 46266 | 1 | |
| 46265 | 2 | |
| 46264 | 1 | |
| 46263 | 1 | |
| 46262 | 1 | |
| 46261 | 1 | |
| 46260 | 1 | |
| 46259 | 1 | |
| 46258 | 1 | |
| 46257 | 1 |
age
Real number (ℝ)
| Distinct | 95 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 73.892792 |
| Minimum | 0 |
|---|---|
| Maximum | 104 |
| Zeros | 5 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 361.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 51 |
| Q1 | 66 |
| median | 75 |
| Q3 | 83 |
| 95-th percentile | 91 |
| Maximum | 104 |
| Range | 104 |
| Interquartile range (IQR) | 17 |
Descriptive statistics
| Standard deviation | 12.625904 |
|---|---|
| Coefficient of variation (CV) | 0.17086788 |
| Kurtosis | 0.88115696 |
| Mean | 73.892792 |
| Median Absolute Deviation (MAD) | 8 |
| Skewness | -0.77169218 |
| Sum | 3418650 |
| Variance | 159.41346 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 80 | 1551 | 3.4% |
| 79 | 1541 | 3.3% |
| 78 | 1528 | 3.3% |
| 81 | 1512 | 3.3% |
| 83 | 1475 | 3.2% |
| 77 | 1470 | 3.2% |
| 73 | 1447 | 3.1% |
| 82 | 1441 | 3.1% |
| 76 | 1428 | 3.1% |
| 84 | 1417 | 3.1% |
| Other values (85) | 31455 |
| Value | Count | Frequency (%) |
| 0 | 5 | |
| 1 | 2 | < 0.1% |
| 2 | 1 | < 0.1% |
| 4 | 1 | < 0.1% |
| 6 | 1 | < 0.1% |
| 14 | 1 | < 0.1% |
| 16 | 4 | |
| 17 | 2 | < 0.1% |
| 18 | 3 | |
| 19 | 7 |
| Value | Count | Frequency (%) |
| 104 | 2 | < 0.1% |
| 103 | 5 | < 0.1% |
| 102 | 6 | < 0.1% |
| 101 | 21 | < 0.1% |
| 100 | 23 | < 0.1% |
| 99 | 65 | 0.1% |
| 98 | 96 | 0.2% |
| 97 | 134 | |
| 96 | 192 | |
| 95 | 261 |
condition__blood_pressure
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 45.3 KiB |
| True | |
|---|---|
| False |
| Value | Count | Frequency (%) |
| True | 30284 | |
| False | 15981 |
condition__diabetes
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 45.3 KiB |
| False | |
|---|---|
| True |
| Value | Count | Frequency (%) |
| False | 33001 | |
| True | 13264 |
condition__heart
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 45.3 KiB |
| False | |
|---|---|
| True |
| Value | Count | Frequency (%) |
| False | 31616 | |
| True | 14649 |
condition__lungs
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 45.3 KiB |
| False | |
|---|---|
| True |
| Value | Count | Frequency (%) |
| False | 40703 | |
| True | 5562 | 12.0% |
condition__obesity
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 45.3 KiB |
| False | |
|---|---|
| True | 2667 |
| Value | Count | Frequency (%) |
| False | 43598 | |
| True | 2667 | 5.8% |
estimated_date
Date
| Distinct | 611 |
|---|---|
| Distinct (%) | 1.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 361.6 KiB |
| Minimum | 2020-03-20 00:00:00 |
|---|---|
| Maximum | 2023-01-22 00:00:00 |
Histogram with fixed size bins (bins=50)
is_male
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 45.3 KiB |
| True | |
|---|---|
| False |
| Value | Count | Frequency (%) |
| True | 23487 | |
| False | 22778 |
people_fully_vaccinated
Categorical
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 361.6 KiB |
| 0 |
|---|
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 46265 |
|---|---|
| Distinct characters | 1 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 46265 |
Length
Histogram of lengths of the category
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 46265 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 46265 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 46265 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 46265 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 46265 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 46265 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 46265 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 46265 |
people_vaccinated
Categorical
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 361.6 KiB |
| 0 |
|---|
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 46265 |
|---|---|
| Distinct characters | 1 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 46265 |
Length
Histogram of lengths of the category
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 46265 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 46265 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 46265 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 46265 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 46265 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 46265 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 46265 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 46265 |
positive_rate
Categorical
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 361.6 KiB |
| 0.0 |
|---|
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 138795 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0.0 |
|---|---|
| 2nd row | 0.0 |
| 3rd row | 0.0 |
| 4th row | 0.0 |
| 5th row | 0.0 |
Common Values
| Value | Count | Frequency (%) |
| 0.0 | 46265 |
Length
Histogram of lengths of the category
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0.0 | 46265 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 92530 | |
| . | 46265 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 92530 | |
| Other Punctuation | 46265 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 92530 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 46265 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 138795 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 92530 | |
| . | 46265 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 138795 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 92530 | |
| . | 46265 |
raw_conditions
Categorical
| Distinct | 21203 |
|---|---|
| Distinct (%) | 45.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 361.6 KiB |
| magasvérnyomás-betegség | 2544 |
|---|---|
| nem ismert alapbetegség | 1950 |
| magasvérnyomás-betegség, cukorbetegség | 1456 |
| magas vérnyomás | 850 |
| daganatos megbetegedés | 672 |
| Other values (21198) |
Length
| Max length | 434 |
|---|---|
| Median length | 193 |
| Mean length | 46.863071 |
| Min length | 3 |
Characters and Unicode
| Total characters | 2168120 |
|---|---|
| Distinct characters | 56 |
| Distinct categories | 9 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 3 ? |
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
| Unique | 18742 ? |
|---|---|
| Unique (%) | 40.5% |
Sample
| 1st row | magasvérnyomás-betegség |
|---|---|
| 2nd row | agyi infraktus |
| 3rd row | cukorbetegség, hasi tályog |
| 4th row | tüdő rosszindulatú daganata, májbetegség |
| 5th row | tüdőfibrózis, cukorbetegség |
Common Values
| Value | Count | Frequency (%) |
| magasvérnyomás-betegség | 2544 | 5.5% |
| nem ismert alapbetegség | 1950 | 4.2% |
| magasvérnyomás-betegség, cukorbetegség | 1456 | 3.1% |
| magas vérnyomás | 850 | 1.8% |
| daganatos megbetegedés | 672 | 1.5% |
| magas vérnyomás, cukorbetegség | 589 | 1.3% |
| adat feltöltés alatt | 484 | 1.0% |
| cukorbetegség | 478 | 1.0% |
| magasvérnyomás-betegség, iszkémiás szívbetegség | 422 | 0.9% |
| cukorbetegség, magasvérnyomás-betegség | 418 | 0.9% |
| Other values (21193) | 36402 |
Length
Histogram of lengths of the category
| Value | Count | Frequency (%) |
| magasvérnyomás-betegség | 20768 | 12.2% |
| cukorbetegség | 13229 | 7.8% |
| magas | 9770 | 5.8% |
| szívbetegség | 9032 | 5.3% |
| vérnyomás | 9011 | 5.3% |
| iszkémiás | 5283 | 3.1% |
| veseelégtelenség | 4603 | 2.7% |
| krónikus | 4581 | 2.7% |
| tüdőbetegség | 4201 | 2.5% |
| megbetegedés | 3599 | 2.1% |
| Other values (3538) | 85664 |
Most occurring characters
| Value | Count | Frequency (%) |
| s | 230510 | 10.6% |
| e | 214040 | 9.9% |
| g | 189067 | 8.7% |
| é | 138215 | 6.4% |
| 123457 | 5.7% | |
| a | 122791 | 5.7% |
| t | 111853 | 5.2% |
| m | 96965 | 4.5% |
| r | 95320 | 4.4% |
| n | 72309 | 3.3% |
| Other values (46) | 773593 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1949662 | |
| Space Separator | 123477 | 5.7% |
| Other Punctuation | 69221 | 3.2% |
| Dash Punctuation | 25561 | 1.2% |
| Decimal Number | 82 | < 0.1% |
| Close Punctuation | 57 | < 0.1% |
| Open Punctuation | 57 | < 0.1% |
| Control | 2 | < 0.1% |
| Final Punctuation | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| s | 230510 | 11.8% |
| e | 214040 | 11.0% |
| g | 189067 | 9.7% |
| é | 138215 | 7.1% |
| a | 122791 | 6.3% |
| t | 111853 | 5.7% |
| m | 96965 | 5.0% |
| r | 95320 | 4.9% |
| n | 72309 | 3.7% |
| o | 67590 | 3.5% |
| Other values (25) | 611002 |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 63 | |
| 1 | 10 | 12.2% |
| 0 | 4 | 4.9% |
| 3 | 2 | 2.4% |
| 4 | 1 | 1.2% |
| 5 | 1 | 1.2% |
| 9 | 1 | 1.2% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 69014 | |
| . | 169 | 0.2% |
| ? | 18 | < 0.1% |
| ; | 16 | < 0.1% |
| / | 2 | < 0.1% |
| : | 2 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 123457 | ||
| 20 | < 0.1% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 56 | |
| „ | 1 | 1.8% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 25561 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 57 |
Control
| Value | Count | Frequency (%) |
| | 2 |
Final Punctuation
| Value | Count | Frequency (%) |
| ” | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1949662 | |
| Common | 218458 | 10.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| s | 230510 | 11.8% |
| e | 214040 | 11.0% |
| g | 189067 | 9.7% |
| é | 138215 | 7.1% |
| a | 122791 | 6.3% |
| t | 111853 | 5.7% |
| m | 96965 | 5.0% |
| r | 95320 | 4.9% |
| n | 72309 | 3.7% |
| o | 67590 | 3.5% |
| Other values (25) | 611002 |
Common
| Value | Count | Frequency (%) |
| 123457 | ||
| , | 69014 | |
| - | 25561 | 11.7% |
| . | 169 | 0.1% |
| 2 | 63 | < 0.1% |
| ) | 57 | < 0.1% |
| ( | 56 | < 0.1% |
| 20 | < 0.1% | |
| ? | 18 | < 0.1% |
| ; | 16 | < 0.1% |
| Other values (11) | 27 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1916060 | |
| None | 252058 | 11.6% |
| Punctuation | 2 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| s | 230510 | 12.0% |
| e | 214040 | 11.2% |
| g | 189067 | 9.9% |
| 123457 | 6.4% | |
| a | 122791 | 6.4% |
| t | 111853 | 5.8% |
| m | 96965 | 5.1% |
| r | 95320 | 5.0% |
| n | 72309 | 3.8% |
| , | 69014 | 3.6% |
| Other values (32) | 590734 |
None
| Value | Count | Frequency (%) |
| é | 138215 | |
| á | 54087 | 21.5% |
| í | 22451 | 8.9% |
| ó | 12450 | 4.9% |
| ü | 8744 | 3.5% |
| ő | 7805 | 3.1% |
| ö | 3371 | 1.3% |
| û | 2629 | 1.0% |
| ú | 1987 | 0.8% |
| ű | 297 | 0.1% |
| Other values (2) | 22 | < 0.1% |
Punctuation
| Value | Count | Frequency (%) |
| „ | 1 | |
| ” | 1 |
total_boosters
Categorical
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 361.6 KiB |
| 0 |
|---|
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 46265 |
|---|---|
| Distinct characters | 1 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 46265 |
Length
Histogram of lengths of the category
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 46265 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 46265 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 46265 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 46265 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 46265 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 46265 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 46265 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 46265 |
total_vaccinations
Categorical
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 361.6 KiB |
| 0 |
|---|
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 46265 |
|---|---|
| Distinct characters | 1 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 46265 |
Length
Histogram of lengths of the category
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 46265 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 46265 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 46265 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 46265 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 46265 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 46265 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 46265 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 46265 |
| serial | age | condition__blood_pressure | condition__diabetes | condition__heart | condition__lungs | condition__obesity | is_male | |
|---|---|---|---|---|---|---|---|---|
| serial | 1.000 | -0.032 | 0.049 | 0.034 | 0.068 | 0.058 | 0.094 | 0.031 |
| age | -0.032 | 1.000 | 0.182 | 0.114 | 0.200 | 0.075 | 0.211 | 0.226 |
| condition__blood_pressure | 0.049 | 0.182 | 1.000 | 0.196 | 0.088 | 0.026 | 0.053 | 0.070 |
| condition__diabetes | 0.034 | 0.114 | 0.196 | 1.000 | 0.020 | 0.038 | 0.085 | 0.000 |
| condition__heart | 0.068 | 0.200 | 0.088 | 0.020 | 1.000 | 0.026 | 0.029 | 0.023 |
| condition__lungs | 0.058 | 0.075 | 0.026 | 0.038 | 0.026 | 1.000 | 0.005 | 0.027 |
| condition__obesity | 0.094 | 0.211 | 0.053 | 0.085 | 0.029 | 0.005 | 1.000 | 0.009 |
| is_male | 0.031 | 0.226 | 0.070 | 0.000 | 0.023 | 0.027 | 0.009 | 1.000 |
A simple visualization of nullity by column.
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
| serial | age | condition__blood_pressure | condition__diabetes | condition__heart | condition__lungs | condition__obesity | estimated_date | is_male | people_fully_vaccinated | people_vaccinated | positive_rate | raw_conditions | total_boosters | total_vaccinations | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 46217 | 90 | True | False | False | False | False | 2023-01-22 | True | 0 | 0 | 0.0 | magasvérnyomás-betegség | 0 | 0 |
| 1 | 46218 | 87 | False | False | False | False | False | 2023-01-22 | False | 0 | 0 | 0.0 | agyi infraktus | 0 | 0 |
| 2 | 46219 | 62 | False | True | False | False | False | 2023-01-22 | False | 0 | 0 | 0.0 | cukorbetegség, hasi tályog | 0 | 0 |
| 3 | 46220 | 68 | False | False | False | True | False | 2023-01-22 | True | 0 | 0 | 0.0 | tüdő rosszindulatú daganata, májbetegség | 0 | 0 |
| 4 | 46221 | 64 | False | True | False | True | False | 2023-01-22 | True | 0 | 0 | 0.0 | tüdőfibrózis, cukorbetegség | 0 | 0 |
| 5 | 46222 | 82 | True | False | False | False | False | 2023-01-22 | False | 0 | 0 | 0.0 | magasvérnyomás-betegség | 0 | 0 |
| 6 | 46223 | 79 | True | False | False | False | False | 2023-01-22 | True | 0 | 0 | 0.0 | magasvérnyomás-betegség | 0 | 0 |
| 7 | 46224 | 63 | True | False | False | False | False | 2023-01-22 | True | 0 | 0 | 0.0 | daganatos-megbetegedés, érelmeszesedés, magasvérnyomás-betegség | 0 | 0 |
| 8 | 46225 | 71 | True | False | False | True | False | 2023-01-22 | True | 0 | 0 | 0.0 | érelmeszesedés, magasvérnyomás-betegség, tüdő tumor, krónikus obstruktív tüdőbetegség | 0 | 0 |
| 9 | 46226 | 75 | True | True | True | False | False | 2023-01-22 | True | 0 | 0 | 0.0 | magasvérnyomás-betegség, cukorbetegség, pajzsmirigy-alulmûködés, szívizomelhalás | 0 | 0 |
| serial | age | condition__blood_pressure | condition__diabetes | condition__heart | condition__lungs | condition__obesity | estimated_date | is_male | people_fully_vaccinated | people_vaccinated | positive_rate | raw_conditions | total_boosters | total_vaccinations | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 46255 | 46207 | 78 | True | False | True | False | False | 2023-01-22 | False | 0 | 0 | 0.0 | iszkémiás szívbetegség, magasvérnyomás-betegség | 0 | 0 |
| 46256 | 46208 | 88 | True | False | False | False | False | 2023-01-22 | False | 0 | 0 | 0.0 | magasvérnyomás-betegség | 0 | 0 |
| 46257 | 46209 | 72 | True | False | True | False | False | 2023-01-22 | True | 0 | 0 | 0.0 | iszkémiás szívbetegség, magasvérnyomás-betegség | 0 | 0 |
| 46258 | 46210 | 75 | True | False | True | False | False | 2023-01-22 | True | 0 | 0 | 0.0 | iszkémiás szívbetegség, magasvérnyomás betegség | 0 | 0 |
| 46259 | 46211 | 81 | True | True | True | False | False | 2023-01-22 | False | 0 | 0 | 0.0 | cukorbetegség, magasvérnyomás-betegség, stroke, szívelégtelenség, vérszegénység, krónikus veseelégtelenség | 0 | 0 |
| 46260 | 46212 | 70 | True | False | True | True | False | 2023-01-22 | False | 0 | 0 | 0.0 | szívelégtelenség, iszkémiás szívbetegség, magasvérnyomás-betegség, demencia, krónikus obstruktív tüdőbetegség | 0 | 0 |
| 46261 | 46213 | 75 | False | False | False | False | False | 2023-01-22 | True | 0 | 0 | 0.0 | stroke, súlyos érszûkület | 0 | 0 |
| 46262 | 46214 | 66 | True | False | False | False | False | 2023-01-22 | True | 0 | 0 | 0.0 | magasvérnyomás-betegség, daganatos megbetegedés | 0 | 0 |
| 46263 | 46215 | 72 | False | False | True | False | False | 2023-01-22 | False | 0 | 0 | 0.0 | szívinfarktus, vérszegénység, leukémia | 0 | 0 |
| 46264 | 46216 | 88 | True | False | False | False | False | 2023-01-22 | False | 0 | 0 | 0.0 | pajzsmirigy-betegség, magasvérnyomás-betegség, csontritkulás, veseelégtelenség | 0 | 0 |
Most frequently occurring
| serial | age | condition__blood_pressure | condition__diabetes | condition__heart | condition__lungs | condition__obesity | estimated_date | is_male | people_fully_vaccinated | people_vaccinated | positive_rate | raw_conditions | total_boosters | total_vaccinations | # duplicates | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 46265 | 86 | False | False | False | False | False | 2023-01-22 | False | 0 | 0 | 0.0 | asztma, vastagbélgyulladás | 0 | 0 | 2 |