Dataset statistics
Number of variables | 15 |
---|---|
Number of observations | 69586 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 10857 |
Duplicate rows (%) | 15.6% |
Total size in memory | 8.0 MiB |
Average record size in memory | 120.0 B |
Variable types
Categorical | 14 |
---|---|
DateTime | 1 |
Alerts
project_name has constant value "" | Constant |
Dataset has 10857 (15.6%) duplicate rows | Duplicates |
country_code has a high cardinality: 75 distinct values | High cardinality |
installer_version has a high cardinality: 103 distinct values | High cardinality |
python_implementation_version has a high cardinality: 65 distinct values | High cardinality |
setuptools_version has a high cardinality: 125 distinct values | High cardinality |
sys_distro_version has a high cardinality: 84 distinct values | High cardinality |
country_code is highly overall correlated with python_implementation_name and 1 other fields | High correlation |
cpu is highly overall correlated with distribution_type and 6 other fields | High correlation |
distribution_type is highly overall correlated with cpu and 7 other fields | High correlation |
installer_name is highly overall correlated with distribution_type and 2 other fields | High correlation |
openssl_version is highly overall correlated with cpu and 6 other fields | High correlation |
package_version is highly overall correlated with openssl_version and 1 other fields | High correlation |
python_implementation_name is highly overall correlated with country_code and 8 other fields | High correlation |
python_implementation_version is highly overall correlated with cpu and 5 other fields | High correlation |
sys_distro_name is highly overall correlated with cpu and 4 other fields | High correlation |
sys_distro_version is highly overall correlated with cpu and 5 other fields | High correlation |
sys_name is highly overall correlated with country_code and 8 other fields | High correlation |
country_code is highly imbalanced (84.8%) | Imbalance |
cpu is highly imbalanced (82.1%) | Imbalance |
distribution_type is highly imbalanced (77.0%) | Imbalance |
installer_name is highly imbalanced (84.8%) | Imbalance |
installer_version is highly imbalanced (61.6%) | Imbalance |
openssl_version is highly imbalanced (55.3%) | Imbalance |
python_implementation_name is highly imbalanced (61.3%) | Imbalance |
python_implementation_version is highly imbalanced (56.7%) | Imbalance |
setuptools_version is highly imbalanced (61.6%) | Imbalance |
sys_distro_name is highly imbalanced (85.6%) | Imbalance |
sys_distro_version is highly imbalanced (70.5%) | Imbalance |
sys_name is highly imbalanced (75.1%) | Imbalance |
Reproduction
Analysis started | 2023-05-04 15:12:19.886254 |
---|---|
Analysis finished | 2023-05-04 15:12:23.498501 |
Duration | 3.61 seconds |
Software version | ydata-profiling vv4.1.2 |
Download configuration | config.json |
country_code
Categorical
HIGH CARDINALITY
HIGH CORRELATION
IMBALANCE
Distinct | 75 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 543.8 KiB |
US | |
---|---|
NL | 1023 |
CN | 821 |
None | 537 |
JP | 477 |
Other values (70) | 4329 |
Common Values
Value | Count | Frequency (%) |
US | 62399 | |
NL | 1023 | 1.5% |
CN | 821 | 1.2% |
None | 537 | 0.8% |
JP | 477 | 0.7% |
TW | 394 | 0.6% |
RU | 387 | 0.6% |
FR | 354 | 0.5% |
GB | 343 | 0.5% |
DE | 290 | 0.4% |
Other values (65) | 2561 | 3.7% |
Length
Value | Count | Frequency (%) |
us | 62399 | |
nl | 1023 | 1.5% |
cn | 821 | 1.2% |
none | 537 | 0.8% |
jp | 477 | 0.7% |
tw | 394 | 0.6% |
ru | 387 | 0.6% |
fr | 354 | 0.5% |
gb | 343 | 0.5% |
de | 290 | 0.4% |
Other values (65) | 2561 | 3.7% |
Most occurring characters
Value | Count | Frequency (%) |
S | 62927 | |
U | 62890 | |
N | 2581 | 1.8% |
R | 1174 | 0.8% |
L | 1153 | 0.8% |
C | 1110 | 0.8% |
E | 748 | 0.5% |
T | 647 | 0.5% |
I | 609 | 0.4% |
K | 599 | 0.4% |
Other values (20) | 5808 | 4.1% |
Most occurring categories
Value | Count | Frequency (%) |
Uppercase Letter | 138633 | |
Lowercase Letter | 1611 | 1.1% |
Decimal Number | 2 | < 0.1% |
Most frequent character per category
Uppercase Letter
Value | Count | Frequency (%) |
S | 62927 | |
U | 62890 | |
N | 2581 | 1.9% |
R | 1174 | 0.8% |
L | 1153 | 0.8% |
C | 1110 | 0.8% |
E | 748 | 0.5% |
T | 647 | 0.5% |
I | 609 | 0.4% |
K | 599 | 0.4% |
Other values (16) | 4195 | 3.0% |
Lowercase Letter
Value | Count | Frequency (%) |
o | 537 | |
n | 537 | |
e | 537 |
Decimal Number
Value | Count | Frequency (%) |
1 | 2 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 140244 | |
Common | 2 | < 0.1% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
S | 62927 | |
U | 62890 | |
N | 2581 | 1.8% |
R | 1174 | 0.8% |
L | 1153 | 0.8% |
C | 1110 | 0.8% |
E | 748 | 0.5% |
T | 647 | 0.5% |
I | 609 | 0.4% |
K | 599 | 0.4% |
Other values (19) | 5806 | 4.1% |
Common
Value | Count | Frequency (%) |
1 | 2 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 140246 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
S | 62927 | |
U | 62890 | |
N | 2581 | 1.8% |
R | 1174 | 0.8% |
L | 1153 | 0.8% |
C | 1110 | 0.8% |
E | 748 | 0.5% |
T | 647 | 0.5% |
I | 609 | 0.4% |
K | 599 | 0.4% |
Other values (20) | 5808 | 4.1% |
cpu
Categorical
HIGH CORRELATION
IMBALANCE
Distinct | 6 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 543.8 KiB |
x86_64 | |
---|---|
None | 5266 |
AMD64 | 527 |
arm64 | 47 |
aarch64 | 13 |
Common Values
Value | Count | Frequency (%) |
x86_64 | 63728 | |
None | 5266 | 7.6% |
AMD64 | 527 | 0.8% |
arm64 | 47 | 0.1% |
aarch64 | 13 | < 0.1% |
armv7l | 5 | < 0.1% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
x86_64 | 63728 | |
none | 5266 | 7.6% |
amd64 | 527 | 0.8% |
arm64 | 47 | 0.1% |
aarch64 | 13 | < 0.1% |
armv7l | 5 | < 0.1% |
Most occurring characters
Value | Count | Frequency (%) |
6 | 128043 | |
4 | 64315 | |
x | 63728 | |
8 | 63728 | |
_ | 63728 | |
N | 5266 | 1.3% |
o | 5266 | 1.3% |
n | 5266 | 1.3% |
e | 5266 | 1.3% |
D | 527 | 0.1% |
Other values (10) | 1290 | 0.3% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 256091 | |
Lowercase Letter | 79757 | 19.6% |
Connector Punctuation | 63728 | 15.7% |
Uppercase Letter | 6847 | 1.7% |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
x | 63728 | |
o | 5266 | 6.6% |
n | 5266 | 6.6% |
e | 5266 | 6.6% |
a | 78 | 0.1% |
r | 65 | 0.1% |
m | 52 | 0.1% |
c | 13 | < 0.1% |
h | 13 | < 0.1% |
v | 5 | < 0.1% |
Decimal Number
Value | Count | Frequency (%) |
6 | 128043 | |
4 | 64315 | |
8 | 63728 | |
7 | 5 | < 0.1% |
Uppercase Letter
Value | Count | Frequency (%) |
N | 5266 | |
D | 527 | 7.7% |
M | 527 | 7.7% |
A | 527 | 7.7% |
Connector Punctuation
Value | Count | Frequency (%) |
_ | 63728 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 319819 | |
Latin | 86604 | 21.3% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
x | 63728 | |
N | 5266 | 6.1% |
o | 5266 | 6.1% |
n | 5266 | 6.1% |
e | 5266 | 6.1% |
D | 527 | 0.6% |
M | 527 | 0.6% |
A | 527 | 0.6% |
a | 78 | 0.1% |
r | 65 | 0.1% |
Other values (5) | 88 | 0.1% |
Common
Value | Count | Frequency (%) |
6 | 128043 | |
4 | 64315 | |
8 | 63728 | |
_ | 63728 | |
7 | 5 | < 0.1% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 406423 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
6 | 128043 | |
4 | 64315 | |
x | 63728 | |
8 | 63728 | |
_ | 63728 | |
N | 5266 | 1.3% |
o | 5266 | 1.3% |
n | 5266 | 1.3% |
e | 5266 | 1.3% |
D | 527 | 0.1% |
Other values (10) | 1290 | 0.3% |
distribution_type
Categorical
HIGH CORRELATION
IMBALANCE
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 543.8 KiB |
bdist_wheel | |
---|---|
sdist | 2592 |
Common Values
Value | Count | Frequency (%) |
bdist_wheel | 66994 | |
sdist | 2592 | 3.7% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
bdist_wheel | 66994 | |
sdist | 2592 | 3.7% |
Most occurring characters
Value | Count | Frequency (%) |
e | 133988 | |
s | 72178 | |
d | 69586 | |
i | 69586 | |
t | 69586 | |
b | 66994 | |
_ | 66994 | |
w | 66994 | |
h | 66994 | |
l | 66994 |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 682900 | |
Connector Punctuation | 66994 | 8.9% |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
e | 133988 | |
s | 72178 | |
d | 69586 | |
i | 69586 | |
t | 69586 | |
b | 66994 | |
w | 66994 | |
h | 66994 | |
l | 66994 |
Connector Punctuation
Value | Count | Frequency (%) |
_ | 66994 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 682900 | |
Common | 66994 | 8.9% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
e | 133988 | |
s | 72178 | |
d | 69586 | |
i | 69586 | |
t | 69586 | |
b | 66994 | |
w | 66994 | |
h | 66994 | |
l | 66994 |
Common
Value | Count | Frequency (%) |
_ | 66994 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 749894 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
e | 133988 | |
s | 72178 | |
d | 69586 | |
i | 69586 | |
t | 69586 | |
b | 66994 | |
_ | 66994 | |
w | 66994 | |
h | 66994 | |
l | 66994 |
installer_name
Categorical
HIGH CORRELATION
IMBALANCE
Distinct | 9 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 543.8 KiB |
pip | |
---|---|
bandersnatch | 4067 |
Browser | 457 |
requests | 399 |
None | 213 |
Other values (4) | 130 |
Common Values
Value | Count | Frequency (%) |
pip | 64320 | |
bandersnatch | 4067 | 5.8% |
Browser | 457 | 0.7% |
requests | 399 | 0.6% |
None | 213 | 0.3% |
conda | 54 | 0.1% |
setuptools | 36 | 0.1% |
Nexus | 33 | < 0.1% |
Artifactory | 7 | < 0.1% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
pip | 64320 | |
bandersnatch | 4067 | 5.8% |
browser | 457 | 0.7% |
requests | 399 | 0.6% |
none | 213 | 0.3% |
conda | 54 | 0.1% |
setuptools | 36 | 0.1% |
nexus | 33 | < 0.1% |
artifactory | 7 | < 0.1% |
Most occurring characters
Value | Count | Frequency (%) |
p | 128676 | |
i | 64327 | |
n | 8401 | 3.4% |
a | 8195 | 3.3% |
e | 5604 | 2.2% |
s | 5427 | 2.2% |
r | 5394 | 2.2% |
t | 4552 | 1.8% |
c | 4128 | 1.7% |
d | 4121 | 1.6% |
Other values (13) | 11054 | 4.4% |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 249169 | |
Uppercase Letter | 710 | 0.3% |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
p | 128676 | |
i | 64327 | |
n | 8401 | 3.4% |
a | 8195 | 3.3% |
e | 5604 | 2.2% |
s | 5427 | 2.2% |
r | 5394 | 2.2% |
t | 4552 | 1.8% |
c | 4128 | 1.7% |
d | 4121 | 1.7% |
Other values (10) | 10344 | 4.2% |
Uppercase Letter
Value | Count | Frequency (%) |
B | 457 | |
N | 246 | |
A | 7 | 1.0% |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 249879 |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
p | 128676 | |
i | 64327 | |
n | 8401 | 3.4% |
a | 8195 | 3.3% |
e | 5604 | 2.2% |
s | 5427 | 2.2% |
r | 5394 | 2.2% |
t | 4552 | 1.8% |
c | 4128 | 1.7% |
d | 4121 | 1.6% |
Other values (13) | 11054 | 4.4% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 249879 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
p | 128676 | |
i | 64327 | |
n | 8401 | 3.4% |
a | 8195 | 3.3% |
e | 5604 | 2.2% |
s | 5427 | 2.2% |
r | 5394 | 2.2% |
t | 4552 | 1.8% |
c | 4128 | 1.7% |
d | 4121 | 1.6% |
Other values (13) | 11054 | 4.4% |
installer_version
Categorical
HIGH CARDINALITY
IMBALANCE
Distinct | 103 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 543.8 KiB |
20.0.2 | |
---|---|
19.0.3 | |
21.0.1 | 3036 |
21.1.3 | 1584 |
19.3.1 | 1189 |
Other values (98) |
Common Values
Value | Count | Frequency (%) |
20.0.2 | 30678 | |
19.0.3 | 23095 | |
21.0.1 | 3036 | 4.4% |
21.1.3 | 1584 | 2.3% |
19.3.1 | 1189 | 1.7% |
4.4.0 | 971 | 1.4% |
21.2.4 | 937 | 1.3% |
20.1.1 | 894 | 1.3% |
5.0.0 | 837 | 1.2% |
21.3.1 | 683 | 1.0% |
Other values (93) | 5682 | 8.2% |
Length
Value | Count | Frequency (%) |
20.0.2 | 30678 | |
19.0.3 | 23095 | |
21.0.1 | 3036 | 4.4% |
21.1.3 | 1584 | 2.3% |
19.3.1 | 1189 | 1.7% |
4.4.0 | 971 | 1.4% |
21.2.4 | 937 | 1.3% |
20.1.1 | 894 | 1.3% |
5.0.0 | 837 | 1.2% |
21.3.1 | 683 | 1.0% |
Other values (93) | 5682 | 8.2% |
Most occurring characters
Value | Count | Frequency (%) |
. | 137678 | |
0 | 94165 | |
2 | 74188 | |
1 | 42722 | 10.4% |
3 | 28913 | 7.0% |
9 | 24604 | 6.0% |
4 | 4200 | 1.0% |
5 | 1844 | 0.4% |
e | 676 | 0.2% |
o | 672 | 0.2% |
Other values (11) | 2038 | 0.5% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 271283 | |
Other Punctuation | 137678 | |
Lowercase Letter | 2036 | 0.5% |
Uppercase Letter | 670 | 0.2% |
Dash Punctuation | 33 | < 0.1% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 94165 | |
2 | 74188 | |
1 | 42722 | |
3 | 28913 | 10.7% |
9 | 24604 | 9.1% |
4 | 4200 | 1.5% |
5 | 1844 | 0.7% |
6 | 520 | 0.2% |
7 | 79 | < 0.1% |
8 | 48 | < 0.1% |
Lowercase Letter
Value | Count | Frequency (%) |
e | 676 | |
o | 672 | |
n | 670 | |
d | 6 | 0.3% |
v | 6 | 0.3% |
p | 2 | 0.1% |
s | 2 | 0.1% |
t | 2 | 0.1% |
Other Punctuation
Value | Count | Frequency (%) |
. | 137678 |
Uppercase Letter
Value | Count | Frequency (%) |
N | 670 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 33 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 408994 | |
Latin | 2706 | 0.7% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
. | 137678 | |
0 | 94165 | |
2 | 74188 | |
1 | 42722 | 10.4% |
3 | 28913 | 7.1% |
9 | 24604 | 6.0% |
4 | 4200 | 1.0% |
5 | 1844 | 0.5% |
6 | 520 | 0.1% |
7 | 79 | < 0.1% |
Other values (2) | 81 | < 0.1% |
Latin
Value | Count | Frequency (%) |
e | 676 | |
o | 672 | |
n | 670 | |
N | 670 | |
d | 6 | 0.2% |
v | 6 | 0.2% |
p | 2 | 0.1% |
s | 2 | 0.1% |
t | 2 | 0.1% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 411700 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
. | 137678 | |
0 | 94165 | |
2 | 74188 | |
1 | 42722 | 10.4% |
3 | 28913 | 7.0% |
9 | 24604 | 6.0% |
4 | 4200 | 1.0% |
5 | 1844 | 0.4% |
e | 676 | 0.2% |
o | 672 | 0.2% |
Other values (11) | 2038 | 0.5% |
openssl_version
Categorical
HIGH CORRELATION
IMBALANCE
Distinct | 39 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 543.8 KiB |
OpenSSL 1.1.1 11 Sep 2018 | |
---|---|
OpenSSL 1.0.2g 1 Mar 2016 | |
None | |
OpenSSL 1.1.1g 21 Apr 2020 | |
OpenSSL 1.1.1f 31 Mar 2020 | |
Other values (34) |
Length
Max length | 32 |
---|---|
Median length | 26 |
Mean length | 24.545814 |
Min length | 4 |
Characters and Unicode
Total characters | 1708045 |
---|---|
Distinct characters | 46 |
Distinct categories | 6 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
Unique
Unique | 7 ? |
---|---|
Unique (%) | < 0.1% |
Sample
1st row | None |
---|---|
2nd row | None |
3rd row | OpenSSL 1.1.1l 24 Aug 2021 |
4th row | OpenSSL 1.1.1l 24 Aug 2021 |
5th row | OpenSSL 1.1.1l 24 Aug 2021 |
Common Values
Value | Count | Frequency (%) |
OpenSSL 1.1.1 11 Sep 2018 | 27793 | |
OpenSSL 1.0.2g 1 Mar 2016 | 22393 | |
None | 5266 | 7.6% |
OpenSSL 1.1.1g 21 Apr 2020 | 5071 | 7.3% |
OpenSSL 1.1.1f 31 Mar 2020 | 3779 | 5.4% |
OpenSSL 1.1.1k 25 Mar 2021 | 1335 | 1.9% |
OpenSSL 1.1.1l 24 Aug 2021 | 1075 | 1.5% |
OpenSSL 1.1.1d 10 Sep 2019 | 808 | 1.2% |
OpenSSL 1.1.1b 26 Feb 2019 | 691 | 1.0% |
OpenSSL 1.0.2k-fips 26 Jan 2017 | 257 | 0.4% |
Other values (29) | 1118 | 1.6% |
Length
Value | Count | Frequency (%) |
openssl | 64281 | |
sep | 28844 | |
2018 | 27855 | |
1.1.1 | 27793 | |
11 | 27793 | |
mar | 27773 | |
2016 | 22395 | 6.9% |
1.0.2g | 22393 | 6.9% |
1 | 22393 | 6.9% |
2020 | 9204 | 2.8% |
Other values (65) | 46027 |
Most occurring characters
Value | Count | Frequency (%) |
321431 | ||
1 | 290163 | |
S | 157472 | |
. | 128640 | |
2 | 108430 | 6.3% |
e | 99578 | 5.8% |
p | 98456 | 5.8% |
0 | 97221 | 5.7% |
n | 70130 | 4.1% |
L | 64359 | 3.8% |
Other values (36) | 272165 |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 556022 | |
Lowercase Letter | 374874 | |
Uppercase Letter | 326821 | |
Space Separator | 321431 | |
Other Punctuation | 128640 | 7.5% |
Dash Punctuation | 257 | < 0.1% |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
e | 99578 | |
p | 98456 | |
n | 70130 | |
r | 32885 | 8.8% |
g | 28551 | 7.6% |
a | 28100 | 7.5% |
o | 5307 | 1.4% |
f | 4068 | 1.1% |
k | 1602 | 0.4% |
b | 1584 | 0.4% |
Other values (12) | 4613 | 1.2% |
Uppercase Letter
Value | Count | Frequency (%) |
S | 157472 | |
L | 64359 | |
O | 64281 | |
M | 27830 | 8.5% |
A | 6157 | 1.9% |
N | 5302 | 1.6% |
F | 856 | 0.3% |
D | 294 | 0.1% |
J | 266 | 0.1% |
I | 2 | < 0.1% |
Decimal Number
Value | Count | Frequency (%) |
1 | 290163 | |
2 | 108430 | 19.5% |
0 | 97221 | 17.5% |
8 | 28060 | 5.0% |
6 | 23516 | 4.2% |
3 | 3834 | 0.7% |
5 | 1615 | 0.3% |
9 | 1544 | 0.3% |
4 | 1164 | 0.2% |
7 | 475 | 0.1% |
Space Separator
Value | Count | Frequency (%) |
321431 |
Other Punctuation
Value | Count | Frequency (%) |
. | 128640 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 257 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 1006350 | |
Latin | 701695 |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
S | 157472 | |
e | 99578 | |
p | 98456 | |
n | 70130 | |
L | 64359 | |
O | 64281 | |
r | 32885 | 4.7% |
g | 28551 | 4.1% |
a | 28100 | 4.0% |
M | 27830 | 4.0% |
Other values (23) | 30053 | 4.3% |
Common
Value | Count | Frequency (%) |
321431 | ||
1 | 290163 | |
. | 128640 | |
2 | 108430 | 10.8% |
0 | 97221 | 9.7% |
8 | 28060 | 2.8% |
6 | 23516 | 2.3% |
3 | 3834 | 0.4% |
5 | 1615 | 0.2% |
9 | 1544 | 0.2% |
Other values (3) | 1896 | 0.2% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 1708045 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
321431 | ||
1 | 290163 | |
S | 157472 | |
. | 128640 | |
2 | 108430 | 6.3% |
e | 99578 | 5.8% |
p | 98456 | 5.8% |
0 | 97221 | 5.7% |
n | 70130 | 4.1% |
L | 64359 | 3.8% |
Other values (36) | 272165 |
package_version
Categorical
Distinct | 4 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 543.8 KiB |
1.2.2 | |
---|---|
1.2.3 | |
1.1.2 | |
1.2.1 | 1777 |
Common Values
Value | Count | Frequency (%) |
1.2.2 | 47174 | |
1.2.3 | 15050 | 21.6% |
1.1.2 | 5585 | 8.0% |
1.2.1 | 1777 | 2.6% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
1.2.2 | 47174 | |
1.2.3 | 15050 | 21.6% |
1.1.2 | 5585 | 8.0% |
1.2.1 | 1777 | 2.6% |
Most occurring characters
Value | Count | Frequency (%) |
. | 139172 | |
2 | 116760 | |
1 | 76948 | |
3 | 15050 | 4.3% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 208758 | |
Other Punctuation | 139172 |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
2 | 116760 | |
1 | 76948 | |
3 | 15050 | 7.2% |
Other Punctuation
Value | Count | Frequency (%) |
. | 139172 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 347930 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
. | 139172 | |
2 | 116760 | |
1 | 76948 | |
3 | 15050 | 4.3% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 347930 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
. | 139172 | |
2 | 116760 | |
1 | 76948 | |
3 | 15050 | 4.3% |
project_name
Categorical
Distinct | 1 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 543.8 KiB |
scikit-mobility |
---|
Length
Max length | 15 |
---|---|
Median length | 15 |
Mean length | 15 |
Min length | 15 |
Characters and Unicode
Total characters | 1043790 |
---|---|
Distinct characters | 11 |
Distinct categories | 2 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | scikit-mobility |
---|---|
2nd row | scikit-mobility |
3rd row | scikit-mobility |
4th row | scikit-mobility |
5th row | scikit-mobility |
Common Values
Value | Count | Frequency (%) |
scikit-mobility | 69586 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
scikit-mobility | 69586 |
Most occurring characters
Value | Count | Frequency (%) |
i | 278344 | |
t | 139172 | |
s | 69586 | 6.7% |
c | 69586 | 6.7% |
k | 69586 | 6.7% |
- | 69586 | 6.7% |
m | 69586 | 6.7% |
o | 69586 | 6.7% |
b | 69586 | 6.7% |
l | 69586 | 6.7% |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 974204 | |
Dash Punctuation | 69586 | 6.7% |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
i | 278344 | |
t | 139172 | |
s | 69586 | 7.1% |
c | 69586 | 7.1% |
k | 69586 | 7.1% |
m | 69586 | 7.1% |
o | 69586 | 7.1% |
b | 69586 | 7.1% |
l | 69586 | 7.1% |
y | 69586 | 7.1% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 69586 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 974204 | |
Common | 69586 | 6.7% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
i | 278344 | |
t | 139172 | |
s | 69586 | 7.1% |
c | 69586 | 7.1% |
k | 69586 | 7.1% |
m | 69586 | 7.1% |
o | 69586 | 7.1% |
b | 69586 | 7.1% |
l | 69586 | 7.1% |
y | 69586 | 7.1% |
Common
Value | Count | Frequency (%) |
- | 69586 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 1043790 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
i | 278344 | |
t | 139172 | |
s | 69586 | 6.7% |
c | 69586 | 6.7% |
k | 69586 | 6.7% |
- | 69586 | 6.7% |
m | 69586 | 6.7% |
o | 69586 | 6.7% |
b | 69586 | 6.7% |
l | 69586 | 6.7% |
python_implementation_name
Categorical
HIGH CORRELATION
IMBALANCE
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 543.8 KiB |
CPython | |
---|---|
None | 5266 |
Common Values
Value | Count | Frequency (%) |
CPython | 64320 | |
None | 5266 | 7.6% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
cpython | 64320 | |
none | 5266 | 7.6% |
Most occurring characters
Value | Count | Frequency (%) |
o | 69586 | |
n | 69586 | |
C | 64320 | |
P | 64320 | |
y | 64320 | |
t | 64320 | |
h | 64320 | |
N | 5266 | 1.1% |
e | 5266 | 1.1% |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 337398 | |
Uppercase Letter | 133906 | 28.4% |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
o | 69586 | |
n | 69586 | |
y | 64320 | |
t | 64320 | |
h | 64320 | |
e | 5266 | 1.6% |
Uppercase Letter
Value | Count | Frequency (%) |
C | 64320 | |
P | 64320 | |
N | 5266 | 3.9% |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 471304 |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
o | 69586 | |
n | 69586 | |
C | 64320 | |
P | 64320 | |
y | 64320 | |
t | 64320 | |
h | 64320 | |
N | 5266 | 1.1% |
e | 5266 | 1.1% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 471304 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
o | 69586 | |
n | 69586 | |
C | 64320 | |
P | 64320 | |
y | 64320 | |
t | 64320 | |
h | 64320 | |
N | 5266 | 1.1% |
e | 5266 | 1.1% |
python_implementation_version
Categorical
HIGH CARDINALITY
HIGH CORRELATION
IMBALANCE
Distinct | 65 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 543.8 KiB |
3.7.5 | |
---|---|
3.7.3 | |
3.7.6 | |
None | |
3.8.10 | |
Other values (60) |
Common Values
Value | Count | Frequency (%) |
3.7.5 | 24954 | |
3.7.3 | 23169 | |
3.7.6 | 5532 | 7.9% |
None | 5266 | 7.6% |
3.8.10 | 3491 | 5.0% |
3.7.10 | 1788 | 2.6% |
3.7.12 | 1013 | 1.5% |
3.8.12 | 660 | 0.9% |
3.7.11 | 438 | 0.6% |
3.8.5 | 435 | 0.6% |
Other values (55) | 2840 | 4.1% |
Length
Value | Count | Frequency (%) |
3.7.5 | 24954 | |
3.7.3 | 23169 | |
3.7.6 | 5532 | 7.9% |
none | 5266 | 7.6% |
3.8.10 | 3491 | 5.0% |
3.7.10 | 1788 | 2.6% |
3.7.12 | 1013 | 1.5% |
3.8.12 | 660 | 0.9% |
3.7.11 | 438 | 0.6% |
3.8.5 | 435 | 0.6% |
Other values (54) | 2840 | 4.1% |
Most occurring characters
Value | Count | Frequency (%) |
. | 128640 | |
3 | 87761 | |
7 | 58129 | |
5 | 25513 | 7.3% |
1 | 8857 | 2.5% |
6 | 6226 | 1.8% |
8 | 5756 | 1.6% |
0 | 5529 | 1.6% |
n | 5266 | 1.5% |
e | 5266 | 1.5% |
Other values (9) | 13809 | 3.9% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 201038 | |
Other Punctuation | 128640 | |
Lowercase Letter | 15807 | 4.5% |
Uppercase Letter | 5266 | 1.5% |
Math Symbol | 1 | < 0.1% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
3 | 87761 | |
7 | 58129 | |
5 | 25513 | 12.7% |
1 | 8857 | 4.4% |
6 | 6226 | 3.1% |
8 | 5756 | 2.9% |
0 | 5529 | 2.8% |
2 | 1965 | 1.0% |
9 | 1122 | 0.6% |
4 | 180 | 0.1% |
Lowercase Letter
Value | Count | Frequency (%) |
n | 5266 | |
e | 5266 | |
o | 5266 | |
r | 4 | < 0.1% |
c | 4 | < 0.1% |
b | 1 | < 0.1% |
Other Punctuation
Value | Count | Frequency (%) |
. | 128640 |
Uppercase Letter
Value | Count | Frequency (%) |
N | 5266 |
Math Symbol
Value | Count | Frequency (%) |
+ | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 329679 | |
Latin | 21073 | 6.0% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
. | 128640 | |
3 | 87761 | |
7 | 58129 | |
5 | 25513 | 7.7% |
1 | 8857 | 2.7% |
6 | 6226 | 1.9% |
8 | 5756 | 1.7% |
0 | 5529 | 1.7% |
2 | 1965 | 0.6% |
9 | 1122 | 0.3% |
Other values (2) | 181 | 0.1% |
Latin
Value | Count | Frequency (%) |
n | 5266 | |
e | 5266 | |
o | 5266 | |
N | 5266 | |
r | 4 | < 0.1% |
c | 4 | < 0.1% |
b | 1 | < 0.1% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 350752 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
. | 128640 | |
3 | 87761 | |
7 | 58129 | |
5 | 25513 | 7.3% |
1 | 8857 | 2.5% |
6 | 6226 | 1.8% |
8 | 5756 | 1.6% |
0 | 5529 | 1.6% |
n | 5266 | 1.5% |
e | 5266 | 1.5% |
Other values (9) | 13809 | 3.9% |
setuptools_version
Categorical
HIGH CARDINALITY
IMBALANCE
Distinct | 125 |
---|---|
Distinct (%) | 0.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 543.8 KiB |
45.2.0 | |
---|---|
40.8.0 | |
None | |
45.2.0.post20200210 | |
52.0.0 | |
Other values (120) |
Length
Max length | 19 |
---|---|
Median length | 6 |
Mean length | 7.0954215 |
Min length | 4 |
Characters and Unicode
Total characters | 493742 |
---|---|
Distinct characters | 18 |
Distinct categories | 4 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
Unique
Unique | 22 ? |
---|---|
Unique (%) | < 0.1% |
Sample
1st row | None |
---|---|
2nd row | None |
3rd row | 57.4.0 |
4th row | 47.3.1.post20200616 |
5th row | 47.3.1.post20200616 |
Common Values
Value | Count | Frequency (%) |
45.2.0 | 25440 | |
40.8.0 | 23102 | |
None | 5458 | 7.8% |
45.2.0.post20200210 | 5210 | 7.5% |
52.0.0 | 2701 | 3.9% |
57.4.0 | 1780 | 2.6% |
56.0.0 | 565 | 0.8% |
47.3.1.post20200616 | 561 | 0.8% |
57.0.0 | 458 | 0.7% |
49.6.0.post20210108 | 431 | 0.6% |
Other values (115) | 3880 | 5.6% |
Length
Value | Count | Frequency (%) |
45.2.0 | 25440 | |
40.8.0 | 23102 | |
none | 5458 | 7.8% |
45.2.0.post20200210 | 5210 | 7.5% |
52.0.0 | 2701 | 3.9% |
57.4.0 | 1780 | 2.6% |
56.0.0 | 565 | 0.8% |
47.3.1.post20200616 | 561 | 0.8% |
57.0.0 | 458 | 0.7% |
49.6.0.post20210108 | 431 | 0.6% |
Other values (115) | 3880 | 5.6% |
Most occurring characters
Value | Count | Frequency (%) |
. | 134959 | |
0 | 115569 | |
4 | 58457 | |
2 | 53689 | 10.9% |
5 | 39273 | 8.0% |
8 | 24159 | 4.9% |
o | 12161 | 2.5% |
1 | 10018 | 2.0% |
s | 6703 | 1.4% |
t | 6703 | 1.4% |
Other values (8) | 32051 | 6.5% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 310139 | |
Other Punctuation | 134959 | |
Lowercase Letter | 43186 | 8.7% |
Uppercase Letter | 5458 | 1.1% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 115569 | |
4 | 58457 | |
2 | 53689 | |
5 | 39273 | 12.7% |
8 | 24159 | 7.8% |
1 | 10018 | 3.2% |
7 | 3942 | 1.3% |
6 | 2726 | 0.9% |
3 | 1419 | 0.5% |
9 | 887 | 0.3% |
Lowercase Letter
Value | Count | Frequency (%) |
o | 12161 | |
s | 6703 | |
t | 6703 | |
p | 6703 | |
e | 5458 | |
n | 5458 |
Other Punctuation
Value | Count | Frequency (%) |
. | 134959 |
Uppercase Letter
Value | Count | Frequency (%) |
N | 5458 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 445098 | |
Latin | 48644 | 9.9% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
. | 134959 | |
0 | 115569 | |
4 | 58457 | |
2 | 53689 | 12.1% |
5 | 39273 | 8.8% |
8 | 24159 | 5.4% |
1 | 10018 | 2.3% |
7 | 3942 | 0.9% |
6 | 2726 | 0.6% |
3 | 1419 | 0.3% |
Latin
Value | Count | Frequency (%) |
o | 12161 | |
s | 6703 | |
t | 6703 | |
p | 6703 | |
e | 5458 | |
n | 5458 | |
N | 5458 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 493742 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
. | 134959 | |
0 | 115569 | |
4 | 58457 | |
2 | 53689 | 10.9% |
5 | 39273 | 8.0% |
8 | 24159 | 4.9% |
o | 12161 | 2.5% |
1 | 10018 | 2.0% |
s | 6703 | 1.4% |
t | 6703 | 1.4% |
Other values (8) | 32051 | 6.5% |
sys_distro_name
Categorical
HIGH CORRELATION
IMBALANCE
Distinct | 29 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 543.8 KiB |
Ubuntu | |
---|---|
None | 5794 |
Debian GNU/Linux | 1460 |
macOS | 382 |
CentOS Linux | 177 |
Other values (24) | 422 |
Common Values
Value | Count | Frequency (%) |
Ubuntu | 61351 | |
None | 5794 | 8.3% |
Debian GNU/Linux | 1460 | 2.1% |
macOS | 382 | 0.5% |
CentOS Linux | 177 | 0.3% |
Amazon Linux | 165 | 0.2% |
Amazon Linux AMI | 113 | 0.2% |
Red Hat Enterprise Linux | 44 | 0.1% |
Manjaro Linux | 16 | < 0.1% |
Arch Linux | 15 | < 0.1% |
Other values (19) | 69 | 0.1% |
Length
Value | Count | Frequency (%) |
ubuntu | 61351 | |
none | 5794 | 8.1% |
gnu/linux | 1474 | 2.1% |
debian | 1460 | 2.0% |
linux | 552 | 0.8% |
macos | 382 | 0.5% |
amazon | 278 | 0.4% |
centos | 177 | 0.2% |
ami | 113 | 0.2% |
enterprise | 55 | 0.1% |
Other values (27) | 224 | 0.3% |
Most occurring characters
Value | Count | Frequency (%) |
u | 124733 | |
n | 71203 | |
U | 62828 | |
b | 62816 | |
t | 61670 | |
e | 7664 | 1.8% |
N | 7268 | 1.7% |
o | 6119 | 1.4% |
i | 3573 | 0.8% |
2274 | 0.5% | |
Other values (38) | 14713 | 3.5% |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 343824 | |
Uppercase Letter | 77276 | 18.2% |
Space Separator | 2274 | 0.5% |
Other Punctuation | 1481 | 0.3% |
Connector Punctuation | 6 | < 0.1% |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
u | 124733 | |
n | 71203 | |
b | 62816 | |
t | 61670 | |
e | 7664 | 2.2% |
o | 6119 | 1.8% |
i | 3573 | 1.0% |
a | 2233 | 0.6% |
x | 2031 | 0.6% |
m | 661 | 0.2% |
Other values (13) | 1121 | 0.3% |
Uppercase Letter
Value | Count | Frequency (%) |
U | 62828 | |
N | 7268 | 9.4% |
L | 2030 | 2.6% |
G | 1484 | 1.9% |
D | 1465 | 1.9% |
S | 591 | 0.8% |
O | 570 | 0.7% |
A | 408 | 0.5% |
C | 177 | 0.2% |
M | 139 | 0.2% |
Other values (10) | 316 | 0.4% |
Other Punctuation
Value | Count | Frequency (%) |
/ | 1474 | |
! | 6 | 0.4% |
. | 1 | 0.1% |
Space Separator
Value | Count | Frequency (%) |
2274 |
Connector Punctuation
Value | Count | Frequency (%) |
_ | 6 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 421100 | |
Common | 3761 | 0.9% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
u | 124733 | |
n | 71203 | |
U | 62828 | |
b | 62816 | |
t | 61670 | |
e | 7664 | 1.8% |
N | 7268 | 1.7% |
o | 6119 | 1.5% |
i | 3573 | 0.8% |
a | 2233 | 0.5% |
Other values (33) | 10993 | 2.6% |
Common
Value | Count | Frequency (%) |
2274 | ||
/ | 1474 | |
_ | 6 | 0.2% |
! | 6 | 0.2% |
. | 1 | < 0.1% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 424861 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
u | 124733 | |
n | 71203 | |
U | 62828 | |
b | 62816 | |
t | 61670 | |
e | 7664 | 1.8% |
N | 7268 | 1.7% |
o | 6119 | 1.4% |
i | 3573 | 0.8% |
2274 | 0.5% | |
Other values (38) | 14713 | 3.5% |
sys_distro_version
Categorical
HIGH CARDINALITY
HIGH CORRELATION
IMBALANCE
Distinct | 84 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 543.8 KiB |
18.04 | |
---|---|
16.04 | |
None | |
20.04 | |
10 | 1019 |
Other values (79) | 1434 |
Common Values
Value | Count | Frequency (%) |
18.04 | 33415 | |
16.04 | 23087 | |
None | 5797 | 8.3% |
20.04 | 4834 | 6.9% |
10 | 1019 | 1.5% |
11 | 395 | 0.6% |
7 | 176 | 0.3% |
2 | 165 | 0.2% |
10.16 | 161 | 0.2% |
2018.03 | 113 | 0.2% |
Other values (74) | 424 | 0.6% |
Length
Value | Count | Frequency (%) |
18.04 | 33415 | |
16.04 | 23087 | |
none | 5797 | 8.3% |
20.04 | 4834 | 6.9% |
10 | 1019 | 1.5% |
11 | 395 | 0.6% |
7 | 176 | 0.3% |
2 | 165 | 0.2% |
10.16 | 161 | 0.2% |
2018.03 | 113 | 0.2% |
Other values (74) | 424 | 0.6% |
Most occurring characters
Value | Count | Frequency (%) |
0 | 67774 | |
. | 62155 | |
4 | 61455 | |
1 | 59269 | |
8 | 33576 | |
6 | 23315 | 6.9% |
n | 5810 | 1.7% |
o | 5809 | 1.7% |
e | 5798 | 1.7% |
N | 5797 | 1.7% |
Other values (11) | 5921 | 1.8% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 251245 | |
Other Punctuation | 62155 | 18.5% |
Lowercase Letter | 17482 | 5.2% |
Uppercase Letter | 5797 | 1.7% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 67774 | |
4 | 61455 | |
1 | 59269 | |
8 | 33576 | |
6 | 23315 | 9.3% |
2 | 5294 | 2.1% |
7 | 259 | 0.1% |
3 | 144 | 0.1% |
5 | 98 | < 0.1% |
9 | 61 | < 0.1% |
Lowercase Letter
Value | Count | Frequency (%) |
n | 5810 | |
o | 5809 | |
e | 5798 | |
l | 24 | 0.1% |
i | 13 | 0.1% |
g | 13 | 0.1% |
r | 12 | 0.1% |
t | 2 | < 0.1% |
s | 1 | < 0.1% |
Other Punctuation
Value | Count | Frequency (%) |
. | 62155 |
Uppercase Letter
Value | Count | Frequency (%) |
N | 5797 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 313400 | |
Latin | 23279 | 6.9% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
0 | 67774 | |
. | 62155 | |
4 | 61455 | |
1 | 59269 | |
8 | 33576 | |
6 | 23315 | 7.4% |
2 | 5294 | 1.7% |
7 | 259 | 0.1% |
3 | 144 | < 0.1% |
5 | 98 | < 0.1% |
Latin
Value | Count | Frequency (%) |
n | 5810 | |
o | 5809 | |
e | 5798 | |
N | 5797 | |
l | 24 | 0.1% |
i | 13 | 0.1% |
g | 13 | 0.1% |
r | 12 | 0.1% |
t | 2 | < 0.1% |
s | 1 | < 0.1% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 336679 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 67774 | |
. | 62155 | |
4 | 61455 | |
1 | 59269 | |
8 | 33576 | |
6 | 23315 | 6.9% |
n | 5810 | 1.7% |
o | 5809 | 1.7% |
e | 5798 | 1.7% |
N | 5797 | 1.7% |
Other values (11) | 5921 | 1.8% |
sys_name
Categorical
HIGH CORRELATION
IMBALANCE
Distinct | 4 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 543.8 KiB |
Linux | |
---|---|
None | 5266 |
Windows | 527 |
Darwin | 382 |
Common Values
Value | Count | Frequency (%) |
Linux | 63411 | |
None | 5266 | 7.6% |
Windows | 527 | 0.8% |
Darwin | 382 | 0.5% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
linux | 63411 | |
none | 5266 | 7.6% |
windows | 527 | 0.8% |
darwin | 382 | 0.5% |
Most occurring characters
Value | Count | Frequency (%) |
n | 69586 | |
i | 64320 | |
L | 63411 | |
u | 63411 | |
x | 63411 | |
o | 5793 | 1.7% |
N | 5266 | 1.5% |
e | 5266 | 1.5% |
w | 909 | 0.3% |
W | 527 | 0.2% |
Other values (5) | 2200 | 0.6% |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 274514 | |
Uppercase Letter | 69586 | 20.2% |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
n | 69586 | |
i | 64320 | |
u | 63411 | |
x | 63411 | |
o | 5793 | 2.1% |
e | 5266 | 1.9% |
w | 909 | 0.3% |
d | 527 | 0.2% |
s | 527 | 0.2% |
a | 382 | 0.1% |
Uppercase Letter
Value | Count | Frequency (%) |
L | 63411 | |
N | 5266 | 7.6% |
W | 527 | 0.8% |
D | 382 | 0.5% |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 344100 |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
n | 69586 | |
i | 64320 | |
L | 63411 | |
u | 63411 | |
x | 63411 | |
o | 5793 | 1.7% |
N | 5266 | 1.5% |
e | 5266 | 1.5% |
w | 909 | 0.3% |
W | 527 | 0.2% |
Other values (5) | 2200 | 0.6% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 344100 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
n | 69586 | |
i | 64320 | |
L | 63411 | |
u | 63411 | |
x | 63411 | |
o | 5793 | 1.7% |
N | 5266 | 1.5% |
e | 5266 | 1.5% |
w | 909 | 0.3% |
W | 527 | 0.2% |
Other values (5) | 2200 | 0.6% |
timestamp
Date
Distinct | 46713 |
---|---|
Distinct (%) | 67.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 543.8 KiB |
Minimum | 2020-11-18 19:00:03 |
---|---|
Maximum | 2022-04-18 22:03:10 |
country_code | cpu | distribution_type | installer_name | openssl_version | package_version | python_implementation_name | python_implementation_version | sys_distro_name | sys_distro_version | sys_name | |
---|---|---|---|---|---|---|---|---|---|---|---|
country_code | 1.000 | 0.376 | 0.477 | 0.315 | 0.256 | 0.242 | 0.691 | 0.219 | 0.280 | 0.203 | 0.502 |
cpu | 0.376 | 1.000 | 0.681 | 0.447 | 0.533 | 0.289 | 1.000 | 0.558 | 0.651 | 0.583 | 0.841 |
distribution_type | 0.477 | 0.681 | 1.000 | 0.690 | 0.681 | 0.348 | 0.680 | 0.682 | 0.647 | 0.648 | 0.680 |
installer_name | 0.315 | 0.447 | 0.690 | 1.000 | 0.353 | 0.294 | 1.000 | 0.352 | 0.335 | 0.334 | 0.577 |
openssl_version | 0.256 | 0.533 | 0.681 | 0.353 | 1.000 | 0.503 | 1.000 | 0.598 | 0.382 | 0.500 | 0.682 |
package_version | 0.242 | 0.289 | 0.348 | 0.294 | 0.503 | 1.000 | 0.498 | 0.531 | 0.288 | 0.404 | 0.289 |
python_implementation_name | 0.691 | 1.000 | 0.680 | 1.000 | 1.000 | 0.498 | 1.000 | 1.000 | 0.949 | 0.949 | 1.000 |
python_implementation_version | 0.219 | 0.558 | 0.682 | 0.352 | 0.598 | 0.531 | 1.000 | 1.000 | 0.364 | 0.362 | 0.690 |
sys_distro_name | 0.280 | 0.651 | 0.647 | 0.335 | 0.382 | 0.288 | 0.949 | 0.364 | 1.000 | 0.913 | 0.816 |
sys_distro_version | 0.203 | 0.583 | 0.648 | 0.334 | 0.500 | 0.404 | 0.949 | 0.362 | 0.913 | 1.000 | 0.815 |
sys_name | 0.502 | 0.841 | 0.680 | 0.577 | 0.682 | 0.289 | 1.000 | 0.690 | 0.816 | 0.815 | 1.000 |
country_code | cpu | distribution_type | installer_name | installer_version | openssl_version | package_version | project_name | python_implementation_name | python_implementation_version | setuptools_version | sys_distro_name | sys_distro_version | sys_name | timestamp | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
0 | CN | None | sdist | Browser | None | None | 1.2.2 | scikit-mobility | None | None | None | None | None | None | 2021-11-11 03:01:34 |
1 | CN | None | sdist | Browser | None | None | 1.2.2 | scikit-mobility | None | None | None | None | None | None | 2021-11-11 08:25:11 |
2 | CA | arm64 | sdist | pip | 21.3.1 | OpenSSL 1.1.1l 24 Aug 2021 | 1.2.2 | scikit-mobility | CPython | 3.9.7 | 57.4.0 | macOS | 12.0.1 | Darwin | 2021-11-11 20:10:37 |
3 | US | x86_64 | bdist_wheel | pip | 20.1.1 | OpenSSL 1.1.1l 24 Aug 2021 | 1.1.2 | scikit-mobility | CPython | 3.7.10 | 47.3.1.post20200616 | Ubuntu | 20.04 | Linux | 2021-11-11 15:08:19 |
4 | US | x86_64 | bdist_wheel | pip | 20.1.1 | OpenSSL 1.1.1l 24 Aug 2021 | 1.1.2 | scikit-mobility | CPython | 3.7.10 | 47.3.1.post20200616 | Ubuntu | 20.04 | Linux | 2021-11-11 14:57:51 |
5 | US | x86_64 | bdist_wheel | pip | 20.1.1 | OpenSSL 1.1.1l 24 Aug 2021 | 1.1.2 | scikit-mobility | CPython | 3.7.10 | 47.3.1.post20200616 | Ubuntu | 20.04 | Linux | 2021-11-11 13:12:56 |
6 | US | x86_64 | bdist_wheel | pip | 20.1.1 | OpenSSL 1.1.1l 24 Aug 2021 | 1.1.2 | scikit-mobility | CPython | 3.7.10 | 47.3.1.post20200616 | Ubuntu | 20.04 | Linux | 2021-11-11 13:00:43 |
7 | US | x86_64 | bdist_wheel | pip | 20.1.1 | OpenSSL 1.1.1l 24 Aug 2021 | 1.1.2 | scikit-mobility | CPython | 3.7.10 | 47.3.1.post20200616 | Ubuntu | 20.04 | Linux | 2021-11-11 09:57:55 |
8 | US | x86_64 | bdist_wheel | pip | 20.1.1 | OpenSSL 1.1.1l 24 Aug 2021 | 1.1.2 | scikit-mobility | CPython | 3.7.10 | 47.3.1.post20200616 | Ubuntu | 20.04 | Linux | 2021-11-11 17:02:30 |
9 | US | x86_64 | bdist_wheel | pip | 20.0.2 | OpenSSL 1.1.1l 24 Aug 2021 | 1.1.2 | scikit-mobility | CPython | 3.7.10 | 45.2.0.post20200210 | Debian GNU/Linux | 10 | Linux | 2021-11-11 00:17:38 |
country_code | cpu | distribution_type | installer_name | installer_version | openssl_version | package_version | project_name | python_implementation_name | python_implementation_version | setuptools_version | sys_distro_name | sys_distro_version | sys_name | timestamp | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
69576 | US | x86_64 | bdist_wheel | pip | 21.0.1 | OpenSSL 1.1.1f 31 Mar 2020 | 1.2.2 | scikit-mobility | CPython | 3.8.10 | 52.0.0 | Ubuntu | 20.04 | Linux | 2021-11-19 18:09:52 |
69577 | US | x86_64 | bdist_wheel | pip | 21.0.1 | OpenSSL 1.1.1f 31 Mar 2020 | 1.2.2 | scikit-mobility | CPython | 3.8.10 | 52.0.0 | Ubuntu | 20.04 | Linux | 2021-11-19 18:27:24 |
69578 | US | x86_64 | bdist_wheel | pip | 21.0.1 | OpenSSL 1.1.1f 31 Mar 2020 | 1.2.2 | scikit-mobility | CPython | 3.8.10 | 52.0.0 | Ubuntu | 20.04 | Linux | 2021-11-19 18:19:14 |
69579 | US | x86_64 | bdist_wheel | pip | 21.0.1 | OpenSSL 1.1.1f 31 Mar 2020 | 1.2.2 | scikit-mobility | CPython | 3.8.10 | 52.0.0 | Ubuntu | 20.04 | Linux | 2021-11-19 18:27:01 |
69580 | US | x86_64 | bdist_wheel | pip | 21.0.1 | OpenSSL 1.1.1f 31 Mar 2020 | 1.2.2 | scikit-mobility | CPython | 3.8.10 | 52.0.0 | Ubuntu | 20.04 | Linux | 2021-11-19 18:44:33 |
69581 | US | x86_64 | bdist_wheel | pip | 21.0.1 | OpenSSL 1.1.1f 31 Mar 2020 | 1.2.2 | scikit-mobility | CPython | 3.8.10 | 52.0.0 | Ubuntu | 20.04 | Linux | 2021-11-19 18:21:47 |
69582 | US | x86_64 | bdist_wheel | pip | 21.0.1 | OpenSSL 1.1.1f 31 Mar 2020 | 1.2.2 | scikit-mobility | CPython | 3.8.10 | 52.0.0 | Ubuntu | 20.04 | Linux | 2021-11-19 00:02:51 |
69583 | US | x86_64 | bdist_wheel | pip | 21.0.1 | OpenSSL 1.1.1f 31 Mar 2020 | 1.2.2 | scikit-mobility | CPython | 3.8.10 | 52.0.0 | Ubuntu | 20.04 | Linux | 2021-11-19 00:02:44 |
69584 | US | x86_64 | bdist_wheel | pip | 21.1.3 | OpenSSL 1.1.1 11 Sep 2018 | 1.2.2 | scikit-mobility | CPython | 3.7.12 | 57.4.0 | Ubuntu | 18.04 | Linux | 2021-11-19 05:32:02 |
69585 | IT | x86_64 | bdist_wheel | pip | 21.3.1 | OpenSSL 1.1.1k 25 Mar 2021 | 1.2.2 | scikit-mobility | CPython | 3.8.12 | 57.5.0 | Debian GNU/Linux | 11 | Linux | 2021-11-19 10:27:53 |
Most frequently occurring
country_code | cpu | distribution_type | installer_name | installer_version | openssl_version | package_version | project_name | python_implementation_name | python_implementation_version | setuptools_version | sys_distro_name | sys_distro_version | sys_name | timestamp | # duplicates | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
433 | US | x86_64 | bdist_wheel | pip | 19.0.3 | OpenSSL 1.0.2g 1 Mar 2016 | 1.2.2 | scikit-mobility | CPython | 3.7.3 | 40.8.0 | Ubuntu | 16.04 | Linux | 2021-05-03 14:52:54 | 15 |
434 | US | x86_64 | bdist_wheel | pip | 19.0.3 | OpenSSL 1.0.2g 1 Mar 2016 | 1.2.2 | scikit-mobility | CPython | 3.7.3 | 40.8.0 | Ubuntu | 16.04 | Linux | 2021-05-03 15:02:10 | 15 |
4243 | US | x86_64 | bdist_wheel | pip | 20.0.2 | OpenSSL 1.1.1 11 Sep 2018 | 1.2.2 | scikit-mobility | CPython | 3.7.5 | 45.2.0 | Ubuntu | 18.04 | Linux | 2021-12-09 17:14:32 | 15 |
6350 | US | x86_64 | bdist_wheel | pip | 20.0.2 | OpenSSL 1.1.1 11 Sep 2018 | 1.2.2 | scikit-mobility | CPython | 3.7.5 | 45.2.0 | Ubuntu | 18.04 | Linux | 2022-01-10 20:56:33 | 15 |
8305 | US | x86_64 | bdist_wheel | pip | 20.0.2 | OpenSSL 1.1.1 11 Sep 2018 | 1.2.3 | scikit-mobility | CPython | 3.7.5 | 45.2.0 | Ubuntu | 18.04 | Linux | 2022-02-22 12:58:02 | 14 |
10443 | US | x86_64 | bdist_wheel | pip | 20.0.2 | OpenSSL 1.1.1 11 Sep 2018 | 1.2.3 | scikit-mobility | CPython | 3.7.5 | 45.2.0 | Ubuntu | 18.04 | Linux | 2022-04-14 18:27:55 | 13 |
505 | US | x86_64 | bdist_wheel | pip | 19.0.3 | OpenSSL 1.0.2g 1 Mar 2016 | 1.2.2 | scikit-mobility | CPython | 3.7.3 | 40.8.0 | Ubuntu | 16.04 | Linux | 2021-05-05 17:22:38 | 12 |
9124 | US | x86_64 | bdist_wheel | pip | 20.0.2 | OpenSSL 1.1.1 11 Sep 2018 | 1.2.3 | scikit-mobility | CPython | 3.7.5 | 45.2.0 | Ubuntu | 18.04 | Linux | 2022-03-14 14:16:09 | 12 |
10402 | US | x86_64 | bdist_wheel | pip | 20.0.2 | OpenSSL 1.1.1 11 Sep 2018 | 1.2.3 | scikit-mobility | CPython | 3.7.5 | 45.2.0 | Ubuntu | 18.04 | Linux | 2022-04-13 21:31:43 | 12 |
4771 | US | x86_64 | bdist_wheel | pip | 20.0.2 | OpenSSL 1.1.1 11 Sep 2018 | 1.2.2 | scikit-mobility | CPython | 3.7.5 | 45.2.0 | Ubuntu | 18.04 | Linux | 2021-12-22 14:21:02 | 11 |