Dataset statistics
Number of variables | 13 |
---|---|
Number of observations | 620540 |
Missing cells | 38006 |
Missing cells (%) | 0.5% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 61.5 MiB |
Average record size in memory | 104.0 B |
Variable types
Numeric | 11 |
---|---|
Categorical | 2 |
Alerts
categories has a high cardinality: 67004 distinct values | High cardinality |
citable_docs_3years is highly overall correlated with h_index and 4 other fields | High correlation |
h_index is highly overall correlated with citable_docs_3years and 7 other fields | High correlation |
journal_rating is highly overall correlated with h_index and 4 other fields | High correlation |
rank is highly overall correlated with h_index and 4 other fields | High correlation |
ref_per_doc is highly overall correlated with h_index and 2 other fields | High correlation |
total_cites_3years is highly overall correlated with citable_docs_3years and 6 other fields | High correlation |
total_docs is highly overall correlated with citable_docs_3years and 4 other fields | High correlation |
total_docs_3years is highly overall correlated with citable_docs_3years and 4 other fields | High correlation |
total_refs is highly overall correlated with citable_docs_3years and 7 other fields | High correlation |
sjr_best_quartile is highly overall correlated with rank | High correlation |
journal_rating has 38006 (6.1%) missing values | Missing |
citable_docs_3years is highly skewed (γ1 = 47.17653641) | Skewed |
total_cites_3years is highly skewed (γ1 = 33.27400829) | Skewed |
total_docs is highly skewed (γ1 = 44.54113692) | Skewed |
total_docs_3years is highly skewed (γ1 = 45.56996407) | Skewed |
total_refs is highly skewed (γ1 = 45.60770981) | Skewed |
citable_docs_3years has 29012 (4.7%) zeros | Zeros |
h_index has 9095 (1.5%) zeros | Zeros |
ref_per_doc has 153751 (24.8%) zeros | Zeros |
total_cites_3years has 65410 (10.5%) zeros | Zeros |
total_docs has 134190 (21.6%) zeros | Zeros |
total_docs_3years has 27877 (4.5%) zeros | Zeros |
total_refs has 153728 (24.8%) zeros | Zeros |
Reproduction
Analysis started | 2023-05-04 15:11:05.642523 |
---|---|
Analysis finished | 2023-05-04 15:11:45.790266 |
Duration | 40.15 seconds |
Software version | ydata-profiling vv4.1.2 |
Download configuration | config.json |
journal__sourceid
Real number (ℝ)
Distinct | 70227 |
---|---|
Distinct (%) | 11.3% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 8.1230085 × 109 |
Minimum | 12000 |
---|---|
Maximum | 2.110106 × 1010 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 4.7 MiB |
Quantile statistics
Minimum | 12000 |
---|---|
5-th percentile | 14065 |
Q1 | 22618 |
median | 144819 |
Q3 | 1.9900192 × 1010 |
95-th percentile | 2.1100837 × 1010 |
Maximum | 2.110106 × 1010 |
Range | 2.1101048 × 1010 |
Interquartile range (IQR) | 1.9900169 × 1010 |
Descriptive statistics
Standard deviation | 9.4602373 × 109 |
---|---|
Coefficient of variation (CV) | 1.1646224 |
Kurtosis | -1.6611209 |
Mean | 8.1230085 × 109 |
Median Absolute Deviation (MAD) | 132149 |
Skewness | 0.46384165 |
Sum | 5.0406517 × 1015 |
Variance | 8.949609 × 1019 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
16801 | 23 | < 0.1% |
5700168957 | 23 | < 0.1% |
30588 | 23 | < 0.1% |
1.970018816 × 1010 | 23 | < 0.1% |
26333 | 23 | < 0.1% |
16671 | 23 | < 0.1% |
29670 | 23 | < 0.1% |
13473 | 23 | < 0.1% |
25571 | 23 | < 0.1% |
23717 | 23 | < 0.1% |
Other values (70217) | 620310 |
Value | Count | Frequency (%) |
12000 | 13 | |
12001 | 23 | |
12002 | 23 | |
12004 | 22 | |
12005 | 23 | |
12006 | 23 | |
12007 | 4 | < 0.1% |
12008 | 6 | < 0.1% |
12009 | 20 | |
12010 | 23 |
Value | Count | Frequency (%) |
2.110105979 × 1010 | 3 | < 0.1% |
2.110105978 × 1010 | 16 | |
2.110105978 × 1010 | 2 | < 0.1% |
2.110105949 × 1010 | 1 | < 0.1% |
2.11010593 × 1010 | 1 | < 0.1% |
2.11010593 × 1010 | 8 | |
2.110105901 × 1010 | 3 | < 0.1% |
2.110105901 × 1010 | 1 | < 0.1% |
2.110105897 × 1010 | 1 | < 0.1% |
2.110105896 × 1010 | 6 | < 0.1% |
year
Real number (ℝ)
Distinct | 23 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2011.3857 |
Minimum | 1999 |
---|---|
Maximum | 2021 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 4.7 MiB |
Quantile statistics
Minimum | 1999 |
---|---|
5-th percentile | 2000 |
Q1 | 2007 |
median | 2012 |
Q3 | 2017 |
95-th percentile | 2020 |
Maximum | 2021 |
Range | 22 |
Interquartile range (IQR) | 10 |
Descriptive statistics
Standard deviation | 6.2753051 |
---|---|
Coefficient of variation (CV) | 0.0031198914 |
Kurtosis | -1.0005721 |
Mean | 2011.3857 |
Median Absolute Deviation (MAD) | 5 |
Skewness | -0.28788358 |
Sum | 1.2481453 × 109 |
Variance | 39.379454 |
Monotonicity | Increasing |
Value | Count | Frequency (%) |
2017 | 34766 | 5.6% |
2016 | 34279 | 5.5% |
2020 | 34169 | 5.5% |
2018 | 33939 | 5.5% |
2015 | 33651 | 5.4% |
2014 | 32937 | 5.3% |
2013 | 32470 | 5.2% |
2012 | 31864 | 5.1% |
2019 | 31861 | 5.1% |
2011 | 30913 | 5.0% |
Other values (13) | 289691 |
Value | Count | Frequency (%) |
1999 | 16987 | |
2000 | 17298 | |
2001 | 17972 | |
2002 | 19114 | |
2003 | 19638 | |
2004 | 20211 | |
2005 | 21092 | |
2006 | 22662 | |
2007 | 24195 | |
2008 | 26089 |
Value | Count | Frequency (%) |
2021 | 27339 | |
2020 | 34169 | |
2019 | 31861 | |
2018 | 33939 | |
2017 | 34766 | |
2016 | 34279 | |
2015 | 33651 | |
2014 | 32937 | |
2013 | 32470 | |
2012 | 31864 |
categories
Categorical
Distinct | 67004 |
---|---|
Distinct (%) | 10.8% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 4.7 MiB |
Medicine (miscellaneous) (Q4) | 12113 |
---|---|
Engineering (miscellaneous) | 7676 |
Medicine (miscellaneous) (Q3) | 7648 |
Software | 3593 |
Electrical and Electronic Engineering | 3067 |
Other values (66999) |
Length
Max length | 509 |
---|---|
Median length | 301 |
Mean length | 65.562819 |
Min length | 3 |
Characters and Unicode
Total characters | 40684352 |
---|---|
Distinct characters | 57 |
Distinct categories | 8 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
Unique
Unique | 25914 ? |
---|---|
Unique (%) | 4.2% |
Sample
1st row | Biochemistry (Q1) |
---|---|
2nd row | Biochemistry, Genetics and Molecular Biology (miscellaneous) (Q1) |
3rd row | Immunology (Q1); Immunology and Allergy (Q1) |
4th row | Cell Biology (Q1); Developmental Biology (Q1) |
5th row | Neuroscience (miscellaneous) (Q1) |
Common Values
Value | Count | Frequency (%) |
Medicine (miscellaneous) (Q4) | 12113 | 2.0% |
Engineering (miscellaneous) | 7676 | 1.2% |
Medicine (miscellaneous) (Q3) | 7648 | 1.2% |
Software | 3593 | 0.6% |
Electrical and Electronic Engineering | 3067 | 0.5% |
Computer Networks and Communications | 2660 | 0.4% |
Medicine (miscellaneous) (Q2) | 2475 | 0.4% |
Medicine (miscellaneous) | 1459 | 0.2% |
Literature and Literary Theory (Q4) | 1458 | 0.2% |
Electrical and Electronic Engineering; Hardware and Architecture | 1450 | 0.2% |
Other values (66994) | 576941 |
Length
Value | Count | Frequency (%) |
and | 570125 | 11.8% |
q1 | 267430 | 5.5% |
q2 | 266021 | 5.5% |
q3 | 264153 | 5.5% |
q4 | 262127 | 5.4% |
miscellaneous | 217620 | 4.5% |
science | 153450 | 3.2% |
engineering | 147044 | 3.0% |
medicine | 108360 | 2.2% |
computer | 73497 | 1.5% |
Other values (378) | 2510647 |
Most occurring characters
Value | Count | Frequency (%) |
4220950 | 10.4% | |
e | 3116433 | 7.7% |
n | 2931015 | 7.2% |
i | 2807687 | 6.9% |
a | 2453829 | 6.0% |
o | 2189272 | 5.4% |
c | 1938240 | 4.8% |
l | 1836891 | 4.5% |
t | 1710166 | 4.2% |
s | 1553537 | 3.8% |
Other values (47) | 15926332 |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 27871190 | |
Space Separator | 4220950 | 10.4% |
Uppercase Letter | 4004971 | 9.8% |
Open Punctuation | 1309111 | 3.2% |
Close Punctuation | 1309111 | 3.2% |
Decimal Number | 1059731 | 2.6% |
Other Punctuation | 890562 | 2.2% |
Dash Punctuation | 18726 | < 0.1% |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
e | 3116433 | |
n | 2931015 | |
i | 2807687 | |
a | 2453829 | |
o | 2189272 | 7.9% |
c | 1938240 | 7.0% |
l | 1836891 | 6.6% |
t | 1710166 | 6.1% |
s | 1553537 | 5.6% |
r | 1466024 | 5.3% |
Other values (15) | 5868096 |
Uppercase Letter
Value | Count | Frequency (%) |
Q | 1066611 | |
S | 407734 | 10.2% |
E | 394214 | 9.8% |
M | 365109 | 9.1% |
C | 304612 | 7.6% |
P | 293668 | 7.3% |
A | 203207 | 5.1% |
H | 125352 | 3.1% |
I | 117754 | 2.9% |
B | 102816 | 2.6% |
Other values (12) | 623894 |
Decimal Number
Value | Count | Frequency (%) |
1 | 267430 | |
2 | 266021 | |
3 | 264153 | |
4 | 262127 |
Other Punctuation
Value | Count | Frequency (%) |
; | 748349 | |
, | 142213 | 16.0% |
Space Separator
Value | Count | Frequency (%) |
4220950 |
Open Punctuation
Value | Count | Frequency (%) |
( | 1309111 |
Close Punctuation
Value | Count | Frequency (%) |
) | 1309111 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 18726 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 31876161 | |
Common | 8808191 | 21.7% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
e | 3116433 | 9.8% |
n | 2931015 | 9.2% |
i | 2807687 | 8.8% |
a | 2453829 | 7.7% |
o | 2189272 | 6.9% |
c | 1938240 | 6.1% |
l | 1836891 | 5.8% |
t | 1710166 | 5.4% |
s | 1553537 | 4.9% |
r | 1466024 | 4.6% |
Other values (37) | 9873067 |
Common
Value | Count | Frequency (%) |
4220950 | ||
( | 1309111 | 14.9% |
) | 1309111 | 14.9% |
; | 748349 | 8.5% |
1 | 267430 | 3.0% |
2 | 266021 | 3.0% |
3 | 264153 | 3.0% |
4 | 262127 | 3.0% |
, | 142213 | 1.6% |
- | 18726 | 0.2% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 40684352 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
4220950 | 10.4% | |
e | 3116433 | 7.7% |
n | 2931015 | 7.2% |
i | 2807687 | 6.9% |
a | 2453829 | 6.0% |
o | 2189272 | 5.4% |
c | 1938240 | 4.8% |
l | 1836891 | 4.5% |
t | 1710166 | 4.2% |
s | 1553537 | 3.8% |
Other values (47) | 15926332 |
citable_docs_3years
Real number (ℝ)
HIGH CORRELATION
SKEWED
ZEROS
Distinct | 5239 |
---|---|
Distinct (%) | 0.8% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 215.02452 |
Minimum | 0 |
---|---|
Maximum | 94370 |
Zeros | 29012 |
Zeros (%) | 4.7% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 4.7 MiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 1 |
Q1 | 38 |
median | 86 |
Q3 | 197 |
95-th percentile | 742 |
Maximum | 94370 |
Range | 94370 |
Interquartile range (IQR) | 159 |
Descriptive statistics
Standard deviation | 812.08001 |
---|---|
Coefficient of variation (CV) | 3.7766856 |
Kurtosis | 3550.1556 |
Mean | 215.02452 |
Median Absolute Deviation (MAD) | 61 |
Skewness | 47.176536 |
Sum | 1.3343132 × 108 |
Variance | 659473.94 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 29012 | 4.7% |
1 | 6135 | 1.0% |
48 | 3649 | 0.6% |
60 | 3639 | 0.6% |
56 | 3636 | 0.6% |
50 | 3635 | 0.6% |
45 | 3589 | 0.6% |
24 | 3572 | 0.6% |
40 | 3545 | 0.6% |
47 | 3542 | 0.6% |
Other values (5229) | 556586 |
Value | Count | Frequency (%) |
0 | 29012 | |
1 | 6135 | 1.0% |
2 | 3043 | 0.5% |
3 | 2688 | 0.4% |
4 | 2700 | 0.4% |
5 | 2829 | 0.5% |
6 | 3032 | 0.5% |
7 | 3031 | 0.5% |
8 | 3307 | 0.5% |
9 | 3207 | 0.5% |
Value | Count | Frequency (%) |
94370 | 1 | |
93640 | 1 | |
88176 | 1 | |
83873 | 1 | |
82993 | 1 | |
78745 | 1 | |
72423 | 1 | |
69798 | 1 | |
68541 | 1 | |
68277 | 1 |
h_index
Real number (ℝ)
HIGH CORRELATION
ZEROS
Distinct | 404 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 39.225832 |
Minimum | 0 |
---|---|
Maximum | 1276 |
Zeros | 9095 |
Zeros (%) | 1.5% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 4.7 MiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 2 |
Q1 | 8 |
median | 20 |
Q3 | 50 |
95-th percentile | 137 |
Maximum | 1276 |
Range | 1276 |
Interquartile range (IQR) | 42 |
Descriptive statistics
Standard deviation | 53.600804 |
---|---|
Coefficient of variation (CV) | 1.366467 |
Kurtosis | 44.797354 |
Mean | 39.225832 |
Median Absolute Deviation (MAD) | 15 |
Skewness | 4.4230824 |
Sum | 24341198 |
Variance | 2873.0462 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
5 | 22152 | 3.6% |
4 | 21720 | 3.5% |
6 | 20510 | 3.3% |
3 | 20478 | 3.3% |
7 | 19338 | 3.1% |
8 | 18376 | 3.0% |
2 | 18132 | 2.9% |
9 | 17803 | 2.9% |
10 | 16147 | 2.6% |
11 | 15021 | 2.4% |
Other values (394) | 430863 |
Value | Count | Frequency (%) |
0 | 9095 | |
1 | 13958 | |
2 | 18132 | |
3 | 20478 | |
4 | 21720 | |
5 | 22152 | |
6 | 20510 | |
7 | 19338 | |
8 | 18376 | |
9 | 17803 |
Value | Count | Frequency (%) |
1276 | 23 | |
1229 | 23 | |
1079 | 23 | |
814 | 23 | |
807 | 23 | |
805 | 23 | |
745 | 23 | |
709 | 23 | |
647 | 23 | |
644 | 23 |
journal_rating
Real number (ℝ)
HIGH CORRELATION
MISSING
Distinct | 8524 |
---|---|
Distinct (%) | 1.5% |
Missing | 38006 |
Missing (%) | 6.1% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 0.55719131 |
Minimum | 0.1 |
---|---|
Maximum | 88.192 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 4.7 MiB |
Quantile statistics
Minimum | 0.1 |
---|---|
5-th percentile | 0.101 |
Q1 | 0.123 |
median | 0.231 |
Q3 | 0.594 |
95-th percentile | 1.834 |
Maximum | 88.192 |
Range | 88.092 |
Interquartile range (IQR) | 0.471 |
Descriptive statistics
Standard deviation | 1.1592403 |
---|---|
Coefficient of variation (CV) | 2.0805067 |
Kurtosis | 350.0633 |
Mean | 0.55719131 |
Median Absolute Deviation (MAD) | 0.127 |
Skewness | 13.019474 |
Sum | 324582.88 |
Variance | 1.343838 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0.1 | 26231 | 4.2% |
0.101 | 25271 | 4.1% |
0.102 | 11394 | 1.8% |
0.103 | 7718 | 1.2% |
0.111 | 6423 | 1.0% |
0.104 | 5988 | 1.0% |
0.105 | 5281 | 0.9% |
0.123 | 4622 | 0.7% |
0.107 | 4592 | 0.7% |
0.11 | 4469 | 0.7% |
Other values (8514) | 480545 | |
(Missing) | 38006 | 6.1% |
Value | Count | Frequency (%) |
0.1 | 26231 | |
0.101 | 25271 | |
0.102 | 11394 | |
0.103 | 7718 | 1.2% |
0.104 | 5988 | 1.0% |
0.105 | 5281 | 0.9% |
0.106 | 4210 | 0.7% |
0.107 | 4592 | 0.7% |
0.108 | 3915 | 0.6% |
0.109 | 3114 | 0.5% |
Value | Count | Frequency (%) |
88.192 | 1 | |
72.576 | 1 | |
62.937 | 1 | |
61.786 | 1 | |
56.204 | 1 | |
50.518 | 1 | |
49.268 | 1 | |
48.894 | 1 | |
47.751 | 1 | |
47.288 | 1 |
rank
Real number (ℝ)
Distinct | 34766 |
---|---|
Distinct (%) | 5.6% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 14209.023 |
Minimum | 1 |
---|---|
Maximum | 34766 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 4.7 MiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1349.95 |
Q1 | 6745.75 |
median | 13490.5 |
Q3 | 20835 |
95-th percentile | 29983 |
Maximum | 34766 |
Range | 34765 |
Interquartile range (IQR) | 14089.25 |
Descriptive statistics
Standard deviation | 8870.0511 |
---|---|
Coefficient of variation (CV) | 0.62425481 |
Kurtosis | -0.90177875 |
Mean | 14209.023 |
Median Absolute Deviation (MAD) | 6999.5 |
Skewness | 0.29967013 |
Sum | 8.8172673 × 109 |
Variance | 78677807 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1 | 23 | < 0.1% |
11333 | 23 | < 0.1% |
11319 | 23 | < 0.1% |
11320 | 23 | < 0.1% |
11321 | 23 | < 0.1% |
11322 | 23 | < 0.1% |
11323 | 23 | < 0.1% |
11324 | 23 | < 0.1% |
11325 | 23 | < 0.1% |
11326 | 23 | < 0.1% |
Other values (34756) | 620310 |
Value | Count | Frequency (%) |
1 | 23 | |
2 | 23 | |
3 | 23 | |
4 | 23 | |
5 | 23 | |
6 | 23 | |
7 | 23 | |
8 | 23 | |
9 | 23 | |
10 | 23 |
Value | Count | Frequency (%) |
34766 | 1 | |
34765 | 1 | |
34764 | 1 | |
34763 | 1 | |
34762 | 1 | |
34761 | 1 | |
34760 | 1 | |
34759 | 1 | |
34758 | 1 | |
34757 | 1 |
ref_per_doc
Real number (ℝ)
HIGH CORRELATION
ZEROS
Distinct | 16545 |
---|---|
Distinct (%) | 2.7% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 27.429817 |
Minimum | 0 |
---|---|
Maximum | 4841 |
Zeros | 153751 |
Zeros (%) | 24.8% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 4.7 MiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0.16 |
median | 23.9 |
Q3 | 39.15 |
95-th percentile | 69.13 |
Maximum | 4841 |
Range | 4841 |
Interquartile range (IQR) | 38.99 |
Descriptive statistics
Standard deviation | 35.475855 |
---|---|
Coefficient of variation (CV) | 1.2933318 |
Kurtosis | 1376.9439 |
Mean | 27.429817 |
Median Absolute Deviation (MAD) | 17.67 |
Skewness | 17.911385 |
Sum | 17021299 |
Variance | 1258.5363 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 153751 | 24.8% |
30 | 432 | 0.1% |
28 | 432 | 0.1% |
29 | 425 | 0.1% |
26 | 418 | 0.1% |
23 | 410 | 0.1% |
25 | 403 | 0.1% |
34 | 395 | 0.1% |
27 | 395 | 0.1% |
24 | 395 | 0.1% |
Other values (16535) | 463084 |
Value | Count | Frequency (%) |
0 | 153751 | |
0.01 | 97 | < 0.1% |
0.02 | 117 | < 0.1% |
0.03 | 114 | < 0.1% |
0.04 | 97 | < 0.1% |
0.05 | 103 | < 0.1% |
0.06 | 79 | < 0.1% |
0.07 | 113 | < 0.1% |
0.08 | 87 | < 0.1% |
0.09 | 86 | < 0.1% |
Value | Count | Frequency (%) |
4841 | 1 | |
4500 | 1 | |
3261 | 1 | |
2753 | 1 | |
2633 | 1 | |
2310 | 1 | |
1992 | 1 | |
1927 | 1 | |
1751 | 1 | |
1719 | 1 |
sjr_best_quartile
Categorical
Distinct | 5 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 4.7 MiB |
Q1 | |
---|---|
- | |
Q2 | |
Q3 | |
Q4 |
Common Values
Value | Count | Frequency (%) |
Q1 | 139258 | |
- | 138621 | |
Q2 | 117491 | |
Q3 | 114504 | |
Q4 | 110666 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
q1 | 139258 | |
138621 | ||
q2 | 117491 | |
q3 | 114504 | |
q4 | 110666 |
Most occurring characters
Value | Count | Frequency (%) |
Q | 481919 | |
1 | 139258 | 12.6% |
- | 138621 | 12.6% |
2 | 117491 | 10.7% |
3 | 114504 | 10.4% |
4 | 110666 | 10.0% |
Most occurring categories
Value | Count | Frequency (%) |
Uppercase Letter | 481919 | |
Decimal Number | 481919 | |
Dash Punctuation | 138621 | 12.6% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
1 | 139258 | |
2 | 117491 | |
3 | 114504 | |
4 | 110666 |
Uppercase Letter
Value | Count | Frequency (%) |
Q | 481919 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 138621 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 620540 | |
Latin | 481919 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
1 | 139258 | |
- | 138621 | |
2 | 117491 | |
3 | 114504 | |
4 | 110666 |
Latin
Value | Count | Frequency (%) |
Q | 481919 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 1102459 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
Q | 481919 | |
1 | 139258 | 12.6% |
- | 138621 | 12.6% |
2 | 117491 | 10.7% |
3 | 114504 | 10.4% |
4 | 110666 | 10.0% |
total_cites_3years
Real number (ℝ)
HIGH CORRELATION
SKEWED
ZEROS
Distinct | 11896 |
---|---|
Distinct (%) | 1.9% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 476.96202 |
Minimum | 0 |
---|---|
Maximum | 321255 |
Zeros | 65410 |
Zeros (%) | 10.5% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 4.7 MiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 7 |
median | 42 |
Q3 | 199 |
95-th percentile | 1797 |
Maximum | 321255 |
Range | 321255 |
Interquartile range (IQR) | 192 |
Descriptive statistics
Standard deviation | 2946.9901 |
---|---|
Coefficient of variation (CV) | 6.1786683 |
Kurtosis | 1976.6331 |
Mean | 476.96202 |
Median Absolute Deviation (MAD) | 41 |
Skewness | 33.274008 |
Sum | 2.9597401 × 108 |
Variance | 8684750.9 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 65410 | 10.5% |
1 | 20124 | 3.2% |
2 | 16098 | 2.6% |
3 | 13688 | 2.2% |
4 | 12125 | 2.0% |
5 | 10747 | 1.7% |
6 | 9917 | 1.6% |
7 | 9246 | 1.5% |
8 | 8579 | 1.4% |
9 | 7924 | 1.3% |
Other values (11886) | 446682 |
Value | Count | Frequency (%) |
0 | 65410 | |
1 | 20124 | 3.2% |
2 | 16098 | 2.6% |
3 | 13688 | 2.2% |
4 | 12125 | 2.0% |
5 | 10747 | 1.7% |
6 | 9917 | 1.6% |
7 | 9246 | 1.5% |
8 | 8579 | 1.4% |
9 | 7924 | 1.3% |
Value | Count | Frequency (%) |
321255 | 1 | |
307069 | 1 | |
282400 | 1 | |
277415 | 1 | |
275478 | 1 | |
274206 | 1 | |
266827 | 1 | |
261696 | 1 | |
245865 | 1 | |
224506 | 1 |
total_docs
Real number (ℝ)
HIGH CORRELATION
SKEWED
ZEROS
Distinct | 2921 |
---|---|
Distinct (%) | 0.5% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 79.936992 |
Minimum | 0 |
---|---|
Maximum | 35329 |
Zeros | 134190 |
Zeros (%) | 21.6% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 4.7 MiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 7 |
median | 29 |
Q3 | 72 |
95-th percentile | 288 |
Maximum | 35329 |
Range | 35329 |
Interquartile range (IQR) | 65 |
Descriptive statistics
Standard deviation | 322.32011 |
---|---|
Coefficient of variation (CV) | 4.0321771 |
Kurtosis | 3163.7742 |
Mean | 79.936992 |
Median Absolute Deviation (MAD) | 29 |
Skewness | 44.541137 |
Sum | 49604101 |
Variance | 103890.25 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 134190 | 21.6% |
20 | 8677 | 1.4% |
24 | 8258 | 1.3% |
21 | 8104 | 1.3% |
16 | 7876 | 1.3% |
22 | 7873 | 1.3% |
23 | 7695 | 1.2% |
18 | 7642 | 1.2% |
25 | 7639 | 1.2% |
19 | 7597 | 1.2% |
Other values (2911) | 414989 |
Value | Count | Frequency (%) |
0 | 134190 | |
1 | 4178 | 0.7% |
2 | 2526 | 0.4% |
3 | 2375 | 0.4% |
4 | 2688 | 0.4% |
5 | 3245 | 0.5% |
6 | 3814 | 0.6% |
7 | 3950 | 0.6% |
8 | 4702 | 0.8% |
9 | 5115 | 0.8% |
Value | Count | Frequency (%) |
35329 | 1 | |
34849 | 1 | |
34154 | 1 | |
31198 | 1 | |
30978 | 1 | |
29385 | 1 | |
29351 | 1 | |
29296 | 1 | |
28799 | 1 | |
27974 | 1 |
total_docs_3years
Real number (ℝ)
HIGH CORRELATION
SKEWED
ZEROS
Distinct | 5429 |
---|---|
Distinct (%) | 0.9% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 231.09512 |
Minimum | 0 |
---|---|
Maximum | 95106 |
Zeros | 27877 |
Zeros (%) | 4.5% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 4.7 MiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 1 |
Q1 | 41 |
median | 92 |
Q3 | 212 |
95-th percentile | 808 |
Maximum | 95106 |
Range | 95106 |
Interquartile range (IQR) | 171 |
Descriptive statistics
Standard deviation | 842.56359 |
---|---|
Coefficient of variation (CV) | 3.6459601 |
Kurtosis | 3356.2104 |
Mean | 231.09512 |
Median Absolute Deviation (MAD) | 65 |
Skewness | 45.569964 |
Sum | 1.4340377 × 108 |
Variance | 709913.41 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 27877 | 4.5% |
1 | 5188 | 0.8% |
47 | 3459 | 0.6% |
58 | 3456 | 0.6% |
48 | 3435 | 0.6% |
60 | 3405 | 0.5% |
64 | 3394 | 0.5% |
54 | 3381 | 0.5% |
36 | 3347 | 0.5% |
45 | 3341 | 0.5% |
Other values (5419) | 560257 |
Value | Count | Frequency (%) |
0 | 27877 | |
1 | 5188 | 0.8% |
2 | 2835 | 0.5% |
3 | 2280 | 0.4% |
4 | 2080 | 0.3% |
5 | 2262 | 0.4% |
6 | 2482 | 0.4% |
7 | 2707 | 0.4% |
8 | 2909 | 0.5% |
9 | 2958 | 0.5% |
Value | Count | Frequency (%) |
95106 | 1 | |
94281 | 1 | |
88185 | 1 | |
83879 | 1 | |
83565 | 1 | |
78760 | 1 | |
75545 | 1 | |
71349 | 1 | |
71205 | 1 | |
69817 | 1 |
total_refs
Real number (ℝ)
HIGH CORRELATION
SKEWED
ZEROS
Distinct | 25931 |
---|---|
Distinct (%) | 4.2% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2459.6655 |
Minimum | 0 |
---|---|
Maximum | 1469402 |
Zeros | 153728 |
Zeros (%) | 24.8% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 4.7 MiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 7 |
median | 806 |
Q3 | 2065 |
95-th percentile | 8946 |
Maximum | 1469402 |
Range | 1469402 |
Interquartile range (IQR) | 2058 |
Descriptive statistics
Standard deviation | 10490.972 |
---|---|
Coefficient of variation (CV) | 4.2652027 |
Kurtosis | 3952.8919 |
Mean | 2459.6655 |
Median Absolute Deviation (MAD) | 806 |
Skewness | 45.60771 |
Sum | 1.5263208 × 109 |
Variance | 1.100605 × 108 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 153728 | 24.8% |
534 | 261 | < 0.1% |
558 | 258 | < 0.1% |
471 | 258 | < 0.1% |
759 | 257 | < 0.1% |
645 | 255 | < 0.1% |
575 | 255 | < 0.1% |
621 | 254 | < 0.1% |
470 | 254 | < 0.1% |
611 | 253 | < 0.1% |
Other values (25921) | 464507 |
Value | Count | Frequency (%) |
0 | 153728 | |
1 | 182 | < 0.1% |
2 | 221 | < 0.1% |
3 | 219 | < 0.1% |
4 | 239 | < 0.1% |
5 | 226 | < 0.1% |
6 | 203 | < 0.1% |
7 | 178 | < 0.1% |
8 | 180 | < 0.1% |
9 | 203 | < 0.1% |
Value | Count | Frequency (%) |
1469402 | 1 | |
1388656 | 1 | |
1317965 | 1 | |
1197093 | 1 | |
1155366 | 1 | |
1115013 | 1 | |
1104125 | 1 | |
1033652 | 1 | |
1030225 | 1 | |
948277 | 1 |
journal__sourceid | year | citable_docs_3years | h_index | journal_rating | rank | ref_per_doc | total_cites_3years | total_docs | total_docs_3years | total_refs | sjr_best_quartile | |
---|---|---|---|---|---|---|---|---|---|---|---|---|
journal__sourceid | 1.000 | 0.333 | -0.367 | -0.494 | -0.314 | 0.487 | -0.202 | -0.313 | -0.411 | -0.369 | -0.355 | 0.258 |
year | 0.333 | 1.000 | 0.033 | -0.109 | 0.071 | 0.264 | 0.150 | 0.163 | -0.007 | 0.040 | 0.096 | 0.074 |
citable_docs_3years | -0.367 | 0.033 | 1.000 | 0.591 | 0.429 | -0.449 | 0.167 | 0.768 | 0.726 | 0.991 | 0.613 | 0.011 |
h_index | -0.494 | -0.109 | 0.591 | 1.000 | 0.823 | -0.785 | 0.528 | 0.824 | 0.644 | 0.587 | 0.735 | 0.205 |
journal_rating | -0.314 | 0.071 | 0.429 | 0.823 | 1.000 | -0.911 | 0.530 | 0.844 | 0.498 | 0.423 | 0.648 | 0.049 |
rank | 0.487 | 0.264 | -0.449 | -0.785 | -0.911 | 1.000 | -0.399 | -0.732 | -0.469 | -0.442 | -0.556 | 0.516 |
ref_per_doc | -0.202 | 0.150 | 0.167 | 0.528 | 0.530 | -0.399 | 1.000 | 0.407 | 0.429 | 0.162 | 0.736 | 0.004 |
total_cites_3years | -0.313 | 0.163 | 0.768 | 0.824 | 0.844 | -0.732 | 0.407 | 1.000 | 0.627 | 0.763 | 0.707 | 0.029 |
total_docs | -0.411 | -0.007 | 0.726 | 0.644 | 0.498 | -0.469 | 0.429 | 0.627 | 1.000 | 0.736 | 0.858 | 0.010 |
total_docs_3years | -0.369 | 0.040 | 0.991 | 0.587 | 0.423 | -0.442 | 0.162 | 0.763 | 0.736 | 1.000 | 0.613 | 0.011 |
total_refs | -0.355 | 0.096 | 0.613 | 0.735 | 0.648 | -0.556 | 0.736 | 0.707 | 0.858 | 0.613 | 1.000 | 0.015 |
sjr_best_quartile | 0.258 | 0.074 | 0.011 | 0.205 | 0.049 | 0.516 | 0.004 | 0.029 | 0.010 | 0.011 | 0.015 | 1.000 |
journal__sourceid | year | categories | citable_docs_3years | h_index | journal_rating | rank | ref_per_doc | sjr_best_quartile | total_cites_3years | total_docs | total_docs_3years | total_refs | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
0 | 16801 | 1999 | Biochemistry (Q1) | 80 | 305 | 50.518 | 1 | 197.10 | Q1 | 3513 | 30 | 80 | 5913 |
1 | 18434 | 1999 | Biochemistry, Genetics and Molecular Biology (miscellaneous) (Q1) | 1332 | 814 | 43.449 | 2 | 45.48 | Q1 | 48292 | 351 | 1340 | 15964 |
2 | 20651 | 1999 | Immunology (Q1); Immunology and Allergy (Q1) | 81 | 309 | 43.020 | 3 | 180.55 | Q1 | 4116 | 29 | 81 | 5236 |
3 | 18395 | 1999 | Cell Biology (Q1); Developmental Biology (Q1) | 60 | 226 | 35.051 | 4 | 165.36 | Q1 | 1815 | 25 | 61 | 4134 |
4 | 14181 | 1999 | Neuroscience (miscellaneous) (Q1) | 60 | 248 | 25.760 | 5 | 162.81 | Q1 | 1631 | 21 | 60 | 3419 |
5 | 22126 | 1999 | Developmental Biology (Q1); Genetics (Q1) | 888 | 453 | 25.272 | 6 | 55.78 | Q1 | 17391 | 298 | 889 | 16623 |
6 | 20798 | 1999 | Immunology (Q1); Immunology and Allergy (Q1); Infectious Diseases (Q1) | 438 | 417 | 22.298 | 7 | 51.46 | Q1 | 9207 | 151 | 438 | 7770 |
7 | 18503 | 1999 | Cell Biology (Q1) | 298 | 267 | 21.691 | 8 | 56.05 | Q1 | 6602 | 104 | 318 | 5829 |
8 | 9500154114 | 1999 | Mathematics (miscellaneous) (Q1); Physics and Astronomy (miscellaneous) (Q1) | 55 | 75 | 20.965 | 9 | 37.94 | Q1 | 1228 | 48 | 55 | 1821 |
9 | 18606 | 1999 | Cell Biology (Q1); Molecular Biology (Q1) | 202 | 414 | 20.282 | 10 | 45.67 | Q1 | 3624 | 193 | 203 | 8814 |
journal__sourceid | year | categories | citable_docs_3years | h_index | journal_rating | rank | ref_per_doc | sjr_best_quartile | total_cites_3years | total_docs | total_docs_3years | total_refs | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
620530 | 21101043236 | 2021 | Pediatrics, Perinatology and Child Health | 0 | 1 | NaN | 27330 | 43.45 | - | 0 | 20 | 0 | 869 |
620531 | 21100853891 | 2021 | Emergency Medicine | 0 | 4 | NaN | 27331 | 20.66 | - | 0 | 86 | 0 | 1777 |
620532 | 21101042998 | 2021 | Internal Medicine | 0 | 1 | NaN | 27332 | 49.75 | - | 0 | 8 | 0 | 398 |
620533 | 21101042490 | 2021 | Transplantation | 0 | 1 | NaN | 27333 | 63.70 | - | 0 | 40 | 0 | 2548 |
620534 | 144806 | 2021 | Electrical and Electronic Engineering | 0 | 17 | NaN | 27334 | 22.32 | - | 0 | 31 | 0 | 692 |
620535 | 144807 | 2021 | Electrical and Electronic Engineering | 0 | 10 | NaN | 27335 | 20.80 | - | 0 | 20 | 0 | 416 |
620536 | 144808 | 2021 | Computer Science Applications; Information Systems | 0 | 21 | NaN | 27336 | 24.33 | - | 0 | 21 | 0 | 511 |
620537 | 144813 | 2021 | Computer Science Applications; Control and Systems Engineering | 0 | 27 | NaN | 27337 | 22.82 | - | 0 | 38 | 0 | 867 |
620538 | 21101046690 | 2021 | Arts and Humanities (miscellaneous); Communication; History; Library and Information Sciences | 0 | 0 | NaN | 27338 | 40.63 | - | 0 | 19 | 0 | 772 |
620539 | 20904 | 2021 | Medicine (miscellaneous) | 0 | 20 | NaN | 27339 | 27.19 | - | 0 | 134 | 0 | 3643 |