Dataset statistics
Number of variables | 3 |
---|---|
Number of observations | 844213 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Total size in memory | 19.3 MiB |
Average record size in memory | 24.0 B |
Variable types
Numeric | 1 |
---|---|
Categorical | 2 |
Alerts
issue__neid has a high cardinality: 82540 distinct values | High cardinality |
paper__pid has a high cardinality: 380892 distinct values | High cardinality |
ind has 67412 (8.0%) zeros | Zeros |
Reproduction
Analysis started | 2023-05-04 15:12:38.891182 |
---|---|
Analysis finished | 2023-05-04 15:12:41.209575 |
Duration | 2.32 seconds |
Software version | ydata-profiling vv4.1.2 |
Download configuration | config.json |
ind
Real number (ℝ)
Distinct | 235 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 9.492714516 |
Minimum | 0 |
---|---|
Maximum | 242 |
Zeros | 67412 |
Zeros (%) | 8.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 6.4 MiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 3 |
median | 6 |
Q3 | 13 |
95-th percentile | 30 |
Maximum | 242 |
Range | 242 |
Interquartile range (IQR) | 10 |
Descriptive statistics
Standard deviation | 11.30564171 |
---|---|
Coefficient of variation (CV) | 1.190980904 |
Kurtosis | 42.42301256 |
Mean | 9.492714516 |
Median Absolute Deviation (MAD) | 4 |
Skewness | 4.510915418 |
Sum | 8013873 |
Variance | 127.8175345 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1 | 69458 | 8.2% |
2 | 67906 | 8.0% |
0 | 67412 | 8.0% |
3 | 63522 | 7.5% |
4 | 58398 | 6.9% |
5 | 53045 | 6.3% |
6 | 48222 | 5.7% |
7 | 43480 | 5.2% |
8 | 39367 | 4.7% |
9 | 35293 | 4.2% |
Other values (225) | 298110 |
Value | Count | Frequency (%) |
0 | 67412 | |
1 | 69458 | |
2 | 67906 | |
3 | 63522 | |
4 | 58398 |
Value | Count | Frequency (%) |
242 | 1 | |
241 | 1 | |
240 | 1 | |
239 | 1 | |
238 | 1 |
issue__neid
Categorical
Distinct | 82540 |
---|---|
Distinct (%) | 9.8% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 6.4 MiB |
nepdem-2022-07-18 | 206 |
---|---|
nepger-2015-08-30 | 176 |
nepgen-2020-05-11 | 167 |
nepisf-2021-08-30 | 166 |
nepisf-2021-08-16 | 160 |
Other values (82535) |
Unique
Unique | 4405 ? |
---|---|
Unique (%) | 0.5% |
Sample
1st row | nepenv-2022-12-05 |
---|---|
2nd row | nepenv-2022-12-05 |
3rd row | nepenv-2022-12-05 |
4th row | nepenv-2022-12-05 |
5th row | nepenv-2022-12-05 |
Common Values
Value | Count | Frequency (%) |
nepdem-2022-07-18 | 206 | < 0.1% |
nepger-2015-08-30 | 176 | < 0.1% |
nepgen-2020-05-11 | 167 | < 0.1% |
nepisf-2021-08-30 | 166 | < 0.1% |
nepisf-2021-08-16 | 160 | < 0.1% |
nepban-2022-03-21 | 153 | < 0.1% |
nepisf-2021-09-13 | 152 | < 0.1% |
nepsog-2016-09-18 | 152 | < 0.1% |
neppke-2016-05-08 | 151 | < 0.1% |
nepmfd-2015-03-05 | 147 | < 0.1% |
Other values (82530) | 842583 |
paper__pid
Categorical
Distinct | 380892 |
---|---|
Distinct (%) | 45.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 6.4 MiB |
eguwpaper/1009 | 13 |
---|---|
hhscesisp/0291 | 12 |
pramprapa/108717 | 12 |
diwdiwwpp/dp1371 | 12 |
csawpaper/2014-25 | 12 |
Other values (380887) |
Unique
Unique | 135249 ? |
---|---|
Unique (%) | 16.0% |
Sample
1st row | bdrborrec/1218 |
---|---|
2nd row | fipfedhwp/95078 |
3rd row | cesceswps/_5f10053 |
4th row | fipfedgfe/2022-73 |
5th row | fipfedgfe/2022-68 |
Common Values
Value | Count | Frequency (%) |
eguwpaper/1009 | 13 | < 0.1% |
hhscesisp/0291 | 12 | < 0.1% |
pramprapa/108717 | 12 | < 0.1% |
diwdiwwpp/dp1371 | 12 | < 0.1% |
csawpaper/2014-25 | 12 | < 0.1% |
eguwpaper/2205 | 12 | < 0.1% |
cesceswps/_5f9198 | 12 | < 0.1% |
izaizadps/dp8097 | 12 | < 0.1% |
halwpaper/hal-00639049 | 11 | < 0.1% |
bdiwptemi/td_5f960_5f14 | 11 | < 0.1% |
Other values (380882) | 844094 |