Overview

Dataset statistics

Number of variables3
Number of observations844213
Missing cells0
Missing cells (%)0.0%
Total size in memory19.3 MiB
Average record size in memory24.0 B

Variable types

Numeric1
Categorical2

Alerts

issue__neid has a high cardinality: 82540 distinct valuesHigh cardinality
paper__pid has a high cardinality: 380892 distinct valuesHigh cardinality
ind has 67412 (8.0%) zerosZeros

Reproduction

Analysis started2023-05-04 15:12:38.891182
Analysis finished2023-05-04 15:12:41.209575
Duration2.32 seconds
Software versionydata-profiling vv4.1.2
Download configurationconfig.json

Variables

ind
Real number (ℝ)

Distinct235
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean9.492714516
Minimum0
Maximum242
Zeros67412
Zeros (%)8.0%
Negative0
Negative (%)0.0%
Memory size6.4 MiB
2023-05-04T15:12:41.285764image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q13
median6
Q313
95-th percentile30
Maximum242
Range242
Interquartile range (IQR)10

Descriptive statistics

Standard deviation11.30564171
Coefficient of variation (CV)1.190980904
Kurtosis42.42301256
Mean9.492714516
Median Absolute Deviation (MAD)4
Skewness4.510915418
Sum8013873
Variance127.8175345
MonotonicityNot monotonic
2023-05-04T15:12:41.754965image/svg+xmlMatplotlib v3.6.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 69458
 
8.2%
2 67906
 
8.0%
0 67412
 
8.0%
3 63522
 
7.5%
4 58398
 
6.9%
5 53045
 
6.3%
6 48222
 
5.7%
7 43480
 
5.2%
8 39367
 
4.7%
9 35293
 
4.2%
Other values (225) 298110
35.3%
ValueCountFrequency (%)
0 67412
8.0%
1 69458
8.2%
2 67906
8.0%
3 63522
7.5%
4 58398
6.9%
ValueCountFrequency (%)
242 1
< 0.1%
241 1
< 0.1%
240 1
< 0.1%
239 1
< 0.1%
238 1
< 0.1%

issue__neid
Categorical

Distinct82540
Distinct (%)9.8%
Missing0
Missing (%)0.0%
Memory size6.4 MiB
nepdem-2022-07-18
 
206
nepger-2015-08-30
 
176
nepgen-2020-05-11
 
167
nepisf-2021-08-30
 
166
nepisf-2021-08-16
 
160
Other values (82535)
843338 

Unique

Unique4405 ?
Unique (%)0.5%

Sample

1st rownepenv-2022-12-05
2nd rownepenv-2022-12-05
3rd rownepenv-2022-12-05
4th rownepenv-2022-12-05
5th rownepenv-2022-12-05

Common Values

ValueCountFrequency (%)
nepdem-2022-07-18 206
 
< 0.1%
nepger-2015-08-30 176
 
< 0.1%
nepgen-2020-05-11 167
 
< 0.1%
nepisf-2021-08-30 166
 
< 0.1%
nepisf-2021-08-16 160
 
< 0.1%
nepban-2022-03-21 153
 
< 0.1%
nepisf-2021-09-13 152
 
< 0.1%
nepsog-2016-09-18 152
 
< 0.1%
neppke-2016-05-08 151
 
< 0.1%
nepmfd-2015-03-05 147
 
< 0.1%
Other values (82530) 842583
99.8%

paper__pid
Categorical

Distinct380892
Distinct (%)45.1%
Missing0
Missing (%)0.0%
Memory size6.4 MiB
eguwpaper/1009
 
13
hhscesisp/0291
 
12
pramprapa/108717
 
12
diwdiwwpp/dp1371
 
12
csawpaper/2014-25
 
12
Other values (380887)
844152 

Unique

Unique135249 ?
Unique (%)16.0%

Sample

1st rowbdrborrec/1218
2nd rowfipfedhwp/95078
3rd rowcesceswps/_5f10053
4th rowfipfedgfe/2022-73
5th rowfipfedgfe/2022-68

Common Values

ValueCountFrequency (%)
eguwpaper/1009 13
 
< 0.1%
hhscesisp/0291 12
 
< 0.1%
pramprapa/108717 12
 
< 0.1%
diwdiwwpp/dp1371 12
 
< 0.1%
csawpaper/2014-25 12
 
< 0.1%
eguwpaper/2205 12
 
< 0.1%
cesceswps/_5f9198 12
 
< 0.1%
izaizadps/dp8097 12
 
< 0.1%
halwpaper/hal-00639049 11
 
< 0.1%
bdiwptemi/td_5f960_5f14 11
 
< 0.1%
Other values (380882) 844094
> 99.9%