Dataset statistics
Number of variables | 5 |
---|---|
Number of observations | 100 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 4.0 KiB |
Average record size in memory | 41.3 B |
Variable types
NUM | 5 |
---|
Reproduction
Analysis started | 2020-03-12 06:05:00.839849 |
---|---|
Analysis finished | 2020-03-12 06:05:06.158345 |
Version | pandas-profiling v2.5.0 |
Command line | pandas_profiling --config_file config.yaml [YOUR_FILE.csv] |
Download configuration | config.yaml |
Distinct count | 100 |
---|---|
Unique (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 0.5217951022108731 |
---|---|
Minimum | 0.014636384701227967 |
Maximum | 0.9982151573421486 |
Zeros | 0 |
Zeros (%) | 0.0% |
Memory size | 928.0 B |
Quantile statistics
Minimum | 0.0146363847 |
---|---|
5-th percentile | 0.07577299662 |
Q1 | 0.3273466411 |
median | 0.5130468168 |
Q3 | 0.7195448121 |
95-th percentile | 0.9625653707 |
Maximum | 0.9982151573 |
Range | 0.9835787726 |
Interquartile range (IQR) | 0.392198171 |
Descriptive statistics
Standard deviation | 0.268149702 |
---|---|
Coefficient of variation (CV) | 0.5138984649 |
Kurtosis | -0.9704911598 |
Mean | 0.5217951022 |
Median Absolute Deviation (MAD) | 0.2229684747 |
Skewness | -0.06856949839 |
Sum | 52.17951022 |
Variance | 0.07190426268 |
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[0.01463638 0.99821516], "bayesian blocks" binning strategy used)
Value | Count | Frequency (%) | |
0.1651171697 | 1 | 1.0% | |
0.1480252679 | 1 | 1.0% | |
0.5940523526 | 1 | 1.0% | |
0.5595833432 | 1 | 1.0% | |
0.6808084366 | 1 | 1.0% | |
0.5472695527 | 1 | 1.0% | |
0.8730897958 | 1 | 1.0% | |
0.1140745295 | 1 | 1.0% | |
0.9624279049 | 1 | 1.0% | |
0.4948157306 | 1 | 1.0% | |
Other values (90) | 90 | 90.0% |
Value | Count | Frequency (%) | |
0.0146363847 | 1 | 1.0% | |
0.04021255319 | 1 | 1.0% | |
0.05132556595 | 1 | 1.0% | |
0.05783541219 | 1 | 1.0% | |
0.06253453379 | 1 | 1.0% |
Value | Count | Frequency (%) | |
0.9982151573 | 1 | 1.0% | |
0.9831905776 | 1 | 1.0% | |
0.9780570719 | 1 | 1.0% | |
0.9687640232 | 1 | 1.0% | |
0.965177221 | 1 | 1.0% |
Distinct count | 100 |
---|---|
Unique (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 0.5365352442023252 |
---|---|
Minimum | 0.01416011603044065 |
Maximum | 0.9659268412154978 |
Zeros | 0 |
Zeros (%) | 0.0% |
Memory size | 928.0 B |
Quantile statistics
Minimum | 0.01416011603 |
---|---|
5-th percentile | 0.05136493031 |
Q1 | 0.3077798149 |
median | 0.5405829856 |
Q3 | 0.7880516894 |
95-th percentile | 0.9482418331 |
Maximum | 0.9659268412 |
Range | 0.9517667252 |
Interquartile range (IQR) | 0.4802718744 |
Descriptive statistics
Standard deviation | 0.2838002408 |
---|---|
Coefficient of variation (CV) | 0.5289498573 |
Kurtosis | -1.100279928 |
Mean | 0.5365352442 |
Median Absolute Deviation (MAD) | 0.241870727 |
Skewness | -0.06310923135 |
Sum | 53.65352442 |
Variance | 0.0805425767 |
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[0.01416012 0.90568316 0.96592684], "bayesian blocks" binning strategy used)
Value | Count | Frequency (%) | |
0.9595152223 | 1 | 1.0% | |
0.2857310937 | 1 | 1.0% | |
0.9154759747 | 1 | 1.0% | |
0.01416011603 | 1 | 1.0% | |
0.7396548084 | 1 | 1.0% | |
0.4224950534 | 1 | 1.0% | |
0.8341035428 | 1 | 1.0% | |
0.6108854553 | 1 | 1.0% | |
0.4995651186 | 1 | 1.0% | |
0.243781658 | 1 | 1.0% | |
Other values (90) | 90 | 90.0% |
Value | Count | Frequency (%) | |
0.01416011603 | 1 | 1.0% | |
0.01863789784 | 1 | 1.0% | |
0.01872780253 | 1 | 1.0% | |
0.02721137234 | 1 | 1.0% | |
0.03444677325 | 1 | 1.0% |
Value | Count | Frequency (%) | |
0.9659268412 | 1 | 1.0% | |
0.9621690043 | 1 | 1.0% | |
0.9595152223 | 1 | 1.0% | |
0.955758438 | 1 | 1.0% | |
0.9525463492 | 1 | 1.0% |
Distinct count | 100 |
---|---|
Unique (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 0.47793724653487835 |
---|---|
Minimum | 0.012540225290272433 |
Maximum | 0.9918251007879255 |
Zeros | 0 |
Zeros (%) | 0.0% |
Memory size | 928.0 B |
Quantile statistics
Minimum | 0.01254022529 |
---|---|
5-th percentile | 0.03652304945 |
Q1 | 0.2310268368 |
median | 0.4725896425 |
Q3 | 0.7273476852 |
95-th percentile | 0.9170778933 |
Maximum | 0.9918251008 |
Range | 0.9792848755 |
Interquartile range (IQR) | 0.4963208484 |
Descriptive statistics
Standard deviation | 0.2865663632 |
---|---|
Coefficient of variation (CV) | 0.5995899362 |
Kurtosis | -1.168478967 |
Mean | 0.4779372465 |
Median Absolute Deviation (MAD) | 0.2464973509 |
Skewness | 0.08584619993 |
Sum | 47.79372465 |
Variance | 0.0821202805 |
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[0.01254023 0.9918251 ], "bayesian blocks" binning strategy used)
Value | Count | Frequency (%) | |
0.3277253874 | 1 | 1.0% | |
0.7233180123 | 1 | 1.0% | |
0.2280619618 | 1 | 1.0% | |
0.2014470165 | 1 | 1.0% | |
0.09211328288 | 1 | 1.0% | |
0.606460476 | 1 | 1.0% | |
0.6184725859 | 1 | 1.0% | |
0.05900069075 | 1 | 1.0% | |
0.5687168232 | 1 | 1.0% | |
0.6700752913 | 1 | 1.0% | |
Other values (90) | 90 | 90.0% |
Value | Count | Frequency (%) | |
0.01254022529 | 1 | 1.0% | |
0.02244242453 | 1 | 1.0% | |
0.02421501622 | 1 | 1.0% | |
0.02525429794 | 1 | 1.0% | |
0.03624962806 | 1 | 1.0% |
Value | Count | Frequency (%) | |
0.9918251008 | 1 | 1.0% | |
0.9836072115 | 1 | 1.0% | |
0.9829415195 | 1 | 1.0% | |
0.9508000727 | 1 | 1.0% | |
0.9295051978 | 1 | 1.0% |
Distinct count | 100 |
---|---|
Unique (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 0.4735091751153356 |
---|---|
Minimum | 0.01005692358975685 |
Maximum | 0.9703035898477324 |
Zeros | 0 |
Zeros (%) | 0.0% |
Memory size | 928.0 B |
Quantile statistics
Minimum | 0.01005692359 |
---|---|
5-th percentile | 0.08863658133 |
Q1 | 0.2284389804 |
median | 0.477830546 |
Q3 | 0.6501625767 |
95-th percentile | 0.8837619216 |
Maximum | 0.9703035898 |
Range | 0.9602466663 |
Interquartile range (IQR) | 0.4217235963 |
Descriptive statistics
Standard deviation | 0.2460331541 |
---|---|
Coefficient of variation (CV) | 0.5195953258 |
Kurtosis | -0.8098688778 |
Mean | 0.4735091751 |
Median Absolute Deviation (MAD) | 0.1988813546 |
Skewness | -0.0005005975408 |
Sum | 47.35091751 |
Variance | 0.06053231292 |
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[0.01005692 0.97030359], "bayesian blocks" binning strategy used)
Value | Count | Frequency (%) | |
0.4776485534 | 1 | 1.0% | |
0.5331229471 | 1 | 1.0% | |
0.4096778008 | 1 | 1.0% | |
0.7295297924 | 1 | 1.0% | |
0.2763906535 | 1 | 1.0% | |
0.3314203812 | 1 | 1.0% | |
0.9703035898 | 1 | 1.0% | |
0.5873387461 | 1 | 1.0% | |
0.5759141427 | 1 | 1.0% | |
0.4875775633 | 1 | 1.0% | |
Other values (90) | 90 | 90.0% |
Value | Count | Frequency (%) | |
0.01005692359 | 1 | 1.0% | |
0.01116948476 | 1 | 1.0% | |
0.06907844988 | 1 | 1.0% | |
0.07544364778 | 1 | 1.0% | |
0.08839065659 | 1 | 1.0% |
Value | Count | Frequency (%) | |
0.9703035898 | 1 | 1.0% | |
0.9624569582 | 1 | 1.0% | |
0.9497318499 | 1 | 1.0% | |
0.9222317551 | 1 | 1.0% | |
0.900087811 | 1 | 1.0% |
Distinct count | 100 |
---|---|
Unique (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 0.5288583631413339 |
---|---|
Minimum | 0.018348094296209205 |
Maximum | 0.9747194521275142 |
Zeros | 0 |
Zeros (%) | 0.0% |
Memory size | 928.0 B |
Quantile statistics
Minimum | 0.0183480943 |
---|---|
5-th percentile | 0.03994839296 |
Q1 | 0.2905421171 |
median | 0.5350562462 |
Q3 | 0.7825261465 |
95-th percentile | 0.9666249084 |
Maximum | 0.9747194521 |
Range | 0.9563713578 |
Interquartile range (IQR) | 0.4919840294 |
Descriptive statistics
Standard deviation | 0.2954455543 |
---|---|
Coefficient of variation (CV) | 0.5586477871 |
Kurtosis | -1.194133776 |
Mean | 0.5288583631 |
Median Absolute Deviation (MAD) | 0.2546875694 |
Skewness | -0.1145953326 |
Sum | 52.88583631 |
Variance | 0.08728807553 |
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[0.01834809 0.96769561 0.97471945], "bayesian blocks" binning strategy used)
Value | Count | Frequency (%) | |
0.4311678549 | 1 | 1.0% | |
0.4231599667 | 1 | 1.0% | |
0.07479239725 | 1 | 1.0% | |
0.0183480943 | 1 | 1.0% | |
0.8207956025 | 1 | 1.0% | |
0.7404067816 | 1 | 1.0% | |
0.8911488286 | 1 | 1.0% | |
0.8369862828 | 1 | 1.0% | |
0.3987414924 | 1 | 1.0% | |
0.173272151 | 1 | 1.0% | |
Other values (90) | 90 | 90.0% |
Value | Count | Frequency (%) | |
0.0183480943 | 1 | 1.0% | |
0.02818261854 | 1 | 1.0% | |
0.03006211426 | 1 | 1.0% | |
0.03368911438 | 1 | 1.0% | |
0.03599039722 | 1 | 1.0% |
Value | Count | Frequency (%) | |
0.9747194521 | 1 | 1.0% | |
0.9726062249 | 1 | 1.0% | |
0.97230883 | 1 | 1.0% | |
0.9694933881 | 1 | 1.0% | |
0.9688852726 | 1 | 1.0% |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
First rows
a | b | c | d | e | |
---|---|---|---|---|---|
0 | 0.551362 | 0.389897 | 0.133736 | 0.075444 | 0.225405 |
1 | 0.697229 | 0.663928 | 0.044779 | 0.130970 | 0.628501 |
2 | 0.353078 | 0.200953 | 0.568717 | 0.221570 | 0.230711 |
3 | 0.493304 | 0.473321 | 0.787721 | 0.761571 | 0.447126 |
4 | 0.416997 | 0.250560 | 0.826592 | 0.361298 | 0.074792 |
5 | 0.669116 | 0.243782 | 0.586026 | 0.465262 | 0.398741 |
6 | 0.366836 | 0.955758 | 0.407857 | 0.164965 | 0.772414 |
7 | 0.823817 | 0.762897 | 0.375249 | 0.461134 | 0.820796 |
8 | 0.796029 | 0.836707 | 0.059001 | 0.882903 | 0.324417 |
9 | 0.508174 | 0.381737 | 0.162696 | 0.604456 | 0.868987 |
Last rows
a | b | c | d | e | |
---|---|---|---|---|---|
90 | 0.637922 | 0.590236 | 0.898878 | 0.219186 | 0.235753 |
91 | 0.204678 | 0.948015 | 0.036537 | 0.754006 | 0.752359 |
92 | 0.162262 | 0.287105 | 0.201447 | 0.462256 | 0.463122 |
93 | 0.423546 | 0.945319 | 0.638775 | 0.749892 | 0.076021 |
94 | 0.820633 | 0.499565 | 0.670075 | 0.775760 | 0.133355 |
95 | 0.114075 | 0.580827 | 0.188434 | 0.336427 | 0.256029 |
96 | 0.449047 | 0.421929 | 0.991825 | 0.720248 | 0.789838 |
97 | 0.998215 | 0.248163 | 0.630692 | 0.731906 | 0.173272 |
98 | 0.464312 | 0.078101 | 0.415281 | 0.617187 | 0.541404 |
99 | 0.820212 | 0.296863 | 0.279386 | 0.414434 | 0.018348 |