1 Introduction
The EDA Report provides exploratory data analysis information on objects that inherit data.frame and data.frame.
1.1 Information of Dataset
The dataset that generated the EDA Report is an ‘data.frame’ object. It consists of 14 observations and 15 variables.
1.2 Information of Variables
The variable information of the data set that generated the EDA Report is shown in the following table.:variables | types | missing_count | missing_percent | unique_count | unique_rate |
---|---|---|---|---|---|
Densidade | numeric | 0 | 0 | 14 | 1.0000000 |
Percentagem.floresta | numeric | 0 | 0 | 12 | 0.8571429 |
Coelho | numeric | 0 | 0 | 14 | 1.0000000 |
Raposa | numeric | 0 | 0 | 13 | 0.9285714 |
Precipitação | numeric | 0 | 0 | 14 | 1.0000000 |
Humidade | numeric | 0 | 0 | 9 | 0.6428571 |
Declive | numeric | 0 | 0 | 4 | 0.2857143 |
Insectos | numeric | 0 | 0 | 10 | 0.7142857 |
Agricola | numeric | 0 | 0 | 10 | 0.7142857 |
Ratos | numeric | 0 | 0 | 14 | 1.0000000 |
Temperatura | numeric | 0 | 0 | 11 | 0.7857143 |
Estradas | numeric | 0 | 0 | 13 | 0.9285714 |
Prox.casas | numeric | 0 | 0 | 12 | 0.8571429 |
Tipo.solo | numeric | 0 | 0 | 1 | 0.0714286 |
Agua | numeric | 0 | 0 | 2 | 0.1428571 |
The target variable of the data is ‘Densidade’, and the data type of the variable is numeric.
1.3 About EDA Report
EDA reports provide information and visualization results that support the EDA process. In particular, it provides a variety of information to understand the relationship between the target variable and the rest of the variables of interest.
2 Univariate Analysis
2.1 Descriptive Statistics
15 Variables 14 Observations
Densidade
n | missing | distinct | Info | Mean | Gmd | .05 | .10 | .25 | .50 | .75 | .90 | .95 |
---|---|---|---|---|---|---|---|---|---|---|---|---|
14 | 0 | 14 | 1 | 76.74 | 19.69 | 52.57 | 54.46 | 63.72 | 78.20 | 91.18 | 96.97 | 98.04 |
Value 51.4 53.2 57.4 62.8 66.5 72.1 74.8 81.6 83.2 88.1 92.2 94.8 Frequency 1 1 1 1 1 1 1 1 1 1 1 1 Proportion 0.071 0.071 0.071 0.071 0.071 0.071 0.071 0.071 0.071 0.071 0.071 0.071 Value 97.9 98.3 Frequency 1 1 Proportion 0.071 0.071
Percentagem.floresta
n | missing | distinct | Info | Mean | Gmd | .05 | .10 | .25 | .50 | .75 | .90 | .95 |
---|---|---|---|---|---|---|---|---|---|---|---|---|
14 | 0 | 12 | 0.996 | 7.171 | 6.459 | 0.20 | 0.26 | 2.00 | 8.20 | 10.57 | 10.77 | 14.19 |
Value 0.2 0.4 1.9 2.3 6.3 6.8 9.6 10.5 10.6 10.7 10.8 20.5 Frequency 2 1 1 1 1 1 2 1 1 1 1 1 Proportion 0.143 0.071 0.071 0.071 0.071 0.071 0.143 0.071 0.071 0.071 0.071 0.071
Coelho
n | missing | distinct | Info | Mean | Gmd | .05 | .10 | .25 | .50 | .75 | .90 | .95 |
---|---|---|---|---|---|---|---|---|---|---|---|---|
14 | 0 | 14 | 1 | 26.99 | 9.113 | 16.54 | 17.32 | 18.93 | 28.80 | 33.58 | 35.74 | 36.53 |
Value 15.3 17.2 17.6 17.8 22.3 26.3 28.2 29.4 29.6 30.2 34.7 35.6 Frequency 1 1 1 1 1 1 1 1 1 1 1 1 Proportion 0.071 0.071 0.071 0.071 0.071 0.071 0.071 0.071 0.071 0.071 0.071 0.071 Value 35.8 37.9 Frequency 1 1 Proportion 0.071 0.071
Raposa
n | missing | distinct | Info | Mean | Gmd | .05 | .10 | .25 | .50 | .75 | .90 | .95 |
---|---|---|---|---|---|---|---|---|---|---|---|---|
14 | 0 | 13 | 0.998 | 14.39 | 7.113 | 8.065 | 8.830 | 10.600 | 11.400 | 19.700 | 23.430 | 25.265 |
Value 6.7 8.8 8.9 10.6 10.8 11.1 11.7 11.9 18.5 20.1 20.7 24.6 Frequency 1 1 1 2 1 1 1 1 1 1 1 1 Proportion 0.071 0.071 0.071 0.143 0.071 0.071 0.071 0.071 0.071 0.071 0.071 0.071 Value 26.5 Frequency 1 Proportion 0.071
Precipitação
n | missing | distinct | Info | Mean | Gmd | .05 | .10 | .25 | .50 | .75 | .90 | .95 |
---|---|---|---|---|---|---|---|---|---|---|---|---|
14 | 0 | 14 | 1 | 11.96 | 8.991 | 3.75 | 5.53 | 6.20 | 9.00 | 17.75 | 22.06 | 24.24 |
Value 0.5 5.5 5.6 5.9 7.1 7.8 8.0 10.0 13.1 14.3 18.9 20.8 Frequency 1 1 1 1 1 1 1 1 1 1 1 1 Proportion 0.071 0.071 0.071 0.071 0.071 0.071 0.071 0.071 0.071 0.071 0.071 0.071 Value 22.6 27.3 Frequency 1 1 Proportion 0.071 0.071
Humidade
n | missing | distinct | Info | Mean | Gmd |
---|---|---|---|---|---|
14 | 0 | 9 | 0.954 | 1.264 | 1.522 |
Value 0.0 0.5 1.0 1.2 1.5 2.2 2.4 2.7 4.7 Frequency 5 1 1 1 2 1 1 1 1 Proportion 0.357 0.071 0.071 0.071 0.143 0.071 0.071 0.071 0.071
Declive
n | missing | distinct | Info | Mean | Gmd |
---|---|---|---|---|---|
14 | 0 | 4 | 0.516 | 0.3929 | 0.6956 |
Value 0.0 1.1 1.9 2.5 Frequency 11 1 1 1 Proportion 0.786 0.071 0.071 0.071
Insectos
n | missing | distinct | Info | Mean | Gmd | .05 | .10 | .25 | .50 | .75 | .90 | .95 |
---|---|---|---|---|---|---|---|---|---|---|---|---|
14 | 0 | 10 | 0.956 | 7.907 | 12.79 | 0.00 | 0.00 | 0.00 | 1.40 | 4.45 | 31.05 | 37.39 |
Value 0.0 0.9 1.2 1.6 2.0 2.5 5.1 20.9 35.4 41.1 Frequency 5 1 1 1 1 1 1 1 1 1 Proportion 0.357 0.071 0.071 0.071 0.071 0.071 0.071 0.071 0.071 0.071
Agricola
n | missing | distinct | Info | Mean | Gmd | .05 | .10 | .25 | .50 | .75 | .90 | .95 |
---|---|---|---|---|---|---|---|---|---|---|---|---|
14 | 0 | 10 | 0.956 | 2.386 | 3.007 | 0.00 | 0.00 | 0.00 | 1.60 | 4.45 | 6.64 | 6.77 |
Value 0.0 0.7 1.5 1.7 1.8 2.5 5.1 6.5 6.7 6.9 Frequency 5 1 1 1 1 1 1 1 1 1 Proportion 0.357 0.071 0.071 0.071 0.071 0.071 0.071 0.071 0.071 0.071
Ratos
n | missing | distinct | Info | Mean | Gmd | .05 | .10 | .25 | .50 | .75 | .90 | .95 |
---|---|---|---|---|---|---|---|---|---|---|---|---|
14 | 0 | 14 | 1 | 33.35 | 22.14 | 9.62 | 15.28 | 22.32 | 25.70 | 49.80 | 58.90 | 59.78 |
Value 0.0 14.8 16.4 21.9 23.6 23.7 24.0 27.4 42.0 44.1 51.7 57.5 Frequency 1 1 1 1 1 1 1 1 1 1 1 1 Proportion 0.071 0.071 0.071 0.071 0.071 0.071 0.071 0.071 0.071 0.071 0.071 0.071 Value 59.5 60.3 Frequency 1 1 Proportion 0.071 0.071
Temperatura
n | missing | distinct | Info | Mean | Gmd | .05 | .10 | .25 | .50 | .75 | .90 | .95 |
---|---|---|---|---|---|---|---|---|---|---|---|---|
14 | 0 | 11 | 0.989 | 12.18 | 14.18 | 0.000 | 0.000 | 1.025 | 9.300 | 19.025 | 29.910 | 33.845 |
Value 0.0 0.5 2.6 6.9 9.3 11.5 13.7 20.8 26.2 31.5 38.2 Frequency 3 1 1 1 2 1 1 1 1 1 1 Proportion 0.214 0.071 0.071 0.071 0.143 0.071 0.071 0.071 0.071 0.071 0.071
Estradas
n | missing | distinct | Info | Mean | Gmd | .05 | .10 | .25 | .50 | .75 | .90 | .95 |
---|---|---|---|---|---|---|---|---|---|---|---|---|
14 | 0 | 13 | 0.998 | 12.49 | 9.119 | 4.195 | 5.700 | 6.675 | 9.650 | 18.675 | 22.440 | 24.620 |
Value 1.4 5.7 6.5 7.2 7.9 8.5 10.8 13.8 15.0 19.9 21.6 22.8 Frequency 1 2 1 1 1 1 1 1 1 1 1 1 Proportion 0.071 0.143 0.071 0.071 0.071 0.071 0.071 0.071 0.071 0.071 0.071 0.071 Value 28.0 Frequency 1 Proportion 0.071
Prox.casas
n | missing | distinct | Info | Mean | Gmd | .05 | .10 | .25 | .50 | .75 | .90 | .95 |
---|---|---|---|---|---|---|---|---|---|---|---|---|
14 | 0 | 12 | 0.991 | 27.59 | 9.1 | 17.43 | 18.10 | 19.33 | 29.00 | 34.40 | 36.44 | 37.23 |
Value 16.2 18.1 23.0 27.1 28.6 29.4 29.9 30.8 35.6 36.3 36.5 38.6 Frequency 1 3 1 1 1 1 1 1 1 1 1 1 Proportion 0.071 0.214 0.071 0.071 0.071 0.071 0.071 0.071 0.071 0.071 0.071 0.071
Tipo.solo
n | missing | distinct | Info | Mean | Gmd |
---|---|---|---|---|---|
14 | 0 | 1 | 0 | 2 | 0 |
Value 2 Frequency 14 Proportion 1
Agua
n | missing | distinct | Info | Sum | Mean | Gmd |
---|---|---|---|---|---|---|
14 | 0 | 2 | 0.754 | 7 | 0.5 | 0.5385 |
2.2 Normality Test of Numerical Variables
2.2.1 Statistics and Visualization of (Sample) Data
[ Percentagem.floresta ]
normality test : Shapiro-Wilk normality test
statistic : 0.88657, p-value : 0.0721477
type | skewness | kurtosis |
---|---|---|
original | 0.5451584 | 3.010174 |
log transformation | -0.9635486 | 2.412474 |
sqrt transformation | -0.3083150 | 1.980650 |
[ Coelho ]
normality test : Shapiro-Wilk normality test
statistic : 0.91539, p-value : 0.188675
type | skewness | kurtosis |
---|---|---|
original | -0.1889050 | 1.662889 |
log transformation | -0.4467146 | 1.744336 |
sqrt transformation | -0.3207935 | 1.688338 |
[ Raposa ]
normality test : Shapiro-Wilk normality test
statistic : 0.86375, p-value : 0.0343213
type | skewness | kurtosis |
---|---|---|
original | 0.7189554 | 2.092522 |
log transformation | 0.3143344 | 1.896306 |
sqrt transformation | 0.5350215 | 1.927766 |
[ Precipitação ]
normality test : Shapiro-Wilk normality test
statistic : 0.93057, p-value : 0.310781
type | skewness | kurtosis |
---|---|---|
original | 0.5619844 | 2.225319 |
log transformation | -1.6825470 | 6.141213 |
sqrt transformation | -0.1928128 | 2.668658 |
[ Humidade ]
normality test : Shapiro-Wilk normality test
statistic : 0.85871, p-value : 0.0292212
type | skewness | kurtosis |
---|---|---|
original | 1.0686247 | 3.624844 |
log transformation | NaN | NaN |
sqrt transformation | 0.0249135 | 1.718327 |
[ Declive ]
normality test : Shapiro-Wilk normality test
statistic : 0.54992, p-value : 1.5011E-05
type | skewness | kurtosis |
---|---|---|
original | 1.753977 | 4.477464 |
log transformation | NaN | NaN |
sqrt transformation | 1.507591 | 3.433156 |
[ Insectos ]
normality test : Shapiro-Wilk normality test
statistic : 0.62191, p-value : 6.40237E-05
type | skewness | kurtosis |
---|---|---|
original | 1.625946 | 3.980389 |
log transformation | NaN | NaN |
sqrt transformation | 1.136235 | 2.872752 |
[ Agricola ]
normality test : Shapiro-Wilk normality test
statistic : 0.7974, p-value : 0.00463533
type | skewness | kurtosis |
---|---|---|
original | 0.7457069 | 1.926734 |
log transformation | NaN | NaN |
sqrt transformation | 0.1441511 | 1.555480 |
[ Ratos ]
normality test : Shapiro-Wilk normality test
statistic : 0.92353, p-value : 0.247053
type | skewness | kurtosis |
---|---|---|
original | 0.080069 | 1.864322 |
log transformation | NaN | NaN |
sqrt transformation | -1.078792 | 4.243522 |
[ Temperatura ]
normality test : Shapiro-Wilk normality test
statistic : 0.87729, p-value : 0.0531884
type | skewness | kurtosis |
---|---|---|
original | 0.8171775 | 2.471805 |
log transformation | NaN | NaN |
sqrt transformation | -0.0173183 | 1.796110 |
[ Estradas ]
normality test : Shapiro-Wilk normality test
statistic : 0.92612, p-value : 0.269029
type | skewness | kurtosis |
---|---|---|
original | 0.5663759 | 2.153198 |
log transformation | -0.8893821 | 3.805399 |
sqrt transformation | 0.0370818 | 2.225574 |
[ Prox.casas ]
normality test : Shapiro-Wilk normality test
statistic : 0.91092, p-value : 0.162526
type | skewness | kurtosis |
---|---|---|
original | -0.1499692 | 1.663703 |
log transformation | -0.4049146 | 1.714642 |
sqrt transformation | -0.2808116 | 1.676968 |
[ Agua ]
normality test : Shapiro-Wilk normality test
statistic : 0.6463, p-value : 0.000108276
type | skewness | kurtosis |
---|---|---|
original | 0 | 1 |
log transformation | NaN | NaN |
sqrt transformation | 0 | 1 |
3 Relationship Between Variables
3.1 Correlation Coefficient
3.1.1 Correlation Coefficient by Variable Combination
Variable1 | Variable2 | Correlation Coefficient |
---|---|---|
Prox.casas | Coelho | 0.9993479 |
Estradas | Precipitação | 0.9992197 |
Estradas | Coelho | -0.9650285 |
Precipitação | Coelho | -0.9627921 |
Prox.casas | Estradas | -0.9605550 |
Prox.casas | Precipitação | -0.9581716 |
Agua | Densidade | 0.8855958 |
Prox.casas | Densidade | 0.8544413 |
Coelho | Densidade | 0.8498895 |
Raposa | Percentagem.floresta | -0.8353497 |
Estradas | Densidade | -0.8205253 |
Precipitação | Densidade | -0.8133017 |
Agua | Prox.casas | 0.7962256 |
Agua | Estradas | -0.7917476 |
Agua | Coelho | 0.7864558 |
Agua | Precipitação | -0.7842829 |
Ratos | Agricola | 0.7310285 |
Percentagem.floresta | Densidade | 0.7004555 |
Agua | Percentagem.floresta | 0.6111027 |
Insectos | Raposa | 0.5842812 |
Raposa | Densidade | -0.5583191 |
Insectos | Percentagem.floresta | -0.5569839 |
Insectos | Densidade | -0.5459909 |
Declive | Coelho | -0.5161795 |
Prox.casas | Declive | -0.5069721 |
3.1.2 Correlation Plot of Numerical Variables
4 Target based Analysis
4.1 Grouped Descriptive Statistics
4.1.1 Grouped Numerical Variables
[ Percentagem.floresta ]
Simple Linear Model Information
Residual standard error: 12 on 12 degrees of freedom
Multiple R-squared: 0.49064, Adjusted R-squared: 0.44819
F-statistic: 12 on 1 and 12 DF, p-value: 0.0052713157382547
Coefficients:
Estimate | Std. Error | t value | Pr(>|t|) | |
---|---|---|---|---|
(Intercept) | 62.32 | 5.36 | 11.62 | 0.00 |
Percentagem.floresta | 2.01 | 0.59 | 3.40 | 0.01 |
Visualization:
[ Coelho ]
Simple Linear Model Information
Residual standard error: 9 on 12 degrees of freedom
Multiple R-squared: 0.72231, Adjusted R-squared: 0.69917
F-statistic: 31 on 1 and 12 DF, p-value: 0.00011857639249599
Coefficients:
Estimate | Std. Error | t value | Pr(>|t|) | |
---|---|---|---|---|
(Intercept) | 27.75 | 9.10 | 3.05 | 0.01 |
Coelho | 1.81 | 0.32 | 5.59 | 0.00 |
Visualization:
[ Raposa ]
Simple Linear Model Information
Residual standard error: 14 on 12 degrees of freedom
Multiple R-squared: 0.31172, Adjusted R-squared: 0.25436
F-statistic: 5 on 1 and 12 DF, p-value: 0.0379835265092045
Coefficients:
Estimate | Std. Error | t value | Pr(>|t|) | |
---|---|---|---|---|
(Intercept) | 97.65 | 9.75 | 10.01 | 0.00 |
Raposa | -1.45 | 0.62 | -2.33 | 0.04 |
Visualization:
[ Precipitação ]
Simple Linear Model Information
Residual standard error: 10 on 12 degrees of freedom
Multiple R-squared: 0.66146, Adjusted R-squared: 0.63325
F-statistic: 23 on 1 and 12 DF, p-value: 0.000403586823335824
Coefficients:
Estimate | Std. Error | t value | Pr(>|t|) | |
---|---|---|---|---|
(Intercept) | 97.37 | 5.03 | 19.34 | 0 |
Precipitação | -1.73 | 0.36 | -4.84 | 0 |
Visualization:
[ Humidade ]
Simple Linear Model Information
Residual standard error: 17 on 12 degrees of freedom
Multiple R-squared: 0.05741, Adjusted R-squared: -0.02114
F-statistic: 1 on 1 and 12 DF, p-value: 0.409341450882033
Coefficients:
Estimate | Std. Error | t value | Pr(>|t|) | |
---|---|---|---|---|
(Intercept) | 80.37 | 6.17 | 13.02 | 0.00 |
Humidade | -2.87 | 3.36 | -0.85 | 0.41 |
Visualization:
[ Declive ]
Simple Linear Model Information
Residual standard error: 16 on 12 degrees of freedom
Multiple R-squared: 0.10738, Adjusted R-squared: 0.033
F-statistic: 1 on 1 and 12 DF, p-value: 0.252732445790891
Coefficients:
Estimate | Std. Error | t value | Pr(>|t|) | |
---|---|---|---|---|
(Intercept) | 79.31 | 4.85 | 16.34 | 0.00 |
Declive | -6.56 | 5.46 | -1.20 | 0.25 |
Visualization:
[ Insectos ]
Simple Linear Model Information
Residual standard error: 14 on 12 degrees of freedom
Multiple R-squared: 0.29811, Adjusted R-squared: 0.23961
F-statistic: 5 on 1 and 12 DF, p-value: 0.0434011022875512
Coefficients:
Estimate | Std. Error | t value | Pr(>|t|) | |
---|---|---|---|---|
(Intercept) | 81.85 | 4.48 | 18.29 | 0.00 |
Insectos | -0.65 | 0.29 | -2.26 | 0.04 |
Visualization:
[ Agricola ]
Simple Linear Model Information
Residual standard error: 15 on 12 degrees of freedom
Multiple R-squared: 0.22218, Adjusted R-squared: 0.15736
F-statistic: 3 on 1 and 12 DF, p-value: 0.0888660093518512
Coefficients:
Estimate | Std. Error | t value | Pr(>|t|) | |
---|---|---|---|---|
(Intercept) | 69.89 | 5.49 | 12.72 | 0.00 |
Agricola | 2.87 | 1.55 | 1.85 | 0.09 |
Visualization:
[ Ratos ]
Simple Linear Model Information
Residual standard error: 16 on 12 degrees of freedom
Multiple R-squared: 0.11018, Adjusted R-squared: 0.03603
F-statistic: 1 on 1 and 12 DF, p-value: 0.246268931809105
Coefficients:
Estimate | Std. Error | t value | Pr(>|t|) | |
---|---|---|---|---|
(Intercept) | 67.10 | 9.02 | 7.44 | 0.00 |
Ratos | 0.29 | 0.24 | 1.22 | 0.25 |
Visualization:
[ Temperatura ]
Simple Linear Model Information
Residual standard error: 17 on 12 degrees of freedom
Multiple R-squared: 0.00992, Adjusted R-squared: -0.07259
F-statistic: 0 on 1 and 12 DF, p-value: 0.734795419733537
Coefficients:
Estimate | Std. Error | t value | Pr(>|t|) | |
---|---|---|---|---|
(Intercept) | 78.34 | 6.51 | 12.03 | 0.00 |
Temperatura | -0.13 | 0.38 | -0.35 | 0.73 |
Visualization:
[ Estradas ]
Simple Linear Model Information
Residual standard error: 10 on 12 degrees of freedom
Multiple R-squared: 0.67326, Adjusted R-squared: 0.64603
F-statistic: 25 on 1 and 12 DF, p-value: 0.000323855046548262
Coefficients:
Estimate | Std. Error | t value | Pr(>|t|) | |
---|---|---|---|---|
(Intercept) | 98.21 | 5.06 | 19.41 | 0 |
Estradas | -1.72 | 0.35 | -4.97 | 0 |
Visualization:
[ Prox.casas ]
Simple Linear Model Information
Residual standard error: 9 on 12 degrees of freedom
Multiple R-squared: 0.73007, Adjusted R-squared: 0.70758
F-statistic: 32 on 1 and 12 DF, p-value: 9.959890031143e-05
Coefficients:
Estimate | Std. Error | t value | Pr(>|t|) | |
---|---|---|---|---|
(Intercept) | 26.38 | 9.16 | 2.88 | 0.01 |
Prox.casas | 1.83 | 0.32 | 5.70 | 0.00 |
Visualization:
[ Agua ]
Simple Linear Model Information
Residual standard error: 8 on 12 degrees of freedom
Multiple R-squared: 0.78428, Adjusted R-squared: 0.7663
F-statistic: 44 on 1 and 12 DF, p-value: 2.51894588648466e-05
Coefficients:
Estimate | Std. Error | t value | Pr(>|t|) | |
---|---|---|---|---|
(Intercept) | 62.60 | 3.03 | 20.68 | 0 |
Agua | 28.27 | 4.28 | 6.61 | 0 |
Visualization:
4.1.2 Grouped Categorical Variables
There is no categorical variable.
4.2 Grouped Relationship Between Variables
4.2.1 Grouped Correlation Coefficient
Numerical target variables are not supported.
4.2.2 Grouped Correlation Plot of Numerical Variables
Numerical target variables are not supported.