1 Introduction

The EDA Report provides exploratory data analysis information on objects that inherit data.frame and data.frame.

1.1 Information of Dataset

The dataset that generated the EDA Report is an ‘data.frame’ object. It consists of 14 observations and 15 variables.

1.2 Information of Variables

The variable information of the data set that generated the EDA Report is shown in the following table.:
Information of Variables
variables types missing_count missing_percent unique_count unique_rate
Densidade numeric 0 0 14 1.0000000
Percentagem.floresta numeric 0 0 12 0.8571429
Coelho numeric 0 0 14 1.0000000
Raposa numeric 0 0 13 0.9285714
Precipitação numeric 0 0 14 1.0000000
Humidade numeric 0 0 9 0.6428571
Declive numeric 0 0 4 0.2857143
Insectos numeric 0 0 10 0.7142857
Agricola numeric 0 0 10 0.7142857
Ratos numeric 0 0 14 1.0000000
Temperatura numeric 0 0 11 0.7857143
Estradas numeric 0 0 13 0.9285714
Prox.casas numeric 0 0 12 0.8571429
Tipo.solo numeric 0 0 1 0.0714286
Agua numeric 0 0 2 0.1428571

The target variable of the data is ‘Densidade’, and the data type of the variable is numeric.

1.3 About EDA Report

EDA reports provide information and visualization results that support the EDA process. In particular, it provides a variety of information to understand the relationship between the target variable and the rest of the variables of interest.

2 Univariate Analysis

2.1 Descriptive Statistics

edaData

15 Variables 14 Observations

Densidade
image
nmissingdistinctInfoMeanGmd.05.10.25.50.75.90.95
14014176.7419.6952.5754.4663.7278.2091.1896.9798.04
 Value       51.4  53.2  57.4  62.8  66.5  72.1  74.8  81.6  83.2  88.1  92.2  94.8
 Frequency      1     1     1     1     1     1     1     1     1     1     1     1
 Proportion 0.071 0.071 0.071 0.071 0.071 0.071 0.071 0.071 0.071 0.071 0.071 0.071
                       
 Value       97.9  98.3
 Frequency      1     1
 Proportion 0.071 0.071
 

Percentagem.floresta
image
nmissingdistinctInfoMeanGmd.05.10.25.50.75.90.95
140120.9967.1716.459 0.20 0.26 2.00 8.2010.5710.7714.19
 Value        0.2   0.4   1.9   2.3   6.3   6.8   9.6  10.5  10.6  10.7  10.8  20.5
 Frequency      2     1     1     1     1     1     2     1     1     1     1     1
 Proportion 0.143 0.071 0.071 0.071 0.071 0.071 0.143 0.071 0.071 0.071 0.071 0.071
 

Coelho
image
nmissingdistinctInfoMeanGmd.05.10.25.50.75.90.95
14014126.999.11316.5417.3218.9328.8033.5835.7436.53
 Value       15.3  17.2  17.6  17.8  22.3  26.3  28.2  29.4  29.6  30.2  34.7  35.6
 Frequency      1     1     1     1     1     1     1     1     1     1     1     1
 Proportion 0.071 0.071 0.071 0.071 0.071 0.071 0.071 0.071 0.071 0.071 0.071 0.071
                       
 Value       35.8  37.9
 Frequency      1     1
 Proportion 0.071 0.071
 

Raposa
image
nmissingdistinctInfoMeanGmd.05.10.25.50.75.90.95
140130.99814.397.113 8.065 8.83010.60011.40019.70023.43025.265
 Value        6.7   8.8   8.9  10.6  10.8  11.1  11.7  11.9  18.5  20.1  20.7  24.6
 Frequency      1     1     1     2     1     1     1     1     1     1     1     1
 Proportion 0.071 0.071 0.071 0.143 0.071 0.071 0.071 0.071 0.071 0.071 0.071 0.071
                 
 Value       26.5
 Frequency      1
 Proportion 0.071
 

Precipitação
image
nmissingdistinctInfoMeanGmd.05.10.25.50.75.90.95
14014111.968.991 3.75 5.53 6.20 9.0017.7522.0624.24
 Value        0.5   5.5   5.6   5.9   7.1   7.8   8.0  10.0  13.1  14.3  18.9  20.8
 Frequency      1     1     1     1     1     1     1     1     1     1     1     1
 Proportion 0.071 0.071 0.071 0.071 0.071 0.071 0.071 0.071 0.071 0.071 0.071 0.071
                       
 Value       22.6  27.3
 Frequency      1     1
 Proportion 0.071 0.071
 

Humidade
image
nmissingdistinctInfoMeanGmd
14090.9541.2641.522
 Value        0.0   0.5   1.0   1.2   1.5   2.2   2.4   2.7   4.7
 Frequency      5     1     1     1     2     1     1     1     1
 Proportion 0.357 0.071 0.071 0.071 0.143 0.071 0.071 0.071 0.071
 

Declive
image
nmissingdistinctInfoMeanGmd
14040.5160.39290.6956
 Value        0.0   1.1   1.9   2.5
 Frequency     11     1     1     1
 Proportion 0.786 0.071 0.071 0.071
 

Insectos
image
nmissingdistinctInfoMeanGmd.05.10.25.50.75.90.95
140100.9567.90712.79 0.00 0.00 0.00 1.40 4.4531.0537.39
 Value        0.0   0.9   1.2   1.6   2.0   2.5   5.1  20.9  35.4  41.1
 Frequency      5     1     1     1     1     1     1     1     1     1
 Proportion 0.357 0.071 0.071 0.071 0.071 0.071 0.071 0.071 0.071 0.071
 

Agricola
image
nmissingdistinctInfoMeanGmd.05.10.25.50.75.90.95
140100.9562.3863.0070.000.000.001.604.456.646.77
 Value        0.0   0.7   1.5   1.7   1.8   2.5   5.1   6.5   6.7   6.9
 Frequency      5     1     1     1     1     1     1     1     1     1
 Proportion 0.357 0.071 0.071 0.071 0.071 0.071 0.071 0.071 0.071 0.071
 

Ratos
image
nmissingdistinctInfoMeanGmd.05.10.25.50.75.90.95
14014133.3522.14 9.6215.2822.3225.7049.8058.9059.78
 Value        0.0  14.8  16.4  21.9  23.6  23.7  24.0  27.4  42.0  44.1  51.7  57.5
 Frequency      1     1     1     1     1     1     1     1     1     1     1     1
 Proportion 0.071 0.071 0.071 0.071 0.071 0.071 0.071 0.071 0.071 0.071 0.071 0.071
                       
 Value       59.5  60.3
 Frequency      1     1
 Proportion 0.071 0.071
 

Temperatura
image
nmissingdistinctInfoMeanGmd.05.10.25.50.75.90.95
140110.98912.1814.18 0.000 0.000 1.025 9.30019.02529.91033.845
 Value        0.0   0.5   2.6   6.9   9.3  11.5  13.7  20.8  26.2  31.5  38.2
 Frequency      3     1     1     1     2     1     1     1     1     1     1
 Proportion 0.214 0.071 0.071 0.071 0.143 0.071 0.071 0.071 0.071 0.071 0.071
 

Estradas
image
nmissingdistinctInfoMeanGmd.05.10.25.50.75.90.95
140130.99812.499.119 4.195 5.700 6.675 9.65018.67522.44024.620
 Value        1.4   5.7   6.5   7.2   7.9   8.5  10.8  13.8  15.0  19.9  21.6  22.8
 Frequency      1     2     1     1     1     1     1     1     1     1     1     1
 Proportion 0.071 0.143 0.071 0.071 0.071 0.071 0.071 0.071 0.071 0.071 0.071 0.071
                 
 Value       28.0
 Frequency      1
 Proportion 0.071
 

Prox.casas
image
nmissingdistinctInfoMeanGmd.05.10.25.50.75.90.95
140120.99127.599.117.4318.1019.3329.0034.4036.4437.23
 Value       16.2  18.1  23.0  27.1  28.6  29.4  29.9  30.8  35.6  36.3  36.5  38.6
 Frequency      1     3     1     1     1     1     1     1     1     1     1     1
 Proportion 0.071 0.214 0.071 0.071 0.071 0.071 0.071 0.071 0.071 0.071 0.071 0.071
 

Tipo.solo
nmissingdistinctInfoMeanGmd
1401020
 Value       2
 Frequency  14
 Proportion  1
 

Agua
nmissingdistinctInfoSumMeanGmd
14020.75470.50.5385

2.2 Normality Test of Numerical Variables

2.2.1 Statistics and Visualization of (Sample) Data

[ Percentagem.floresta ]

normality test : Shapiro-Wilk normality test

 statistic : 0.88657,  p-value : 0.0721477

skewness and kurtosis
type skewness kurtosis
original 0.5451584 3.010174
log transformation -0.9635486 2.412474
sqrt transformation -0.3083150 1.980650



[ Coelho ]

normality test : Shapiro-Wilk normality test

 statistic : 0.91539,  p-value : 0.188675

skewness and kurtosis
type skewness kurtosis
original -0.1889050 1.662889
log transformation -0.4467146 1.744336
sqrt transformation -0.3207935 1.688338



[ Raposa ]

normality test : Shapiro-Wilk normality test

 statistic : 0.86375,  p-value : 0.0343213

skewness and kurtosis
type skewness kurtosis
original 0.7189554 2.092522
log transformation 0.3143344 1.896306
sqrt transformation 0.5350215 1.927766



[ Precipitação ]

normality test : Shapiro-Wilk normality test

 statistic : 0.93057,  p-value : 0.310781

skewness and kurtosis
type skewness kurtosis
original 0.5619844 2.225319
log transformation -1.6825470 6.141213
sqrt transformation -0.1928128 2.668658



[ Humidade ]

normality test : Shapiro-Wilk normality test

 statistic : 0.85871,  p-value : 0.0292212

skewness and kurtosis
type skewness kurtosis
original 1.0686247 3.624844
log transformation NaN NaN
sqrt transformation 0.0249135 1.718327



[ Declive ]

normality test : Shapiro-Wilk normality test

 statistic : 0.54992,  p-value : 1.5011E-05

skewness and kurtosis
type skewness kurtosis
original 1.753977 4.477464
log transformation NaN NaN
sqrt transformation 1.507591 3.433156



[ Insectos ]

normality test : Shapiro-Wilk normality test

 statistic : 0.62191,  p-value : 6.40237E-05

skewness and kurtosis
type skewness kurtosis
original 1.625946 3.980389
log transformation NaN NaN
sqrt transformation 1.136235 2.872752



[ Agricola ]

normality test : Shapiro-Wilk normality test

 statistic : 0.7974,  p-value : 0.00463533

skewness and kurtosis
type skewness kurtosis
original 0.7457069 1.926734
log transformation NaN NaN
sqrt transformation 0.1441511 1.555480



[ Ratos ]

normality test : Shapiro-Wilk normality test

 statistic : 0.92353,  p-value : 0.247053

skewness and kurtosis
type skewness kurtosis
original 0.080069 1.864322
log transformation NaN NaN
sqrt transformation -1.078792 4.243522



[ Temperatura ]

normality test : Shapiro-Wilk normality test

 statistic : 0.87729,  p-value : 0.0531884

skewness and kurtosis
type skewness kurtosis
original 0.8171775 2.471805
log transformation NaN NaN
sqrt transformation -0.0173183 1.796110



[ Estradas ]

normality test : Shapiro-Wilk normality test

 statistic : 0.92612,  p-value : 0.269029

skewness and kurtosis
type skewness kurtosis
original 0.5663759 2.153198
log transformation -0.8893821 3.805399
sqrt transformation 0.0370818 2.225574



[ Prox.casas ]

normality test : Shapiro-Wilk normality test

 statistic : 0.91092,  p-value : 0.162526

skewness and kurtosis
type skewness kurtosis
original -0.1499692 1.663703
log transformation -0.4049146 1.714642
sqrt transformation -0.2808116 1.676968



[ Agua ]

normality test : Shapiro-Wilk normality test

 statistic : 0.6463,  p-value : 0.000108276

skewness and kurtosis
type skewness kurtosis
original 0 1
log transformation NaN NaN
sqrt transformation 0 1



3 Relationship Between Variables

3.1 Correlation Coefficient

3.1.1 Correlation Coefficient by Variable Combination

Table of correlation coefficients (0.5 or more)
Variable1 Variable2 Correlation Coefficient
Prox.casas Coelho 0.9993479
Estradas Precipitação 0.9992197
Estradas Coelho -0.9650285
Precipitação Coelho -0.9627921
Prox.casas Estradas -0.9605550
Prox.casas Precipitação -0.9581716
Agua Densidade 0.8855958
Prox.casas Densidade 0.8544413
Coelho Densidade 0.8498895
Raposa Percentagem.floresta -0.8353497
Estradas Densidade -0.8205253
Precipitação Densidade -0.8133017
Agua Prox.casas 0.7962256
Agua Estradas -0.7917476
Agua Coelho 0.7864558
Agua Precipitação -0.7842829
Ratos Agricola 0.7310285
Percentagem.floresta Densidade 0.7004555
Agua Percentagem.floresta 0.6111027
Insectos Raposa 0.5842812
Raposa Densidade -0.5583191
Insectos Percentagem.floresta -0.5569839
Insectos Densidade -0.5459909
Declive Coelho -0.5161795
Prox.casas Declive -0.5069721

3.1.2 Correlation Plot of Numerical Variables

4 Target based Analysis

4.1 Grouped Descriptive Statistics

4.1.1 Grouped Numerical Variables

[ Percentagem.floresta ]

Simple Linear Model Information

 Residual standard error: 12 on 12 degrees of freedom
 Multiple R-squared:  0.49064,  Adjusted R-squared:  0.44819
 F-statistic: 12 on 1 and 12 DF,  p-value: 0.0052713157382547

Coefficients:
Simple Linear Model coefficients : Percentagem.floresta
Estimate Std. Error t value Pr(>|t|)
(Intercept) 62.32 5.36 11.62 0.00
Percentagem.floresta 2.01 0.59 3.40 0.01


Visualization:




[ Coelho ]

Simple Linear Model Information

 Residual standard error: 9 on 12 degrees of freedom
 Multiple R-squared:  0.72231,  Adjusted R-squared:  0.69917
 F-statistic: 31 on 1 and 12 DF,  p-value: 0.00011857639249599

Coefficients:
Simple Linear Model coefficients : Coelho
Estimate Std. Error t value Pr(>|t|)
(Intercept) 27.75 9.10 3.05 0.01
Coelho 1.81 0.32 5.59 0.00


Visualization:




[ Raposa ]

Simple Linear Model Information

 Residual standard error: 14 on 12 degrees of freedom
 Multiple R-squared:  0.31172,  Adjusted R-squared:  0.25436
 F-statistic: 5 on 1 and 12 DF,  p-value: 0.0379835265092045

Coefficients:
Simple Linear Model coefficients : Raposa
Estimate Std. Error t value Pr(>|t|)
(Intercept) 97.65 9.75 10.01 0.00
Raposa -1.45 0.62 -2.33 0.04


Visualization:




[ Precipitação ]

Simple Linear Model Information

 Residual standard error: 10 on 12 degrees of freedom
 Multiple R-squared:  0.66146,  Adjusted R-squared:  0.63325
 F-statistic: 23 on 1 and 12 DF,  p-value: 0.000403586823335824

Coefficients:
Simple Linear Model coefficients : Precipitação
Estimate Std. Error t value Pr(>|t|)
(Intercept) 97.37 5.03 19.34 0
Precipitação -1.73 0.36 -4.84 0


Visualization:




[ Humidade ]

Simple Linear Model Information

 Residual standard error: 17 on 12 degrees of freedom
 Multiple R-squared:  0.05741,  Adjusted R-squared:  -0.02114
 F-statistic: 1 on 1 and 12 DF,  p-value: 0.409341450882033

Coefficients:
Simple Linear Model coefficients : Humidade
Estimate Std. Error t value Pr(>|t|)
(Intercept) 80.37 6.17 13.02 0.00
Humidade -2.87 3.36 -0.85 0.41


Visualization:




[ Declive ]

Simple Linear Model Information

 Residual standard error: 16 on 12 degrees of freedom
 Multiple R-squared:  0.10738,  Adjusted R-squared:  0.033
 F-statistic: 1 on 1 and 12 DF,  p-value: 0.252732445790891

Coefficients:
Simple Linear Model coefficients : Declive
Estimate Std. Error t value Pr(>|t|)
(Intercept) 79.31 4.85 16.34 0.00
Declive -6.56 5.46 -1.20 0.25


Visualization:




[ Insectos ]

Simple Linear Model Information

 Residual standard error: 14 on 12 degrees of freedom
 Multiple R-squared:  0.29811,  Adjusted R-squared:  0.23961
 F-statistic: 5 on 1 and 12 DF,  p-value: 0.0434011022875512

Coefficients:
Simple Linear Model coefficients : Insectos
Estimate Std. Error t value Pr(>|t|)
(Intercept) 81.85 4.48 18.29 0.00
Insectos -0.65 0.29 -2.26 0.04


Visualization:




[ Agricola ]

Simple Linear Model Information

 Residual standard error: 15 on 12 degrees of freedom
 Multiple R-squared:  0.22218,  Adjusted R-squared:  0.15736
 F-statistic: 3 on 1 and 12 DF,  p-value: 0.0888660093518512

Coefficients:
Simple Linear Model coefficients : Agricola
Estimate Std. Error t value Pr(>|t|)
(Intercept) 69.89 5.49 12.72 0.00
Agricola 2.87 1.55 1.85 0.09


Visualization:




[ Ratos ]

Simple Linear Model Information

 Residual standard error: 16 on 12 degrees of freedom
 Multiple R-squared:  0.11018,  Adjusted R-squared:  0.03603
 F-statistic: 1 on 1 and 12 DF,  p-value: 0.246268931809105

Coefficients:
Simple Linear Model coefficients : Ratos
Estimate Std. Error t value Pr(>|t|)
(Intercept) 67.10 9.02 7.44 0.00
Ratos 0.29 0.24 1.22 0.25


Visualization:




[ Temperatura ]

Simple Linear Model Information

 Residual standard error: 17 on 12 degrees of freedom
 Multiple R-squared:  0.00992,  Adjusted R-squared:  -0.07259
 F-statistic: 0 on 1 and 12 DF,  p-value: 0.734795419733537

Coefficients:
Simple Linear Model coefficients : Temperatura
Estimate Std. Error t value Pr(>|t|)
(Intercept) 78.34 6.51 12.03 0.00
Temperatura -0.13 0.38 -0.35 0.73


Visualization:




[ Estradas ]

Simple Linear Model Information

 Residual standard error: 10 on 12 degrees of freedom
 Multiple R-squared:  0.67326,  Adjusted R-squared:  0.64603
 F-statistic: 25 on 1 and 12 DF,  p-value: 0.000323855046548262

Coefficients:
Simple Linear Model coefficients : Estradas
Estimate Std. Error t value Pr(>|t|)
(Intercept) 98.21 5.06 19.41 0
Estradas -1.72 0.35 -4.97 0


Visualization:




[ Prox.casas ]

Simple Linear Model Information

 Residual standard error: 9 on 12 degrees of freedom
 Multiple R-squared:  0.73007,  Adjusted R-squared:  0.70758
 F-statistic: 32 on 1 and 12 DF,  p-value: 9.959890031143e-05

Coefficients:
Simple Linear Model coefficients : Prox.casas
Estimate Std. Error t value Pr(>|t|)
(Intercept) 26.38 9.16 2.88 0.01
Prox.casas 1.83 0.32 5.70 0.00


Visualization:




[ Agua ]

Simple Linear Model Information

 Residual standard error: 8 on 12 degrees of freedom
 Multiple R-squared:  0.78428,  Adjusted R-squared:  0.7663
 F-statistic: 44 on 1 and 12 DF,  p-value: 2.51894588648466e-05

Coefficients:
Simple Linear Model coefficients : Agua
Estimate Std. Error t value Pr(>|t|)
(Intercept) 62.60 3.03 20.68 0
Agua 28.27 4.28 6.61 0


Visualization:




4.1.2 Grouped Categorical Variables

There is no categorical variable.

4.2 Grouped Relationship Between Variables

4.2.1 Grouped Correlation Coefficient

Numerical target variables are not supported.

4.2.2 Grouped Correlation Plot of Numerical Variables

Numerical target variables are not supported.