Sumários
Data mining
21 Maio 2024, 13:00 • António Ferreira
Data mining: overview; infrastructure; typical tasks; how it is being used; virtuous cycle; classification with decision trees and neural networks; hierarchical agglomerative and k-means clustering; association rules; evaluation of classification models; difficulties in using data mining.
Physical design of data warehouses
14 Maio 2024, 13:00 • António Ferreira
Physical design of data warehouses: motivation; tree-based, hash-based, and bitmap indexes; clustered and multi-attribute indexes; materialized views; data compression; data partitioning; distributed storage.
ETL system
7 Maio 2024, 13:00 • António Ferreira
ETL system: staging steps of a data warehouse; conceptual ETL plan; logical data map; ETL build sequence; metadata; flat files vs. databases; data quality screens; conforming data; loading data into dimensions; handling SCD type 2 changes; loading data into facts; loading snapshot fact tables; indexes during the ETL processing; outwitting the database log; increasing ETL throughput.