Three basic process of data mining
1. Data mining tasks
There are many non-trivial tasks involved in the data mining process: these include data pre-processing, rule or model induction, model validation and result presentation.
2. Data volume
Many modern data mining applications are faced with growing volumes (in
bytes) of data to be analysed. Some of the larger data sets comprise millions of entries and
require gigabytes or terabytes of storage.
3. Data complexity
This dimension has two aspects. First, the phenomena analysed in complex application scenarios are captured by increasingly complex data structures and types, including natural language text, images, time series, multi-relational and object data types. Second, data are increasingly located in geographically distributed data placements and cannot be gathered centrally for technological (e.g. large data volumes), data privacy (Clifton et al., 2002), security, legal or other reasons.
(Stankovski et al., 2004).
- 232 reads