Data mining is a process of discovering patterns in large data setsinvolving methods at the intersection of machine learning,statistics, and database systems. Data mining is aninterdisciplinary subfield of computer science and statistics withan overall goal to extract information (with intelligent methods)from a data set and transform the information into acomprehensible structure for further use. Data mining is theanalysis step of the "knowledge discovery in databases" process,or KDD. Aside from the raw analysis step, it also involvesdatabase and data management aspects, data pre-processing,model and inference considerations, interestingness metrics,complexity considerations, post-processing of discoveredstructures, visualization, and online updating.
Published Date: 2020-12-28; Received Date: 2020-11-28