KLI

Data Mining and Analysis of Supervised & Unsupervised Learning Algorithms

Metadata Downloads
Abstract
Data mining is a process of investigating large pre-existing databases to gather new information. Data mining is a connection between computer science and statistics used to discover patterns in the data. The main objective of the data mining process is to mine the useful information from the data and formulate it into an understandable/logical structure for further use. The large data is sorted into sets to categorize patterns and create relationships to resolve problems through data analysis. Supervised (Classification) and unsupervised (Clustering) learning techniques are discussed in this research work.
Supervised machine learning is a method of machine learning. It involves allocating the specific data in such a way that a specific type of pattern or function can be extracted from that labeled data. Classification is defined as the function of learning in which provided data items are mapped into more than a few classes that are predefined.
Unsupervised techniques are essentially initiated from the sets of unlabeled data so, these are directly associated to figure out the unfamiliar properties in clusters. Clustering is a technique and process of unsupervised learning, used for the analysis of the statistical data exploited in several fields.
The analysis of Supervised (Classification) and Unsupervised (Clustering) learning techniques are based on accuracy and time studied in this research work. The Classification algorithms K Nearest Neighbor (KNN), Backpropagation (BP), Naïve Bayes, and Support Vector Machine (SVM) are compared by using different datasets through the testing tool Weka 3.8. The clustering algorithms K-means and Expectation-Maximization (EM) are also compared based upon accuracy and time by using Rapid miner and Weka 3.8 tools. The results show that the classification algorithm back-propagation performs with good accuracy as compared to the remaining classification algorithms. KNN performs timely executions as compared to other classification algorithms in supervised learning techniques. The clustering algorithm k-means shows good accuracy as compared to Expectation-Maximization (EM). K-means algorithm produces quality clusters as compared to Expectation-Maximization (EM).
Author(s)
요사프 아짐
Issued Date
2022
Awarded Date
2022-08
Type
dissertation
URI
https://oak.ulsan.ac.kr/handle/2021.oak/9720
http://ulsan.dcollection.net/common/orgView/200000640936
Alternative Author(s)
Yousaf Azeem
Affiliation
울산대학교
Department
산업대학원 스마트IT융합
Advisor
정의필
Degree
Master
Publisher
울산대학교 산업대학원 스마트IT융합
Language
eng
Rights
울산대학교 논문은 저작권에 의해 보호 받습니다.
Appears in Collections:
Industry > Smart IT Convergence Engineering
공개 및 라이선스
  • 공개 구분공개
파일 목록

Items in Repository are protected by copyright, with all rights reserved, unless otherwise indicated.