Analysis of ensemble machine learning classification comparison on the skin cancer MNIST dataset 
	Poetri Lestari Lokapitasari Belluano, Reyna Aprilia Rahma, Herdianti Darwis, Abdul Rachman Manga 
	
			
		Abstract 
		
		This study aims to analyze the performance of various ensemble machine learning methods, such as Adaboost, Bagging, and Stacking, in the context of skin cancer classification using the skin cancer MNIST dataset. We also evaluate the impact of handling dataset imbalance on the classification model’s performance by applying imbalanced data methods such as random under sampling (RUS), random over sampling (ROS), synthetic minority over-sampling technique (SMOTE), and synthetic minority over-sampling technique with edited nearest neighbor (SMOTEENN). The research findings indicate that Adaboost is effective in addressing data imbalance, while imbalanced data methods can significantly improve accuracy. However, the selection of imbalanced data methods should be carefully tailored to the dataset characteristics and clinical objectives. In conclusion, addressing data imbalance can enhance skin cancer classification accuracy, with Adaboost being an exception that shows a decrease in accuracy after applying imbalanced data methods.
		
		 
	
			
		Keywords 
		
		Ensemble machine learning; Imbalanced data; Performance comparison; Skin cancer
		
		 
	
				
			
	
	
							
		
		DOI: 
https://doi.org/10.11591/csit.v5i3.p235-242 																				
Refbacks 
				There are currently no refbacks. 
	 
		
Computer Science and Information Technologies  2722-323X,  e-ISSN: 2722-3221 This journal is published by the Institute of Advanced Engineering and Science (IAES)  in collaboration  with Universitas Ahmad Dahlan (UAD) .
<a title="Web Analytics" href="https://statcounter.com/" target="_blank"<img class="statcounter" src="https://c.statcounter.com/11992001/0/5fa2f457/0/" alt="Web Analytics"</div> <br> CSIT Visitor Stats 
Creative Commons Attribution-ShareAlike 4.0 International License .