ACS Applied Computer Science

  • Increase font size
  • Default font size
  • Decrease font size

Applied Computer Science Volume 20, Number 4, 2024

STUDY ON DEEP LEARNING MODELS FOR VR SICKNESS LEVELS CLASSIFICATION

Print

Virtual Reality (VR) sickness is often accompanied by symptoms such as nausea and dizziness, and a prominent theory explaining this phenomenon is the sensory conflict theory. Recently, studies have used Deep Learning to classify VR sickness levels; however, there is a paucity of research on Deep Learning models that utilize both visual information and motion data based on sensory conflict theory. In this paper, the authors propose a parallel merging of a Deep Learning model (4bay) to classify the level of VR sickness by utilizing the user's motion data (HMD, controller data) and visual data (rendered image, depth image) based on sensory conflict theory. The proposed model consists of a visual processing module, a motion processing module, and an FC-based VR sickness level classification module. The performance of the proposed model was compared with that of the developed models at the time of design. As a result of the comparison, it was confirmed that the proposed model performed better than the single model and the merged (2bay) model in classifying the user's VR sickness level.

  • APA 7th style
Na, H., & Kim, Y. S. (2024). Study on deep learning models for the classification of VR sickness levels. Applied Computer Science, 20(4), 1–13. https://doi.org/10.35784/acs-2024-37
  • Chicago style
Na, Haechan, and Yoon Sang Kim. “Study on Deep Learning Models for the Classification of VR Sickness Levels”. Applied Computer Science 20, no. 4 (2024): 1–13. https://doi.org/10.35784/acs-2024-37.
  • IEEE style
H. Na and Y. S. Kim, “Study on deep learning models for the classification of VR sickness levels”, Applied Computer Science, vol. 20, no. 4, pp. 1–13, doi: 10.35784/acs-2024-37.
  • Vancouver style
Na H, Kim YS. Study on deep learning models for the classification of VR sickness levels. Applied Computer Science. 2024; 20(4):1–13.

ENHANCING TOMATO LEAF DISEASE DETECTION THROUGH MULTIMODAL FEATURE FUSION

Print

The need for an ensemble classifier is driven by better accuracy; reduced overfitting, increased robustness that copes with noisy data and reduced variance of individual models, combining the advantages and overcoming the drawbacks of the individual classifier. A comparison of different classifiers like Support Vector Machine (SVM), XGBoost, Random Forest (RF), Naive Bayes (NB), Convolutional Neural Network (CNN) and proposed Ensemble method used in the classification task was conducted. Among all the classifiers evaluated, CNN was found to be the most accurate having an accuracy rate of 93.7%. This indicates that CNN can identify complex data patterns that are also important for photo recognition and classification tasks. Nonetheless, NB and SVM only achieved medium results with accuracy rates of 82.66% and 85.6% respectively. These could have been due to either the complexity of data being handled or underlying assumptions made. RF and XGBoost demonstrated remarkable performances by employing ensemble learning methods as well as gradient-boosting approaches with accuracies of 83.33% and 90.7% respectively. The Ensemble method presented in this paper outperformed all individual models at an accuracy level of 95.5%, indicating that more than one technique is better when classifying correctly based on various resource allocations across techniques employed thereby improving such outcomes altogether by combining them. These results display the pros and cons of every classifier on the Plant Village dataset, giving vital data to improve plant disease classification and guide further research into precision farming and agricultural diagnostics.

  • APA 7th style
Saraf, P., Patil, J., & Wagh, R. (2024). Enhancing tomato leaf disease detection through multimodal feature fusion. Applied Computer Science, 20(4), 14–38. https://doi.org/10.35784/acs-2024-38
  • Chicago style
Saraf, Puja, Jayantrao Patil, and Rajnikant Wagh. “Enhancing Tomato Leaf Disease Detection through Multimodal Feature Fusion”. Applied Computer Science 20, no. 4 (2024): 14–38.
  • IEEE style
P. Saraf, J. Patil, and R. Wagh, “Enhancing tomato leaf disease detection through multimodal feature fusion”, Applied Computer Science, vol. 20, no. 4, pp. 14–38, doi: 10.35784/acs-2024-38.
  • Vancouver style
Saraf P, Patil J, Wagh R. Enhancing tomato leaf disease detection through multimodal feature fusion. Applied Computer Science. 2024; 20(4):14–38.

NOVEL MULTI-MODAL OBSTRUCTION MODULE FOR DIABETES MELLITUS CLASSIFICATION USING EXPLAINABLE MACHINE LEARNING

Print

Diabetes Mellitus (DM) is a persistent metabolic disorder which is characterized by increased blood glucose level in the blood stream. Initially, DM occurs while the insulin secretion in the pancreas has a disability to secrete or to use hormone for the metabolic process. Moreover, there are different types of DM depending on the physiological process, and the types include Type1 DM, Type2 DM and Gestational DM. Electrocardiography (ECG) waves are used to detect the abnormal heartbeats and cannot directly detect DM, but the wave abnormality can indicate the possibility and presence of DM. Whereas the Photoplethysmography (PPG) signals are a non-invasive method used to detect changes in  blood volume that can monitor BG changes. Furthermore, the detection and classification of DM using PPG and ECG can involve analyzing the functional performance of these modalities. By extracting the features like R wave (W1) and QRS complex (W2) in the ECG signals and Pulse Width (S1) and Pulse Amplitude Variation (S2) can detect DM and can be classified into DM and Non-DM. The authors propose a Novel architecture in the basis of Encoder Decoder structure named as Obstructive Encoder Decoder module. This module extracts the specific features and the proposed novel Obstructive Erasing Module remove the remaining artifacts and then the extracted features are fed into the Multi-Uni-Net for the fusion of the two modalities and the fused image is classified using EXplainable Machine Learning (EX-ML). From this classification the performance metrics like Accuracy, Precision, Recall, F1-Score and AUC can be determined.

  • APA 7th style
Shaik, R., & Siddique, I. (2024). Novel multi-modal obstruction module for diabetes mellitus classification using explainable machine learning. Applied Computer Science, 20(4), 39–62. https://doi.org/10.35784/acs-2024-39
  • Chicago style
Shaik, Reehana, and Ibrahim Siddique. “Novel Multi-Modal Obstruction Module for Diabetes Mellitus Classification Using Explainable Machine Learning”. Applied Computer Science 20, no. 4 (2024): 39–62.
  • IEEE style
R. Shaik and I. Siddique, “Novel multi-modal obstruction module for diabetes mellitus classification using explainable machine learning”, Applied Computer Science, vol. 20, no. 4, pp. 39–62, doi: 10.35784/acs-2024-39.
  • Vancouver style
Shaik R, Siddique I. Novel multi-modal obstruction module for diabetes mellitus classification using explainable machine learning. Applied Computer Science. 2024; 20(4):39–62.

COMPUTATIONAL SYSTEM FOR EVALUATING HUMAN PERCEPTION IN VIDEO STEGANOGRAPHY

Print

This paper presents a comprehensive computational system designed to evaluate the undetectability of video steganography from human perspective. The system assesses the perceptibility of steganographic modifications to the human eye while simultaneously determining the minimum encoding level required for successful automated decoding of hidden messages. The proposed architecture comprises four subsystems: steganogram database preparation, human evaluation, automated decoding, and comparative analysis. The system was tested using example steganographic techniques applied to a dataset of video files. Experimental results revealed the thresholds of human-level undetectability and automated decoding for each technique, enabling the identification of critical differences between human and algorithmic detection capabilities. This research contributes to the field of steganography by offering a novel framework for evaluating the trade-offs between human perception and automated decoding in video-based information hiding. The system serves as a tool for advancing the development of more secure and reliable video steganographic techniques.

  • APA 7th style
Pery, M., & Waszkowski, R. (2024). A computational system for evaluating human perception in video steganography. Applied Computer Science, 20(4), 63–76. https://doi.org/10.35784/acs-2024-40
  • Chicago style
Pery, Marcin, and Robert Waszkowski. “A Computational System for Evaluating Human Perception in Video Steganography”. Applied Computer Science 20, no. 4 (2024): 63–76. 
  • IEEE style
M. Pery and R. Waszkowski, “A computational system for evaluating human perception in video steganography”, Applied Computer Science, vol. 20, no. 4, pp. 63–76, doi: 10.35784/acs-2024-40.
  • Vancouver style
Pery M, Waszkowski R. A computational system for evaluating human perception in video steganography. Applied Computer Science. 2024; 20(4):63–76.

PUPIL DIAMETER AND MACHINE LEARNING FOR DEPRESSION DETECTION: A COMPARATIVE STUDY WITH DEEP LEARNING MODELS

Print

According to the World Health Organization, the Global Mental Health Report estimated that between 251 and 310 million individuals worldwide experienced depression during the first year of the COVID-19 pandemic. Most methods for detecting depression rely on clinical diagnoses and surveys. However, the American Psychiatric Association reports that over 50% of patients do not receive appropriate treatment. This study aims to utilize machine learning and pupil diameter features to identify depression and evaluate the accuracy of these classifiers in comparison to our previous deep learning model. While limited research has explored the use of pupillary diameter as a classification tool for distinguishing between individuals with and without depression, several studies have focused on EEG signals for this purpose. The study involved 58 participants, with 29 classified as depressed and 29 as healthy. The classification was based on statistical features extracted from the Hilbert-Huang Transform. Results showed a significant improvement in average accuracy compared to the authors’ prior work, with the current study achieving 77.72% accuracy, compared to 64.78% in their previous research. Machine learning methods, particularly Bagging, outperformed deep learning models such as AlexNet when classifying data from the left and right eyes individually (90.91% vs. 78.57% for the left eye; 90.91% vs. 71.43% for the right eye). However, when combining data from both eyes, deep learning using AlexNet demonstrated superior performance (98.28% accuracy compared to 93.75% using Bagging with statistical features from both eyes). Despite the higher accuracy of deep learning, machine learning is recommended for its faster execution times.

  • APA 7th style
Mohamed, I., El-Wakad, M., Abbas, K., Aboamer, M., & Mohamed, N. A. R. (2024). Pupil diameter and machine learning for depression detection: A comparative study with deep learning models. Applied Computer Science, 20(4), 77–99. https://doi.org/10.35784/acs-2024-41
  • Chicago style
Mohamed, Islam, Mohamed El-Wakad, Khaled Abbas, Mohamed Aboamer, and Nader A. Rahman Mohamed. “Pupil Diameter and Machine Learning for Depression Detection: A Comparative Study with Deep Learning Models”. Applied Computer Science 20, no. 4 (2024): 77–99.
  • IEEE style
I. Mohamed, M. El-Wakad, K. Abbas, M. Aboamer, and N. A. R. Mohamed, “Pupil diameter and machine learning for depression detection: a comparative study with deep learning models”, Applied Computer Science, vol. 20, no. 4, pp. 77–99, doi: 10.35784/acs-2024-41.
  • Vancouver style
Mohamed I, El-Wakad M, Abbas K, Aboamer M, Mohamed NAR. Pupil diameter and machine learning for depression detection: a comparative study with deep learning models. Applied Computer Science. 2024; 20(4):77–99. 

CLASSIFICATION AND PREDICTION OF BENTHIC HABITAT BASED ON SCIENTIFIC ECHOSOUNDER DATA: APPLICATION OF MACHINE LEARNING ALGORITHMS

Print

This study aims to map three main benthic habitats (coral, seagrass, and sand) in Kapota Atoll (Wakatobi, Indonesia) using a single-beam echosounder (SBES) Simrad EK15. The acoustic data were processed using Sonar5-Pro software. Eight acoustic parameters were used as input for the classification and prediction of benthic habitats, including depth (D), five acoustic parameters of the first echo (BD, BP, AttSv1, DecSv1, and AttDecSv1), and cumulative energy of the second and third echoes (AttDecSv2 and AttDecSv3). The classification and prediction process of benthic habitats uses two machine learning algorithms, Random Forest (RF) and Support Vector Machine (SVM), in XLSTAT Basic+ software. The study results show that 49 combinations of acoustic parameters produce benthic habitat maps that meet the minimum accuracy standards for benthic habitat mapping (≥60%). Using eight acoustic parameters produces a more accurate benthic habitat map than using only two main SBES parameters (DecSv1 and AttDecSv2 parameters or E1 and E2 in the RoxAnn system indicating the roughness and hardness indices). The RF and SVM algorithms produce benthic habitat maps with the highest accuracy of 79.33% and 78.67%, respectively. Each acoustic parameter has a different importance for the classification of benthic habitats, where the order of importance of each acoustic parameter in the overall classification follows the following order: AttDecSv2 > D > DecSv1 > BD > AttDecSv3 > AttSv1 > AttDecSv1 > BP. Overall, using more acoustic parameters can significantly improve the accuracy of benthic habitat maps.

  • APA 7th style
Hamuna, B., Pujiyati, S., Gaol, J. L., & Hestirianoto, T. (2024). Classification and prediction of benthic habitat from scientific echosounder data: Application of machine learning algorithms. Applied Computer Science, 20(4), 100–116. https://doi.org/10.35784/acs-2024-42
  • Chicago style
Hamuna, Baigo, Sri Pujiyati, Jonson Lumban Gaol, and Totok Hestirianoto. “Classification and Prediction of Benthic Habitat from Scientific Echosounder Data: Application of Machine Learning Algorithms”. Applied Computer Science 20, no. 4 (2024): 100–116. 
  • IEEE style
B. Hamuna, S. Pujiyati, J. L. Gaol, and T. Hestirianoto, “Classification and prediction of benthic habitat from scientific echosounder data: application of machine learning algorithms”, Applied Computer Science, vol. 20, no. 4, pp. 100–116, doi: 10.35784/acs-2024-42.
  • Vancouver style
Hamuna B, Pujiyati S, Gaol JL, Hestirianoto T. Classification and prediction of benthic habitat from scientific echosounder data: application of machine learning algorithms. Applied Computer Science. 2024; 20(4):100–116.

ENHANCEMENT OF ARTIFICIAL IMMUNE SYSTEMS FOR THE TRAVELING SALESMAN PROBLEM THROUGH HYBRIDIZATION WITH NEIGHBORHOOD IMPROVEMENT AND PARAMETER FINE-TUNING

Print

This research investigates the enhancement of Artificial Immune Systems (AIS) for solving the Traveling Salesman Problem (TSP) through hybridization with Neighborhood Improvement (NI) and parameter fine-tuning. Two main experiments were conducted: Experiment A identified the optimal integration points for NI within AIS, revealing that position 2 (AIS+NIpos2) improved solution quality by an average of 27.78% compared to other positions. Experiment B benchmarked AIS performance with various enhancement techniques. Using symmetric and asymmetric TSP datasets, the results showed that integrating NI at strategic points and fine-tuning parameters boosted AIS performance by up to 46.27% in some cases. The hybrid and fine-tuned version of AIS (AIS-th) consistently provided the best solution quality, with up to a 50.36% improvement, though it required more computational time. These findings emphasize the importance of strategic combinations and fine-tuning for creating effective optimization algorithms.

  • APA 7th style
Thapatsuwan, P., Thapatsuwan, W., & Kulworatit, C. (2024). Enhancement of artificial immune systems for the traveling salesman problem through hybridization with neighborhood improvement and parameter fine-tuning. Applied Computer Science, 20(4), 117–137. https://doi.org/10.35784/acs-2024-43
  • Chicago style
Thapatsuwan, Peeraya, Warattapop Thapatsuwan, and Chaichana Kulworatit. “Enhancement of Artificial Immune Systems for the Traveling Salesman Problem through Hybridization with Neighborhood Improvement and Parameter Fine-Tuning”. Applied Computer Science 20, no. 4 (2024): 117–137.
  • IEEE style
P. Thapatsuwan, W. Thapatsuwan, and C. Kulworatit, “Enhancement of artificial immune systems for the traveling salesman problem through hybridization with neighborhood improvement and parameter fine-tuning”, Applied Computer Science, vol. 20, no. 4, pp. 117–137, doi: 10.35784/acs-2024-43.
  • Vancouver style
Thapatsuwan P, Thapatsuwan W, Kulworatit C. Enhancement of artificial immune systems for the traveling salesman problem through hybridization with neighborhood improvement and parameter fine-tuning. Applied Computer Science. 2024; 20(4):117–137.

EVALUATING LARGE LANGUAGE MODELS FOR MEDICAL INFORMATION EXTRACTION: A COMPARATIVE STUDY OF ZERO-SHOT AND SCHEMA-BASED METHODS

Print

This study investigates the application of large language models, particularly ChatGPT, in the extraction and structuring of medical information from free-text patient reports. The authors explore two distinct methods: a zero-shot extraction approach and a schema-based extraction approach. The dataset, consisting of 1230 anonymized French medical reports from the Department of Neonatology of the Mohammed VI University Hospital, served as the basis for these experiments. The findings indicate that while ChatGPT demonstrates a significant capability in structuring medical data, certain challenges remain, particularly with complex and non-standardized text formats. The authors evaluate the model's performance using precision, recall, and F1 score metrics, providing a comprehensive assessment of its applicability in clinical settings.

  • APA 7th style
Kaddari, Z., El Hachmi, I., Berrich, J., Amrani, R., & Bouchentouf, T. (2024). Evaluating large language models for medical information extraction: A comparative study of zero-shot and schema-based methods. Applied Computer Science, 20(4), 138–148. https://doi.org/10.35784/acs-2024-44
  • Chicago style
Kaddari, Zakaria, Ikram El Hachmi, Jamal Berrich, Rim Amrani, and Toumi Bouchentouf. “Evaluating Large Language Models for Medical Information Extraction: A Comparative Study of Zero-Shot and Schema-Based Methods”. Applied Computer Science 20, no. 4 (2024): 138–148.
  • IEEE style
Z. Kaddari, I. El Hachmi, J. Berrich, R. Amrani, and T. Bouchentouf, “Evaluating large language models for medical information extraction: a comparative study of zero-shot and schema-based methods”, Applied Computer Science, vol. 20, no. 4, pp. 138–148, doi: 10.35784/acs-2024-44.
  • Vancouver style
Kaddari Z, El Hachmi I, Berrich J, Amrani R, Bouchentouf T. Evaluating large language models for medical information extraction: a comparative study of zero-shot and schema-based methods. Applied Computer Science. 2024; 20(4):138–148.

EXPLORING THE EXPEDIENCY OF BLOCKCHAIN-BASED SOLUTIONS: REVIEW AND CHALLENGES

Print

A distributed type of database where digital data is stored as blocks chained together is called a Blockchain. Each block consists of several transactions, authenticated using cryptographic keys, and approved by a group of validators. Hundreds of different Blockchain solutions have been proposed over the years, proving that they attract research and business interest. In this article, the authors present a generic vocabulary for unifying different terminologies used by various researchers in the field, followed by a review and evaluation of several research works presenting Blockchain-based solutions. A set of criteria regarding usefulness and suitability of adopting a Blockchain application are distinguished in these works. A method to examine their applicability is also discussed. Cryptocurrencies and supply chains, the two most well-known Blockchain uses, are considered and examined to assess how important these criteria are in these two use cases.

  • APA 7th style
Moreno Arboleda, F. J., Garani, G., & Arboleda Zuluaga, S. A. (2024). Exploring the expediency of blockchain-based solutions: Review and challenges. Applied Computer Science, 20(4), 149–174. https://doi.org/10.35784/acs-2024-45
  • Chicago style
Moreno Arboleda, Francisco Javier, Georgia Garani, and Sergio Andrés Arboleda Zuluaga. “Exploring the Expediency of Blockchain-Based Solutions: Review and Challenges”. Applied Computer Science 20, no. 4 (2024): 149–174. 
  • IEEE style
F. J. Moreno Arboleda, G. Garani, and S. A. Arboleda Zuluaga, “Exploring the expediency of blockchain-based solutions: review and challenges”, Applied Computer Science, vol. 20, no. 4, pp. 149–174, doi: 10.35784/acs-2024-45.
  • Vancouver style
Moreno Arboleda FJ, Garani G, Arboleda Zuluaga SA. Exploring the expediency of blockchain-based solutions: review and challenges. Applied Computer Science. 2024; 20(4):149–174.

FEASIBILITY OF USING LOW-PARAMETER LOCAL LLMS IN ANSWERING QUESTIONS FROM ENTERPRISE KNOWLEDGE BASE

Print

This paper evaluates the feasibility of deploying locally-run Large Language Models (LLMs) for retrieval-augmented question answering (RAG-QA) over internal knowledge bases in small and medium enterprises (SMEs), with a focus on Polish-language datasets. The study benchmarks eight popular open-source and source-available LLMs, including Google’s Gemma-9B and Speakleash’s Bielik-11B, assessing their performance across closed, open, and detailed question types, with metrics for language quality, factual accuracy, response stability, and processing efficiency. The results highlight that desktop-class LLMs, though limited in factual accuracy (with top scores of 45% and 43% for Gemma and Bielik, respectively), hold promise for early-stage enterprise implementations. Key findings include Bielik's superior performance on open-ended and detailed questions and Gemma's efficiency and reliability in closed-type queries. Distribution analyses revealed variability in model outputs, with Bielik and Gemma showing the most stable response distributions. This research underscores the potential of offline-capable LLMs as cost-effective tools for secure knowledge management in Polish SMEs.

  • APA 7th style
Badurowicz, M., Skulimowski, S., & Laskowski, M. (2024). Feasibility of using low-parameter local LLMs in answering questions from enterprise knowledge base. Applied Computer Science, 20(4), 175–191. https://doi.org/10.35784/acs-2024-46
  • Chicago style
Badurowicz, Marcin, Stanisław Skulimowski, and Maciej Laskowski. “Feasibility of Using Low-Parameter Local LLMs in Answering Questions from Enterprise Knowledge Base”. Applied Computer Science 20, no. 4 (2024): 175–191.
  • IEEE style
M. Badurowicz, S. Skulimowski, and M. Laskowski, “Feasibility of using low-parameter local LLMs in answering questions from enterprise knowledge base”, Applied Computer Science, vol. 20, no. 4, pp. 175–191, doi: 10.35784/acs-2024-46.
  • Vancouver style
Badurowicz M, Skulimowski S, Laskowski M. Feasibility of using low-parameter local LLMs in answering questions from enterprise knowledge base. Applied Computer Science. 2024; 20(4):175–191.

SHARPNESS IMPROVEMENT OF MAGNETIC RESONANCE IMAGES USING A GUIDED-SUBSUMED UNSHARP MASK FILTER

Print

Magnetic resonance imaging (MRI) is a key method for imaging human tissues and organs. The accuracy of medical diagnosis is greatly affected by the quality of MRI images. Sometimes, MRI images are obtained blurry due to various inevitable constraints related to the imaging equipment, which affects the detection of important features in the image. Several sharpening methods were introduced, but not all were successful in this task, as artifacts may be introduced, contrast may be changed, and high complexity may be involved. Thus, this paper introduces a guided-subsumed unsharp mask filter (GSUM) to improve the sharpness of MRI images. The GSUM utilizes an improved guided filter instead of the low-pass Gaussian filter and a dynamic sharpening parameter. The improved guided filter employs a hybrid procedure instead of the mean filter in the smoothing process and relies on an adaptive regularization parameter. The applied modifications eliminated the overshooting and halo effects of the original unsharp masking and the guided filter, resulting in better-quality images. The GSUM was tested with real-blurry MRI images, evaluated using three no-reference metrics, and compared with six other algorithms. The metric scores indicate that the proposed filter can surpass existing methods, as it produced better results with average readings of 24.2074 in PIQE, 0.6878 in BLUR, and 5.7944 in FISH. It also scored a fast computation time, averaging 0.3384 seconds.

  • APA 7th style
Al-Abaji, M., & Al-Ameen, Z. (2024). Sharpness improvement of magnetic resonance images using a guided-subsumed unsharp mask filter. Applied Computer Science, 20(4), 192–210. https://doi.org/10.35784/acs-2024-47
  • Chicago style
Al-Abaji, Manar, and Zohair Al-Ameen. "Sharpness Improvement of Magnetic Resonance Images Using a Guided-Subsumed Unsharp Mask Filter". Applied Computer Science 20, no. 4 (2024): 192–210. 
  • IEEE style
M. Al-Abaji and Z. Al-Ameen, “Sharpness improvement of magnetic resonance images using a guided-subsumed unsharp mask filter”, Applied Computer Science, vol. 20, no. 4, pp. 192–210, doi: 10.35784/acs-2024-47.
  • Vancouver style
Al-Abaji M, Al-Ameen Z. Sharpness improvement of magnetic resonance images using a guided-subsumed unsharp mask filter. Applied Computer Science. 2024; 20(4):192–210.

FUZZY REGION MERGING WITH HIERARCHICAL CLUSTERING TO FIND OPTIMAL INITIALIZATION OF FUZZY REGION IN IMAGE SEGMENTATION

Print

One of the most important goals in image segmentation is the process of separating the object parts from the image background. Image segmentation is also a fundamental stage in the development of other image applications such as object recognition, target tracking, computer vision, and biomedical image processing. Interactive image segmentation methods with additional user interaction are still popular in research. Interactive image segmentation aims to provide additional information through simple interactions, especially in images with complex objects. Interactive image segmentation with region merging processes has drawbacks, one of which is suboptimal region splitting due to soft color shades, blurred contours, and uneven lighting, referred to in this study as ambiguous regions. However, in the fuzzy region initialization stage after obtaining values from the marker process, there is a possibility of missing or suboptimal determination of fuzzy regions. This is because it only takes the highest gray level value for the background marker and the lowest gray level value for the object marker. In this study, fuzzy region merging using hierarchical clustering is proposed to find optimal initialization for fuzzy regions in image segmentation. Based on the experimental results, the proposed method can achieve optimal segmentation with an average misclassification error value of 2.62% for Natural Images and 9.33% for Dental Images.

  • APA 7th style
Gunawan, W. (2024). Fuzzy region merging with hierarchical clustering to find optimal initialization of fuzzy region in image segmentation. Applied Computer Science, 20(4), 211–220. https://doi.org/10.35784/acs-2024-48
  • Chicago style
Gunawan, Wawan. “Fuzzy Region Merging with Hierarchical Clustering to Find Optimal Initialization of Fuzzy Region in Image Segmentation”. Applied Computer Science 20, no. 4 (2024): 211–220.
  • IEEE style
W. Gunawan, “Fuzzy region merging with hierarchical clustering to find optimal initialization of fuzzy region in image segmentation”, Applied Computer Science, vol. 20, no. 4, pp. 211–220, doi: 10.35784/acs-2024-48.
  • Vancouver style
Gunawan W. Fuzzy region merging with hierarchical clustering to find optimal initialization of fuzzy region in image segmentation. Applied Computer Science. 2024; 20(4):211–220.