International Journal of Advances in Intelligent Informatics
Not a member yet
235 research outputs found
Sort by
Region-based convolutional neural networks for occluded person re-identification
In a variety of applications, including intelligent surveillance systems, targeted tracking, and assistive human-following robots, the ability to accurately identify individuals even when they are partially obscured is imperative. Such Continuous person tracking is complicated by the close similarity between the appearance of people and target occlusions. This study addresses this significant challenge by proposing a two-step, detection-first approach that uses a region-based convolutional neural network (R-CNN) as the re-identification (re-ID)solution. The model is specifically trained to detect occluded persons at different levels of occlusion before forwarding the image for the re-ID process. Three occluded-specific datasets are selected to evaluate the model's effectiveness in detecting occluded people. There are 379 distinct people in total, and each has five images obstructed from different angles. A sample of the data is taken to simulate various environment settings, and new data points are generated with different degrees of occlusion to assess how well the model performs under varying levels of obstruction. The findings demonstrate that the proposed person re-ID model is reliable in most circumstances, correctly re-identifying at 74% (Rank-1) and 90% (Rank-5). Although there is a decrease in accuracy as the number of distinctive people in the dataset increases, this does not significantly impact the tracking performance in various applications, which are expected to recognize a single person or a small group of individuals. Future works will explore refining similarity matching algorithms by delving into robust image comparison techniques, thereby addressing the challenges presented by occlusions. A critical aspect is to assess the model under diverse lighting conditions and investigate scenarios with multiple individuals in a frame. It is also beneficial to exploit high-resolution datasets, such as DukeMTMC-reID, and integrate finer contextual details, like clothing or carried objects. These collective efforts are essential for optimizing the model’s efficacy in practical applications and advancing person re-ID technologies
Evaluation of sleep stage classification using feature importance of EEG signal for big data healthcare
Sleep analysis is widely and experimentally considered due to its importance to body health care. Since its sufficiency is essential for a healthy life, people often spend almost a third of their lives sleeping. In this case, a similar sleep pattern is not practiced by every individual, regarding pure healthiness or disorders such as insomnia, apnea, bruxism, epilepsy, and narcolepsy. Therefore, this study aims to determine the classification patterns of sleep stages, using big data for health care. This used a high-dimensional FFT extraction algorithm, as well as a feature importance and tuning classifier, to develop accurate classification. The results showed that the proposed method led to more accurate classification than previous techniques. This was because the previous experiments had been conducted with the feature selection model, with accuracy implemented as a performance evaluation. Meanwhile, the EEG Sleep Stages classification model in this present report was composed of the feature selection and importance of the extraction stage. The previous and present experiments also reached the highest values of accuracy, with the Random Forest and SVM models using 2000 and 3000 features (87.19% and 89.19%, respectively. In this article, we proposed an analysis that the feature importance subsequently influenced the model's accuracy. This was because the proposed method was easily fine-tuned and optimized for each subject to improve sensitivity and reduce false negative occurrences
An automatic lip reading for short sentences using deep learning nets
One study whose importance has significantly grown in recent years is lip-reading, particularly with the widespread of using deep learning techniques. Lip reading is essential for speech recognition in noisy environments or for those with hearing impairments. It refers to recognizing spoken sentences using visual information acquired from lip movements. Also, the lip area, especially for males, suffers from several problems, such as the mouth area containing the mustache and beard, which may cover the lip area. This paper proposes an automatic lip-reading system to recognize and classify short English sentences spoken by speakers using deep learning networks. The input video extracts frames and each frame is passed to the Viola-Jones to detect the face area. Then 68 landmarks of the facial area are determined, and the landmarks from 48 to 68 represent the lip area extracted based on building a binary mask. Then, the contrast is enhanced to improve the quality of the lip image by applying contrast adjustment. Finally, sentences are classified using two deep learning models, the first is AlexNet, and the second is VGG-16 Net. The database consists of 39 participants (32 males and 7 females). Each participant repeats the short sentences five times. The outcomes demonstrate the accuracy rate of AlexNet is 90.00%, whereas the accuracy rate for VGG-16 Net is 82.34%. We concluded that AlexNet performs better for classifying short sentences than VGG-16 Net
Enhanced feature clustering method based on ant colony optimization for feature selection
The popular modified graph clustering ant colony optimization (ACO) algorithm (MGCACO) performs feature selection (FS) by grouping highly correlated features. However, the MGCACO has problems in local search, thus limiting the search for optimal feature subset. Hence, an enhanced feature clustering with ant colony optimization (ECACO) algorithm is proposed. The improvement constructs an ACO feature clustering method to obtain clusters of highly correlated features. The ACO feature clustering method utilizes the ability of various mechanisms, such as local and global search to provide highly correlated features. The performance of ECACO was evaluated on six benchmark datasets from the University California Irvine (UCI) repository and two deoxyribonucleic acid microarray datasets, and its performance was compared against that of five benchmark metaheuristic algorithms. The classifiers used are random forest, k-nearest neighbors, decision tree, and support vector machine. Experimental results on the UCI dataset show the superior performance of ECACO compared with other algorithms in all classifiers in terms of classification accuracy. Experiments on the microarray datasets, in general, showed that the ECACO algorithm outperforms other algorithms in terms of average classification accuracy. ECACO can be utilized for FS in classification tasks for high-dimensionality datasets in various application domains such as medical diagnosis, biological classification, and health care systems
Deep learning mango fruits recognition based on tensorflow lite
Agricultural images such as fruits and vegetables have previously been recognised and classified using image analysis and computer vision techniques. Mangoes are currently being classified manually, whereby mango sellers must laboriously identify mangoes by hand. This is time-consuming and tedious. In this work, TensorFlow Lite was used as a transfer learning tool. Transfer learning is a fast approach in resolving classification problems effectively using small datasets. This work involves six categories, where four mango types are classified (Harum Manis, Langra, Dasheri and Sindhri), categories for other types of mangoes, and a non-mango category. Each category dataset comprises 100 images, and is split 70/30 between the training and testing set, respectively. This work was undertaken with a mobile-based application that can be used to distinguish various types of mangoes based on the proposed transfer learning method. The results obtained from the conducted experiment show that adopted transfer learning can achieve an accuracy of 95% for mango recognition. A preliminary user acceptance survey was also carried out to investigate the user’s requirements, the effectiveness of the proposed functionalities, and the ease of use of its proposed interfaces, with promising results
Multi-granularity active learning based on the three-way decision
The reliance on data and the high cost of data labelling are the main problems facing deep learning today. Active learning aims to make the best model with as few training samples as possible. Previous query strategies for active learning have mainly used the uncertainty and diversity criteria, and have not considered the data distribution's multi-granularity. To extract more valid information from the samples, we use three-way decisions to select uncertain samples and propose a multi-granularity active learning method (MGAL). The model divides the unlabeled samples into three parts: positive, negative and boundary region. Through active iterative training samples, the decision delay of the boundary domain can reduce the decision cost. We validated the model on five UCI datasets and the CIFAR10 dataset. The experimental results show that the cost of three-way decisions is lower than that of two-way decisions. The multi-granularity active learning achieves good classification results, which validates the model. In this case study, the reader can learn about the ideas and methods of the three-way decision theory applied to deep learning
Automatic plant recognition using convolutional neural network on malaysian medicinal herbs: the value of data augmentation
Herbs are an important nutritional source for humans since they provide a variety of nutrients. Indigenous people have employed herbs, in particular, as traditional medicines since ancient times. Malaysia has hundreds of plant species; herb detection may be difficult due to the variety of herb species and their shape and color similarities. Furthermore, there is a scarcity of support datasets for detecting these plants. The main objective of this paper is to investigate the performance of convolutional neural network (CNN) on Malaysian medicinal herbs datasets, real data and augmented data. Malaysian medical herbs data were obtained from Taman Herba Pulau Pinang, Malaysia, and ten kinds of native herbs were chosen. Both datasets were evaluated using the CNN model developed throughout the research. Overall, herbs real data obtained an average accuracy of 75%, whereas herbs augmented data achieved an average accuracy of 88%. Based on these findings, herbs augmented data surpassed herbs actual data in terms of accuracy after undergoing the augmentation technique
Detection of code smells using machine learning techniques combined with data-balancing methods
Code smells are prevalent issues in software design that arise when implementation or design principles are violated. These issues manifest as symptoms or anomalies in the source code. Timely identification of code smells plays a crucial role in enhancing software quality and facilitating software maintenance. Previous studies have shown that code smell detection can be accomplished through the utilization of machine learning (ML) methods. However, despite their increasing popularity, research suggests that the suitability of these methods are not always appropriate due to the problem of imbalanced data. Consequently, the effectiveness of ML models may be negatively affected. This study aims to propose a novel method for detecting code smells by employing five ML algorithms, namely decision tree (DT), k-nearest neighbors (K-NN), support vector machine (SVM), XGboost (XGB), and multi-layer perceptron (MLP). Additionally, to tackle the challenge of imbalanced data, the proposed method incorporates the random oversampling technique. Experiments were conducted in this study using four datasets that encompassed code smells, specifically god-class, data-class, long-method, and feature-envy. The experimental outcomes were evaluated and compared using various performance metrics. Upon comparing the outcomes of our models on both the balanced and original datasets, we found that the XGB model achieved the highest accuracy of 100% for detecting the data class and long method on the original datasets. In contrast, the highest accuracy of 100% was obtained for the data class and long method using DT, SVM, and XGB models on the balanced datasets. According to the empirical findings, there is significant promise in using ML techniques for the accurate prediction of code smells
Sentiment classification from reviews for tourism analytics
User-generated content is critical for tourism destination management as it could help them identify their customers' opinions and come up with solutions to upgrade their tourism organizations as it could help them identify customer opinions. There are many reviews on social media and it is difficult for these organizations to analyse the reviews manually. By applying sentiment classification, reviews can be classified into several classes and help ease decision-making. The reviews contain noisy contents, such as typos and emoticons, which could affect the accuracy of the classifiers. This study evaluates the reviews using Support Vector Machine and Random Forest models to identify a suitable classifier. The main phases in this study are data collection, data preparation, data labelling and modelling phases. The reviews are labelled into three sentiments; positive, neutral, and negative. During pre-processing, steps such as removing the missing value, tokenization, case folding, stop words removal, stemming, and applying n-grams are performed. The result of this research is evaluated by looking at the performance of the models based on accuracy where the result with the highest accuracy is chosen as the solution. In this study, data is data from TripAdvisor and Google reviews using web scraping tools. The findings show that the Support Vector Machine model with 5-fold cross-validation the most suitable classifier with an accuracy of 67.97% compared to Naive Bayes with 61.33% accuracy and Random Forest classifier with 63.55% accuracy. In conclusion, the result of this paper could provide important information in tourism besides determining the suitable algorithm to be used for Sentiment Analysis related to the tourism domain
Multidisciplinary classification for Indonesian scientific articles abstract using pre-trained BERT model
Scientific articles now have multidisciplinary content. These make it difficult for researchers to find out relevant information. Some submissions are irrelevant to the journal's discipline. Categorizing articles and assessing their relevance can aid researchers and journals. Existing research still focuses on single-category predictive outcomes. Therefore, this research takes a new approach by applying a multidisciplinary classification for Indonesian scientific article abstracts using a pre-trained BERT model, showing the relevance between each category in an abstract. The dataset used was 9,000 abstracts with 9 disciplinary categories. On the dataset, text preprocessing is performed. The classification model was built by combining the pre-trained BERT model with Artificial Neural Network. Fine-tuning the hyperparameters is done to determine the most optimal hyperparameter combination for the model. The hyperparameters consist of batch size, learning rate, number of epochs, and data ratio. The best hyperparameter combination is a learning rate of 1e-5, batch size 32, epochs 3, and data ratio 9:1, with a validation accuracy value of 90.8%. The confusion matrix results of the model are compared with the confusion matrix results by experts. In this case, the highest accuracy result obtained by the model is 99.56%. A software prototype used the most accurate model to classify new data, displaying the top two prediction probabilities and the dominant category. This research produces a model that can be used to solve Indonesian text classification-related problems