eISSN : 3022-7011
ISSUER : KIPS
 
After the Korea Information Processing Society (KIPS) Transactions journal was founded in 1994, it was reorganized into the KIPS Transactions: Computer and Communication Systems(2287-5891/2734-049X ) and the KIPS Transactions: Software and Data Engi neering(2287-5905/2734-0503) in 2012. Through the KIPS official meeting on January 8th, 2024, the new KIPS Transaction journal was founded by integrating two KIPS Journals, KIPS Transactions: Computer and Communication Systems and KIPS Transactions: Software and Data Engineering. The new journal aims to realize social value and contribute to the development of South Korea’s science and technology with support from the lottery fund of the Ministry of Strategy and Finance and the science/technology promotion fund of the Ministry of Science and ICT. It is indexed in the Korea Science Academic Database, Korea Citation Index (KCI), and EBSCO.

HighlightsMore

Smart City Framework Based on Geospatial Information Standards

Eunbi Ko  Guk Sik Jeong  Kyoung Cheol Koo

Modern cities are actively adopting smart city services to address various urban challenges. Geospatial information acts as the foundational infrastructure of smart cities, promoting the sustainable development of urban areas. Consequently, as the st...

Performance Evaluation and Analysis on Single and Multi-Network Virtualization Systems with Virtio and SR-IOV

Jaehak Lee  Jongbeom Lim  Heonchang Yu

As functions that support virtualization on their own in hardware are developed, user applications having various workloads are operating efficiently in the virtualization system. SR-IOV is a virtualization support function that takes direct access t...

Technique to Reduce Container Restart for Improving Execution Time of Container Workflow in Kubernetes Environments

Taeshin Kang  Heonchang Yu

The utilization of container virtualization technology ensures the consistency and portability of data-intensive and memory volatile workflows. Kubernetes serves as the de facto standard for orchestrating these container applications. Cloud users of...

Model-Based Intelligent Framework Interface for UAV Autonomous Mission

Son Gun Joon  Lee Jaeho

Recently, thanks to the development of artificial intelligence technologies such as image recognition, research on unmanned aerial vehicles is being actively conducted. In particular, related research is increasing in the field of military drones, w...

Latest Publication   (Vol. 13, No. 10, Oct.  2024)

Adaptive Secure Firmware Over The Air Update Mechanism for Lightweight Internet of Things
Seung Eun Lee  Jin Min Lee  Il Gu Lee
As Internet of Things (IoT) technology is being used in all industries, the importance of secure and convenient firmware update technology is increasing. However, conventional FOTA (Firmware Over-The-Air) technology has a problem because the security is weak when updating firmware with a single path, and strong encryption technology cannot be utilized. Therefore, this study proposes a secure FOTA (S-FOTA) mechanism for lightweight IoT and adaptive S-FOTA ARQ (Automatic Repeat Request) mechanism. This adaptive S-FOTA ARQ mechanism considers the case where the original file cannot be recovered because of the increase in lost files due to the congested channel state and compares and analyzes the conventional method in terms of security, complexity, and transmission speed. Experimental results show that S-FOTA with 40 encrypted files reduced the attacker’s attack success rate by at least 62.58% and up to 99.99%, and S-FOTA with 40% of the total number of encrypted file segments takes at least 996.39% more time on average and up to 3374.99% more time than conventional FOTA. In addition, the transmission speed of the adaptive S-FOTA ARQ mechanism was at least 63.16% and up to 2736.36% higher than that of the conventional S-FOTA, and at least 53.89% and up to 70.89% higher than that of the conventional ARQ mechanism.
The Transactions of the Korea Information Processing Society, Vol. 13, No. 10, pp. 475-480, Oct. 2024
https://doi.org/10.3745/TKIPS.2024.13.10.475
Firmware Over-The-Air Shamir’s secret sharing Internet of Things Adaptive FOTA
Hierarchical Watermarking Technique Combining Error Correction Codes
Do-Eun Kim  So-Hyun Park  Il-Gu Lee
Digital watermarking is a technique for embedding information into digital content. Digital watermarking has attracted attention as a technique to combat piracy and identify artificially generated content, but it is still not robust in various situations. In this paper, we propose a frequency conversion-based hierarchical watermarking technique capable of attack detection, error correction, and owner identification. By embedding attack detection and error correction signatures in hierarchical watermarking, the proposed scheme maintains invisibility and outperforms the existing methods in capacity and robustness. We also proposed a framework to evaluate the performance of the image quality and error correction according to the type of error correction signature and the number of signature embeddings. We compared the visual quality and error correction performance of the conventional model without error correction signature and the conventional model with hamming and BCH signatures. We compared the quality by the number of signature embeddings and found that the quality deteriorates as the number of embeddings increases but is robust to attacks. By analyzing the quality and error correction ability by error correction signature type, we found that hamming codes showed better error correction performance than BCH codes and 41.31% better signature restoration performance than conventional methods.
The Transactions of the Korea Information Processing Society, Vol. 13, No. 10, pp. 481-491, Oct. 2024
https://doi.org/10.3745/TKIPS.2024.13.10.481
Digital watermarking Copyright Security error correction code Noise Attack Otsu Algorithm
A Deep Learning System for Emotional Cat Sound Classification and Generation
Joo Yong Shim  SungKi Lim  Jong-Kook Kim
Cats are known to express their emotions through a variety of vocalizations during interactions. These sounds reflect their emotional states, making the understanding and interpretation of these sounds crucial for more effective communication. Recent advancements in artificial intelligence has introduced research related to emotion recognition, particularly focusing on the analysis of voice data using deep learning models. Building on this background, the study aims to develop a deep learning system that classifies and generates cat sounds based on their emotional content. The classification model is trained to accurately categorize cat vocalizations by emotion. The sound generation model, which uses deep learning based models such as SampleRNN, is designed to produce cat sounds that reflect specific emotional states. The study finally proposes an integrated system that takes recorded cat vocalizations, classify them by emotion, and generate cat sounds based on user requirements.
The Transactions of the Korea Information Processing Society, Vol. 13, No. 10, pp. 492-496, Oct. 2024
https://doi.org/10.3745/TKIPS.2024.13.10.492
Audio Classification Audio Generation Animal Emotion Recognition SampleRNN Deep Learning System
Mitigating Mode Collapse using Multiple GANs Training System
Joo Yong Shim  Jean Seong Bjorn Choe  Jong-Kook Kim
Generative Adversarial Networks (GANs) are typically described as a two-player game between a generator and a discriminator, where the generator aims to produce realistic data, and the discriminator tries to distinguish between real and generated data. However, this setup often leads to mode collapse, where the generator produces limited variations in the data, failing to capture the full range of the target data distribution. This paper proposes a new training system to mitigate the mode collapse problem. Specifically, it extends the traditional two-player game of GANs into a multi-player game and introduces a peer-evaluation method to effectively train multiple GANs. In the peer-evaluation process, the generated samples from each GANs are evaluated by the other players. This provides external feedback, serving as an additional standard that helps GANs recognize mode failure. This cooperative yet competitive training method encourages the generators to explore and capture a broader range of the data distribution, mitigating mode collapse problem. This paper explains the detailed algorithm for peer-evaluation based multi-GANs training and validates the performance through experiments.
The Transactions of the Korea Information Processing Society, Vol. 13, No. 10, pp. 497-504, Oct. 2024
https://doi.org/10.3745/TKIPS.2024.13.10.497
generative models generative adversarial networks mode collapse Peer-evaluation Multi-Model Training
Implementation of Virtual Touch Service Using Hand Gesture Recognition
A-Ra Cho  Seung-Bae Yoo  Byeong-Hun Yun  Hyung-Ju Cho
As the need for hygiene management increases due to COVID-19, the importance of non-contact services is gaining attention. Hands, a tool for expressing intentions and conveying information, are emerging as an alternative to computer input devices such as the keyboard and mouse. In this study, we propose a method to address public health problems that arise when using unmanned ordering machines by controlling a computer using hand gestures detected through a camera. The focus is on identifying frequently used hand gestures, especially the bending of the index finger. To this end, we develop a non-contact input device using the MediaPipe framework and the long short-term memory (LSTM) model. This approach can identify hand gestures in three-dimensional space and provides scenarios that can be applied to the fields of virtual reality (VR) and augmented reality (AR). It offers improved public health and user experience by presenting methods that can be applied to various situations such as navigation systems and unmanned ordering machines.
The Transactions of the Korea Information Processing Society, Vol. 13, No. 10, pp. 505-512, Oct. 2024
https://doi.org/10.3745/TKIPS.2024.13.10.505
Camera-based Recognition LSTM MediaPipe Contactless Hand Gesture Recognition
CoNSIST: Consist of New Methodologies on AASIST for Audio Deepfake Detection
Jae Hoon Ha  Joo Won Mun  Sang Yup Lee
Advancements in artificial intelligence(AI) have significantly improved deep learning-based audio deepfake technology, which has been exploited for criminal activities. To detect audio deepfake, we propose CoNSIST, an advanced audio deepfake detection model. CoNSIST builds on AASIST, which a graph-based end-to-end model, by integrating three key components: Squeeze and Excitation, Positional Encoding, and Reformulated HS-GAL. These additions aim to enhance feature extraction, eliminate unnecessary operations, and incorporate diverse information. Our experimental results demonstrate that CoNSIST significantly outperforms existing models in detecting audio deepfakes, offering a more robust solution to combat the misuse of this technology
The Transactions of the Korea Information Processing Society, Vol. 13, No. 10, pp. 513-519, Oct. 2024
https://doi.org/10.3745/TKIPS.2024.13.10.513
AASIST ASVspoof Audio Deepfake Graph Attention Network
Exploring Aesthetic Values and Technical Elements Through Comparison of AI and Artist Creations
Kim Min Kyu  Park Jae Wan
This study explores the differences in technical aspects and beauty between artworks generated by artificial intelligence (AI) and humans, assessing the characteristics, potential, and limitations of AI art, as well as the role of artists in depth. The results demonstrate that artworks generated by AI possess a level of technical proficiency and aesthetic value that can compete with human art, earning high appreciation among the general public. Specifically, with regard to emotional transmission and impression, AI can provide an artistic resonance comparable to that of humans; however, in artworks depicting natural landscapes, the subtle emotions and techniques of human artists surpass those of AI. This indicates that while AI can play a significant role in the field of artistic creation, AI also exhibits limitations in specific genres. The study is expected to provide deeper insights into the social acceptance and the position of AI art within the art community
The Transactions of the Korea Information Processing Society, Vol. 13, No. 10, pp. 520-528, Oct. 2024
https://doi.org/10.3745/TKIPS.2024.13.10.520
Artificial Intelligence Art Generative AI Technological Elements Aesthetic Values Survey
3D Object State Extraction Through Adjective Analysis from Informal Requirements Specs
Ye Jin Jin  Chae Yun Seo  Ji Hoon Kong  R. Young Chul Kim
Recent advancements in AI technology have led to its application across various fields. However, the lack of transparency in AI operations makes it challenging to guarantee the quality of its outputs. Therefore, we integrate requirements engineering in software engineering with conversational AI technology to ensure procedural fairness. Traditional requirements engineering research uses grammar-centered analysis, which often fails to fully interpret the semantic aspects of natural language. To solve this, we suggest combining Noam Chomsky's syntactic structure analysis with Charles Fillmore's semantic role theory. Additionally, we extend our previous research by analyzing adjectives in informal requirement sentence structures. This enables precise emotional analysis of the main characters in comics. Based on the results of the analysis, we apply the emotional states of the objects to the states in the UML state diagram. Then, we create the 3D object with Three.js based on the object that reflects the emotional states in the state diagram. With this approach, we expect to represent the emotional state of a 3D object.
The Transactions of the Korea Information Processing Society, Vol. 13, No. 10, pp. 529-536, Oct. 2024
https://doi.org/10.3745/TKIPS.2024.13.10.529
Informal Requirements UML State Diagram 3D Object Model Adjective Analysis
Development of an AutoML Web Platform for Text Classification Automation
Ha-Yoon Song  Jeon-Seong Kang  Beom-Joon Park  Junyoung Kim  Kwang-Woo Jeon  Junwon Yoon  Hyun-Joon Chung
The rapid advancement of artificial intelligence and machine learning technologies is driving innovation across various industries, with natural language processing offering substantial opportunities for the analysis and processing of text data. The development of effective text classification models requires several complex stages, including data exploration, preprocessing, feature extraction, model selection, hyperparameter optimization, and performance evaluation, all of which demand significant time and domain expertise. Automated machine learning (AutoML) aims to automate these processes, thus allowing practitioners without specialized knowledge to develop high-performance models efficiently. However, current AutoML frameworks are primarily designed for structured data, which presents challenges for unstructured text data, as manual intervention is often required for preprocessing and feature extraction. To address these limitations, this study proposes a web-based AutoML platform that automates text preprocessing, word embedding, model training, and evaluation. The proposed platform substantially enhances the efficiency of text classification workflows by enabling users to upload text data, automatically generate the optimal ML model, and visually present performance metrics. Experimental results across multiple text classification datasets indicate that the proposed platform achieves high levels of accuracy and precision, with particularly notable performance when utilizing a Stacked Ensemble approach. This study highlights the potential for non-experts to effectively analyze and leverage text data through automated text classification and outlines future directions to further enhance performance by integrating Large language models.
The Transactions of the Korea Information Processing Society, Vol. 13, No. 10, pp. 537-544, Oct. 2024
https://doi.org/10.3745/TKIPS.2024.13.10.537
Text Classification Automated Machine Learning (AutoML) web platform Natural Language Processing H2O
Development of IoT Sensor Data Generation Emulator for Smart Marine Logistics
Park Chae Rim  Kim Tae Hoon  Lee Eun Kyu
As the 4th Industrial Revolution progresses, the shipping logistics sector is becoming smarter by utilizing various core technologies such as AI, IoT, and Bigdata. In particular, the collected marine Bigdata plays a significant role in providing various services like vessel operation monitoring analysis and greenhouse gas emission evaluation, and it is also essential in shipping logistics. Although this maritime Bigdata is collected during actual vessel operations, there are instances where data is lost due to temporal and environmental factors. While It is important to identify and address the fundamental cause of such losses, it is also necessary to generate data through the utilization and analysis of the collected data. This paper develops an Emulator that repeatedly generates new location data, speed values, etc., using maritime transport data collected through empirical tests. The location data is generated by calculating the standard deviation from the collected position information, and the speed values are extracted from the generated location data. The generated data is accumulated by being inserted into the database in real-time. To demonstrate the performance of the Emulator, evperiments were conducted using 5 routes, providing its excellence.
The Transactions of the Korea Information Processing Society, Vol. 13, No. 10, pp. 545-552, Oct. 2024
https://doi.org/10.3745/TKIPS.2024.13.10.545
BigData emulator Generate Data Smart Container Smart Logistics
Syllable-Level Lightweight Korean POS Tagger using Transformer Encoder
Suyoung Min  Youngjoong Ko
Morphological analysis involves segmenting morphemes, the smallest units of meaning or grammatical function in a language, and assigning part-of-speech tags to each morpheme. It plays a critical role in various natural language processing tasks, such as named entity recognition and dependency parsing. Much of modern natural language processing relies on deep learning-based language models, and Korean morphological analysis can be broadly categorized into sequence-to-sequence methods and sequential labeling methods. This study proposes a morphological analysis approach using the transformer encoder for sequential labeling to perform syllable-level part-of-speech tagging, followed by morpheme restoration and tagging through a pre-analyzed dictionary. Additionally, the CBOW method was used to extract syllable-level embeddings in lower dimensions, designing a lightweight morphological analyzer model with reduced parameters. The proposed model achieves fast inference speed and low parameter usage, making it efficient for use in resource-constrained environments.
The Transactions of the Korea Information Processing Society, Vol. 13, No. 10, pp. 553-558, Oct. 2024
https://doi.org/10.3745/TKIPS.2024.13.10.553
Transformer encoder Morphological Analysis Part of Speech Tagging Sequential-Labeling
A Study on the Design of Embedded System-Based Wheel Drive Robots for Overcoming the Terrain
Kim Min Gyu  Seon Ji Ho  Jeong Se Jin  Kim Sang Hoon
The purpose of this paper is to design and implement a wheel-driven small intelligent robot with intelligent sensor signal processing and various driving methods to overcome non-flat terrain such as slopes and steps and avoid obstacles. An eccentric gear structure was proposed to overcome non-flat terrain, optimal sensor signal processing was applied to maintain real-time balance, and an omnidirectional driving method that enables obstacle recognition and escape from a narrow space using a LiDAR sensor was proposed and designed to overcome obstacles. An optimal embedded system was designed and constructed to implement and control the intelligent elements of the robot.
The Transactions of the Korea Information Processing Society, Vol. 13, No. 10, pp. 559-567, Oct. 2024
https://doi.org/10.3745/TKIPS.2024.13.10.559
Overcoming Terrain Wheel-Driven Robot Embedded System
Design and Implementation of 2D Image-Based Implant Placement Guide System
Minwoo Kang  Jiwoo Shin  Seongmin Lee  Soungjun Yoon  Jinman Jung
decision and visual obstructions caused by various factors can lead to errors during the procedure. This paper proposes a 2D image-based real-time implant placement guiding system that predicts the implant position using 2D surgical video without the need for preoperative oral scans or 3D model generation. In the initial phase of the surgical video, two segmentation models are employed to measure prior statistics of the occlusal and incisal surfaces for each tooth. Subsequently, a single segmentation model is used to separate the occlusal and incisal surfaces, and the implant placement is predicted and guided based on the axis and length of adjacent teeth as well as the center of the prosthesis to be implanted. The system was designed and implemented using a dental phantom model, which replicates the oral structure of an actual human. The algorithm’s average execution time for guiding implant placement on 10 images was measured to be 12.14 ms, demonstrating its feasibility for real-time application in surgical video.
The Transactions of the Korea Information Processing Society, Vol. 13, No. 10, pp. 568-573, Oct. 2024
https://doi.org/10.3745/TKIPS.2024.13.10.568
segmentation Implant Real-Time Implant Guide System Tooth Structure 2D Video
Image Processing Acceleration using WebGPU and WebAssembly
Hyunwoo Nam  Myungho Lee  Neungsoo Park
JavaScript is slow for high-performance image processing in web browsers and cannot directly utilize the GPU. Therefore, web plugin technology or server-based processing methods have been used. However, since web plugins are no longer supported by the latest web browsers and server processing methods become increasingly expensive as the number of users grows. In this paper, an image processing acceleration method is proposed using the latest web standards such as WASM and WebGPU in a client environment, instead of plugins or server-based methods. The final experimental results confirmed that the WASM+WebGPU-based code, which utilizes both the CPU and GPU, improved execution performance by up to 10 times compared to traditional javaScript.
The Transactions of the Korea Information Processing Society, Vol. 13, No. 10, pp. 574-578, Oct. 2024
https://doi.org/10.3745/TKIPS.2024.13.10.574
Web Assembly WebGPU image processing