Fast Independent Component Analysis (FastICA) in Separating Vocals and Instruments in the Art of Geguntangan (pdf)

Article PDF cannot be displayed. You can download it here:

https://ojs.unud.ac.id/index.php/jlk/article/download/51725/33336

Fast Independent Component Analysis (FastICA) in Separating Vocals and Instruments in the Art of Geguntangan

Jurnal Elektronik Ilmu Komputer Udayana Volume 8, No 3. February 2020 p-ISSN: 2301-5373 e-ISSN: 2654-5101 Fast Independent Component Analysis (FastICA) in Separating Vocals and Instruments in the Art of Geguntangan a1 Angga Pramana Putra , I Gede Arta Wibawa a2 a Informatics Department, Faculty of Mathematics and Natural Sciences, Udayana University, Jalan Raya Kampus Unud, Jimbaran, Bali, 80361, Indonesia 1 Abstract Geguntangan is pesantian in religious ceremonies in Bali accompanied by gamelan music. The human sense of hearing tends to have limitations, which causes not all vocals mixed with gamelan to be heard clearly. Therefore we need a system that can be used to separate vocals with gamelan in the geguntangan. Separation of sound sources is categorized as Blind Source Separation (BSS) or also called Blind Signal Separation, which means an unknown source. The algorithm used to handle BSS is the Fast Independent Component Analysis (FastICA) algorithm with a focus on separating the sound signal in a wav-format sound file. FastICA algorithm is used for the sound separation process with the value parameter used is Mean Square Error (MSE). From the simulation results show the results of MSE calculations using the mixing matrix [0.3816, 0.8678], [0.8534, -0.5853] obtained the -5 -6 results for the FastICA method, the MSE value is 3.60 x 10 for the vocal and 1.71 x 10 for the instrument. Keywords: Blind Sources Separation, Fast Independent Component Analysis, Audio Signal Processing, Mean Square Error 1. Introduction Geguntangan is pesantian in religious ceremonies in Bali accompanied by gamelan. Geguntangan often used in religious traditions to accompany the ceremony and also for public entertainment. However, the voice mixed between vocal and gamelan causes difficulty in learning the vocal in the geguntangan. The human sense of hearing tends to have limitations, which causes not all vocals mixed with gamelan to be heard clearly. Therefore we need a system that can be used to separate vocals with gamelan in the geguntangan. Separation of sound sources is categorized as Blind Source Separation (BSS) or also called Blind Signal Separation, which means an unknown source [1]. BSS is a way to separate the mixed signals into several forming signals, without information about the number of signal sources, or the process of mixing the signals. BSS utilizes the different signal characteristics before the sensor is detected and the information obtained due to differences in the angle of arrival and distance of the sensor. The sensor used is a microphone. There are two techniques in recording sound, namely, single-channel and multichannel. Single-channel is a recording technique that uses a single sensor, and multichannel is a recording technique that uses more than one sensor. The algorithm used to handle BSS is the Fast Independent Component Analysis (FastICA) algorithm. This algorithm focuses on the separation of sound signals in a Wav format sound file that has two or more sounds mixed so that separate sound results are recorded. The reason for using the wav format is that the wav format file contains sounds that are not compressed [2]. This research uses the FastICA Algorithm. FastICA algorithm can distinguish the elements or components of the signal mixture independently [3], which in this research uses only two sound sources, namely vocal sound signal and instrument. Before doing the FastICA process, the first thing to do is to do some pre-processing. Pre-processing in question is centering and whitening. 219 Putra & Wibawa Fast Independent Component Analysis (FastICA) in Separating Vocals and Instruments in the Art of Geguntangan Besides that, there is a Non-negative Matrix Factorization (NMF) algorithm, which can also be used in Blind Source Separation (BSS). NMF can process data on a large scale using a matrix factorization model compared to the classical algorithm. Some of the advantages of NMF are easy implementation, good interpretability of decomposition results, and small storage space [4]. Most of the research on the NMF method to overcome the limitations of the NMF method, namely the fact that NMF is a temporal magnitude model only [5]. Another algorithm commonly used to handle the Blind Source Separation (BSS) approach for signal processing problems is the Sparse Component Analysis (SCA) algorithm. SCA utilizes less signal to extract the source and has higher precision in terms of signal separation. If the signal source has a Gaussian distribution, which is a noise model that follows a standard normal distribution with zero averages, the SCA can still extract rare sources effectively. SCA is also a promising approach for BSS when there are fewer sensors from the source [6]. But, FastICA can sparate more effectively than SCA because the FastICA algorithm can define how many and what specifically source will be separate by using mixing matrix. The FastICA algorithm is used for the sound separation process with the value parameter used is Mean Square Error (MSE) or see the similarity between the output results with the input selected to test the sound output results. In speech signal recognition, the process of verbally listening to the output results compared to voice input is used to measure the parameters of success. Besides that, by comparing the input and output signals, it can be seen the results of the sound separation process. 2. Related Work Blind Source Separation is one way to blindly separate a mixed-signal into several forming signals [7]. BSS is one of the techniques used to obtain sources from blind mixing. Because every mixed component can be reconstructed again into its constituent signals, many algorithms can be used to solve problems in Blind Source Separation. FastICA algorithm in its testing of Blind Source Separation is better than the ability of the PCA algorithm, and NMF [8]. In testing the algorithm to determine a superior algorithm between FastICA, PCA, and NMF using the parameters of signal to interference ratio (SIR), signal to distortion ratio (SDR) and signal to artifact ratio (SAR). The greater the value of the parameter, the better the algorithm used for the BSS method, and vice versa. By using the three parameters, the FastICA method is superior to the other two methods tested. There are two types of sound sources used in the FastICA Algorithm, namely single-channel and multichannel. FastICA can run using both kinds of sound sources by modifying it with FSS-Kernel (Finite Support Samples Kernel), where nonlinear functions are replaced with PDF (Probability Density Function). The results of the study are that FastICA with FSS-Kernel modification can effectively separate sound sources with more than one mixed-signal [9]. FastICA can be used to improve sound quality by reducing noise or noise in an audio signal. This can be done by separating noise with sound signals. Another method that is usually used to improve the qu (...truncated)