Physically Informed Signal Processing Methods for Piano Sound Synthesis: A Research Overview (pdf)

Article PDF cannot be displayed. You can download it here:

https://link.springer.com/content/pdf/10.1155%2FS1110865703304093.pdf

Physically Informed Signal Processing Methods for Piano Sound Synthesis: A Research Overview

EURASIP Journal on Applied Signal Processing 2003:10, 941–952 c 2003 Hindawi Publishing Corporation Physically Informed Signal Processing Methods for Piano Sound Synthesis: A Research Overview Balázs Bank Department of Measurement and Information Systems, Faculty of Electronical Engineering and Informatics, Budapest University of Technology and Economics, H-111 Budapest, Hungary Email: Federico Avanzini Department of Information Engineering, University of Padova, 35131 Padua, Italy Email: Gianpaolo Borin Dipartimento di Informatica, University of Verona, 37134 Verona, Italy Email: Giovanni De Poli Department of Information Engineering, University of Padova, 35131 Padua, Italy Email: Federico Fontana Department of Information Engineering, University of Padova, 35131 Padua, Italy Email: Davide Rocchesso Dipartimento di Informatica, University of Verona, 37134 Verona, Italy Email: Received 31 May 2002 and in revised form 6 March 2003 This paper reviews recent developments in physics-based synthesis of piano. The paper considers the main components of the instrument, that is, the hammer, the string, and the soundboard. Modeling techniques are discussed for each of these elements, together with implementation strategies. Attention is focused on numerical issues, and each implementation technique is described in light of its eﬃciency and accuracy properties. As the structured audio coding approach is gaining popularity, the authors argue that the physical modeling approach will have relevant applications in the field of multimedia communication. Keywords and phrases: sound synthesis, audio signal processing, structured audio, physical modeling, digital waveguide, piano. 1. INTRODUCTION Sounds produced by acoustic musical instruments can be described at the signal level, where only the time evolution of the acoustic pressure is considered and no assumptions on the generation mechanism are made. Alternatively, source models, which are based on a physical description of the sound production processes [1, 2], can be developed. Physics-based synthesis algorithms provide semantic sound representations since the control parameters have a straightforward physical interpretation in terms of masses, springs, dimensions, and so on. Consequently, modification of the parameters leads in general to meaningful results and allows more intuitive interaction between the user and the virtual instrument. The importance of sound as a primary vehicle of information is being more and more recognized in the multimedia community. Particularly, source models of sounding objects (not necessarily musical instruments) are being explored due to their high degree of interactivity and the ease in synchronizing audio and visual synthesis [3]. The physical modeling approach also has potential applications in structured audio coding [4, 5], a coding scheme 942 where, in addition to the parameters, the decoding algorithm is transmitted to the user as well. The structured audio orchestral language (SAOL) became a part of the MPEG-4 standard, thus it is widely available for multimedia applications. Known problems in using physical models for coding purposes are primarily concerned with parameter estimation. Since physical models describe specific classes of instruments, automatic estimation of the model parameters from an audio signal is not a straightforward task: the model structure which is best suited for the audio signal has to be chosen before actual parameter estimation. On the other hand, once the model structure is determined, a small set of parameters can describe a specific sound. Casey [6] and Serafin et al. [7] address these issues. In this paper, we review some of the strategies and algorithms of physical modeling, and their applications to piano simulation. The piano is a particularly interesting instrument, both for its prominence in western music and for its complex structure [8]. Also, its control mechanism is simple (it basically reduces to key velocity), and physical control devices (MIDI keyboards) are widely available, which is not the case for other instruments. The source-based approach can be useful not only for synthesis purposes but also for gaining a better insight into the behavior of the instruments. However, as we are interested in eﬃcient algorithms, the features modeled are only those considered to have audible eﬀects. In general, there is a trade-oﬀ between the accuracy and the simplicity of the description. The optimal solution may vary depending on the needs of the user. The models described here are all based on digital waveguides. The waveguide paradigm has been found to be the most appropriate for real-time synthesis of a wide range of musical instruments [9, 10, 11]. As early as in 1987, Garnett [12] presented a physical waveguide piano model. In his model, a semiphysical lumped hammer is connected to a digital waveguide string and the soundboard is modeled by a set of waveguides, all connected to the same termination. In 1995, Smith and Van Duyne [13, 14] presented a model based on commuted synthesis. In their approach, the soundboard response is stored in an excitation table and fed into a digital waveguide string model. The hammer is modeled as a linear filter whose parameters depend on the hammer-string collision velocity. The hammer filter parameters have to be precalculated and stored for all notes and hammer velocities. This precalculation can be avoided by running an auxiliary string model connected to a nonlinear hammer model in parallel, and, based on the force response of the auxiliary model, designing the hammer filters in real time [15]. The original motivation for commuted synthesis was to avoid the high-order filter which is needed for high quality soundboard modeling. As low-complexity methods have been developed for soundboard modeling (see Section 5), the advantages of the commuted piano with respect to the direct modeling approach described here are reduced. Also, due to the lack in physical description, some eﬀects, such as the restrike (ribattuto) of the same string, cannot be precisely modeled with the commuted approach. Describing the com- EURASIP Journal on Applied Signal Processing muted synthesis in detail is beyond the scope of this paper, although we would like to mention that it is a comparable alternative to the techniques described here. As part of a collaboration between the University of Padova and Generalmusic, Borin et al. [16] presented a complete real-time piano model in 1997. The hammer was treated as a lumped model, with a mass connected in parallel to a nonlinear spring, and the strings were simulated using digital waveguides, all connected to a single-lumped load. Bank [17] introduced in 2000 a similar physical model, based on the same functional blocks, but with slightly diﬀerent implementation. An alternative approach was used for the solution of the hammer diﬀerential equation. Independent string models wer (...truncated)