|
ヒライ シゲユキ
HIRAI SHIGEYUKI
平井 重行 所属 京都産業大学 情報理工学部 情報理工学科 職種 教授 |
|
| 発表年月日 | 2023/03/30 |
| 発表テーマ | Synthesis of Explosion Sounds from Utterance Voice of Onomatopoeia using Transformer |
| 会議名 | ACM IUI2023 |
| 主催者 | ACM |
| 学会区分 | 国際学会 |
| 発表形式 | ポスター |
| 単独共同区分 | 共同 |
| 開催地名 | Sydney |
| 開催期間 | 2023/03/27~2023/03/31 |
| 発表者・共同発表者 | Riki Takizawa and Shigeyuki Hirai |
| 概要 | Sound creators use knowledge, techniques, and experience to create sound effects for media works, ensuring that these sound effects are suitable for different situations and dramatic presentations. This is a challenging task for inexperienced creators and beginners, but it is relatively easy for anyone to imagine desired sounds and express them as onomatopoeic utterances. Therefore, we propose a novel technique to easily create a desired sound effect by synthesizing the sound from voice articulation, rhythm, and intonation. In this research, we focus on explosion sounds, a kind of sound effect that has various representations. The proposed technique uses a Transformer, which trains the conversion from speech to explosion sounds. This paper describes the synthesizer model with Transformer and datasets for training the model and some results in current. |