業績リスト (河原英紀)
学術雑誌等
- 榊原 健一、河原 英紀、水町 光徳、利用価値の高い音声データの録音手順、日本音響学会誌、Vol.76, No.6, pp.343-350, (2020) (解説:招待)
- 河原 英紀、インパルス応答の基本概念、日本音響学会誌、Vol.76, No.3, pp.148-155, (2020) (解説:招待)
- Sara Popham, Dana Boebinger, Dan P. W. Ellis, Hideki Kawahara & Josh H. McDermott, Inharmonic speech reveals the role of harmonicity in the cocktail party problem,
Nature Communicationsvolume 9, Article number: 2122 (2018), ( DOI:10.1038/s41467-018-04551-8 )
- Hideki Kawahara, Application of time-frequency representations of aperiodicity and instantaneous frequency for detailed analysis of filled pauses, Journal of the Phonetic Society of Japan, Vol.21, No.3, pp.63-73, 2017.
- 河原 英紀,ディジタル信号処理の落とし穴,日本音響学会誌,Vol.73, No.9, 2017. (link to PDF)
- Matsui T, Irino T, Nagae M, Kawahara H, Patterson RD. The Effect of Peripheral Compression on Syllable Perception Measured with a Hearing Impairment Simulator. InPhysiology, Psychoacoustics and Cognition in Normal and Impaired Hearing 2016 (pp. 307-314). Springer International Publishing. (Link to open access page)
- 溝渕 翔平、西村 竜一、松井 淑恵、入野 俊夫、河原 英紀、声道形状と声帯音源特性の操作に基づいたグロウル系歌唱の印象付与法、電子情報通信学会 論文誌D,Vol.J99-D,No.3,pp.283-292,Mar. 2016.
- 河原 英紀、音声分析変換合成基盤ソフトウェアSTRAIGHTとその応用、コンピュータソフトウェア、日本ソフトウェア科学会、Vol.32, No.3, pp.3_23-3_28, (2015)
PDF
- 河原 英紀, 音声の実時間表示とモーフィングで探る声の多様性, 音声研究, Vol.18, No.3, pp.43-52 (2014)
PDF
- 森本 隆司, 入野 俊夫, 西村 竜一, 河原 英紀, 劣化音声認識における単語の音響的連続性とモーラ遷移情報の影響の評価,
日本音響学会誌, Vol.70, No. 11, pp.578-588, 2014.
- Schweinberger, S. R., Kawahara, H., Simpson, A. P., Skuk, V. G. and Zäske, R. , Speaker perception. WIREs Cogn Sci, 5(1), pp.15-25, 2014. (DOI: 10.1002/wcs.1261)
- Taiki Nishi, Ryuichi Nisimura, Toshio Irino and Hideki Kawahara,
Controlling linguistic information and filtered sound identity for a new cross-synthesis vocoder,
Acousti. Sci. and Tech., 34(4),Pp.287-288, 2013.
- Toshio Irino, Yoshie Aoki, Hideki Kawahara, Roy. D. Patterson,
Comparison of performance with voiced and whispered speech in word recognition and mean-formant-
frequency discrimination, Speech Communication,54(9), pp.998-1013, 2012.
- Hideki Kawahara and Masanori Morise,
Technical foundations of TANDEM-STRAIGHT, a speech analysis, modification and synthesis framework,
SADHANA - Academy Proceedings in Engineering Sciences, Vol.36, Part 5, pp.713-722, 2011.
(PDF)
-
赤桐隼人,森勢将雅,入野俊夫,河原英紀, ``スペクトルピークを強調したF0適応型スペクトル包絡抽出法の最適化と評価,'' 電子情報通信学会 論文誌A,Vol.J94-A, No.8, pp.557-567, 2011.
- Erika Okamoto, Toshio Irino, Ryuichi Nisimura and Hideki Kawahara: Evaluation of voice morphing using vocal tract length normalization based on auditory filterbank, Journal of Signal Processing, Vol.15, No.4, pp.283-286, July, 2011.
- 河原 英紀:音声分析合成技術の動向、日本音響学会誌、Vol.67, No.1, pp.40-45 (2011). [解説:再掲]
- 森勢 将雅、河原 英紀、西浦 敬信:基本波検出に基づく高SNR の音声を対象とした高速なF0 推定法、電子情報通信学会論文誌D, Vol.J93-D, No.2, pp.109-117 (2010).
- Romi Zäske, Stefan R. Schweinberger, Hideki Kawahara, Voice aftereffects of adaptation to speaker identity, Hearing Research, Volume 268, Issues 1-2, 1 September 2010, Pages 38-45,
DOI: 10.1016/j.heares.2010.04.011
- Laetitia Bruckert, Patricia Bestelmeyer, Marianne Latinus, Julien Rouger, Ian Charest, Guillaume A. Rousselet, Hideki Kawahara, Pascal Belin, Vocal Attractiveness Increases by Averaging, Current Biology, Volume 20, Issue 2, 116-120, 26 (January 2010)
DOI: 10.1016/j.cub.2009.11.034
- Romi Zäske, Stefan R. Schweinberger, Jürgen M. Kaufmann, Hideki Kawahara:
In the ear of the beholder: neural correlates of adaptation to voice gender,
European Journal of Neuro Science, Vol.30, No.3, pp.527-534 (August 2009)
DOI: 10.1111/j.1460-9568.2009.06839.x
- Osamu Fujimura, Kiyoshi Honda, Hideki Kawahara, Yasuyuki Konparu, Masanori Morise and J.C. Williams, Noh Voice Quality, J. Logopedics Phoniatrics Vocology,34(4), 157-170 (04 June 2009)
DOI: 10.1080/14015430903002288
- 河原、森勢:TANDEM-STRAIGHTと音声モーフィング:感情音声と歌唱研究への応用、音声研究、Vol.13, No.1, pp.29-39 (2009) [解説:再掲]
- 河原英紀:音声モーフィングの背景と可能性、音声言語医学、50(2), pp.131-135, (2009). [解説:再掲]
- 森勢 将雅、高橋 徹、入野 俊夫、河原 英紀、分析時刻に依存しない周期信号のパワースペクトル推定法を用いた音声分析、電子情報通信学会、Vol.J92-A,No.3,pp.163-171,Mar. 2009.
- Stefan R. Schweinberger, Christoph Casper, Nadine Hauthal, Jürgen M. Kaufmann, Hideki Kawahara, Nadine Kloth, David M.C. Robertson, Adrian P. Simpson and Romi Zäske,
Auditory Adaptation in Voice Perception, Current Biology 18(9), 684-688, May 6, (2008).
DOI: 10.1016/j.cub.2008.04.015
- 河原 英紀,生駒 太一,森勢 将雅,高橋 徹,豊田 健一,片寄 晴弘, モーフィングに基づく歌唱デザインインタフェースの提案と初期検討,
情報処理学会論文誌,Vol.48,No.12, pp.3637-3648 (2007)
- 森勢 将雅, 高橋 徹,河原 英紀, 入野 俊夫, 窓関数による分析時刻の影響を受けにくい周期信号のパワースペクトル推定法,
電子情報通信学会誌D, Vol.J90-D, No.12, pp.3265-3267 (2007)
- 森勢 将雅, 入野 俊夫, 河原 英紀, 測定用信号として音声を用いたクロススペクトル法によるインパルス応答推定の誤差評価,
電子情報通信学会論文誌A, Vol.J90-A, No.7, pp.559-566(2007)
- 河原英紀:Vocoderのもう一つの可能性を探る--音声分析変換合成システムSTRAIGHTの背景と展開--,
日本音響学会誌,Vol.63,No.8,pp.442-449 (2007).[招待:解説:再掲]
- Hideki Banno, Hiroaki Hata, Masanori Morise, Toru Takahashi, Toshio Irino and Hideki Kawahara,
"Implementatioin of realtime STRAIGHT speech manipulation system: Report on its first implementation,"
Acoustic Science and Technology, Vol.28, No.3, pp.140-146 (2007)
DOI: 10.1250/ast.28.140
- 橋田 光代, 長田 典子, 河原 英紀, 片寄 晴弘, 複数旋律音楽に対する演奏表情付けモデルの構築(演奏認識/合成,
情報処理学会論文誌,Vol.48, No.1,pp. 248-257 (2007)
- Hideki Kawahara: STRAIGHT, Exploration of the other aspect of VOCODER:
Perceptually isomorphic decomposition of speech sounds,
Acoustic Science and Technology, Vol.27, No.6, pp.349-353 (2006).[招待:解説:再掲]
DOI: 10.1250/ast.27.349
- Toshio Irino, Roy D. Patterson, and Hideki Kawahara, "Speech
segregation using an auditory vocoder with event-synchronous
enhancements," IEEE Trans. Speech and Audio Process.,
Vol.27, Issue 6, pp.2212-2221 (2006).
DOI: 10.1109/TASL.2006.872611
- 森勢 将雅, 入野 俊夫, 坂野 秀樹, 河原 英紀, "暗騒音と高調波歪みに
頑健なインパルス応答測定用信号:Warped-TSP," 電子情報通信学会論文誌,
Vol.J89-A,No.1,pp.-(Jan. 2006)
- David R. R. Smith, Roy D. Patterson, Richard Turner Hideki Kawahara and Toshio Irino,
The processing and perception of size information in speech sounds,
Journal of the Acoustical Society of America, Vol.117, No.1, pp.305-318 (2005)
DOI: 10.1121/1.1828637
- 貫名真澄、河原英紀, "発話時の頭部周辺での音声の伝達特性について ",日本音響学会誌、Vol.59, No.5, (2003).
- Alain de Cheveigné,Hideki Kawahra, YIN, "a fundamental frequency estimator for speech and music" Journal of the Acoustical Society of America, Vol.111, No.4, pp.1917-1930 (2002)
- 河原英紀,片寄晴弘,"高品質音声分析変換合成システムSTRAIGHTを用いたスキャット生成研究の提案" 情報処理学会論文誌 Vol.43 No.2 pp.208-218 Feb.2002.
- 坂野秀樹,陸金林,中村哲,鹿野清宏,河原英紀,"時間領域平滑化郡遅延を用いた短時間位相の効率的表現方法" 電子情報通信学会論文誌,D-II, Vol.J84-D-II, No.4, pp.621-628, Apr.2001.
- 坂野秀樹,陸金林,中村哲,鹿野清宏,河原英紀 "時間領域平滑化群遅延による位相制御を用いた声質制御方式",電子情報通信学会誌,J83-DII,11,pp.2276--2282,
(2000).
- 阿竹義徳, 入野俊夫, 河原英紀 , 陸金林, 中村哲, 鹿野清宏 "調波成分の瞬時周波数を用いた基本周波数推定方法",
電子情報通信学会誌, D-II, J83-D-II, 11, pp.2077--2086, (2000)
- Alain de Cheveigne,Hideki Kawahara,"Missing-data Model of Vowel Identification" J.Acoust.Soc.Am., Vol.105, pp.3497-3508, 1999.
- Hideki Kawahara, Ikuyo Masuda-Katsuse and Alain de Cheveigne:
Restructuring speech representations using a pitch-adaptive time-frequency
smoothing and an instantaneous-frequency-based F0 extraction:
Possible role of a reptitive structure in sounds, Speech Communication,
27, 3, pp.187-207 (1999).
- Alain de Cheveigne and Hideki Kawahara, " Multiple period estimation and pitch perception model",
Speech Communication,
27, 3, pp.175-186 (1999).
- Ikuyo Masuda-Katsuse and Hideki Kawahara, "Dynamic sound stream formation based on continuity of spectral change",
Speech Communication,
27, 3, pp.235-259 (1999).
- Hiroko Kato and Hideki Kawahara: ``An Application of the
Bayesian Time Series Model and Statistical System Analysis for
F0 Control'', Speech Communication, 24 (4), pp.325-339(1998),
- Alain DE CHEVEIGNE (CNRS), Hideki KAWAHARA, Minoru TSUZAKI,
and Kiyoaki AIKAWA: ''Concurrent Vowel Identification I: Effects
of relative Amplitude and F0 Differences,'' J. Acoust. Soc. Am.,
Vol.101, pp.2839-2847 (1997.5)
- Reiko AKAHANE-YAMADA, Takahiro ADACHI and Hideki KAWAHARA:
Second language production training using spectrographic representations
as feedback, J. Acoust. Soc. Jpn(E), 18, pp.341-343 (1997).
- 相川清明,河原英紀: ''複合周波数変化音追跡神経演算モデル,'' 日本音響学会誌, 53巻, 2号, pp.95-102
(1997.2)
- 相川清明,津崎 実,河原英紀,東倉洋一: ''周波数変化音追跡の動特性,'' 日本音響学会誌, 52巻, 10号,
pp.741-751 (1996.10)
- Kiyoaki AIKAWA, Harald SINGER (ATR-ITL), Hideki KAWAHARA
and Yoh'ichi TOHKURA: ''Cepstral Representation of Speech Motivated
by Time-Frequency Masking: An Application to Speech Recognition,''
J. Acoust. Soc. Am. , Vol.100, No.1, pp.603-614 (1996.7)
- Alain DE CHEVEIGNE (CNRS/Universite Paris 7), Hideki KAWAHARA,
Kiyoaki AIKAWA and Andrew P. LEA : ``Speech Separation for Speech
Recognition,'' J. de Physique IV, C5, pp.545-548 (1994.5)
- Hideki KAWAHARA : ``Interactions between Speech Produciton
and Perception under Auditory Feedback Perturbations on Fundamental
Frequencies,'' J. Acoust. Soc. Jpn. (E), Vol.15, No.3, pp.201-202
(1994.5),
- 小原和昭、相川清明、河原英紀 : ``時間周波数マスキング特性を模擬した聴覚フィルタモデルによる音声認識,'' 日本音響学会誌,
50巻, 5号, pp.345-352 (1994.5)
- Toshio Irino and Hideki Kawahara : ``Signal Reconstruction
from Modified Auditory Wavelet Transform,'' IEEE Trans. Signal
Processing, Vol.41, No.12, pp.3549-3554 (1993.12),
- 相川清明、河原英紀、東倉洋一 : ``順向マスキングの時間周波数特性を模擬した動的ケプストラムを用いた音韻認識,''
電子情報通信学会論文誌 A, Vol.J76-A, No.11, pp.1514-1521 (1993.11)
- 鈴木紳、河原英紀: ``平均曲率に基づいた神経回路網の評価基準''、 電子情報通信学会誌D-II、Vol.J75-D-II、No.3、pp.673-645、(1992.3)
- Toshio Irino and Hideki Kawahara : ``A Method for Designing
Neural Networks Using Nonlinear Multivariate Analysis: Application
to Speaker-Independent Vowel Recognition,'' {\em Neural Computation},
vol.2, pp.386-397 (1990)
- 入野俊夫、河原英紀 : ``多層神経回路網の非線形多変量解析による構成法''、 電子情報通信学会誌D-II、Vol.J72-D-II、No.8、pp.1187-1193、(1989.8)
- 河原英紀、筧一彦 : ``音声のラウドネスに対する周波数貢献度の推定''、 日本音響学会誌、Vol.37、No.1、pp.26-32、(1981.1)
- 河原英紀、栃内香次、永田邦一 : ``小区間の線形予測分析とその誤差評価''、 日本音響学会誌、Vol.33、No.9、pp.470-479、(1977.9)
国際会議等(査読, Proceedings有, 招待)
2019年
- Hideki Kawahara, Ken-Ichi Sakakibara, Eri Haneishi, and Kaori Hagiwara.
Real-time and interactive tools for vocal training based on an analytic signal with a cosine series envelope,
In Proc. APSIPA ASC, pp. 907-910. (2019).
- Hideki Kawahara, Ken-IchiSakakibara, Mitsunori Mizumachi, Hideki Banno, Masanori Morise, and Toshio Irino,
Frequency domain variant of Velvet noise and its application to acoustic measurements.
In Proc. APSIPA ASC, pp. 1523-1532, (2019).
- Hiroko Terasawa, Wakasa, K., Kawahara, H., Sakakibara, K.
Investigating the Physiological and Acoustic Contrasts Between Choral and Operatic Singing.
Proc. Interspeech 2019, 2025-2029, DOI: 10.21437/Interspeech.2019-1864.
2018年
- Hideki Kawahara, Masanori Morise, Kanru Hua, Revisiting spectral envelope recovery from speech sounds generated by periodic excitation,
APSIPA Annual Summit and Conference 2018, TH-P1-8.4, Hawaii USA, 12-15 November, (2018) (Accepted)
- Hemant A. Patil, Hideki Kawahara, Voice Conversion: Challenges and Opportunities,
APSIPA Annual Summit and Conference 2018 Tutorial, T3, Hawaii USA, 12-15 November, (2018) (Accepted)
- Hideki Kawahara, Frequency domain velvet noise: A flexible building block for hearing and acoustic research,
Tohoku Universal Acoustical Communication Month 2018, Sendai Japan, 22-27 October, (2018) (Accepted)
- Hideki Kawahara, Making Speech Tangible for a Better Understanding of Human Speech Communication, S4P Summer School on Speech Signal Processing 2018,
Gandhinagar India, 9-11 September (2018). (Invited talk) ( S4P site )
- Hideki Kawahara, Ken-Ichi Sakakibara, Masanori morise, Hideki Banno, Tomoki Toda, Toshio Irino,
Frequency domain variants of velvet noise and their application to speech processing and synthesis, Interspeech 2018, 2-6 Sept. 2018, Hyderabad, India,
pp.2027-2031, (2018). ( DOI:Interspeech.2018-43)
2017年
- H Kawahara, KI Sakakibara, M Morise, H Banno, T Toda, Accurate estimation of fo and aperiodicity based on periodicity detector residuals and deviations of phase derivatives, APSIPA Annual Summit and Conference 2017, FA-P5.8
- H Kawahara, E Haneishi, K Hagiwara, Realtime feedback of singing voice information for assisting students learning music therapy, 2017 International Conference on Orange Technologies, 99-102, 2017.
- Hideki Kawahara, Making speech tangible for better understanding of human speech communication, The 21th International Conference on Asian Language Processing, Dec. 2017. [Keynote Speech]
- H. Kawahara, K. Sakakibara, Characterization of subharmonic voices using phase derivatives,
Proc. PEVOC, Aug.-Sept. 2017.
- H. Kawahara, K. Sakakibara, H. Banno, M. Morise, T. Toda, T. Irino,
A new cosine series antialiasing function and its application to aliasing-free glottal source models for speech and singing synthesis,
Interspech 2017, Aug. pp.1358-1362, 2017. , (Link to arXiV)
- H. Kawahara, K. Sakakibara, H. Banno, M. Morise, T. Toda,
A modulation property of time-frequency derivatives of filtered phase and its application to aperiodicity and fo estimation,
Interspech 2017, Aug. pp.424-428, 2017.
- T. Matsui, T. Irino, K. Yamamoto, H. Kawahara, R. D. Patterson,
The effect of spectral tilt on size discrimination of voiced speech sounds,
Interspech 2017, Aug. pp.601-605, 2017.
2016年
- H. Kawahara, Y. Agiomyrgiannakis, H. Zen, Using instantaneous frequency and aperiodicity detection to estimate F0 for high-quality speech synthesis, ISCA SSW9, Sept. 2016. (Link to arXiV)
- H. Kawahara, SparkNG: Interactive Matlab tools for introduction to speech production, perception and processing fundamentals and application of the aliasing-free L-F model component, Interspeech 2016 (Show and tell), Sept. 2016.
- M. Morise, H. Kawahara, TUSK: A framework for overviewing the performance of F0 estimators, Interspeech2016, Sept. 2016.
2015年
- H. Kawahara, K. Sakakibara, H. Banno, M. Morise, T. Toda and T. Irino, "Aliasing-free implementation of discrete-time
glottal source models and their applications to
speech synthesis and F0 extractor evaluation", APSIPA ASC 2015, 520-529, Dec. 2015.
- Haneishi, E., H. Kawahara, K. Hagiwara, R. Oribe, H. Takemoto, and K. Honda. "a preliminary study on diaphragm motions and vocal tract configurations during singing: analyses of real-time MRI and acoustic data." In PAN EUROPEAN VOICE CONFERENCE ABSTRACT BOOK, p. 120., Sep.2015.
- K. Yamamoto, T. Irino, R. Nisimura, Ryuichi, H. Kawahara, R. D. Patterson, "How the slope of the speech spectrum affects the perception of speaker size", In INTERSPEECH-2015, 1556-1560, Sep. 2015.
- Toshie Matsui, Toshio Irino, Misaki Nagae, Hideki Kawahara, and Roy D. Patterson, "The effect of peripheral compression on syllable perception measured with a hearing impairment simulator," 17th International Symposium on Hearing , Groningen, Netherlands,15-19 June 2015.
2014年
- Hideki Kawahara, Speech Analysis Modification and Synthesis tool STRAIGHT and extended voice morphing,
in ToolShop:Workshop for Auditory Research Software, ARO midwinter meeting, Baltimore USA,
21-25, Feb. 2014. (Invited)
- Hideki Kawahara, Masanori Morise, Ken-Ichi Sakakibara, Tomoki Toda, Hideki Banno Ryuichi Nisimura, Toshio Irino,
Excitation source design for high-quality speech manipulation systems based on a temporally static group delay representation of periodic signals, APSIPA ASC 2014, Siem Reap, Cambodia, 9-12 Dec., 2014.
- Misaki Nagae, Toshio Irino, Ryuichi Nisimura, Hideki Kawahara, Roy D. Patterson,
Hearing impairment simulator based on compressive gammachirt filter,
APSIPA ASC 2014, Siem Reap, Cambodia, 9-12 Dec., 2014.
- Hideki Kawahara, STRAIGHT speech analysis, Tutorial of APSIPA ASC 2014, Siem Reap, Cambodia, 9-12 Dec., 2014.
- Hideki Kawahara, Masanori Morise, Tomoki Toda, Hideki Banno Ryuichi Nisimura, Toshio Irino,
"Excitation source analysis for high-quality speech manipulation systems based on an interference-free representation of group delay
with minimum phase response compensation,"
Prod. Interspeech 2014, Singapore, 14-18, Sept. 2014. (Accepted)
- Hideki Kawahara, Tatsuya Kitamura, Hironori Takemoto, Ryuichi Nisimura, Toshio Irino,
"Vocal tract length estimation based on vowels
using a database consisting of 385 speakers and
a database with MRI-based vocal tract shape information,"
Prod. Interspeech 2014, Singapore, 14-18, Sept. 2014. (Accepted)
- Minori Matsuyama, Ryuichi Nisimura, Hideki Kawahara, Junnosuke Yamada,
and Toshio Irino, "Development of a mobile application for
crowdsourcing the data collection of environmental sounds," in Sakae
Yamamoto (Ed.): Human Interface and the Management of Information,
Information and Knowledge Design and Evaluation, Lecture Notes in
Computer Science, Volume 8521, pp. 514-524,Springer International
Publishing, (2014), presented at HCI International 2014, Heraklion,
Crete, Greece, June 22-27,2014.(25 June, 2014),
(link)
- Ryuichi Nisimura, Kazuki Hashimoto, Hideki Kawahara, and Toshio Irino,
"Proposal for an Interactive 3D Sound Playback Interface Controlled by
User Behavior," in Constantine Stephanidis (Ed.): HCI International
2014 - Posters' Extended Abstracts, Part I, Communications in Computer
and Information Science, Volume 434, pp. 446-450, Springer
International Publishing, (2014), presented at HCI International 2014
(Poster), Heraklion, Crete, Greece, June 22-27,2014.
(link)
- Eri Haneishi, Reiji Oribe, Hironori Takemoto, Hideki Kawahara,
Kiyoshi Honda, Takeshi Saitou, Kaori Hagiwara,
Hiroko Kishimoto, Attempts of Visualization of Singing Techniques: MRI Motion
Imaging of Diaphragm Activities and Acoustic Features during
Singing, 43rd Annual Symposium of the Voice Foundation, Philadelphia PA USA, June 1, p.21, 2014.
2013年
- H. Kawahara, M. Morise, K. Sakakibara:
Temporally fine F0 extractor applied for frequency modulation power spectral analysis of singing voices,
Proc. MAVEBA 2013, Firenze, Italy, 16-18 December, pp.125-128, 2013. (17/Dec./2013)
- M. Sakaguchi, M. Kobayashi, R. Nisimura, T. Irino, H. Kawahara:
Spectrally estimated vocal tract lengths of singing voices and their contributing factors,
Proc. MAVEBA 2013, Firenze, Italy, 16-18 December, pp.121-124, 2013. (17/Dec./2013)
- Hideki Kawahara, Masanori Morise, Hideki Banno, and Verena G. Skuk:
Temporally variable multi-aspect N-way morphing based on interference-free speech representations,
Proc. APSIPA ASC 2013, Kaohsiung, Taiwan, OS.28-SLA.9, 2013. (31/Oct./2013)
- Toshio Irino, Erika Okamoto, Ryuichi Nisimura, and Hideki Kawahara,
Vocal Tract Length Estimation for Voiced and Whispered Speech Using Gammachirp Filterbank,
Proc. APSIPA ASC 2013, Kaohsiung, Taiwan, OS.13-SLA.5, 2013. (30/Oct./2013)
- Masanori Morise, Hideki Kawahara and Kenji Ozawa:
Periodicity extraction for voiced sounds with multiple periodicity,
Proc. Interspeech2013, Lyon, 25-29 August, pp.1921-1925, 2013. (28/Aug./2013)
- Yuri Nishigaki, Ken-Ichi Sakakibara, Masanori Morise, Ryuichi Nisimura, Toshio Irino and Hideki Kawahara:
Controlling "shout" expression in a Japanese POP singing performance: analysis and suppression study,
Proc. Interspeech2013, Lyon, 25-29 August, pp.2905-2909, 2013. (28/Aug./2013)
- Hideki Kawahara, Masanori Morise, Tomoki Toda, Ryuichi Nisimura and Toshio Irino:
Beyond bandlimited sampling of speech spectral envelope imposed by the harmonic structure of voiced sounds,
Proc. Interspeech2013, Lyon, 25-29 August, pp.34-38, 2013. (26/Aug./2013)
- Hideki Kawahara, Masanori Morise and Ken-Ichi Sakakibara:
Interference-free observation of temporal and spectral features in "shout" singing voices and their perceptual roles,
Proc. SMAC-SMC 2013, Stockholm, 30 July-3 August, pp.256-263, 2013. (1/Aug./2013)
- Mayuko Kobayashi, Ryuichi Nisimura, Toshio Irino, and Hideki Kawahara,
Estimated relative vocal tract lengths from vowel spectra based on fundamental frequency adaptive analyses and their relations to relevant physical data of speakers,
ASA/ICA 2013, Montreal, 5aSCb44, 2013. (7/June/2013)
- Tomofumi Fukawatase, Toshio Irino, Ryuichi Nisimura, Hideki Kawahara, and Roy Patterson,
Optimizing the simultaneous estimation of frequency selectivity and compression using notched-noise maskers with asymmetric levels,
ASA/ICA 2013, Montreal, 1pPPb3, 2013. (3/June/2013)
- Hideki Kawahara, Masanori Morise, Ryuichi Nisimura and Toshio Irino:
Higher order waveform symmetry measure and its application to periodicity detectors for speech and singing with fine temporal resolution,
ICASSP2013, Vancouver Canada, 26-31 May, pp.6797-6801, 2013. (30/May/2013).
- Hideki Kawahara,
New dimensions of voice attributes modification for exploratory research on person related information,
7th PPRU Workshop, Jena Germany, 26, April, 2013. (Invited)
2012年
- Hideki Kawahara, Masanori Morise, Ryuichi Nisimura and Toshio Irino:
An interference-free representation of group delay
for periodic signals,
Proc. APSIPA, 3-6 December, OS.17-SLA 8, 2012 Calfornia, USA. (4/Dec./2012)
- Ryuichi Nisimura, Shoko Miyamori, Erika Okamoto, Hideki Kawahara, and Toshio Irino:
Detecting child speaker based on
auditory feature vectors for VTL estimation,
Proc. APSIPA, 3-6 December, PS.5-SLA18, 2012 Calfornia, USA. (6/Dec./2012)
- Taiki Nishi, Ryuichi Nisimuray, Toshio Irinoy and Hideki Kawahara:
Modulation transfer function design for a flexible
cross synthesis VOCODER based on F0 adaptive
spectral envelope recovery,
Proc. APSIPA, 3-6 December, PS.5-SLA18, 2012 Calfornia, USA. (6/Dec./2012)
- Hideki Kawahara, Masanori Morise:
Simplified aperiodicity representation for
high-quality speech manipulation systems,
Proc. ICSP2012, Beijing, pp.579-584, 2012. (22/Oct./2012)
- Zhengqi Wen, Hideki Kawahara, Jianhua Tao: Pitch-Scaled Analysis based Residual Reconstruction for Speech Analysis and Synthesis,
Interspeech2012, 2012. (10/Sept./2012)
- Hideki Kawahara, Masanori Morise, Ryuichi Nisimura, Toshio Irino:
Deviation measure of waveform symmetry and its
application to high-speed and temporally-fine F0
extraction for vocal sound texture manipulation,
Interspeech2012, 2012. (10/Sept./2012)
- Josh H. McDermott, Daniel P. W. Ellis and Hideki Kawahara:
Inharmonic Speech: A Tool for the Study of Speech Perception and Separation,
SAPA-Scale Conference 2012, 2012. (9/Sept./2012)
- Hideki Kawahara: Excitation source structural analysis of Japanese traditional singing voices,
Acoustics 2012, Hong Kong China, 14-18 May 2012. (Invited talk:16/May)
- Hideki Kawahara: A new dimension of voice quality manipulation,
LISTA workshop 2012, Edinburgh UK, 2-3 May 2012. (Invited talk:3/May)
2011年
- Hideki Kawahara, Masanori Morise, Toshio Irino,
Analysis and synthesis of strong vocal expressions: Extension and application of audio texture features to singing voice,
ICASSP2012, pp.5389-5392, March 2012.
- Erika Okamoto, Ryuichi Nisimura, Toshio Irino, Hideki Kawahara,
Auditory filterbank improves voice morphing,
Proc. Interspeech 2011, pp.2517-2520, August 2011.
- R. Nisimura, S. Miyamori, L. Kurihara, H. Kawahara, and T. Irino,
Development of Web-Based Voice Interface to Identify Child Users Based on Automatic Speech Recognition System,
HCI 2011, part 4, pp.607-616, July 2011.
- Hideki Kawahara, Toshio Irino and Masanori Morise,
An interference-free representation of instantaneous frequency of periodic signals and its application to F0 extraction,
Proc. ICASSP 2011, pp.5420-5423, May 2011.
2010年
- Hideki Kawahara, In search of perceptually relevant speech representations, - STRAIGHT, TANDEM-STRAIGHT and beyond -,
NCSP2011, pp.252-155, 2011. (Plenary Talk, 2nd March, 2011)
- Yoshika Wada, Ryuichi Nisimura, Toshio Irino, Hideki Kawahara, A new formulation of a multiple periodicity extractor for expressive and pathological voices, NCSP2011, pp.336-339, 2011. (3rd March, 2011).
- Erika Okamoto, Ryuichi Nisimura, Toshio Irino, Hideki Kawahara, Evaluation of voice morphing using vocal tract length normalization based on auditory filterbank, NCSP2011, pp.187-190, 2011. (2nd March, 2011)
- Yoshika Wada, Masanori Morise, Ryuichi Nisimura, Toshio Irino, Hideki Kawahara, Optimization of a Multiple Local Periodicity Detector for Vocal Excitation Structure Analysis, Proceedings of the Second APSIPA Annual Summit and Conference, pp.518-521, Biopolis, Singapore, 14-17 December 2010.
- Kawahara, H.; Morise, M.; , "Tolerance of FO adaptive time-frequency analysis for spectrographic representations," 2010 IEEE 10th International Conference on Signal Processing (ICSP), pp.601-604, 24-28 Oct. 2010
DOI: 10.1109/ICOSP.2010.5655859
- Hideki Kawahara, Masanori Morise, Toru Takahashi, Hideki Banno, Ryuichi Nisimura, Toshio Irino, Simplification and Extension of Non-Periodic Excitation Source Representations for High-Quality Speech Manipulation Systems, Interspeech 2010, Makuhari Japan, 27 September, 2010.
- Hideki Kawahara : Exploration of the other aspect of Vocoder revisited, -- A-Z STRAIGHT, TANDEM-STRAIGHT and morphing --, 7th ISCA Speech Synthesis Workshop (SSW7), Kyoto Japan, 22 September, 2010.
- Hideki Kawahara, Hanae Itagaki, Yoshika Wada, Masanori Morise, Ryuichi Nisimura, Toshio Irino, 20th International Congress on Acoustics (ICA2010), Sydney Australia, 26, August, 2010.
- Hayato Akagiri, Masanori Morise, Ryuichi Nisimura, Toshio Irino, Hideki Kawahara, Evaluation and optimization of F0‐adaptive spectral envelope estimation based on spectral smoothing with peak emphasis, 20th International Congress on Acoustics (ICA2010), Sydney Australia, 24, August, 2010.
2009年
- Hideki Kawahara, Ryuichi Nisimura, Toshio Irino, Masanori Morise, Toru Takahashi and Hideki Banno:
High-quality and light-weight voice transformation enabling extrapolation without perceptual and objective breakdown,
ICASSP2010, Dallas Texas, pp.4818-4821 (March, 2010)
- Ayanori Arakawa, Yoshinori Uchimura, Hideki Banno, Fumitada Itakura and Hideki Kawahara:
High quality voice manipulation method based on the vocal tract area function obtained from sub-band STRAIGHT spectrum,
ICASSP2010, Dallas Texas, pp.4834-4837 (March, 2010)
- Hideki Kawahara: Speech morphing based on biologically relevant signal representations, Proc. MAVEBA09, 14-16 Dec. Firenze Italy pp.83-86 (Dec., 2009).
- Hanae Itagaki, Masanori Morise, Ryuichi Nisimura, Toshio Irino and Hideki Kawahara:
A bottom-up procedure to extract periodicity structure of voiced sounds and its application to represent and restoration of pathological voices, Proc. MAVEBA09, 14-16 Dec. Firenze Italy, pp.115-118 (Dec., 2009).
- Masanori Morise, Hideki Kawahara and Takanobu Nishiura: Rapid F0 estimation for high-SNR speech, Proc, WESPAC2009, Beijing, China, CD-ROM, Sept. 21-23, Beijing (2009).
- Hideki Kawahara, Toru Takahashi, Masanori Morise and Hideki Banno: Development of exploratory research tools based on TANDEM-STRAIGHT, Proc. APSIPA, Sapporo, pp.111-120 (2009).
- Hideki Kawahara, Masanori Morise, Toru Takahashi, Hideki Banno, Ryuichi Nisimura and Toshio Irino: Observation of empirical cumulative distribution of vowel spectral distances and its application to vowel based voice conversion, Proc. Interspeech2009, Brighton, pp.2647-2650 (2009).
- Masanori Morise, Masato Onishi, Hideki Kawahara and Haruhiro Katayose: v.morish’09: A Morphing-based Singing Design ICEC 2009, Paris, pp.185-190 (2009).
- Hideki Kawahara, Masanori Morise, Toru Takahashi, Hideki Banno, Ryuichi Nisimura and Toshio Irino: Vocoder-based morphing tool demonstrations for flexible voice manipulations, AES 14th Regional Convention, Tokyo (2009).
- H. Kawaahra, R. Nisimura, T. Irino, M. Morise, T. Takahashi, B. Banno, Temporally variable multi-aspect auditory morphing enabling extrapolation without objective and perceptual breakdown, Proc. ICASSP, Taipei, Taiwan, 19-24 (2009).
DOI: 10.1109/ICASSP.2009.4960481
2008年
- Y. Yoshida, R. Nisimura, T. Irino, H. Kawahara, Vowel-based voice conversion and its application to singing-voice manipulation, Proc. AES 35th International conference audio for games, London, UK, 11-13 (2009).
- M. Morise, H. Kawahara, H. Katayose, Fast and reliable F0 estimation methos based on the period extraction of vocal fold vibration of singing voice and speech, Proc. AES 35th International conference audio for games, London, UK, 11-13 (2009).
- R. Nisimura, J. Miyake, H. Kawahara, T. Irino: Speech-to-text input method for web system using JavaScript, SLT 2008, Goa, India, 15-18 December, 2008.
DOI: 10.1109/SLT.2008.4777877
- Masato Onishi, Toru Takahashi, Toshio Irino and Hideki Kawahara, Vowel-based frequency alignment function design and recognition-based time alignment for automatic speech morphing, SLT 2008, Goa, India, 15-18 December, 2008.
DOI: 10.1109/SLT.2008.4777831
- Hideki Kawahara, Looking into the past: Power spectral representation of periodic signals, sampling theories and fundamental frequency estimation for remaking speech, The 6th ISCSLP 2008, Kunming, Chine, 16-19 December, 2008. [Keynote lecture]
- Hideki Kawahara, Masanori Morise, Hideki Banno, Toru Takahashi, Ryuichi Nisimura and Toshio Irino, Spectral Envelope Recovery beyond the Nyquist Limit for High-Quality Manipulation of Speech Sounds, Interspeech 2008, Brisbane, Australia, 22-26 September 2008, pp.650-653 (2008).
- Yoshinori Uchimura, Hideki Banno, Fumitada Itakura and Hideki Kawahara, STUDY ON MANIPULATION METHOD OF VOICE QUALITY BASED ON THE VOCAL TRACT AREA FUNCTION, Interspeech 2008, Brisbane, Australia, 22-26 September 2008, pp.1084-1087 (2008).
- Hideki Kawahara, Masanori Morise, Toru Takahashi, Ryuichi Nisimura, Hideki Banno, Toshio Irino,
A unified approach for F0 extraction and aperiodicity estimation based on a temporally stable power spectral representation,
ISCA Tutorial and Research Workshop (ITRW) on "Speech Analysis and Processing for Knowledge Discovery",
Aalborg, 4-6 June (2008)
- Hideki Kawaahra,
TANDEM-STRAIGHT, a research tool for L2 study enabling flexible manipulations of prosodic information,
Speech Prosody 2008, Campinas Brazil, May 6-9 2008. [Keynote lecture]
- Donna Erickson, Takaaki Shoichi, Caroline Menzes, Hideki Kawahara and Ken-Ichi Sakakibara, Some non-F0 cues to emotional speech: An experiment with morphing, SP2008, Campinas Brazil, 6-9 May 2008
- Hideki Kawahara, Masanori Morise, Toru Takahashi, Ryuichi Nisimura, Toshio Irino, Hideki Banno,
Tandem-STRAIGHT: A temporally stable power spectral representation for periodic signals and applications to interference-free spectrum, F0, and aperiodicity estimation,
Proc. ICASSP 2008, Las Vegas,pp.3933-3936(2008)
DOI: 10.1109/ICASSP.2008.4518514
2007年
- Hideki Kawahara,
Remaking Speech Revisited -- STRAIGHT and TANDEM-STRAIGHT and Their Implications,
Asian Workshop on Speech Science and Technology, Tokyo, Japan, Mar 20, 2008
[Invited]
- Masato Onishi, Toru Takahashi, Masanori Morise, Toshio Irino, Hideki Kawahara, "Vowel-based voice conversion and its objective evaluation", Proc. 2008 RISP International Workshop on Nonlinear Circuits and Signal Processing (NCSP'08), pp.275-278, Gold Coast, Australia, 6-8 Mar. 2008.
- Yoshie Aoki, Toshio Irino, Hideki Kawahara, Roy D. Patterson, "Speaker size discrimination for acoustically scaled versions of naturally spoken words", in Abstracts of ARO 31th Midwinter meeting, Phoenix, AZ, USA, 16-21 Feb. 2008.
- Hideki Kawahara,
Application and Extensions of STRAIGHT-based Morphing for Singing Voice Manipulations
based on Vowel Centerd Approach,
Proc. ICA2007, Madrid (2007)[Invited].
- Toshio Irino, Yoshie Aoki, Yoshie Hayashi, Hideki Kawahara, Roy Patterson,
Discription and Recognition of Scaled Word Sound,
Proc. Interspeech 2005, Antwerp (2007).
- Hideki Kawahara, Masanori Morise, Toru Takahashi, Hideki Banno, Toshio Irino, Osamu Fujimura,
Group delay for acoustic event representation and its application for speech aperiodicity analysis,
Proc. EUSIPCO (2007).
2006年
- Ryuichi Nisimura, Souji Omae, Hideki Kawahara and Toshio Irino,
"Analyzing dialogue data for real-world emotional speech classification",
Proc. Interspeech-2006, CD-ROM (2006).
- Toru Takahashi, Masashi Nishi, Toshio Irino and Hideki Kawahara,
"Automatic assignment of anchoring points on vowel templates for defining correspondence between time-frequency representations of speech samples",
Proc. Interspeech-2006, CD-ROM (2006).
- Masanori Morise, Toshio Irino and Hideki Kawahara,
"Logarithmic temporal processing applied to accurate empirical transfer function measurements in vocal sound propagation",
Proc.EUSIPCO, CD-ROM (2006).
- Toru Takahashi, Hideki Banno, Toshio Irino and Hideki Kawahara,
"Speech style conversion based on the statistics of vowel spectrogram and nonlinear frequency mapping", Proc.EUSIPCO, CD-ROM (2006).
- Toru Takahashi, Toshio Irino, Hideki Kawahara, "General Framework for Flexible Speech Style Manipulation and Synthesis", Proc. of WESPAC IX 2006 (9th Western Pacific Acoustics Conference), Seoul, Republic of Korea, June 26-28, (2006).
- Ryuichi Nisimura, Aki Hashizume, Toshio Irino, Hideki Kawahara, "Human-robot interaction interface using GMM-based noise recognition", Proc. of WESPAC IX 2006 (9th Western Pacific Acoustics Conference), Seoul, Republic of Korea, June 26-28, (2006).
2005年
- Toru Takahashi, Hideki Banno, Ryuichi Nisimura, Toshio Irino and Hideki Kawahara,
Spectral Fluctuation Mapping Model for Japanese Speech Style Conversion based on Statistics in Emotional Speech Database,
Cocosda 2005, Jakarta, Indonesia, 6-8 Dec. 2005.
- Hideki Kawahara, Alain de Cheveigne, Hideki Banno, Toru Takahashi and Toshio Irino,
Nearly Defect-free F0 Trajectory Extraction for Expressive Speech Modifications based on STRAIGHT,
Proc. Interspeech2005, Lisboa, pp.537-540, Sept. 2005.
- Toshio Irino, Satoru Satou, Shunsuke Nomura, Hideki Banno,
Hideki Kawahara,"Speech intelligibility derived from
time-frequency and source smearing,"
Interspeech 2005,pp.1737-1740, Lisbon, Portugal, Sept. 2005.
- Toru Takahashi, Takeshi Fujii, Masashi Nishi, Hideki Banno,
Toshio Irino, Hideki Kawahara,"Voice and Emotional Expression
Transformation based on Statistics of Vowel Parameters in an
Emotional Speech Database,"
Interspeech 2005, pp.1853-1856, Lisbon, Portugal, Sept. 2005.
- Masanori Morise, Toshio Irino, Hideki Banno, and Hideki
Kawahara,"A test signal robust against background noise in the
measurement of acoustic impulse responses: Warped-TSP,"
The 34th International Congress and Exposition on
Noise Control Engineering (Internoise 2005) ,
Rio de Janeiro, Brazil, 7-10 Aug. 2005.
2004年
- Hideki Kawahara, Yumi Hirachi, Masanori Morise and Hideki Banno, Procedure “senza vibrato”: a
key component for morphing singing, Proc. ICSLP2004, vol.5, pp.934-937, 2004.
- Jiang Jin, Hideki Banno, Hideki Kawahara, and Toshio Irino, Intelligibility of degraded speech from
smeared STRAIGHT spectrum, Proc. ICSLP2004, vol.4, pp.530-533, 2004.
- Yuki DENDA Takanobu NISHIURA Hideki KAWAHARA and Toshio IRINO,
A Design of Audio-Visual Talker Tracking System Based On
CSP Analysis and Frame Difference in Real Noisy Environments,
International workshop on Multimedia Signal Processing, Siena, OW2, September 29-Oct.1, 2004.
- Masanori Morise and Hideki Kawahara,
Loudspeaker equalization based on multi-location observation with
reliable time-frequency region selection and its evaluation using
sound propagation measurement,
Proc. EUSIPCO'2004 Vienna, pp.1995-1998, 2004.
- Yuki Denda, Takanobu Nishiura, Hideki Kawahara, Toshio Irino, "Effectiveness of wavelet spectral subtraction in noisy speech recognition," Seventh International Conference on Signal Processing (ICSP'04) , Beijing, China, 31 Aug. - 4 Sept., 2004.
- Hideki Kawahara, Hideki Banno, Masanori Morise, Yumi Hirachi, A cappella synthesis demonstrations
using RWC music database, Proc. NIME04, pp.130-131, 2004.
- Hideki Kawahara, Hideki Banno, Toshio Irino and Parham Zolfaghari,
ALGORITHM AMALGAM:
MORPHING WAVEFORM BASED METHODS, SINUISOIDAL MODELS AND STRAIGHT,
Proc. ICASSP'2004, Montreal Canada, pp.13-16, 2004.
- Hideki Kawahara,
Computational basis of illusionary pitch perception,
International Congress on Acoustics Kyoto, vol.2, pp.1081-1084, 2004. (Invited talk)
- Masanori Morise,
A new acoustic measurement and compensation method based on logarithmic
transformation of the time axis and multi-location acquisition
International Congress on Acoustics Kyoto, vol.1, pp.721-724, 2004.
- Minoru Tsuzaki, Hideki Kawahara,
Effects of Group Delay Diffusion in Pulse Trains on Timbre:
A Periodicity Cue in Auditory Images,
International Congress on Acoustics Kyoto, vol.2, pp.1803-1806, 2004.
- Reiko Akahane-Yamada, Hiroaki Kato, Takahiro Adachi, Hideyuki Watanabe,
Ryo Komaki, Rieko Kubo, Tomoko Takada, Yuko Ikuma, Hiroaki Tagawa,
Keiichi Tajima and Hideki Kawahara,
ATR CALL: A speech perception/production training system
utilizing speech technology,
International Congress on Acoustics Kyoto, vol.3, pp.2319-2322, 2004.
- Toshio Irino, Roy D. Patterson and Hideki Kawahara,
Speech segregation using an auditory VOCODER with event-synchronous enhancement,
International Congress on Acoustics Kyoto, vol.4, 3025-3028, 2004.
- Masato Nakayama, Yuki Denda, Takanobu Nishiura, Hideki Kawahara, and Toshio Irino,
An Evaluation of In-CAR Speech Enhancement Techniques with Microphone Array Steering,
International Congress on Acoustics Kyoto, vol.4, 3041-3044, 2004.
2003年
- Hideki Kawahara and Toshio Irino,
Underlying principles of a high-quality speech manipulation system STRAIGHT and its application to speech segregation, Perspectives on Speech Separation ― a Workshop (NSF), October 31 - November 2, 2003,Montreal, Quebec (2003) [招待]
- Yuki Denda, Takanobu Nishiura, and Hideki Kawahara, "Noisy Speech Recognition with Microphone Array Steering and Fourier/Wavelet Spectral Subtraction, " Proc. IEEE International Workshop on Statistical Signal Processing (SSP), St Louis, Missouri, pp. 573--576, U.S.A, Sep.-Oct. 2003.
- Masato Nakayama, Takanobu Nishiura, Hideki Kawahara, "Adaptive Beamformer Based on Average Vowel / Consonant Spectrum with Phoneme Identification, " International Workshop on Acoustic Echo and Noise Control (IWAENC2003), Kyoto, Japan, Sep. 2003.
- Hideki Kawahara, Exemplar-based Voice Quality Analysis and Control using a High Quality Auditory Morphing Procedure based on STRAIGHT,
VOQUAL'03, ISCA Tutorial and Research Workshop, Geneva, August 27-29, 2003, pp.109-114.
- Hiroaki Kato, Masumi Nukina, Hideki Kawahara, Reiko Akahane-Yamada,
Influence of Recording Equipment on the Identification of Second Language Phoneme Contrasts.
Prod. Eurospeech'03, pp. 3157-3160, 2003.
- Toshio Irino, Roy D. Patterson, Hideki Kawahara,
Speech Segregation Based on Fundamental Event Information Using an Auditory Vocoder
Prod. Eurospeech'03, pp. 553-556, 2003.
- Hisami Matsui, Hideki Kawahara,
Investigation of Emotionally Morphed Speech Perception and its Structure Using a High Quality Speech Manipulation System
Prod. Eurospeech'03, pp. 2113-2116, 2003.
- Yuki Denda, Takanobu Nishiura, Hideki Kawahara,
Speech Enhancement with Microphone Array and Fourier / Wavelet Spectral Subtraction in Real Noisy Environments,
Prod. Eurospeech'03, pp. 2153-2156, 2003.
- Parham Zolfaghari, Tomohiro Nakatani, Toshio Irino, Hideki Kawahara, Fumitada Itakura,
Glottal Closure Instant Synchronous Sinusoidal Model for High Quality Speech Analysis/Synthesis,
Prod. Eurospeech'03, pp. 2441-2444, 2003.
- Hideki Kawahara: Exemplar-based Voice Quality Analysis and Control
using a High Quality Auditory Morphing Procedure based on STRAIGHT,
VOQUAL'03, ISCA Tutorial and Research Workshop, Geneva, August 27-29, 2003, pp.109-114.
- Yuko Sogabe, Kazuhiko Kakehi and Hideki Kawahara,
Psychological evaluation of emotional speech using a new morphing method,
4th ICCS International Conference on Cognitive Science, Sydney Australia,
13-17 July, 2003.
- Hideki Kawahara and Hisami Matsui: AUDITORY MORPHING BASED
ON AN ELASTIC PERCEPTUAL DISTANCE METRIC IN AN INTERFERENCE-FREE
TIME-FREQUENCY REPRESENTATION, Proc. ICASSP'2003, vol.I, pp.256-259,
2003.
- Toshio Irino, Roy Patterson and Hideki Kawahara: SPEECH SEGREGATION
USING EVENT SYNCHRONOUSAUDITORY VOCODER, ICASSP'2003, Hong Kong,
6-10 April 2003.
2002年
- Hideki Kawahara, Parham Zolfaghari and Alain de Cheveigne, "On F0 Trajectory for very high-quality speech manipulation" ICSLP'2002,
- Toshio Irino, Roy D. Patterson , and Hideki Kawahara, "Auditory vocoder to playback sound from an auditory Mellin representation," Dynamics of Speech Production and Perception, NATO Advanced Study Institute , Il Ciocco, Itary, 24 June - 6 July, 2002.
- Toshio Irino, Roy D. Patterson , and Hideki Kawahara, "Auditory VOCODER: Speech resynthesis from an auditory Mellin representation," Proc. International Conference on Acoust. Signal, and Speech Processing, ICASSP 2002 , vol. II, pp.1921-1924, Orlando, Florida, USA., 13-17 May 2002.
2001年
- H. Kawahara, Jo Estill and O. Fujimura: Aperiodicity extraction
and control using mixed mode excitation and group delay manipulation
for a high quality speech analysis, modification and synthesis
system STRAIGHT, MAVEBA 2001, Sept.13-15, Firentze Italy, 2001.
- H. Kawahara and P Zolfaghari: Systematic F0 glitches around
vowel nasal transitions, EUROSPEECH'2001, pp.2459-2462, 2001.
- Toshio Irino, Roy D. Patterson , and Hideki Kawahara, "Sound resynthesis from Auditory Mellin Image using STRAIGHT," CRAC (Consistent and Reliable Acoustic Cues for sound analysis) workshop , Aalborg, Denmark,
2nd Sept. 2001
2000年
- Hideki Kawahara, Yoshinori Atake and Parham Zolfaghari: Accurate
vocal event detection method based on a fixed-point to weighted
average group delay, ICSLP-2000, Beijing, pp.664-667 2000.
- Parham Zolfaghari, Yoshinori Atake, Kiyohiro Shikano, Hideki Kawahara "Investigation of analysis and synthesis parameters of STRAIGHT by
subjective evaluation" ICSLP-2000, Beijin (2000)
1999年
- Hideki Kawahara, Haruhiro Katayose, Alain de Cheveigne, Roy
D. Patterson: Fixed Point Analysis of Frequency to Instantaneous
Frequency Mapping for Accurate Estimation of F0 and Periodicity
, Proc. EUROSPEECH'99, Volume 6, Page 2781-2784 (1999).
- H. Katayose and H. Kawahara : Applying STRAIGHT toward Music Systems - Accurate F0 Estimation and -Application for Data-driven Synthesis -, Proc.Intl. Computer Music Conf., pp.514-517 (1999)
1998年
- A. K. Barros, H. Kawahara and N. Ohnishi, "Heart Rate
Variability Calculation: A Non conventional Approach for Saving
Memory" Proc. of Computers in Cardiology (1998.9), Cleveland,
USA,
- Hideki Kawahara, Alain de Cheveigne and Roy D. Patterson:
``An instantaneous-frequency-based pitch extraction method for
high-quality speech transformation: revised TEMPO in the STRAIGHT-suite'',
Proc. 5th Int. Conf. on Spoken Language Processing (ICSLP '98),
Sudney, (1998.12).
- Reiko Akahane-Yamada, Erik McDermot, Takahiro Adachi, Hideki
Kawahara and Jecsica Pruitt: ``Computer-based second language
production training by using spectrographic representation and
HMM-based speech recognition scores'', Proc. 5th Int. Conf. on
Spoken Language Processing (ICSLP '98), Sudney, (1998.12).
- John S Pruitt, Hideki Kawahara, Reiko Akahane-Yamada and
Rieko Kubo:` `Methods of enhancing speech stimuli for perceptual
training: Exaggerated articulation, context truncation, and "STRAIGHT"
resynthesis'', ESCA workshop STiLL (Speech Technology in Language
Learning) , Stokholm, (1998.5).
- Reiko Akahane-Yamada, Takahiro Adachi, Hideki Kawahara, John
S Pruitt and Erik McDermott: ``Toward the optimization of computer-based
second-language production training'', ESCA workshop STiLL (Speech
Technology in Language Learning) , Stokholm, (1998.5).
- Hideki Banno, J. Ju, Satoshi Nakamura, Kiyohiro Shikano and
Hideki Kawahara: ``Efficient Representation of Short-time Phase
Based on Group Delay'', ICASSP'98, Seattle, vol.2, pp.861-864 (1998.5).
1997年
- Hideki Kawahara: ``STRAIGHT-TEMPO: A Universal Tool to Manipulate
Linguistic and Paralinguistic Speech Information,'' Proc. System
Man and cybernetics 97, Olrand Florida, vol.2, pp.1620-1625 (1997.10)
- Alain de Cheveigne and Hideki Kawahara: ``Modeling the perception
of multiple pitches,'' IJCAI-CASA workshop on Auditory Scene
Analysis, Nagoya, (1997.8).
- Ikuyo Masuda-Katsuse, Hideki Kawahara and Kiyoaki Aikawa:
``Speech segregation based on continuity of spectral shapes,''
IJCAI-CASA workshop on Auditory Scene Analysis, Nagoya, (1997.8).
- Hideki Kawahara, Ikuyo Masuda-Katsuse and Alain de Cheveigne:
``Restructuting speech representations using STRAIGHT-TEMPO:
Possible role of a repetitive structure in sounds,'' IJCAI-CASA
workshop on Auditory Scene Analysis, Nagoya, (1997.8).
- Hideki Kawahara: ''Speech Representation and Transformation
using Adaptive Interpolation of Weighted Spectrum: VOCODER Revisited,''
Proc. IEEE Int. Conf. on Acoustics, Speech and Signal Processing
(ICASSP '97) , vol.2, pp.1303-1306 (1997.4)
- Hideki Kawahara: ''Visual Representations of Interactions
between Speech Perception and Production,'' Int. Symposium on
Simulation, Visualization and Auralization for Acoustic Research
and Education (ASVA'97) (1997.4)
- Reiko Akahane-Yamada, Takahiro Adachi and Hideki Kawahara:
``Toward the optimezation of second language speech training,''
Int. Symposium on Simulation, Visualization and Auralization
for Acoustic Research and Education (ASVA'97) (1997.4)
1996年
- Kiyoaki AIKAWA (NTT), Minoru TSUZAKI and Hideki KAWAHARA:
''Dynamic Perception of Frequency-Modulated Tones,'' J. Acoust.
Soc. Am., Vol.100, No.4, Pt.2, 4aPP2, p,2750 (1996.10) Proc.
ASA/ASJ Third Joint Meeting, 4aPP2, pp.691-694 (1996.12)
- Hideki KAWAHARA and Kiyoaki AIKAWA (NTT): ''Contributions
of Auditory Feedback Frequency Components on F0 Fluctuations,''
J. Acoust. Soc. Am., Vol.100, No.4, Pt.2, 5aSC12, p.2825 (1996.10)
Proc. ASA/ASJ Third Joint Meeting, 5aSC12, pp.1177-1182 (1996.12)
- Hideki KAWAHARA, Hiroko KATO and Julia C. WILLIAMS (Ohio
State Univ.): ''Effects of Auditory Feedback on F0 Trajectory
Generation,'' Proc. 4th Int. Conf. on Spoken Language Processing
(ICSLP '96), pp.287-290 (1996.10)
- Kiyoaki AIKAWA, Hideki KAWAHARA and Minoru TSUZAKI: ''A Neural
Matrix Model for Active Tracking of Frequency-Modulated Tones,''
Proc. 4th Int. Conf. on Spoken Language Processing (ICSLP '96),
pp.578-581 (1996.10)
- Hiroko KATO and Hideki KAWAHARA: ''For the Relationship of
Human Speech Production and Perception,'' Proc. 2nd IFMBE-IMIA
Int. Workshop on Biosignal Interpretation (BSI96), pp.191-194
(1996.9)
- Hideki KAWAHARA and Julia C. WILLIAMS (Ohio State Univ.):
''Effects of Auditory Feedback on Voice Pitch Trajectories: Characteristic
Responses to Pitch Perturbations,'' Pamela J. Davis, Neville
H. Fletcher (eds.), Vocal Fold Physiology, Chapter 18, pp.263-278,
Singular Publishing Group, Inc. (1996.9)
- Hideki KAWAHARA: ''Auditory Effects on Speech Production:
An Alternative Approach to Pitch Perception Mechanisms,'' Proc.
ESCA Workshop on the Auditory Basis of Speech Perception, pp.144-147
(1996.7)
1995年
- Hideki KAWAHARA : ``Hearing Voice: Transformed Auditory Feedback
Effects on Voice Pitch Control,'' {\em Proc. IJCAI'95 Workshop
on Computational Auditory Scene Analysis }, pp. 143-148 (1995.8),
- Hideki KAWAHARA and J. C. WILLIAMS : ``Effects of Auditory
Feedback on Voice Pitch Trajectories: Response Characteristics
to Pitch Perturbations,'' {\em The 9th Vocal Fold Physiology
Symposium}, (1995.5)
1994年以前
- Hideki KAWAHARA : ``Effects of Natural Auditory Feedback
on Fundamental Frequency Control,'' {\em Proc. 3rd Int. Conf.
on Spoken Language Processing (ICSLP'94)}, Vol.3, S24-2, pp.1399-1402
(1994.9)
- Kiyoaki AIKAWA, Harald SINGER, Hideki KAWAHARA and Yoh'ichi
TOHKURA : ``A Dynamic Cepstrum Incorporating Time-Frequency Masking
and Its Application to Continuous Speech Recognition,'' {\em
Proc. IEEE International Conference on Acoustics, Speech and
Signal Processing (ICASSP'93)}, Vol.II, pp.668-671 (1993.4),
- Toshio Irino and Hideki Kawahara : ``Signal Reconstruction
from Modified Wavelet Transform - An Application to Auditory
Signal Processing,'' Proc. IEEE International Conference
on Acoustics, Speech and Signal Processing (ICASSP'92), Vol.I,
pp.85-88 (1992.3),
- Kazuhiko Kakehi and Hideki Kawahara : ``Neural Network Research
in Japan'', Presented at the Lyon Conference: NEURAL NETWORKS
- Biological Computers of Electronic Brains, Lyon (France), (1990.3),
- Toshio Irino and Hideki Kawahara : ``Vowel-feature Extraction
from Cochlear Vibration using Neural Networks,'' {\em The first
meeting of the International Neural Networks Society (Neural
Networks)}, Vol.1, Suppl. 1, p.300, (1988.9),
その他国際会議、講演等
2013年
- Hideki Kawahara:
Interference-free representations of periodic signals for exprolatory research on speech communication,
BRAMS Open talk: International Laboratory for Brain, Music, and Sound Research, Center, Montreal, CA, 4, June 2013. (invited talk)
- Hideki Kawahara:
Interference-free representations of periodic signals for exprolatory research on speech communication,
HRC Seminar: Boston University Hearing Research, 24, May 2013. (invited talk)
2012年
- Hideki Kawahara:
Reading Matlab code of TANDEM-STRAIGHT and morphing procedures for signal processing engineers,
CSTR open talk series, April, 2012. (invited lecture)
- Hideki Kawahara:
Generating stimulus continuum for testing linguistic and para-linguistic distinction,
CSTR open talk series, April, 2012. (invited lecture)
2008年
- Hideki Kawahara, Masanori Morise, Toru Takahashi, Ryuichi Nisimura, Hideki Banno, Toshio Irino, A temporally stable representation of power spectra of periodic signals and its application to F0 and periodicity estimation, Acoustics'08 Paris, IpSCc24 (2008)
2007年
- Caroline Menzes, Donna Erickson, Kikuo Maekawa, Hideki Kawahara,
Experimental paradigm influence subjects's perception of attitudes,
the 154th ASA meeting, 3aSCb8, New Orleans (2007).
2006年
- Hideki KAWAHARA and Reiko Akahane-YAMADA:
STRAIGHT as a research tool for L2 study: How to manipulate segmental and supra-segmental features.
ASA and ASJ joint meeting, Hawaii, 2pSCb2 (2006) [invited]
- Hideki Kawahara, Osamu Fujimura and Yasuyuki Konparu:
Voice quality of artistic expression in Noh: An analysis-synthesis study on source-related parameters,
ASA and ASJ joint meeting, Hawaii, 1pMU1 (2006) .
- Reiko Akahane-Yamada, Takahiro Adachi and Hideki Kawahara:
Tools for speech perception, production, and training studies: Web-based second language training system, and a speech resynthesis system,
ASA and ASJ joint meeting, Hawaii, 2aED9 (2006) .
- Makio Kashino, Hideki Kawahara and Hiroshi Riquimaroux:
Wonders in perception and manipulation of speech,
ASA and ASJ joint meeting, Hawaii, 2aED12 (2006) .
- Masanori Morise, Toshio Irino, Hideki Banno and Hideki Kawahara:
Warped-time-stretched pulse: An acoustic test signal robust against ambient noise,
ASA and ASJ joint meeting, Hawaii, 4AAA1 (2006) .
- Hideki Kawahara:
A precursor to ecologically relevant speech science,
Proc. WESPAC IX 2006, 26-28 June, Seoul (June 2006) [Plenary lecture]
2005年
- Hideki Kawahara,
STRAIGHT: A high-quality speech manipulation system,
CAHR TDU, Copenhagen, Denmark, 14 September (2005). [invited lecture]
- Y. Denda, T. Nishiura, H. Kawahara, T. Irino: A study of talker localization based on subband CSP analysis in real noisy environments, Nonlinear Signal and Image Processing, 2005. NSIP 2005. Abstracts. IEEE-Eurasip, 18-20 May, pp.27-27 (2005)
DOI: 10.1109/NSIP.2005.1502265
- Hideki Kawahara,
Recent developments in STRAIGHT,
CNBH, Cambridge University, United Kingdom, 9 September (2005). [invited lecture]
- Hideki Kawahara,
STRAIGHT: High-quality speech manipulation system,
Winter school in speech science, UFMG, Belo Horizonte, Brazil, 11 Augst (2005). [invited lecture]
- Hideki Kawahara, "Manipulating the pulse rate and resonance scale
in speech and animal calls," J. Acoust. Soc. Am. , 117(4),
Pt.2, p.2373, May 2005. [invited]
2004年
- Hideki Kawahara, Hideki Banno, Toshio Irino and Parham Zolfaghari,
Filling the gap between speech processing models,
Special Workshop in MAUI (SWIM), Maui Island, U.S.A, Session1.7, Jan. 2004.
- Yuki Denda, Takanobu Nishiura, Hideki Kawahara and Toshio Irino, "Performance Evaluation of Wavelet Spectral Subtraction in Noisy Speech Environment, " Special Workshop in MAUI (SWIM), Maui Island, U.S.A, Session 2.7, Jan. 2004.
2003年
- Masanori Morise and Hideki Kawahara
Logarithmic temporal axis manipulation and its application for measuring perceptually salient acoustic features of loudspeakers based on multiple observations
J. Acoust. Soc. Am. 114, 2460 (2003)
- Ryuichiro Yanaga and Hideki Kawahara
Logarithmic temporal axis manipulation and its application for measuring auditory contributions in F0 control using a transformed auditory feedback procedure
J. Acoust. Soc. Am. 114, 2458 (2003)
- Hideki Kawahara, Encoding and manipulating speaker size information with STRAIGHT,
CNBH workshop on Source Size Information in Speech and Music,
Cambridge UK, 8-10 September, 2003.
- Hideki Kawahara and Ryuichiro Yanaga, "Filtering on Non-Linear Time Axis and its Application for Measuring Perception to Production Transfer Functions in F0 Control," Speech Dynamics by Ear, Eye, Mouth and Machine, Kyoto, Japan, June 2003
2002年
- Hisami Matsui and Hideki Kawahara:
Auditorily motivated elastic spectral distance and its application
to emotional morphing of portrayal speech,
FIRST PAN-AMERICAN/IBERIAN MEETING ON ACOUSTICS, 2-6 December 2002, Cancun,
(J. Acoust. Soc. Am. 112, 2323 (2002)).
- Masumi Nukina and Hideki Kawahara, "Cross spectral measurement of head related speech transfer functions using speaker's own voice ",
FIRST PAN-AMERICAN/IBERIAN MEETING ON ACOUSTICS, 2-6 December 2002, Cancun,
(J. Acoust. Soc. Am. 112, 2323 (2002)).
- Masato Nakayama, Takanobu Nishiura, and Hideki Kawahara, "Adaptive Beamformer Based on Average Vowels/Consonant Spectrum Weights for Noisy Speech Recognition, "
FIRST PAN-AMERICAN/IBERIAN MEETING ON ACOUSTICS, 2-6 December 2002, Cancun,
(J. Acoust. Soc. Am. 112, 2324 (2002)).
- Hideki Kawahara, Systematic downgrading for investigating ``naturalness'' in synthesized singing using STRAIGHT: A high quality VOCODER" 143rd meeting of the Acoust. Soc. Amer., Pittsburgh, June 3-7,
(2002) (Invited Talk)
- Parham Zorfaghari, Hideki Banno and Fumitada Itakura and Hideki Kawahara: Event synchronous sinusoidal model based on frequency-to-instantaneous frequency mapping, 143rd meeting of the Acoust. Soc. Amer., Pittsburgh, June 3-7, Vol.111(5), p.2478
(2002).
2001年
- Hideki Kawahara: Extraction and generation of aperiodic component in speech sounds ,
The Journal of the Acoustical Society of America , Volume 110, Issue 5, p. 2775 (2001).
- Hideki Kawahara and Haruhiro Katayose: Scat singing generation using a versatile speech manipulation system, STRAIGHT ,
The Journal of the Acoustical Society of America, Volume 109, Issue 5, pp. 2425-2426 (2001).
- Alain de Cheveigne and Hideki Kawahara: Running autocorrelation method of F0 estimation ,
The Journal of the Acoustical Society of America, Volume 109, Issue 5, p. 2417 (2001).
1999年
- Hideki Kawahara: Applications of a high-quality sound manipulation algorithm STRAIGHT for animal voices,138th MEETING OF THE ACOUSTICAL SOCIETY OF AMERICA,Columbus Ohio (USA),Vol.106 (4), pp2129, (1999) (Invited Talk)
- Hideki Kawahara and Parham Zolfaghari: Comparative study of F0 extractors for high-quality speech synthesis ,
The Journal of the Acoustical Society of America, Volume 106, Issue 4, p. 2182 (1999).
- Rieko Kubo, Reiko Akahane-Yamada and Hideki Kawahara: Using resynthesized speech in /r/−/l/ production and perception training, The Journal of the Acoustical Society of America, Volume 106, Issue 4, p. 2150 (1999).
1998年
- Hideki Kawahara and Reiko Akahane-Yamada: Perceptual effects of spectral envelope and F0 manipulations using the STRAIGHT method, The Journal of the Acoustical Society of America, Volume 103, Issue 5, p. 2776 (1998).
- Yasuji Sawada and Hideki Kawahara:``Brain Creators: Japanese
Initiative to Create Computational Models of Brain Functions'',
ICONIP'98 Special Panel Session, Kitakyusyu Japan, (1998.10).
- Hideki Kawahara and Reiko Akahane-Yamada: ``Perceptual effects
of spectral envelope and F0 manipulations using the STRAIGHT
method'', the ICA/ASA `98 meeting,1aSC27, Seattle, (1998.6).
- Hideki Kawahara: ``An extremely high-quality VOCODER for
speech and auditory perception research'', NATO-ASI meeting on
Computational Hearing, (1998.7).
1996年
- Hideki Kawahara and Kiyoaki Aikawa: Contributions of auditory feedback frequency components on F0,
The Journal of the Acoustical Society of America, Volume 100, Issue 4, p. 2825 (1996.10).
- Kiyoaki Aikawa and Hideki Kawahara: Dynamic perception of frequency-modulated tones,
The Journal of the Acoustical Society of America, Volume 100, Issue 4, p. 2750 (1996.10).
- Kiyoaki AIKAWA and Hideki KAWAHARA: ''A Neural Computational
Model for Tracking of Multiple Frequency-Modulated Tones,'' J.
Acoust. Soc. Am., Vol.99, No.4, Pt.2, p.2490 (1996.4)
1995年
- Kiyoaki AIKAWA, Minoru TSUZAKI, Hideki KAWAHARA and Yoh'ichi
TOHKURA: ''Pitch Ringing Induced by Frequency-Modulated Tones,''
J. Acoust. Soc. Am., Vol.98, No.5, Pt.2, p.2926 (1995.11)
- Kiyoaki AIKAWA, Minoru TSUZAKI and Hideki KAWAHARA: ''Psychoacoustic
Analysis of the Sweep Tone Tracking Process,'' IBRO Satellite
Symposium"Processing in Auditory and Language Cortex: Katsuki
Memorial" Program and Abstract, p.29 (1995.7)
- Hideki KAWAHARA and J. C. WILLIAMS (OSU): ''Hearing Voice:
Investigating Auditory Functions through Interactions between
Speech Perception and Production,'' IBRO Satellite Symposium"
Processing in Auditory and Language Cortex: Katsuki Memorial"
Program and Abstract, p.36 (1995.7)
1993年以前
- Hideki KAWAHARA : Transformed Auditory Feedback: Effects
of Fundamental Frequency Perturbation, Journal of the Acoustical
Society of America, Vol.94, No.3, Pt.2, p.1883 (1993.10),
- Kazuaki OBARA, Kiyoaki AIKAWA and Hideki KAWAHARA : ``Speaker-Independent
Speech Recognition Using an Auditory Model Front End Incorporating
the Spectro-Temporal Masking Effect,'' Journal of the Acoustical
Society of America, Vol. 93, No.4, Pt.2, p.2319 (1993.5),
- Kazuaki OBARA, Kiyoaki AIKAWA and Hideki KAWAHARA : ``Word
Recognition Using an Auditory Model Front-End Incorporating Spectrotemporal
Masking Effect,'' Journal of the Acoustical Society of America,
Vol. 92, No.4, Pt.2, p.2476 (1992.10),
- Kiyoaki AIKAWA, Hideki KAWAHARA and Yoh'ichi TOHKURA : ``Dynamic
Cepstral Parameter Incorporating Time-Frequency Masking and Its
Application to Speech Recognition,'' Journal of the Acoustical
Society of America, Vol.92. No.4, Pt.2, p.2476 (1992.10),
- Hideki Kawahara : ``On Burst Detectability in Synthetic and
Natural Speech,'' Journal of the Acoustical Society of America,
Vol.80, Suppl.1, (1986.12),
解説論文等
- 河原 英紀:音声分析合成技術の動向、日本音響学会誌、Vol.67, No.1, pp.40-45 (2011).
- 鹿野、河原、西村他、総合報告:ユーザ負担のない話者・環境適応性を実現する自然な音声対話処理技術の総合開発、電子情報通信学会誌、Vol.92, No6, pp.475-491 (2009)
- 河原英紀:Vocoderのもう一つの可能性を探る--音声分析変換合成システムSTRAIGHTの背景と展開--,
日本音響学会誌,Vol.63,No.8,pp.442-449 (2007).
- Hideki Kawahara: STRAIGHT, Exploration of the other aspect of VOCODER:
Perceptually isomorphic decomposition of speech sounds,
Acoustic Science and Technology, Vol.27, No.6, (2006).[invited]
- 河原英紀、"聴覚フィードバックの発声への影響--ヒトは自分の話声を聞いているのか?", 日本音響学会誌、Vol.59, No.11,(2003).
- 千住真理子, 橘 秀樹, 小野隆彦, 河原英紀,"演奏者にとっての「実感」 −心の通い合う演奏を求めて−" 日本音響学会誌56巻5号, pp.367-371, May.2000
- 河原英紀:``自然性の極めて高い音声分析変換合成法''、音声研究、2巻,2号、pp. 28-36 (1998.8).
- 河原英紀:``聴覚の情景分析が生んだ高品質VOCODER: STRAIGHT''、日本音響学会誌、54巻、7号、pp.521-526
(1998.7).
- 河原英紀:``聴覚脳を創る--聴覚の機能を生態学的立場から再構築してみよう'',人工知能学会誌, Vol.11,
No.1, pp.43-44 (1998.1).
- 河原英紀:``声を使って聴覚を探る,''日本音響学会誌、53巻、9号、 pp.731-737 (1997.9)
- A. S. Bregman(McGill大学)(訳および概説:河原英紀): ''小特集―聴覚の情景分析―,'' 日本音響学会誌,
50巻, 12号, pp.1006-1010 (1994.12)
- 山田光穂(NHK)、伊藤崇之 (NHK)、河原英紀、大塚作一 : ``視聴覚技術,'' テレビジョン学会誌, Vol.
48, No.7, pp.821-827 (1994.7)
- 河原英紀 : ``音声の生成と知覚,'' テレビジョン学会誌, Vol.47, No.12, pp.1573-1578
(1993.12)
- 河原英紀 : ``聴覚の工学的表現,'' 電子情報通信学会誌, Vol.76, No.11, pp.1197-1202
(1993.11)
- 河原英紀 : ``ウェーブレット解析の聴覚研究への応用,'' 日本音響学会誌, 47巻, 6号, pp.424-429
(1991.6)
- 筧一彦、河原英紀 : ``ニューラルネットワークは音声認識に何をもたらすか''、 人工知能学会誌、Vol.4、No.2、pp.453-460、(1989.3)
- 河原英紀 : ``コネクショニズムの展望(IV)パターン処理の観点からの期待''、 情報処理学会誌、Vol.29、No.9、pp.1004-1008、(1988.9)
- 河原英紀 : ``神経回路網モデルと音声認識''、 人工知能学会誌、Vol.3、No.4、pp.453-460、(1988.7)
特許
- 特許開 昭55-114064 秘書電話装置 河原
- 特許願 平03-243725 信号処理方法 入野、河原
- 特許願 平03-265197 信号特徴点の抽出方法 河原、入野
- 特許願 平04-167832 音声認識方法 相川、河原、東倉
- 特許願 平06-299559 音質改善装置 相川、東倉、河原
- 特許願 平08-200845,08-344247 周期信号変換方法および音変換方法 河原、増田
- 特許願 平08- 平滑化スペクトログラムを用いる周期信号変換方法 増田、河原
- 特許願 平09-017505 信号分析方法および信号分析装置 河原、増田
- 特許開 平09-113550 時間変化パターン追跡方法 相川、河原
- 特許 3251555 信号分析装置、河原
- 特許開2001-249674 駆動信号分析装置、河原
著書等
- Hideki Kawahara, Verena G. Skuk, Voice Morphing, In Oxford Handbook of voice perception (Eds.) Pascal Berin and Sascha Fruehholz.
Oxford University Press, pp.685-706 (2018). (ISBN: 9780198743187)
- Hideki Kawahara:
Temporally Variable Multi attribute Morphing of Arbitrarily Many Voices for Exploratory Research of Speech Prosody,
in Speech Prosody in Speech Synthesis: Modeling and generation of prosody for high quality and flexible speech synthesis,
(eds. Keikichi Hirose and Jianhua Tao),
Springer Berlin Heidelberg, pp.109-120, 2015.
(DOI:10.1007/978-3-662-45258-5_8 ).
- Hideki Kawahara, Masanori Morise, Toru Takahashi, Ryuichi Nisimura, Hideki Banno, Toshio Irino,:
STRAIGHT, a framework for speech analysis, modification and synthesis, in
Computer processing of Asian spoken languages, (eds. Shuichi Itahashi and Chiu-yu Tseng),
Consideration Books, Los Angeles, March 2010.
- Kazuhiko Kakehi, Yuko Sogabe, Hideki Kawahara,
Research on Emotional Perception of voices based on a morphing method,
in Emotions in the Human Voice: Culture And Perception, K. Izdebski (Ed.) Plural Publishing, San Diego, pp.1-14 (2008.7)
- Hideki Kawahara and Toshio Irino,
Underlying principles of a high-quality speech manipulation system STRAIGHT and its application to speech segregation,
in Speech separation by human and machines, ed. Pierre Divenyi,
Kluwer Academic Publisher (2005.1).
- R. E. Turner, M. A. Al-Hames, D. R. R. Smith, H. Kawahara, T. Irino, and R. D. Patterson ,
Vowel normalization: Time-domain processing of the internal dynamics of speech,
(eds. P. Divenyi),NATO Science Series (2004.10)
- Hidek Kawahara, STRAIGHT: An extremely high-quality VOCODER for auditory and speech perception research, in Computational Models of Auditory Function (Eds. Greenberg and Slaney), IOS Press, pp.343-354 (2001)
- デザイン情報学入門、9章「音のデザイン」、日本規格協会(2000年)
- 脳を知る・創る・守る,3章1. 「聴覚の情景分析と計算論」, クバプロ, pp.82-93 (1999年)
- 視覚認知と聴覚認知、1章「聴覚心理」、オーム社(1998年)
- Computational Auditory Scene Analysis、22章、「Hearing Voice:
Transformed Auditory Feesback Effects on Voice Pitch Control」、Laurence
Erlbaum、pp.335-350 (1998年).
- 脳科学ハンドブック、5章「聴覚フィードバック」、11章「音声」、朝倉書店 (1999)
- Volcal Fold Physiology、18章、「Effects of Auditory Feedback
on Voice Pitch」、Singular Publishing Group、pp.263-278、(1996年)
- AI奇想曲、2章「やわらかいコンピュータ」
、NTT出版、pp.28-43、(1992)(PDFは、2001年の第2刷から)
その他
- 河原英紀, 「ゲーデル, エッシャー, バッハーあるいは不思議の環」, ダグラスR.ホフスタッター著, 野崎昭弘,はやしはじめ,柳瀬尚紀訳, 白揚社, 1985年(私のすすめるこの一冊), 日本音響学会誌, Vol.61, No.10, pp.618-619, 2005.
(PDF )
- 河原英紀: ''1996年音声言語処理国際会議(会議報告):「聴覚神経モデル」,'' 日本音響学会誌, 53巻,
3号, P.244 (1997.3)
- 河原英紀: ''音声知覚の聴覚的基礎(Auditory Basis of Speech Perception)ワークショップ,''
日本音響学会誌, 53巻, 1号, P.71 (1997.1)
- 河原英紀: ''聴覚の情景分析―聴覚の能動性と異種感覚との融合―,'' 超音波TECHNO, Vol.8, No.12,
pp.15-18 (1996.12)
- Hideki KAWAHARA: ''Recent Topics in Auditory Information
Processing -- Auditory Organization and its Implication --'',
Technical Digest of the 14th Sensor Symposium, B2-1, pp.87-92
(1996.6).
- 河原英紀: ''第1回計算機による聴覚的情景分析ワークショップ参加報告,'' 日本音響学会誌, 52巻, 1号,
p.68 (1996.1)
- 河原英紀: ''会議報告:「音声知覚・生成における生物学観点に関するATRワークショップ」,'' 日本音響学会誌,
51巻, 3号, p.240 (1995.3)
- 音声知覚研究支援用ソフト開発、(後に NTTアドバンスドテクノロジー(株)より 「音声工房」として発売)(1986-1987)
- 河原英紀、小坂直敏、筧一彦、磯部成二、佐藤征四郎 : ``サービス品質評価システム''、 研究実用化報告書(電電公社通研)、Vol.30、No.4、pp.949-968、(1981.4)
- 河原英紀、筧一彦 : ``ラウドネス客観計算法の検討''、 研究実用化報告書(電電公社通研)、Vol.30、No.4、pp.911-922、(1981.4)
ソフトウェア作品等
- STRAIGHT:聴覚の情景分析に基づく音声・音響処理システム。
戦略的基礎研究推進事業の中核システムとしてMatlab上に実装。(1997年)
- Spark:PC9801上に実装された音声知覚研究支援環境。後に「音声工房」としてNTTアドバンステクノロジより発売。
(1986年)
- お絵描き:当時3才と5才だった娘たちのために作成したペインティングプログラム。
JUNETのPDSとして公開。(1986年)
研究会、大会等
表彰
- 日本音響学会
佐藤記念論文賞 (1997)
- 電気通信協会 テレコム技術論文賞 (1997)
- ATR論文・発明表彰 (1998)
- EURASIP 1998,1999 best paper award (2000)
Last update:Sat Sep 29 07:33:25 JST 2018