Skip to main content

Kazuhito Koishida

Principal Lead Scientist

Kazuhito Koishida
Kazuhito Koishida

I am a Principal Lead Scientist at Applied Sciences Group in Experiences + Devices organization. I have been with Microsoft since 2000. My area of interests is in signal processing and machine learning for audio, speech, computer vision, and other sensor data.

Past projects

  • Audio and voice compression: Bitrate/bandwidth scalable codec, MELP codec at 1.2kbps, and Windows Media Audio and Voice codec
  • Audio matching: Voice note application and music recognition service
  • Microphone array processing: Beamforming and sound source localization
  • Audio/voice detection and recognition: Keyword spotting and speaker identification
  • Speech enhancement: Audio/visual fusion and bandwidth expansion

Education

  • B.S degree in Electrical Engineering from the Tokyo Institute of Technology, Japan, in 1994
  • M.S. degree in Electrical Engineering from the Tokyo Institute of Technology, Japan, in 1995
  • Ph.D. degree in Electrical Engineering from the Tokyo Institute of Technology, Japan, in 1998. Dissertation title: Speech Coding Based on Mel-Generalized Cepstral Analysis
  • Post doctoral researcher at Signal Compression Lab in the University of California, Santa Barbara, 1998-2000
  • The IEEE Conference on Computer Vision and Pattern Recognition (CVPR) June, 2020
  • Proceedings of the 37th International Conference on Machine Learning (ICML), Vienna, Austria July, 2020
    By Saeed Amizadeh, Kazuhito Koishida, Hamid Palangi, Oleksandr Polozov, Yichen Huang
  • Adversarial Training for Speech Super-Resolution
    IEEE Journal of Selected Topics in Signal Processing May, 2019 Vol. 13, No. 2 Pages 347-358
    By Sefik Emre Eskimez, Kazuhito Koishida, Zhiyao Duan
  • Speech Super Resolution Generative Adversarial Network
    Proceedings, 2019 International Conference on Acoustics, Speech and Signal Processing (ICASSP) May, 2019 Pages 3717-3721
    By Kazuhito Koishida, Sefik Emre Eskimez
  • Text Independent Speaker Verification Based on Triplet Convolutional Neural Network Embeddings
    IEEE/ACM Transactions on Audio, Speech, and Language Processing September, 2018 Vol. 26, No. 9 Pages 1633-1644
    By Chunlei Zhang, Kazuhito Koishida, John H. L. Hansen
  • End-to-End Text-Independent Speaker Verification with Flexibility in Utterance Duration
    Proceedings, 2017 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU) December, 2017 Pages 584-590
    By Kazuhito Koishida, Chunlei Zhang
  • End-to-End Text-Independent Speaker Verification with Triplet Loss on Short Utterances
    Proceedings, Interspeech 2017 2017 Pages 1487-1491
    By Kazuhito Koishida, Chunlei Zhang
  • Hybrid Low Bitrate Audio Coding Using Adaptive Gain Shape Vector Quantization
    Proceedings, 2008 IEEE 10th Workshop on Multimedia Signal Processing October, 2008 Pages 927-932
    By Kazuhito Koishida, Sanjeev Mehrotra, Wei-ge Chen, Naveen Thumpudi
  • A 1200/2400 BPS Coding Suite Based on MELP
    Proceedings, 2002 IEEE Workshop on Speech Coding October, 2002 Pages 90-92
    By Kazuhito Koishida, Tian Wang, Vladimir Cuperman, Allen Gersho, J.S. Collura
  • Vector Quantization of Speech Spectral Parameters Using Statistics of Static and Dynamic Features
    IEICE Transactions on Information and Systems October, 2001 Vol. E84-D, No. 10 Pages 1427-1434
    By Kazuhito Koishida, Keiichi Tokuda, Takashi Masuko, Takao Kobayashi
  • A 16 kb/s Wideband CELP-based Speech Coder Using Mel-Generalized Cepstral Analysis
    IEICE Transactions on Information and Systems April, 2000 Vol. E83-D, No. 4 Pages 876-883
    By Kazuhito Koishida, Gou Hirabayashi, Keiichi Tokuda, Takao Kobayashi
  • A 16-kbit/s Bandwidth Scalable Audio Coder Based on the G.729 Standard
    Proceedings, 2000 International Conference on Acoustics, Speech and Signal Processing (ICASSP) June, 2000 Pages 1149-1152
    By Kazuhito Koishida, Vladimir Cuperman, Allen Gersho
  • A 1200 BPS Speech Coder Based on MELP
    Proceedings, 2000 International Conference on Acoustics, Speech and Signal Processing (ICASSP) June, 2000 Pages 1375-1378
    By Kazuhito Koishida, Tian Wang, Vladimir Cuperman, Allen Gersho, J.S. Collura
  • Enhancing MPEG-4 CELP by Jointly Optimized Inter/Intra-frame LSP Predictors
    Proceedings, 2000 IEEE Workshop on Speech Coding September, 2000 Pages 90-92
    By Kazuhito Koishida, Tian Wang, Vladimir Cuperman, Allen Gersho, Jan Lindén
  • CELP Speech Coding Based on Mel-Generalized Cepstral Analysis
    IEICE Transactions on Information and Systems February, 1998 Vol. J81-A, No. 2 Pages 252-260
    By Kazuhito Koishida, Keiichi Tokuda, Satoshi Imai, Takao Kobayashi
  • A 16 kbit/s Wideband CELP Coder Using Mel-Generalized Cepstral Analysis and Its Subjective Evaluation
    Proceedings, 5th International Conference on Spoken Language Processing (ICSLP '98) 1998 Vol. 6 Pages 2583-2586
    By Kazuhito Koishida, Gou Hirabayashi, Keiichi Tokuda, Takao Kobayashi
  • A Wideband CELP Speech Coder at 16 kbit/s Based on Mel-Generalized Cepstral Analysis
    Proceedings, 1998 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) May, 1998 Vol. 1 Pages 161-164
    By Kazuhito Koishida, Gou Hirabayashi, Keiichi Tokuda, Takao Kobayashi
  • Low Bit Rate Speech Coding Based on Mel-Generalized Cepstral Analysis
    Tokyo Institute of Technology 1998
  • Spectral Representation of Speech Based on Mel-Generalized Cepstral Coefficients and Its Properties
    IEICE Transactions on Information and Systems November, 1997 Vol. J80-A, No. 11 Pages 1999-2006
    By Kazuhito Koishida, Keiichi Tokuda, Satoshi Imai, Takao Kobayashi
  • Spectral Quantization Using Statistics of Static and Dynamic Features
    Proceedings, 1997 IEEE Workshop on Speech Coding for Telecommunications September, 1997 Pages 19-20
    By Kazuhito Koishida, Takashi Masuko, Keiichi Tokuda, Takao Kobayashi
  • Efficient Encoding of Mel-Generalized Cepstrum for CELP Coders
    Proceedings, 1997 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) April, 1997 Vol. 2 Pages 1355-1358
    By Kazuhito Koishida, S. Imai, Keiichi Tokuda, Takao Kobayashi
  • CELP Coding System Based on Mel-Generalized Cepstral Analysis
    Proceedings, 4th International Conference on Spoken Language Processing (ICSLP '96) October, 1996 Vol. 1 Pages 314-317
    By Kazuhito Koishida, S. Imai, Keiichi Tokuda, Takao Kobayashi
  • CELP Coding System Based on Mel-Cepstral Analysis
    Proceedings, 1995 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) October, 1995 Vol. 1 Pages 33-36
    By Kazuhito Koishida, S. Imai, Keiichi Tokuda, Takao Kobayashi

Contact