Introduction to the Special Section on Large-Scale Optimization for Audio, Speech, and Language Processing

  • Dimitri Kanevsky ,
  • Xiaodong He ,
  • Georg Heigold ,
  • Haizhou Li ,
  • Stephen Wright

IEEE Transactions on Audio, Speech, and Language Processing

Pattern recognition in audio, speech, and language processing requires estimation of parameters in statistical models via optimization criteria. Formulation of these optimization models is far from straightforward. For example, likelihood criteria usually are inadequate if the training data do not represent all possible variations in patterns. Significant progress in pattern recognition has been achieved by introducing discrimination criteria for training, but overtraining remains a danger. An important formulation device is a regularization term in the optimization objective that captures the prior information available about parameter values and their relationships.