Portrait of Achraf Chalabi

Achraf Chalabi

Principal Research Software Development Engineer Manager


I’m an Architect at Microsoft Research, I’m particularly interested in:

  • Natural Language Processing
  • Information Extraction
  • Machine Translation
  • Speech Processing

I joined Microsoft Innovation Laboratory in Cairo (CMIC) in October 2008 as a Dev Lead. Prior to joining Microsoft, I had a pretty long and productive journey at Sakhr Software, the leading company in Arabic technologies including NLP, OCR, Speech and Search. At Sakhr, I gained an extensive experience in Arabic/English NLP, starting with the implementation of the first Arabic Morphological Analyzer, ending with the R&D of a two-way Arabic-English speech-to-speech translation system, and in between covering a broad spectrum of NLP tasks targeting both Arabic and English languages, including the Lexicalizer, Parser, POS tagger, Analysis and Transfer Grammars, and many of their direct applications such as the Diacritizer, Transliterator, Text Mining and Arabic<>English Machine Translation.
I was granted a US Patent in the field of NLP and more precisely in Word Sense Disambiguation. I was born in France, grew up in Zaire, and graduated from the computer engineering department at Ain Shams university, Cairo.


Arabic Toolkit Service (ATKS)

Established: December 12, 2013

Natural Language Processing (NLP) is a foundational infrastructure for processing written text. This processing revolves around text analysis and understanding. NLP serves a multitude of sophisticated tasks such as Text Search, Document Management, Automatic Translation, Proofreading, Text Summarization and many more. The Advanced Technology Lab in Cairo has developed the Arabic Toolkit Service (ATKS) as a set of NLP components targeting Arabic language. ATKS Components The component suite includes a full-fledged morphological analyzer (SARF), a spell-checker, an auto…










Professional Organizations

  • Member of the Scientific Committee for the Workshop “Second Workshop on Advances in Text Input Methods” in COLING, Mumbai 2012
  • Member of the program committee for the workshop “Arabic & Local Languages” at LREC2008 Marakech.
  • Member of the program committee for “The 6th International Conference on Informatics and Systems”, 2008, EGYPT
  • Member of the program committee for the workshop “Computational Approaches to Semitic Languages, ACL  2007” .
  • Member of the program committee for MT SUMMIT 2005 and 2007
  • Member of the Program Committee for the Workshop : Arabic Language Processing Text & Speech, JEP-TALN 2004
  • Member of the scientific committee at NEMLAR workshop2004, Cairo
  • Member of the Program Committee for the Workshop : MT for Semitic Languages, MT Summit 2003
  • Member of the Scientific Committee for the Workshop : Arabic Language Resources (LR) and
  • Evaluation , LREC 2002 Status and Prospects
  • Member of ACL
  • Participated in AMTA 2004, Georgetown University, Washington DC
  • Participated in NIST 2005,2006 and 2008 for MT Evaluation, Washington DC