What is the difference between System.Speech.Recognition and Microsoft.Speech.Recognition?

The short answer is that Microsoft.Speech.Recognition uses the Server version of SAPI, while System.Speech.Recognition uses the Desktop version of SAPI.

The APIs are mostly the same, but the underlying engines are different. Typically, the Server engine is designed to accept telephone-quality audio for command & control applications; the Desktop engine is designed to accept higher-quality audio for both command & control and dictation applications.

You can use System.Speech.Recognition on a server OS, but it’s not designed to scale nearly as well as Microsoft.Speech.Recognition.

The differences are that the Server engine won’t need training, and will work with lower-quality audio, but will have a lower recognition quality than the Desktop engine.

Leave a Comment