Microsoft’s Project Oxford Gives Developers Access To Facial, Image And Speech-Recognition APIs


Microsoft quietly launched a set of new machine-learning APIs in beta under the “Project Oxford” moniker yesterday. These new APIs allow developers to add face detection and recognition features to their apps, as well as speech recognition with the ability to understand the speaker’s intent. The project also features a vision API for automatically categorizing images and creating smart image crops that always put the subject into the center of the cropped images.

These three services are now available as a public beta. There’s also a fourth API that lets developers build custom language understanding into their applications.

Previously, Microsoft offered a set of somewhat similar APIs under the Bing brand. Bing offers a speech and translator API, for example, but for the most part, these Bing services are somewhat more basic and search-focused than the Project Oxford tools.

To showcase Project Oxford’s Face API, Microsoft built This site…

View original post 213 more words

Leave a Reply

Fill in your details below or click an icon to log in: Logo

You are commenting using your account. Log Out / Change )

Twitter picture

You are commenting using your Twitter account. Log Out / Change )

Facebook photo

You are commenting using your Facebook account. Log Out / Change )

Google+ photo

You are commenting using your Google+ account. Log Out / Change )

Connecting to %s