Speechmatics’ AI deal a success in any language

16 Jan, 2025
Newsdesk
Cambridge speech technology company Speechmatics is collaborating with Santa Clara-based Ambarella – an edge AI semiconductor company – to bring AI-powered natural language interactions to edge applications.
Thumbnail
Credit – jittawit21 / Shutterstock.com

Speechmatics’ technology is recognised for its ability to accurately understand speech in over 50 languages, regardless of accents or dialects. With the recent launch of Flow, they have now moved into the world of voice-powered AI interactions.

The partnership with Ambarella has significant implications for multiple applications, including advanced robotics, autonomous driving, automotive in-cabin systems, smart cities, security and customer service.

Autonomous warehouse robots could combine visual object recognition with natural voice commands, allowing for more efficient and dynamic workflows. Similarly, in customer-facing scenarios, kiosks and smart assistants could respond to both verbal and visual cues to provide a more personalised and engaging experience.

Other applications include voice-activated assistants in remote locations, adaptive smart cameras that respond to voice and visual commands, as well as in-vehicle voice commands and verbal feedback.

Speechmatics’ technology running on Ambarella’s robust, low-power portfolio of CVflow® AI system-on-chips (SoCs) provides machines with groundbreaking capabilities to process complex speech and visual inputs on the fly.

The companies showcased the technology during CES in Las Vegas earlier this month, running locally on Ambarella’s AI SoCs without an internet connection. By combining Ambarella’s edge AI SoCs with Speechmatics’ foundational speech technology users can now experience seamless, natural device interactions – even in environments without internet connectivity, the companies say.

Katy Wigdahl, CEO of Speechmatics said: “Speechmatics’ conversational AI product, Flow, supports a wide range of speech-to-speech deployments, from on-camera to robotics and larger on-premise deployments in smart city use cases.

“This means users can benefit from the low latency and privacy intrinsic to edge computing, whilst still gaining the huge value of natural language interactions.

“It also gives users tight control over costs, which can be unpredictable with cloud deployments. This collaboration will redefine what’s possible in the fields of autonomous machines, smart cities and customer service.

“This partnership marks an exciting step forward for human-machine interaction. Speechmatics is supported on Ambarella’s entire portfolio of CVflow AI SoCs, which enables a huge range of devices with voice interactivity. We’re thrilled to work together to drive innovation in the edge AI space.”

Amit Badlani, Director of Generative AI and Robotics at Ambarella added: “Our partnership with Speechmatics opens a new world of possibilities for natural language understanding at the edge.

“This is just the beginning. Ambarella is committed to advancing edge AI technologies, and we see this partnership as a launchpad for creating smarter, more adaptive solutions across robotics, industrial automation and smart cities.”