Meet Amazon’s New Voice Recognition Technology

Not content with knowing what you eat for breakfast, Amazon launched three new products on Thursday that can tell you what you, a human, is saying; exactly what you meant by that; and translate it into over a hundred languages.

Taking the stage at AWS Re:INVENT, Amazon CTO Werner Vogels revealed to the packed auditorium of developers Amazon Transcribe, Translate and Comprehend.

Available soon as a review (link), developers will be able to integrate trio of new products into their own apps and services sometime in 2018. Here’s exactly what Amazon’s new services do, and how your business can use them.

It starts with Transcribe

Audio goes in, words come out. At a massive scale.

Amazon Transcribe can take just about any Mp3, Wav or Flac file – even low bitrate files – and spit out accurate text. But for the first time, Amazon’s machine learning means that we get pretty accurate punctuation too.

The uses for a good transcription API are million-fold, and are a huge step forward for audio automation – and bad news for the call-centre workers who just got replaced by 22 lines of code. Low bitrate support is code for telephone-quality audio support, so now your company can generate logs of every customer support call, context-aware, with timestamps.

Marketing videos – especially those aimed at commuters using mobile devices – can now have their subtitles generated automatically. Expect to see digital asset management services (like Overcast) make hay of this in 2018.

Digital advertising is set for new heights of amazingness as marketers will soon be able to target you based not just on what you type, but what you say and what you listen to. Context-aware audio marketing just became a real possibility as machine learning can understand emotions like excitement, hesitancy and acquiescence.

Amazon Translate

Past audio-to-text software had one huge functionality gap for big companies – they worked well only in English, and ignored the swathes of customers calling in to complain about their phone bill in Spanish, Portuguese or Chinese. Now, Amazon says Translate’s “neural machine translation”, which uses machine learning and deep learning models, can efficiently translate huge amounts of text into over a hundred languages. And because it’s context-aware the results are pretty great.

While the likes of Google Translate are great for asking directions to the nearest library on your school trip to Paris, Amazon Translate can localise websites and applications to the individual user. So that advert for 6,000 pairs of socks on Alibaba.com will finally make sense to the rest of us.

Amazon Comprehend

In increasing order of amazingness, I present: Amazon Comprehend.

Amazon Comprehend is a natural processing language (NLP) <<Link>> that can understand the context of a piece of text, read insights into it, and even understand that text’s relationship with another text. This has huge implications for aggregation of insights into customer support incidents, product reviews, news articles, your social media feed – anything you can commit to text.

Moving up to a wider-angle lens, Amazon Comprehend can analyse a collection of documents, organise them by topic, and then deliver personalised content to the end user. Those newspaper digest emails suddenly got way more relevant.

At the launch, Amazon pressed hard on the applications for voice-to-text in customer service workflows, and demonstrated how it can provide the company with a holistic view of the customer experience before they get as far as needing support.

graph of aws trancribe applications for customer support

If you’re into digital asset management, Amazon Comprehend can plug-in better search results by enabling search to index keywords, entities and – scariest of all – sentiments.

graph of amazon search

Imagine your marketplace website sells products that customers can review – now your search function can return results based not just on 1-to-5-star ratings, but also the sentiment of attached customer comments. All on the fly.

graph of aws semantic search

Ready to join the world of tomorrow? Amazon’s new triumvirate is available for Preview Signup, with accounts expected to be rolled out in the coming weeks.

OvercastHQ just released our first product – our Video and Digital Asset Review and Approval tool – on Amazon Web Services. Check it out here.

Latest Blog Posts

Enterprise Video Strategy: Scaling and Expanding

Live streaming became one of the most popular uses of video during the pandemic as enterprises needed to communicate with external stakeholders and also employees who were working from home.

In 2022 the popularity of this form of mass communication is not waning at all.

Video podcasting is also on the increase. Use cases include communicating with customers, marketing, sales, and training.

Video Now Main Form of Content For Enterprises

“Video is now poised to become the main form of content that’s digested in the enterprise. The huge surge in video meetings was the first wave in the shift to the visual enterprise. The second wave that is arriving now and in 2022 is about the need to organise, optimise, and deliver visual content to users.” — Aragon Research Globe for Enterprise Video, 2022.

This demonstrates the need for providers to step forward with innovative technology that enables enterprises to create, manage and distribute video content. The key focus for enterprises is on customer experience and employee engagement.

The Role of Video in Digital Transformation

In 2020 workplaces as we knew them experienced massive unexpected disruption.

Enterprises found themselves faces with a need for rapid digital transformation when the pandemic necessitated employees to work remotely. Suddenly, colleagues and managers were not in the same room to share information or collaborate on team projects.

This led to much upheaval among workforces. However, a magic pill to cure those ills was close at hand: video!

Influencers With A Cause

Social media activism creates opportunities for grassroots movements to evolve and expand. Some of the best-known examples are the Ice Bucket Challenge, #MeToo and Black Lives Matter, which have all involved successful social media movements.

So, how do creators and influencers with a cause go about making a difference?

James Dean was a rebel without a cause, but nowadays social causes are top of mind for creators, according to a study by Adobe.

Does Being A Creator Make You Happy?

The words ‘social media’ and ‘mental health’ are not always happy bedfellows, so it’s a legitimate question to ask, “Does being a creator make you happy?”

It takes time and effort to be create content, distribute it, and build a community, so what impact does all of that effort have on a creator’s mental health?

Adobe’s ‘Creators In The Creator Economy’ global study shows that creative people are happy people!

Get Started Now

This website uses cookies to ensure you get the best experience on our website. To manage cookies, please refer to our Privacy Policy. Please note that you must "accept" the privacy policy to continue using this website. View the Privacy Policy

The cookie settings on this website are set to "allow cookies" to give you the best browsing experience possible. If you continue to use this website without changing your cookie settings or you click "Accept" below then you are consenting to this.

Close