2

After the user invokes an app on Alexa , Is there a way to get the query as a voice stream/audio file of a user? Through alexa I want to send the stream to a webservice/lambda that the invoked app will call and analyze the intent there.

We have some proprietary code that we want to use for analyzing intent hence we cant do it on the alexa side Since I am sending the query after the user has invoked the app and through the app there are no privacy concerns(hopefully)

Thanks

1 Answer 1

2

No, that is not possible, and I don't think it will be.

Echo devices connect to Amazon only, and Amazon uses Lex (which is also available via AWS) to parse speech files. As a skill developer, you will only receive the parsed results: intent, slots - and maybe, when Amazon implements user differentiation, an anonymous ID for the speaker.

There is no way to access the original speech audio in your skill. As every file is also used by Amazon to train their speech recognition, I doubt they will open their ecosystem accordingly.

Only option I see currently: build your own Echo with e.g. a Raspberry Pi, then you have full control. But you can't leverage the install base of Echo.

Same applies to Google Home and Microsoft Cortana, so it's not just Amazon.

3
  • Hmm Interesting, Can we build a Alexa Voice Service on Top of Echo?
    – Rrrrr
    Commented Jan 8, 2018 at 17:31
  • @Rrrrr Not as far as I know, and not with an out-of-the-box Echo. Maybe there are some hacks that jail-break an Echo out there on the Internet somewhere... A regular Echo can use skills implemented with ASK, and if you have your own hardware (processor, microphone, software, the works) then you can leverage AVS for speech recognition. But I understand from your question that this is where your own code comes into play, so you have your own variant of AVS, correct? Commented Jan 10, 2018 at 13:49
  • In the video at the bottom of the page here it seems like it is possible if you hook up the alexa device to a custom server. Any idea how they are doing this?
    – Corey Cole
    Commented Apr 15, 2019 at 21:57

Not the answer you're looking for? Browse other questions tagged or ask your own question.