[ aws . lexv2-runtime ]

recognize-utterance

Description

Sends user input to Amazon Lex. You can send text or speech. Clients use this API to send text and audio requests to Amazon Lex at runtime. Amazon Lex interprets the user input using the machine learning model built for the bot.

See also: AWS API Documentation

See ‘aws help’ for descriptions of global parameters.

Synopsis

  recognize-utterance
--bot-id <value>
--bot-alias-id <value>
--locale-id <value>
--session-id <value>
[--session-state <value>]
[--request-attributes <value>]
--request-content-type <value>
[--response-content-type <value>]
[--input-stream <value>]
<outfile>

Options

--bot-id (string)

The identifier of the bot that should receive the request.

--bot-alias-id (string)

The alias identifier in use for the bot that should receive the request.

--locale-id (string)

The locale where the session is in use.

--session-id (string)

The identifier of the session in use.

--session-state (string)

Sets the state of the session with the user. You can use this to set the current intent, attributes, context, and dialog action. Use the dialog action to determine the next step that Amazon Lex should use in the conversation with the user.

--request-attributes (string)

Request-specific information passed between the client application and Amazon Lex

The namespace x-amz-lex: is reserved for special attributes. Don’t create any request attributes for prefix x-amz-lex: .

--request-content-type (string)

Indicates the format for audio input or that the content is text. The header must start with one of the following prefixes:

  • PCM format, audio data must be in little-endian byte order.

    • audio/l16; rate=16000; channels=1

    • audio/x-l16; sample-rate=16000; channel-count=1

    • audio/lpcm; sample-rate=8000; sample-size-bits=16; channel-count=1; is-big-endian=false

  • Opus format

    • audio/x-cbr-opus-with-preamble;preamble-size=0;bit-rate=256000;frame-size-milliseconds=4

  • Text format

    • text/plain; charset=utf-8

--response-content-type (string)

The message that Amazon Lex returns in the response can be either text or speech based on the responseContentType value.

  • If the value is text/plain;charset=utf-8 , Amazon Lex returns text in the response.

  • If the value begins with audio/ , Amazon Lex returns speech in the response. Amazon Lex uses Amazon Polly to generate the speech using the configuration that you specified in the requestContentType parameter. For example, if you specify audio/mpeg as the value, Amazon Lex returns speech in the MPEG format.

  • If the value is audio/pcm , the speech returned is audio/pcm at 16 KHz in 16-bit, little-endian format.

  • The following are the accepted values:

    • audio/mpeg

    • audio/ogg

    • audio/pcm (16 KHz)

    • audio/* (defaults to mpeg)

    • text/plain; charset=utf-8

--input-stream (blob)

User input in PCM or Opus audio format or text format as described in the requestContentType parameter.

outfile (string) Filename where the content will be saved

See ‘aws help’ for descriptions of global parameters.

Output

inputMode -> (string)

Indicates whether the input mode to the operation was text or speech.

contentType -> (string)

Content type as specified in the responseContentType in the request.

messages -> (string)

A list of messages that were last sent to the user. The messages are ordered based on the order that you returned the messages from your Lambda function or the order that the messages are defined in the bot.

interpretations -> (string)

A list of intents that Amazon Lex determined might satisfy the user’s utterance.

Each interpretation includes the intent, a score that indicates how confident Amazon Lex is that the interpretation is the correct one, and an optional sentiment response that indicates the sentiment expressed in the utterance.

sessionState -> (string)

Represents the current state of the dialog between the user and the bot.

Use this to determine the progress of the conversation and what the next action might be.

requestAttributes -> (string)

The attributes sent in the request.

sessionId -> (string)

The identifier of the session in use.

inputTranscript -> (string)

The text used to process the request.

If the input was an audio stream, the inputTranscript field contains the text extracted from the audio stream. This is the text that is actually processed to recognize intents and slot values. You can use this information to determine if Amazon Lex is correctly processing the audio that you send.

audioStream -> (blob)

The prompt or statement to send to the user. This is based on the bot configuration and context. For example, if Amazon Lex did not understand the user intent, it sends the clarificationPrompt configured for the bot. If the intent requires confirmation before taking the fulfillment action, it sends the confirmationPrompt . Another example: Suppose that the Lambda function successfully fulfilled the intent, and sent a message to convey to the user. Then Amazon Lex sends that message in the response.