Behavior of Speech and DTMF inputs (STT call module)

Katrin Geske
Katrin Geske
  • Updated

When using the Speech to Text (STT) module, customers can mix speech and DTMF inputs (if this feature is enabled). For your service, it might be relevant to log how a customer made such an input. In order to achieve this, you can use a set of variables to find out what the input was:

  • {app.speechToText.myVar} = returns the final result
  • {app.speechToText.myVar.transcript} = includes e.g. digits which did not match the minimum or maximum number of inputs
  • {app.speechToText.myVar.input_type} = returns whether the input was voice or DTMF
  • {app.speechToText.myVar.confidence} = returns the confidence we had in the customer's input
  • {app.speechToText.myVar.valid} = checks whether the input was valid

There are some general rules when it comes to mixing speech and DTMF:

  • Whenever a customer starts typing an input via the dialpad, the audio channel is closed
    • This means, even if the customer starts saying something, this will not be recorded by babelforce
    • Also, if a customer started saying something and then began to type, only the typed input will be kept
    • This will be reflected in the type which will then be dtmf
  • All DTMF inputs always have a confidence of 1

 

Numeric scenarios

Below, we give an example test row of expected behavior as the system is setup as of first quarter of 2023:

  • Speech model = Command and search
  • DTMF = true
  • Numeric = true
  • Min 2
  • Max 5
  • barge-In = true

Scenario

Variable

Transcript

InputType

Confidence

Valid

Comment

Say 1 2

12

12

voice

0.91

true

 

Say 1 2, then type 3 4

34

34

DTMF

1

true

 

Say 1 2 3 4 5 6

12345 

123456

voice

0.88

true

 

type 1 2 3 4 5 6

Say 1 2 3

12345 

12345 

DTMF

1

true

As soon as the “match” for dtmf inputs is reached (max 5 in this case) audio is interrupted. The 6th digit is not filled, voice input is not possible

 

Note: The Speech to Text (STT) feature will incur additional third-party costs. These charges are determined by the duration of the caller's input and the pricing.

For detailed pricing information or assistance with cost estimation, please contact our support team at support@babelforce.com.

Related to

Was this article helpful?

/

Comments

0 comments

Please sign in to leave a comment.