Behavior of Speech and DTMF inputs (STT call module)

When using the Speech to Text (STT) module, customers can mix speech and DTMF inputs (if this feature is enabled). For your service, it might be relevant to log how a customer made such an input. In order to achieve this, you can use a set of variables to find out what the input was:

{app.speechToText.myVar} = returns the final result
{app.speechToText.myVar.transcript} = includes e.g. digits which did not match the minimum or maximum number of inputs
{app.speechToText.myVar.input_type} = returns whether the input was voice or DTMF
{app.speechToText.myVar.confidence} = returns the confidence we had in the customer's input
{app.speechToText.myVar.valid} = checks whether the input was valid

There are some general rules when it comes to mixing speech and DTMF:

Whenever a customer starts typing an input via the dialpad, the audio channel is closed
- This means, even if the customer starts saying something, this will not be recorded by babelforce
- Also, if a customer started saying something and then began to type, only the typed input will be kept
- This will be reflected in the type which will then be dtmf
All DTMF inputs always have a confidence of 1

Numeric scenarios

Below, we give an example test row of expected behavior as the system is setup as of first quarter of 2023:

Speech model = Command and search
DTMF = true
Numeric = true
Min 2
Max 5
barge-In = true

Scenario	Variable	Transcript	InputType	Confidence	Valid	Comment
Say 1 2	12	12	voice	0.91	true
Say 1 2, then type 3 4	34	34	DTMF	1	true
Say 1 2 3 4 5 6	12345	123456	voice	0.88	true
type 1 2 3 4 5 6 Say 1 2 3	12345	12345	DTMF	1	true	As soon as the “match” for dtmf inputs is reached (max 5 in this case) audio is interrupted. The 6th digit is not filled, voice input is not possible

Note: The Speech to Text (STT) feature will incur additional third-party costs. These charges are determined by the duration of the caller's input and the pricing.

For detailed pricing information or assistance with cost estimation, please contact our support team at support@babelforce.com.

Related to

Behavior of Speech and DTMF inputs (STT call module)

Numeric scenarios

Scenario

Variable

Transcript

InputType

Confidence

Valid

Comment

Comments