Voice Assistant

Last edit: 2023.08.07

The in-vehicle Voice Personal Assistant (VPA) is a digital assistant that uses voice recognition to help users complete their daily tasks without being distracted while driving. All communication between the user and any voice assistant is either vocal only or a vocal and visual response.

TomTom Digital Cockpit allows you to integrate various voice assistants currently available on the market. This documentation describes how voice assistants could work in the system using the example of a built-in Alexa voice assistant.

The default configurations available for Alexa VPA are certification-ready as they have been built following the Alexa Auto Design Guidelines. However, any changes to the Alexa voice assistant in the system will need to adhere to the Alexa Auto Design Guidelines for certification.

Commands and wake word

Voice commands usually start with a voice assistant wake word, to initialize the speech dialog with a further request to the voice assistant from the user. Voice commands and wake words depend on voice assistants and their possibilities.

The wake word is the word customers can use to invoke a voice assistant. Another way to invoke voice assistants is by pressing a hardware or on-screen UI button to begin a speech dialogue (if the button is available in the vehicle).

For example, two primary ways to invoke Alexa in the vehicle are described in the Alexa Auto Design Guidelines. Both are required:

  • Saying the wake word "Alexa".
  • Pressing the Push to talk (PTT) button or an on-screen Tap to talk (TTT) button to directly invoke Alexa without the wake word.

Customization

Customization of the wake word depends on integrated voice assistance possibilities. For example, Alexa doesn't support wake word customization.

Voice chrome

Voice chrome is a visual indicator of the voice assistant's active state. The chrome could be visually different depending on its states, such as status (error, notification, etc.) or other possible states (listening, speaking, waiting, capturing, etc.). The states of the voice chrome depend on the voice assistant.

Chrome appears on a top system level in the Overlay panel.

Customization

Chrome could be customized based on integrated voice assistant guidelines, brand colors, VPA status, and states. For more information see the Alexa chrome guidelines.

navigation

Out of the box

Stock chrome of Alexa VPA.

communication

Customized

Customized chrome of Alexa VPA.

Status control

The status control indicates the voice assistant's activity status. The status control is optional and has reserved space in the Control center.

For example, Alexa's wake word could be disabled or enabled with status control. Also, the status control has a yellow indicator badge that shows whether Alexa notifications are available.

voice chrome status control

Voice personal assistant templates

All communication between the user and any voice assistant is either voice commands or voice commands together with a visual response (template).

Voice assistant templates are a visual way to communicate with the user, that is usually supported by voice responses from the voice assistant.

All templates are universal and can be used by any voice personal assistant, but the template's content must follow the format provided. Customers can create a template from scratch, based on provided generic templates, if additional functionality is required.

There are generic templates supported by TomTom Digital Cockpit:

  • Points of interest (POI) list
  • Points of interest (POI) details
  • Traffic
  • Weather
  • Image and text
  • List and buttons

Customization

VPA usually provides a visual response, depending on whether the user is driving. Whether or not to show a visual response is a customizable setting that depends on the customer's decision. However, we recommend avoiding visual responses while driving if it is possible to solve the user's problem by using only voice response.

customization
SectionUse
1. Full visual response (template)The complete visual template version is recommended to be used while parked.
2. Short visual responseShort version of the template.
3. Voice onlyNo visual response, except for the voice chrome, is recommended while driving.

When the user requests the voice assistant about POIs (points of interest), the voice and visual feedback template from the voice assistant appears on the screen. The template shows the list of available POIs nearby/along the way or in any other driving context. For example, if the user asks "Alexa, show me restaurants nearby", Alexa will show a template with a list of restaurants nearby. Users can interact with an index by referencing the position of the item in the list. For example, "Navigate to the first one" or selecting the item with a touch. After selecting one of the provided options, the user will start navigation to the POI, or the user will open the POI details template. This option is customizable.

navigation user flow navigation template
SectionUse
1. TitleThe title usually contains the answer or additional question from the voice assistant.
2. VPS logoOptional attribute, depends on voice assistant brand guidelines and certification requirements.
3. POI listInteractive list with short information about each POI, information depends on the provider.
4. Distance to POIOptional attribute, depends on the data provider.
5. Short POI infoOptional attribute, depends on the data provider. It could be a rating, a list of providers, or the current POI status (closed, open, permanently closed, etc.)

POI details

Voice assistants may provide specific POI details supported with visuals by the user's request.

navigation poi details user flow poi details template
SectionUse
1. TitleUsually contains the answer or an additional question from the voice assistant.
2. Short POI infoOptional attribute, depends on the data provider.
3. Main POI infoUsually contains a phone number and address, depending on the data provider.
4. Expandable listOptional attribute, depends on the data provider.
5. ImageOptional attribute, depends on the data provider.
6. ButtonsOptional touch actions that could also be performed by voice.

Traffic details

Voice assistants may provide specific traffic details during navigation to the POI.

traffic details user flow traffic details template
SectionUse
1. TitleUsually contains the answer or additional question from the voice assistant.
2. Icon/imageOptional attribute, should show the provider logo or any other supportive visual.
3. Arrival statusOptional attribute to display the traffic, destination/travel time, and distance indicator, if provided by the data source.
4. AddressUsually contains the location icon/image and full address, depending on the data provider.

Weather template

Voice assistants may provide specific weather information with a template by user request. The information provided depends on the voice assistant.

weather user flow weather template
SectionUse
1. TitleUsually contains the location, city name, country or region, etc.
2. Secondary infoUsually holds the date.
3. Weather status iconProvides information about the weather status (sunny, rainy, etc.).
4. Average temperatureThe average temperature during the day.
5. List itemOptional attribute, usually contains weather information (weather status and maximum/minimum temperatures) on the week.
6. Max/min temperatureThe maximum and minimum temperature during the day.

Generic content and image

The generic content and image template usually provides information requested by the user, which is read aloud by the voice assistant. The information depends on the provider.

generic content user flow generic content template
SectionUse
1. TitleUsually contains the asked question.
2. Secondary infoUsually contains data provider information.
3. ImageOptional attribute, to provide additional visual information.
4. TextThe information provided by the data source.

Generic list

The generic list usually provides possible options and depends on the voice assistant. It is only possible to interact with the list by voice. The amount of options in the list is customizable.

generic list flow generic list template
SectionUse
1. TitleUsually contains the list name.
2. List itemUsually contains a sequence number (optional) and text.

Voice assistant login

A login screen is shown inside a modal window when a voice assistant requires the user to log in before using it. The login flow depends on the voice assistant's guidelines and usually requires implementation by the customer. When login is complete, the user can interact with the voice assistant and see the VPA status (if available) in a control centre. The moment when the user needs to log in depends on the integrated voice assistant. For example, a login modal could be part of the onboarding flow or appear once the user says a wake word.

login

Customization

Currently, TomTom Digital Cockpit only supports the Alexa voice personal assistant. Any Alexa VPA customizations require certification from Amazon. Adding a new voice assistant requires integration work.