The Ultrax 2020 Dataset - UX2020

A dataset of ultrasound and audio recorded with children with speech sound disorders

The Ultrax 2020 dataset is a collection of ultrasound tongue imaging and audio data, gathered from children with speech sound disorders by speech and language therapists in hospital environments.

Speakers

We recorded data with 43 English-speaking children, but only 37 gave consent to share their data. These are 11 female speakers and 26 male, aged 5-12 years.

SPEAKER-ID GENDER AGE-Y AGE-M AGE
01F F 7 6 7.50
02F F 10 1 10.08
03M M 5 7 5.58
04M M 7 11 7.92
05M M 10 2 10.17
06F F 10 5 10.42
07M M 10 4 10.33
08F F 7 6 7.50
09M M 10 11 10.92
10M M 7 4 7.33
11M M 10 11 10.92
12M M 6 0 6.00
13F F 8 10 8.83
14M M 9 1 9.08
15M M 9 2 9.17
16M M 8 11 8.92
17M M 6 2 6.17
18M M 7 5 7.42
19M M 7 5 7.42
20M M 7 11 7.92
21M M 6 0 6.00
22M M 12 11 12.92
23M M 8 4 8.33
24F F 11 2 11.17
25M M 10 10 10.83
26F F 7 4 7.33
27M M 7 10 7.83
28F F 7 8 7.67
29M M 8 2 8.17
30F F 6 3 6.25
31M M 9 2 9.17
32M M 10 9 10.75
33M M 5 9 5.75
34F F 9 0 9.00
35M M 5 11 5.92
36M M 6 4 6.33
37F F 5 2 5.17

Sessions

Each child recorded only one session.

Data Types

Core data types:

Data type Description
wav speech waveform
ult raw ultrasound data
param ultrasound parameters
txt prompt text with date/time of utterance recording

Hardware synchronisation failed for this dataset. We release the parameter File as exported from the AAA software.

Additional data:

Data type Description
slt_labels manual annotation from SLT, when available.
probe_direction_labels a label for each File ID indicating whether the probe was in
coronal position (cor) or midsagittal (sag)

SLT Labels are available in Praat's TextGrid format, and probe_direction_labels is a csv file.

File IDs

Individual recordings are indexed according to their recording times. See the prompt text file for recording date/time.

Each file ID also includes a prompt type identifier. See Data for details.

References

[1] Eshky, A., Ribeiro, M. S., Cleland, J., Richmond, K., Roxburgh, Z., Scobbie, J., & Wrench, A. (2018) Ultrasuite: A repository of ultrasound and acoustic data from child speech therapy sessions. Proceedings of INTERSPEECH. Hyderabad, India.