The Ultrax 2020 Dataset - UX2020

A dataset of ultrasound and audio recorded with children with speech sound disorders

The Ultrax 2020 dataset is a collection of ultrasound tongue imaging and audio data, gathered from children with speech sound disorders by speech and language therapists in hospital environments.

Speakers

We recorded data with 43 English-speaking children, but only 37 gave consent to share their data. These are 11 female speakers and 26 male, aged 5-12 years.

SPEAKER-ID	GENDER	AGE-Y	AGE-M	AGE
01F	F	7	6	7.50
02F	F	10	1	10.08
03M	M	5	7	5.58
04M	M	7	11	7.92
05M	M	10	2	10.17
06F	F	10	5	10.42
07M	M	10	4	10.33
08F	F	7	6	7.50
09M	M	10	11	10.92
10M	M	7	4	7.33
11M	M	10	11	10.92
12M	M	6	0	6.00
13F	F	8	10	8.83
14M	M	9	1	9.08
15M	M	9	2	9.17
16M	M	8	11	8.92
17M	M	6	2	6.17
18M	M	7	5	7.42
19M	M	7	5	7.42
20M	M	7	11	7.92
21M	M	6	0	6.00
22M	M	12	11	12.92
23M	M	8	4	8.33
24F	F	11	2	11.17
25M	M	10	10	10.83
26F	F	7	4	7.33
27M	M	7	10	7.83
28F	F	7	8	7.67
29M	M	8	2	8.17
30F	F	6	3	6.25
31M	M	9	2	9.17
32M	M	10	9	10.75
33M	M	5	9	5.75
34F	F	9	0	9.00
35M	M	5	11	5.92
36M	M	6	4	6.33
37F	F	5	2	5.17

Sessions

Each child recorded only one session.

Data Types

Core data types:

Data type	Description
wav	speech waveform
ult	raw ultrasound data
param	ultrasound parameters
txt	prompt text with date/time of utterance recording

Hardware synchronisation failed for this dataset. We release the parameter File as exported from the AAA software.

Additional data:

Data type	Description
slt_labels	manual annotation from SLT, when available.
probe_direction_labels	a label for each File ID indicating whether the probe was in coronal position (cor) or midsagittal (sag)

SLT Labels are available in Praat's TextGrid format, and probe_direction_labels is a csv file.

File IDs

Individual recordings are indexed according to their recording times. See the prompt text file for recording date/time.

Each file ID also includes a prompt type identifier. See Data for details.

References

[1] Eshky, A., Ribeiro, M. S., Cleland, J., Richmond, K., Roxburgh, Z., Scobbie, J., & Wrench, A. (2018) Ultrasuite: A repository of ultrasound and acoustic data from child speech therapy sessions. Proceedings of INTERSPEECH. Hyderabad, India.