The Cleft Dataset

A dataset of ultrasound and audio recorded with children with cleft lip and palate

The cleft dataset is a collection of ultrasound tongue imaging and audio data, gathered from children with cleft lip and palate by a research speech and language therapist working in a hospital environment.

Speakers

We recorded data with 39 English-speaking children, but only 29 gave consent to share their data. These are 11 female speakers and 18 male, aged 7-11 years.

SPEAKER-ID	GENDER	AGE-Y	AGE-M	AGE	CLEFT-TYPE	OTHER-MEDICAL
01M	M	10	5	10.42	BCLP	no
03F	F	5	1	5.83	UCLP	no
05M	M	10	3	10.75	UCLP	no
06M	M	9	8	9.75	UCLP	no
07M	M	9	2	9.75	UCLP	no
09M	M	9	8	9.33	UCLP	yes
11M	M	4	5	4.42	CP	no
12F	F	5	1	5.42	CP	yes
14F	F	5	0	5.33	BCLP	no
15F	F	4	9	4.75	UCLP	yes
16M	M	9	7	9.33	UCLP	no
17F	F	4	4	4.42	BCLP	no
18F	F	5	1	5.25	UCLP	no
19F	F	3	9	3.58	CP	yes
20F	F	7	5	7.75	CP	no
21M	M	9	1	9.5	CP	no
24M	M	6	5	6.33	CP	yes
25M	M	4	1	4.33	CP	no
26M	M	4	4	4.67	BCLP	no
28F	F	8	9	8.58	BCLP	no
30F	F	7	7	7.42	CP	no
31F	F	5	4	5.42	CP	no
32M	M	5	8	5.5	UCLP	no
33M	M	6	4	6.42	UCLP	yes
34M	M	5	3	5.25	CP	yes
35M	M	3	7	3.42	UCLP	no
36M	M	5	0	5.58	BCLP	no
37M	M	7	0	7.58	BCLP	no
39M	M	7	0	7	CP	yes

Cleft Types

Data type	Description
CP	cleft palate only
UCLP	unilateral cleft lip and palate affecting one side of the lip and palate
BLP	bilateral cleft lip and palate affecting both sides

Sessions

Each child recorded an "Assessment" session, and two children recorded a "Therapy" session.

Data Types

Core data types:

Data type	Description
wav	speech waveform
ult	raw ultrasound data
param	ultrasound parameters
txt	prompt text with date/time of utterance recording

Hardware synchronisation failed for this dataset. We release the parameter File as exported from the AAA software. In [1] we provide automatically predicted synchronisation offsets.

Additional data:

Data type	Description
slt_labels	manual annotation from SLT, when available. See [2] for details
probe_direction_labels	a label for each utterance indicating whether the probe was in a coronal position (cor) or midsagittal right or let (sag_right, sag_left)

SLT Labels are available in Praat's TextGrid format, and probe_direction_labels is a csv file.

File IDs

Individual recordings are indexed for each session according to their recording times. See the prompt text file for recording date/time.

Each file ID also includes a prompt type identifier. See Data for details.

References

[1] Eshky, A.,Cleland, J., Ribeiro, M. S., Renals, S. Automatic audiovisual synchronisation for ultrasound tongue imaging. (Under revision).

[2] Cleland, J., Lloyd, S., Campbell, L., Crampin, L., Palo, J.-P., Sugden,E., Wrench, A., & Zharkova, N. (2020). The impact of real-time ar-ticulatory information on phonetic transcription: ultrasound-aidedtranscription in cleft lip and palate speech. Folia Phoniatrica etLogopaedica, 72, 120–130.