A repository of ultrasound and acoustic data from child speech therapy sessions
UltraSuite is a repository of ultrasound and acoustic data from child speech therapy sessions. The current release includes three data collections, one from typically developing children and two from children with speech sound disorders. It also includes a set of annotations, some manual and some automatically produced, and tools to process, transform and visualise the data. Read the paper here!
There are three datasets available in the repository:
Ultrax Typically Developing - UXTD. A dataset of 58 typically developing children. See here for further details.
Ultrax Speech Sound Disorders - UXSSD. A dataset of 8 children with speech sound disorders. See here for further details.
UltraPhonix - UPX. A second dataset of children with speech sound disorders. The data was collected from 20 children. See here for further details.
- Ultrasuite Tools - Python library to process raw ultrasound data.
- Ultrasuite Kaldi (soon) - Recipes and other code to use UltraSuite data with the Kaldi Speech Recognition Toolkit.
We welcome user contribution to UltraSuite! We are hoping to keep UltraSuite in active development with help from the community. All contributions will be given proper credits! There are various ways to participate:
Contributing with data
The current release of UltraSuite has three datasets of ultrasound and audio from Child speech, but we hope to include additional datasets from other modalities (e.g. MRI) and age groups (e.g. Adults). If you'd like share data that you collected through UltraSuite, please get in touch with any member of the Ultrax Speech project. Note that even though data is available through UltraSuite, we encourage users to cite the original authors.
Contributing with code
To contribute with code or to help improve this documentations, please submit your changes with Pull Requests.
To report any issues, you can use GitHub's Issue Tracker or you can contact any member of the Ultrax Speech project. Please submit any issues related to code in their respective repositories using Github's Issue Tracker. For issues found in the data, please contact us directly.
Datasets from UltraSuite are distributed under Attribution-NonCommercial 4.0 Generic (CC BY-NC 4.0). Code is available under the Apache License v.2.
If using data or code from UltraSuite, please provide appropriate web links and cite the following paper: