News Release

AI could soon detect early voice box cancer from the sound of your voice

Vocal fold lesions and early stages of laryngeal cancer alter acoustics of the voice, paving the way for AI recognition

Peer-Reviewed Publication

Frontiers

Cancer of the voice box or larynx is an important public health burden. In 2021, there were an estimated 1.1 million cases of laryngeal cancer worldwide, and approximately 100,000 people died from it. Risk factors include smoking, alcohol abuse, and infection with human papillomavirus. The prognosis for laryngeal cancer ranges from 35% to 78% survival over five years when treated, depending on the tumor’s stage and its location within the voice box.

Catching cancer early is key for a patient’s prospects. At present, laryngeal cancers are diagnosed through video nasal endoscopy and biopsies – onerous, invasive procedures. Getting to a specialist who can perform these procedures can take time, causing delays in diagnosis. But now, researchers have shown in Frontiers in Digital Health that abnormalities of the vocal folds can be detected from the sound of the voice. Such ‘vocal fold lesions’ can be benign, like nodules or polyps, but may also represent the early stages of laryngeal cancer. These proof-of-principle results open the door for a new application of AI: namely, to recognize the early warning stages of laryngeal cancer from voice recordings.

“Here we show that with this dataset we could use vocal biomarkers to distinguish voices from patients with vocal fold lesions from those without such lesions,” said Dr Phillip Jenkins, a postdoctoral fellow in clinical informatics at Oregon Health & Science University, and the study’s corresponding author.

Voice messages

Jenkins and his colleagues are members of the ‘Bridge2AI-Voice’ project within the US National Institute of Health’s ‘Bridge to Artificial Intelligence’ (Bridge2AI) consortium, a nationwide endeavor to apply AI to complex biomedical challenges. Here, they analyzed variations in tone, pitch, volume, and clarity within the first version of the public Bridge2AI-Voice dataset, with 12,523 voice recordings of 306 participants from across North America.

A minority were from patients with known laryngeal cancer, benign vocal fold lesions, or two other conditions of the voice box: spasmodic dysphonia and unilateral vocal fold paralysis.

The researchers focused on differences in a number of acoustic features of the voice: for example, the mean fundamental frequency (pitch); jitter, variation in pitch within speech; shimmer, variation of the amplitude; and the harmonic-to-noise ratio, a measure of the relation between harmonic and noise components of speech.

The researchers found marked differences in the harmonic-to-noise ratio and fundamental frequency between men without any voice disorder, men with benign vocal fold lesions, and men with laryngeal cancer. They didn’t find any informative acoustic features among women, but it is possible that a larger dataset would reveal such differences.

The authors concluded that especially variation in the harmonic-to-noise ratio can be helpful to monitor the clinical evolution of vocal fold lesions, and to detect laryngeal cancer at an early stage, at least in men.

“Our results suggest that ethically sourced, large, multi‑institutional datasets like Bridge2AI‑Voice could soon help make our voice a practical biomarker for cancer risk in clinical care,” said Jenkins.

Building a bridge to AI

Now that the proof-of-principle has been established,  the next step is to use these algorithms on more data and test them in clinical settings on patient voices.

“To move from this study to an AI tool that recognizes vocal fold lesions, we would train models using an even larger dataset of voice recordings, labeled by professionals. We then need to test the system to make sure it works equally well for women and men,” said Jenkins.

“Voice-based health tools are already being piloted. Building on our findings, I estimate that with larger datasets and clinical validation, similar tools to detect vocal fold lesions might enter pilot testing in the next couple of years," predicted Jenkins.


Disclaimer: AAAS and EurekAlert! are not responsible for the accuracy of news releases posted to EurekAlert! by contributing institutions or for the use of any information through the EurekAlert system.