September 14, 2024

Search
Close this search box.

We are creating some awesome events for you. Kindly bear with us.

Transforming Audio: SIT’s AI-Driven Noise-Cancellation Tech

Getting your Trinity Audio player ready...

At the Singapore Institute of Technology (SIT), Professor Ian McLoughlin and his team are pioneering an innovative approach to improving speech clarity using artificial intelligence. Over the past two years, they have developed advanced AI frameworks designed to eliminate background noise from recorded speech, leaving only clear and intelligible voice. This technology, known as deep denoising, represents a significant advancement in audio processing and promises to transform communication in noisy environments.

Image credits: SIT Photo: Keng Photography/Tan Eng Keng

Clear communication in noisy settings, like a phone call on a crowded train or in a busy café, is often disrupted by background noise, causing frustration. Traditional signal processing methods address this but struggle with unpredictable sounds. AI’s ability to manage these challenges makes it a game-changer for enhancing audio clarity.

Deep denoising uses deep learning models to separate speech from ambient noise during calls or recordings. By training AI to identify and remove various noise types while enhancing spoken words, this technology filters out distractions, delivering clear speech. Professor McLoughlin notes that while traditional methods handle predictable noise, AI’s ability to manage unpredictable sounds significantly improves audio quality.

The development of this technology involved a rigorous process. The research team trained multiple AI models using over 50,000 recordings of noisy speech, paired with clean, noise-free samples. This iterative process, known as back propagation, gradually refined the models’ ability to produce clear speech from noisy input. The AI frameworks were continuously tested and trained until they could reliably transform noisy speech into clear, intelligible audio.

This project, a collaboration between SIT and a Taiwanese electroacoustic company with AI Singapore’s support, involved developing nearly a hundred AI frameworks. The final model was optimised for compact, embedded systems, ensuring real-time performance without compromising quality. This allowed the technology to be used in various applications, from consumer electronics to professional communication tools.

One major challenge was adapting the AI to function efficiently on tiny embedded systems, like those in portable audio devices. While it performed well on desktop computers with powerful GPUs, scaling it down for smaller devices required significant adjustments.

The team simplified and rewrote several mathematical equations within the AI to reduce the number of instructions needed to denoise speech, a process Professor McLoughlin likened to “performing mathematical tricks” to maintain performance while reducing computational complexity.

The result is an AI-powered denoising system capable of operating in real-time with minimal latency, meaning the delay between input and output is imperceptible. This marks a major step forward in speech and audio technology, where the focus has shifted from merely making speech intelligible to improving the overall quality of sound.

Professor McLoughlin, who has worked in speech and audio technology since 1991, has seen the field evolve rapidly, particularly with the rise of AI. In the early days, the primary concern was ensuring that speech was intelligible, especially in critical situations like emergency calls. Today, the emphasis is on enhancing the quality of sound, making it clear and pleasant to listen to.

The potential applications for this technology are vast. Professor McLoughlin is in discussions with industry partners, including rail companies and manufacturers of emergency communication equipment, to explore how SIT’s AI-based denoising technology can be licensed and adopted. If successful, this technology could greatly improve communication in noisy environments, making it a valuable asset across various sectors.

As SIT continues to refine and advance this technology, the future of clear, intelligible speech in noisy environments looks promising. The AI-powered denoising system developed by Professor McLoughlin and his team has the potential to revolutionise global communication by significantly improving sound clarity. This technology could transform how people experience and interact with audio in various aspects of daily life.

PARTNER

Qlik’s vision is a data-literate world, where everyone can use data and analytics to improve decision-making and solve their most challenging problems. A private company, Qlik offers real-time data integration and analytics solutions, powered by Qlik Cloud, to close the gaps between data, insights and action. By transforming data into Active Intelligence, businesses can drive better decisions, improve revenue and profitability, and optimize customer relationships. Qlik serves more than 38,000 active customers in over 100 countries.

PARTNER

As a Titanium Black Partner of Dell Technologies, CTC Global Singapore boasts unparalleled access to resources.

Established in 1972, we bring 52 years of experience to the table, solidifying our position as a leading IT solutions provider in Singapore. With over 300 qualified IT professionals, we are dedicated to delivering integrated solutions that empower your organization in key areas such as Automation & AI, Cyber Security, App Modernization & Data Analytics, Enterprise Cloud Infrastructure, Workplace Modernization and Professional Services.

Renowned for our consulting expertise and delivering expert IT solutions, CTC Global Singapore has become the preferred IT outsourcing partner for businesses across Singapore.

PARTNER

Planview has one mission: to build the future of connected work. Our solutions enable organizations to connect the business from ideas to impact, empowering companies to accelerate the achievement of what matters most. Planview’s full spectrum of Portfolio Management and Work Management solutions creates an organizational focus on the strategic outcomes that matter and empowers teams to deliver their best work, no matter how they work. The comprehensive Planview platform and enterprise success model enables customers to deliver innovative, competitive products, services, and customer experiences. Headquartered in Austin, Texas, with locations around the world, Planview has more than 1,300 employees supporting 4,500 customers and 2.6 million users worldwide. For more information, visit www.planview.com.

SUPPORTING ORGANISATION

SIRIM is a premier industrial research and technology organisation in Malaysia, wholly-owned by the Minister​ of Finance Incorporated. With over forty years of experience and expertise, SIRIM is mandated as the machinery for research and technology development, and the national champion of quality. SIRIM has always played a major role in the development of the country’s private sector. By tapping into our expertise and knowledge base, we focus on developing new technologies and improvements in the manufacturing, technology and services sectors. We nurture Small Medium Enterprises (SME) growth with solutions for technology penetration and upgrading, making it an ideal technology partner for SMEs.

PARTNER

HashiCorp provides infrastructure automation software for multi-cloud environments, enabling enterprises to unlock a common cloud operating model to provision, secure, connect, and run any application on any infrastructure. HashiCorp tools allow organizations to deliver applications faster by helping enterprises transition from manual processes and ITIL practices to self-service automation and DevOps practices. 

PARTNER

IBM is a leading global hybrid cloud and AI, and consulting services provider, helping clients in more than 175 countries capitalize on insights from their data, streamline business processes, reduce costs and gain the competitive edge in their industries. Nearly 3,800 government and corporate entities in critical infrastructure areas such as financial services, telecommunications and healthcare rely on IBM’s hybrid cloud platform and Red Hat OpenShift to affect their digital transformations quickly, efficiently, and securely. IBM’s breakthrough innovations in AI, quantum computing, industry-specific cloud solutions and business services deliver open and flexible options to our clients. All of this is backed by IBM’s legendary commitment to trust, transparency, responsibility, inclusivity, and service. For more information, visit www.ibm.com