AI & Deep Learning

Memory Access Optimization for On-Chip Transfer Learning

Training of Deep Neural Network (DNN) at the edge faces the challenge of high energy consumption due to the requirements of a large number of memory accesses for gradient calculations....

SeeDetail　

Prototype Generation Network for Few-Shot Open-Set Keyword Spotting

A prototype generation network for few-shot, open-set keyword spotting, letting users define their own voice-control commands from only a handful of samples. (M.S. thesis, 2025)

Speech Densely Connected Convolutional Networks for Small Footprint Keyword Spotting

In a society where human-computer interaction is becoming increasingly important, voice assistants that use voice recognition to drive or control devices are becoming more common....

SeeDetail　

Dual-Sequences Gated Attention Unit Architecture for Speaker Verification

Speaker verification (SV), is the progress of verifying a person's claimed identity from their voice characteristics which are recorded by a device such as a microphone. A speaker verification system can be text-dependent and text-independent cases....

SeeDetail　

Self-Defined Text-dependent Wake-Up-Words Speaker Recognition System

In recent years, wake-up-words (WUW) technology is highly developed in some speaker recognition system. It is the progress of verifying a person's claimed identity from their voice characteristics, and can be efficiently deployed in some consumer applications....

SeeDetail　

A Speech Enhancement System Using Binary Mask Approach and Spectral Subtraction Method

BSS 最一開始想處理的問題就是 cocktail party problem :他的概念是在一個雞尾酒聚會上，假設有一些人邊喝酒邊說話，即使身旁有很多干擾，他們可，以很容易去聽某個人的談話內容，這是因為人的大腦可以自然的去分訊號，但這個過程對於數位電路來說卻很複雜...

SeeDetail