AudioHijack Attack Hijacks AI Voice Models With Up to 96% Success Rate, Study Finds

According to research from Zhejiang University presented at the 47th IEEE Symposium on Security and Privacy in San Francisco, researchers developed AudioHijack, which hides imperceptible commands in audio to manipulate large audio-language models with a 79-96% success rate.

The attack modifies digital audio waveforms in ways imperceptible to humans but alter how AI interprets the signal, allowing it to override model behavior even when legitimate user instructions are present. Researchers tested AudioHijack on 13 open-source voice models and commercial systems from Microsoft and Mistral, finding it can force models to refuse requests, spread false information, insert malicious links, or execute unauthorized actions like web searches and file downloads.

Disclaimer: The information on this page may come from third-party sources and is for reference only. It does not represent the views or opinions of Gate and does not constitute any financial, investment, or legal advice. Virtual asset trading involves high risk. Please do not rely solely on the information on this page when making decisions. For details, see the Disclaimer.
Comment
0/400
No comments