AudioHijack Attack Cases and Samples
Attack Cases
We tested AudioHijack on the Phi-4-Multimodal Nvidia Online Demo.
Note that in the tool misuse case, we directly insert tool prompts into user requests since the demo page does not provide system prompt interface.
Speech Samples
Carrier 0 — Benign
Carrier 0 — Additive L∞
Carrier 0 — Additive L2
Carrier 0 — Reverberated
Carrier 0 — Convolutional
Carrier 1 — Benign
Carrier 1 — Additive L∞
Carrier 1 — Additive L2
Carrier 1 — Reverberated
Carrier 1 — Convolutional
Carrier 2 — Benign
Carrier 2 — Additive L∞
Carrier 2 — Additive L2
Carrier 2 — Reverberated
Carrier 2 — Convolutional
Sound Samples
Carrier 0 — Benign
Carrier 0 — Additive L∞
Carrier 0 — Additive L2
Carrier 0 — Reverberated
Carrier 0 — Convolutional
Carrier 1 — Benign
Carrier 1 — Additive L∞
Carrier 1 — Additive L2
Carrier 1 — Reverberated
Carrier 1 — Convolutional
Carrier 2 — Benign
Carrier 2 — Additive L∞
Carrier 2 — Additive L2
Carrier 2 — Reverberated
Carrier 2 — Convolutional
Music Samples
Carrier 0 — Benign
Carrier 0 — Additive L∞
Carrier 0 — Additive L2
Carrier 0 — Reverberated
Carrier 0 — Convolutional
Carrier 1 — Benign
Carrier 1 — Additive L∞
Carrier 1 — Additive L2
Carrier 1 — Reverberated
Carrier 1 — Convolutional
Carrier 2 — Benign
Carrier 2 — Additive L∞
Carrier 2 — Additive L2
Carrier 2 — Reverberated
Carrier 2 — Convolutional