UW researchers designed a headphone system that translates several people speaking at once, following them as they move and preserving the direction and qualities of their voices. The team built the...
It’s not sending the audio to an unknown server. It’s all local.
From the article:
The system then translates the speech and maintains the expressive qualities and volume of each speaker’s voice while running on a device, such mobile devices with an Apple M2 chip like laptops and Apple Vision Pro. (The team avoided using cloud computing because of the privacy concerns with voice cloning.)
Why do you need someone’s permission to translate them?
To clone their voice, and to send the audio to some unknown server
It’s not sending the audio to an unknown server. It’s all local. From the article:
Dude… RTFA