DeepL has acquired San Francisco-based Mixhalo, a live audio streaming startup, for an undisclosed sum. The deal gives the German AI translation company a foothold in real-time, low-latency audio delivery at large-scale live events, and its first office in San Francisco.
Mixhalo’s technology lets event attendees download an app, connect to the venue’s audio feed, and hear the same high-quality sound through their headphones regardless of where they are sitting. The startup has deployed its tech at Metallica and Sting concerts, MLB and NASCAR events, and with carriers Verizon and T-Mobile. It has raised nearly $40 million, with backers including Pharrell Williams, Founders Fund, Fortress Investment, and Cowboy Ventures.
The acquisition comes at a complicated moment for DeepL. The Cologne-based company, last valued at $2 billion, cut around 250 employees last month. Despite that, it appears to be doubling down on expanding its product scope, particularly into voice and live speech translation.
DeepL is best known for its AI text translation and writing tools, used by more than 200,000 businesses. It has been building out voice-to-voice translation more recently, and Mixhalo was already using DeepL as its primary translation provider before the acquisition. That existing relationship makes the deal a logical next step rather than a left-field move.
The company framed the rationale clearly: it wants to integrate what it calls “ultra-low-latency audio infrastructure” into its platform so that translated speech and captions can reach large audiences instantly without losing the natural pace of live conversation. That matters more than it might seem. Delayed or stilted translation breaks the experience at live events, especially when tens of thousands of people are listening at once.
DeepL founder and CEO Jarek Kutylowski said Mixhalo had “solved one of the hardest problems in live audio, which is delivering high-fidelity sound to thousands of people at once with basically zero latency.” He described the combined vision as building a “real-time Language AI layer for communication” covering everything from team meetings and customer calls to major international events.
Mixhalo was founded in 2016 by two musicians and a technologist, which partly explains its strong foothold in the live entertainment world. That entertainment credibility, combined with DeepL’s enterprise translation reach, gives the merged operation an interesting position in a market that has seen growing demand for multilingual live experiences at global sporting events, conferences, and concerts.
The broader trend here is worth noting. Real-time AI translation is moving fast, and the race is no longer just about accuracy in text. Speed, audio quality, and the ability to work at scale in noisy, live environments are becoming the new battleground. DeepL is betting that owning the audio delivery layer, not just the translation engine, is where that advantage will be built.




