Victor talks about Synthesia’s journey and the rapid progression of media technology, where video content demand now outpaces the speed of production. He unpacks the role of academia in the company, where PhDs and professors make up nearly 50% of staff.
The CEO goes over the evolution of text-to-speech in the last decade, from the appearance of deep learning and voice-cloning to multi-dimensional speech with emotions, pitch, and style. He also discusses the difficulty of extending voice into multiple languages when there is no data to support the neural network.
Victor reviews the success of video content over text and how this ties into working with global companies, which not only want to train and communicate with their employees but also improve the customer journey. He shares how he sees the localization and translation industries as partners and an integral part of creating multilingual content.
Victor talks about Synthesia’s funding rounds and shares the story behind connecting with their first investor, Mark Cuban, after struggling to find funding. He gives advice on finding the right type of investor and what to expect from venture capitalist’s unfamiliar with the tech space.
The podcast wraps up with Victor’s view on deepfakes and the impact on their approach toward harmful content, and the company’s vision of creating more storytelling rather than informative content.
First up, Florian and Esther go through the poll results from May 21, where respondents weighed in on Translation as a Subscription, with only 12% thinking of it as “the future.” The duo discuss the pros and cons of the subscription model and reference the Pricing and Procurement report, which highlights its simplicity and predictability.
Florian talks about KUDO’s latest public relations win as billionaire investor Bill Ackman tweeted about using the multilingual conferencing platform for an investor presentation. Ackman used KUDO to run the presentation in 11 languages, where he announced buying 10% of Universal Music Group from Vivendi SE.
For the third week in a row, RWS pops up in language industry news as it partners with speech recognition system CEDAT85 to launch a live subtitling and captioning solution for online meetings and events. Esther touches on possible competitors in the space, such as Ai-Media, Redbee, and Verbit.
Florian discusses SwissText’s 2021 conference competition, which saw Microsoft’s winning approach toward the recognition and translation of the Swiss-German dialect into standard German text.
Stream Slator webinars, workshops, and conferences on the Slator Video-on-Demand channel.