We're hiring!
*

WhisperSpeech makes its way to AI.dev

Mark Filion avatar

Mark Filion
December 07, 2023

Share this post:

Reading time:

Collabora is headed to San Jose, California, to take part in the inaugural edition of AI​.dev: Open Source GenAI & ML Summit, a new event which aims to bring together the brightest developers from around the world to shape the trajectory of open source AI.

Join us on Tuesday, December 12, as Jakub Piotr Cłapa dives into findings from WhisperSpeech, a new Open Source text-to-speech model developed by Collabora. Based entirely on properly licensed speech datasets and unrestricted Open Source code, the model's focus is to deliver the best natural-sounding Open Source speech synthesis solution for improved communication.

In this talk, Jakub will look at how Collabora scaled its models and training pipelines from hundreds to 80K+ hours of speech recordings, and will share lessons learned along the way. He'll also discuss some of the challenges encountered, including:

  • Gone in 16 minutes: the importance of small scale experiments.
  • Full throttle: is 100% GPU utilization enough?
  • Do you need a fancy framework? From single- to multi-GPU training.
  • Are SSDs fast enough? WebDataset brings a 10x improvement.
  • Does bigger always mean better? How to effortlessly scale AI models.
  • Clouds, enthusiasts or clusters? How to hunt down GPUs.
  • Defending moats. How is a gaming 4090 different from an H100?

If you plan on attending, please make sure to come say hello! Note that so you can also watch Jakub's talk remotely via the Room LL20D live stream.

Update: The video recording is now available, click on the link below to start watching!

Collabora @ AI​.dev: Open Source GenAI & ML Summit

Tricks Learned from Scaling WhisperSpeech Models to 80k+ Hours of Speech
Presented by Jakub Piotr Cłapa - Tuesday, December 12

 

Comments (0)


Add a Comment






Allowed tags: <b><i><br>Add a new comment:


 

Search the newsroom

Latest News & Events

Implementing DRM format modifiers in NVK

16/05/2024

This week we merged support for the VK_EXT_image_drm_format_modifier extension in NVK, the new open-source Vulkan driver for NVIDIA hardware.…

Kernel 6.9: Enable, test, repeat

14/05/2024

Collabora's engineers continue to be involved in the hardware enablement for a few different system-on-chips (SoCs) and platforms, and have…

SteamOS 3.6: How the Steam Deck atomic updates are improving

10/05/2024

Highlighting some of the key changes Collabora worked on with Valve to improve the system update tooling on SteamOS, including the move…

Open Since 2005 logo

We use cookies on this website to ensure that you get the best experience. By continuing to use this website you are consenting to the use of these cookies. To find out more please follow this link.

Collabora Ltd © 2005-2024. All rights reserved. Privacy Notice. Sitemap.