NLP - ML - Audio - Speech to text - Transcription
Customer data is often at the heart of any project start. But whereas textual and image data are typically easier to collect and label, audio data is often more scarce and tricky.
This typically results in a few rounds of “back and forth” at a project start with typical questions such as:
In this internship, we aim to answer those questions to as specific a degree as possible. For this, you will focus on creating a speech transcription engine that can identify and accurately transcribe your colleagues in various contexts. In this process, you’ll identify key relationships between data quantity, quality, type and model accuracy. You’ll then package all of this into a demo which can take various forms for your fellow agents and the world to use!
This, dear ML6 Intern agent, is your mission.
The goals of this internship are as follows: