1. Home
  2. >
  3. AI 🤖
Posted

OpenAI Unveils Whisper AI for Flawless Speech-to-Text Transcription

  • OpenAI's Whisper AI tool offers incredibly accurate speech-to-text transcription. It was trained on 680,000 hours of data and makes 50% fewer errors than other tools.

  • Whisper is open-sourced by OpenAI for developers and researchers to build applications, not aimed at end users yet.

  • Multiple models available with different vRAM requirements based on accuracy. Largest 10GB model is most accurate.

  • Can run Whisper locally by cloning Git repo if you have x86 machine, or in Google Colab. More powerful hardware speeds it up.

  • Great for transcribing interviews and videos. 25-minute interview transcribed flawlessly in testing. Also does translation.

xda-developers.com
Relevant topic timeline:
OpenAI plans to partner with Scale AI to make it easier for developers to fine-tune their AI models using custom data, allowing businesses to tailor models to specific tasks and customize responses to match brand voice and tone.
OpenAI has launched ChatGPT Enterprise, a business-focused version of its AI-powered chatbot app that offers enhanced privacy, data analysis capabilities, and customization options, aiming to provide an AI assistant for work that protects company data and is tailored to each organization's needs.
OpenAI offers ChatGPT plugins through its ChatGPT Plus subscription, providing access to a range of plugins that allow users to interact with external apps and services for various purposes such as travel arrangements, food delivery, job applications, and language learning. The article provides a step-by-step guide on how to access and use these plugins, along with a list of recommended plugins including AI Quest, A Review Summary, A-to-Z Video Summary, Calorie Coach, HiCollectors Finder, Kayak, Music, Podcast Search, Timeport, and What to Watch.
OpenAI's ChatGPT, a language processing AI model, continues to make strides in natural language understanding and conversation, showcasing its potential in a wide range of applications.
OpenAI's ChatGPT is expanding its capabilities by adding voice and image-based functionalities, allowing users to have voice conversations with the chatbot and search for answers using images.
Spotify is partnering with OpenAI to use artificial intelligence for translating podcasts into other languages, utilizing OpenAI's capabilities to generate "human-like audio" for the translated content while maintaining the original speaker's voice and style.
Google plans to integrate its Bard artificial intelligence chatbot into its voice assistant product on mobile phones in the coming months, following announcements from Amazon and OpenAI about their own conversational chatbots, as big tech companies race to develop more advanced voice assistants and determine how to monetize them.
OpenAI is exploring various options, including building its own AI chips and considering an acquisition, to address the shortage of powerful AI chips needed for its programs like the AI chatbot ChatGPT.
Users are engaging in hours-long conversations with OpenAI's ChatGPT AI assistant using its recently added voice features, echoing the concept of human-AI emotional connections depicted in the film "Her."