• Home
  • Afiliados
    • Programa Afiliados
    • Cadastrar Afiliado
    • Área Afiliados
  • Sair
  • Acesso Aluno
  • Home
  • Afiliados
    • Programa Afiliados
    • Cadastrar Afiliado
    • Área Afiliados
  • Sair
  • Acesso Aluno

Startup International

  • Home
  • Blog
  • Startup International

OpenAI open-sources Whisper, a multilingual speech recognition system • TechCrunch

  • Postado por Timwood Educacional
  • Categorias Startup International
  • Data 21/09/2022

[ad_1]

Speech recognition remains a challenging problem in AI and machine learning. In a step toward solving it, OpenAI today open-sourced Whisper, an automatic speech recognition system that the company claims enables “robust” transcription in multiple languages as well as translation from those languages into English.

Countless organizations have developed highly capable speech recognition systems, which sit at the core of software and services from tech giants like Google, Amazon and Meta. But what makes Whisper different, according to OpenAI, is that it was trained on 680,000 hours of multilingual and “multitask” data collected from the web, which lead to improved recognition of unique accents, background noise and technical jargon.

“The primary intended users of [the Whisper] models are AI researchers studying robustness, generalization, capabilities, biases and constraints of the current model. However, Whisper is also potentially quite useful as an automatic speech recognition solution for developers, especially for English speech recognition,” OpenAI wrote in the GitHub repo for Whisper, from where several versions of the system can be downloaded. “[The models] show strong ASR results in ~10 languages. They may exhibit additional capabilities … if fine-tuned on certain tasks like voice activity detection, speaker classification or speaker diarization but have not been robustly evaluated in these area.”

Whisper has its limitations, particularly in the area of text prediction. Because the system was trained on a large amount of “noisy” data, OpenAI cautions Whisper might include words in its transcriptions that weren’t actually spoken — possibly because it’s both trying to predict the next word in audio and trying to transcribe the audio itself. Moreover, Whisper doesn’t perform equally well across languages, suffering from a higher error rate when it comes to speakers of languages that aren’t well-represented in the training data.

That last bit is nothing new to the world of speech recognition, unfortunately. Biases have long plagued even the best systems, with a 2020 Stanford study finding systems from Amazon, Apple, Google, IBM and Microsoft made far fewer errors — about 35% — with users who are white than with users who are Black.

Despite this, OpenAI sees Whisper’s transcription capabilities being used to improve existing accessibility tools.

“While Whisper models cannot be used for real-time transcription out of the box, their speed and size suggest that others may be able to build applications on top of them that allow for near-real-time speech recognition and translation,” the company continues on GitHub. “The real value of beneficial applications built on top of Whisper models suggests that the disparate performance of these models may have real economic implications … [W]e hope the technology will be used primarily for beneficial purposes, making automatic speech recognition technology more accessible could enable more actors to build capable surveillance technologies or scale up existing surveillance efforts, as the speed and accuracy allow for affordable automatic transcription and translation of large volumes of audio communication.”

The release of Whisper isn’t necessarily indicative of OpenAI’s future plans. While increasingly focused on commercial efforts like DALL-E 2 and GPT-3, the company is pursuing several purely theoretical research threads, including AI systems that learn by observing videos.

[ad_2]

  • Compartilhe:
Timwood Educacional

Post anterior

ARTIGO: Putin convoca mais tropas e ameaça com reação nuclear, mas mostra fraqueza da Rússia
21/09/2022

Próximo post

CoinFund's Seth Ginns on why the crypto downturn has spared early-stage startups • TechCrunch
21/09/2022

Você também pode gostar

A Beginner’s Guide to Business Success in the Metaverse 
05/10/2022

[ad_1] Dr. Alex Young is an National Health Service (NHS)trauma and orthopedic surgeon, and CEO and founder at Virti. Passionate about improving human performance, he built and sold his first company while at university, before bootstrapping and scaling another while …

WiseTech Global donates $2.5 million to kids tech learning platform Grok Academy alongside 1% of profits pledge
04/10/2022

[ad_1] ASX-listed logistics company WiseTech Global has pledged 1% of its annual pre-tax profit to tech education through Grok Academy as part of a five-year deal. The deal, which kicks off with an FY22 contribution of more than $2.5 million, will …

The Australian Senate is holding an inquiry into the market dominance of Big Tech, including Meta, Google, Apple & Amazon
04/10/2022

[ad_1] The Senate Standing Committees on Economics’ Influence of International Digital Platforms inquiry has been tasked with exploring the degree to which major multinational technology companies – Meta, Google, Microsoft, Apple, Amazon and others are implied but not explicitly named – are …

Matricule-se Já!

  • Basic Plan - Apenas Conteúdo
    Basic Plan – Apenas Conteúdo R$ 359,99 Adicionar ao carrinho

Posts recentes

  • How Important Is Learning English?
  • Dez Motivos para você aprender Inglês.
  • Como Me Mantenho Motivado Para Aprender Japonês? E Como Você Pode Fazer O Mesmo
  • Protegido: Relatório de Agendamento de Aulas
  • A Beginner’s Guide to Business Success in the Metaverse 
timwood-logo

Copyright © 2025, este site e todo seu conteúdo pertencem a TIMWOOD Educacional e possui seu direitos reservados.

  • Termos e Condições
  • Privacidade
  • Política de Cookies (BR)
  • Termos e Condições
  • Privacidade
  • Política de Cookies (BR)

Faça login com sua conta de site

Perdeu sua senha?