Ai ASR - 搜索

约 1,910 个结果

在新选项卡中打开链接

时间不限

openai.com
https://openai.com › index › whisper
Introducing Whisper - OpenAI
2022年9月21日 · Whisper is an automatic speech recognition (ASR) system trained on 680,000 hours of multilingual and multitask supervised data collected from the web. We show that the use of such a large and diverse dataset leads to improved robustness to accents, background noise and technical language.
openai.com
https://help.openai.com › en › articles
Whisper Audio API FAQ - OpenAI Help Center
How much does the Whisper ASR API cost to use? See our Pricing page for details. Is Whisper still free in the playground? Starting March 1st, 2023, with the Whisper API launch it is no longer free in the playground. What languages are supported?
openai.com
https://cdn.openai.com › papers › whisper.pdf
[PDF]
Robust Speech Recognition via Large-Scale Weak Supervision
ASR systems include some level of inverse text normaliza-tion, it is often simple or rule-based and still detectable from other unhandled aspects such as never including commas. We also use an audio language detector, which was created by fine-tuning a …
openai.com
https://openai.com › index › whisper
Presentamos a Whisper - OpenAI
Whisper es un sistema de reconocimiento automático del habla (ASR), entrenado con 680 000 horas de datos multilingües y multitarea supervisados, obtenidos de la web. Mostramos que el uso de un conjunto de datos tan grande y diverso mejora el rendimiento en términos de acentos, ruido de fondo y lenguaje técnico.
openai.com
https://platform.openai.com › docs › guides › speech-to-text
OpenAI Platform
Explore resources, tutorials, API docs, and dynamic examples to get the most out of OpenAI's developer platform.
openai.com
https://community.openai.com
GPT-4o text to speech and speech to text - API
2024年5月13日 · Currently using Azure AI Speech API for speech/text interfacing to chat model. The Microsoft API supports streaming on-demand and continuous recognition. Will GPT-4o audio support still be file-based or will it be able to replace the Microsoft API?
openai.com
https://community.openai.com › can-whisper-distinguish-two-speakers
API - OpenAI Developer Community - OpenAI API Community …
2023年7月8日 · It seems that Whisper can’t do timestamps itself and instead uses an external tool that tracks something like the length of time for each word, or the gap between words, something like that. It’s measured in seconds. Assembly AI on the other hand provides the actual timestamps in milliseconds for each word.
openai.com
https://community.openai.com › api-whisper-transcriptions-errors-solved
API Whisper transcriptions errors (SOLVED) - API - OpenAI API …
2024年2月12日 · I have seen many posts commenting on bugs and errors when using the openAI’s transcribe APIs (whisper-1). I also encountered them and came up with a solution for my case, which might be helpful for you as well. This is my app’s workflow: Form (video) → Conversion to .mp3 → Upload to cloud storage → Return the ID of the created audio (used …
openai.com
https://community.openai.com
API - OpenAI Developer Community - OpenAI API Community …
2023年12月20日 · I’m currently using the Whisper API for audio transcription, and the default 25 MB file size limit poses challenges, particularly in maintaining sentence continuity when splitting files. By default, the Whisper API only supports files that are less than 25 MB. If you have an audio file that is longer than that, you will need to break it up into chunks of 25 MB’s or less or …
openai.com
https://community.openai.com › whisper-language-recognition
Documentation - OpenAI Developer Community - OpenAI API …
2024年3月4日 · If there was any hint of intelligence to be found in the AI, even something that says “our interview with a non-native speaker of the German language conducted in German now continues.” (“Unser Interview mit einer Person, die Deutsch nicht als Muttersprache spricht und das ausschließlich auf Deutsch geführt wird, geht nun weiter.”
分页
- 1
- 2
- 3
- 4
- 下一页