Azure Speech To Text Rest Py
Skill AktivAzure Speech to Text REST API for short audio (Python). Use for simple speech recognition of audio files up to 60 seconds without the Speech SDK. Triggers: "speech to text REST", "short audio transcription", "speech recognition REST API", "STT REST", "recognize speech REST". DO NOT USE FOR: Long audio (>60 seconds), real-time streaming, batch transcription, custom speech models, speech translation. Use Speech SDK or Batch Transcription API instead.
To enable simple, SDK-free speech-to-text transcription of short audio files using the Azure REST API.
Funktionen
- Transcribe short audio files (up to 60 seconds)
- Utilize Azure Speech to Text REST API directly
- Support for WAV and OGG audio formats
- Provide basic and detailed response formats
- Enable pronunciation assessment capabilities
Anwendungsfälle
- Quickly transcribing short voice memos or notes.
- Integrating speech-to-text into applications without the overhead of the Speech SDK.
- Performing simple audio file analysis for content extraction.
Nicht-Ziele
- Transcribing audio longer than 60 seconds
- Real-time streaming transcription
- Batch transcription of multiple files
- Speech translation
- Using custom speech models
Trust
- warning:Issues AttentionThere are 19 open issues and 11 closed issues in the last 90 days, indicating a closure rate below 50% and potentially slow maintainer response.
Installation
Zuerst Marketplace hinzufügen
/plugin marketplace add microsoft/skills/plugin install azure-sdk-python@skillsQualitätspunktzahl
Vertrauenssignale
Ähnliche Erweiterungen
Azure Servicebus Py
100Azure Service Bus SDK for Python messaging. Use for queues, topics, subscriptions, and enterprise messaging patterns. Triggers: "service bus", "ServiceBusClient", "queue", "topic", "subscription", "message broker".
Azure Monitor Query Py
100Azure Monitor Query SDK for Python. Use for querying Log Analytics workspaces and Azure Monitor metrics. Triggers: "azure-monitor-query", "LogsQueryClient", "MetricsQueryClient", "Log Analytics", "Kusto queries", "Azure metrics".
Azure Container Registry SDK for Python
100Azure Container Registry SDK for Python. Use for managing container images, artifacts, and repositories. Triggers: "azure-containerregistry", "ContainerRegistryClient", "container images", "docker registry", "ACR".
Azure App Configuration SDK for Python
100Azure App Configuration SDK for Python. Use for centralized configuration management, feature flags, and dynamic settings. Triggers: "azure-appconfiguration", "AzureAppConfigurationClient", "feature flags", "configuration", "key-value settings".
Elevenlabs Stt
98ElevenLabs speech-to-text with Scribe models and forced alignment via inference.sh CLI. Models: Scribe v1/v2 (98%+ accuracy, 90+ languages). Capabilities: transcription, speaker diarization, audio event tagging, word-level timestamps, forced alignment, subtitle generation. Use for: meeting transcription, subtitles, podcast transcripts, lip-sync timing, karaoke. Triggers: elevenlabs stt, elevenlabs transcription, scribe, elevenlabs speech to text, forced alignment, word alignment, subtitle timing, diarization, speaker identification, audio event detection, eleven labs transcribe
Whisper
97OpenAI's general-purpose speech recognition model. Supports 99 languages, transcription, translation to English, and language identification. Six model sizes from tiny (39M params) to large (1550M params). Use for speech-to-text, podcast transcription, or multilingual audio processing. Best for robust, multilingual ASR.