A customizable iOS overlay that handles voice permission and converts speech to text using native speech recognition.
Voice Overlay for iOS is a Swift library that provides a ready-to-use overlay interface for capturing voice input and converting it to text in iOS applications. It handles the complex process of requesting microphone and speech recognition permissions while offering a polished, customizable user experience that integrates seamlessly with native iOS speech recognition capabilities.
iOS developers building apps that require voice input functionality, particularly those who want to implement speech-to-text features without dealing with low-level permission handling and UI implementation.
Developers choose Voice Overlay because it dramatically reduces the development time for voice-enabled features by providing a complete, production-ready solution with extensive customization options, while leveraging Apple's native speech recognition for optimal performance and accuracy.
🗣 An overlay that gets your user’s voice permission and input as text in a customizable UI
Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.
Automatically guides users through microphone and speech recognition permissions with clear visual feedback, as shown in the permission screenshots and nopermission.gif, reducing developer boilerplate code.
Offers extensive customization options for layout, colors, text, and behavior across all screens, evidenced by the settings examples in the README like changing background colors and titles.
Leverages Apple's SFSpeechRecognizer for accurate and efficient speech-to-text conversion, ensuring optimal performance and compatibility on iOS devices without external dependencies.
Supports configurable auto-start and auto-stop recording based on voice activity with timeouts, simplifying user interaction through settings like autoStopTimeout as documented.
The result screen is marked as beta in the README, indicating potential instability or incomplete functionality, which might not be suitable for production-critical applications.
Tied exclusively to iOS and Apple's speech recognition APIs, limiting cross-platform use and requiring internet connectivity for full functionality in some languages, with no alternative backends out-of-the-box.
Requires manual Info.plist modifications for privacy descriptions, adding initial setup complexity and potential for errors if not properly configured, as detailed in the usage section.