Speech-to-Text Troubleshooting Guide
Overview
The speech-to-text (dictation) feature uses the Web Speech API, which is built into modern browsers (Chrome, Edge, Safari). This API requires an internet connection because it uses Google’s cloud-based speech recognition servers.
Common Issues
Network Error
Symptom: The microphone icon activates but immediately shows a “network error”
Cause: Something is blocking the connection to Google’s speech recognition servers
Solutions:
- Check Antivirus/Firewall
- Temporarily disable your antivirus software
- Check Windows Defender Firewall settings
- Look for any security software that might block web requests
- Check Browser Settings
- Ensure microphone permissions are granted
- Check that site permissions allow microphone access
- Try resetting browser permissions
- Network Configuration
- Disable VPN if you’re using one
- Try a different network (e.g., mobile hotspot)
- Check if you’re on a corporate network with restrictions
- Contact IT department if on managed network
- Browser Extensions
- Disable ad blockers
- Disable privacy/security extensions temporarily
- Try in incognito/private mode
No Microphone Permission
Symptom: Browser doesn’t ask for microphone permission
Solution:
- Go to browser settings
- Navigate to Site Permissions → Microphone
- Ensure the site has permission to use microphone
Microphone Not Working
Symptom: Permission granted but no audio detected
Solution:
- Check Windows Sound Settings
- Ensure microphone is not muted
- Test microphone in other apps
- Check microphone volume level
- Try selecting a different microphone if multiple are available
Testing Speech Recognition
To test if the issue is with your browser/system configuration:
- Visit Google’s official demo: https://www.google.com/intl/en/chrome/demos/speech.html
- If this doesn’t work, the issue is with your system, not our app
- If it works, there may be a configuration issue with our implementation
Browser Support
- ✅ Google Chrome (HIGHLY RECOMMENDED) - Full support, best performance
- ⚠️ Microsoft Edge - Known network error issues with speech recognition. Use Chrome instead.
- ✅ Safari (macOS) - Generally works well
- ❌ Firefox - Limited/no support for Web Speech API
- ❌ Opera - Limited support
Microsoft Edge Known Issue
Microsoft Edge has a known bug with the Web Speech API that causes immediate “network” errors when attempting to use speech recognition, even though:
- The microphone permission is granted
- The API is supported
- The same code works perfectly in Chrome
Solution: Use Google Chrome for speech-to-text dictation.
Why this happens: This appears to be an Edge-specific bug in how the browser communicates with Google’s speech recognition servers. Even though Edge and Chrome are both Chromium-based, they have different implementations of certain APIs.
Still Having Issues?
If you’ve tried all the above and still can’t get it working:
- Check the browser console (F12) for detailed error messages
- Look for
ERR_BLOCKED_BY_CLIENTor similar network errors - Check the Network tab in DevTools for failed requests to Google servers
- File an issue on GitHub with console logs and system details
Technical Details
The Web Speech API makes requests to:
www.google.com/speech-api/speech.googleapis.com
If these domains are blocked by your network or security software, speech recognition will not work.