
Google has introduced a new artificial intelligence-powered dictation application designed for iPhone users, offering offline voice-to-text functionality that processes speech directly on the device without requiring internet connectivity.
The application, identified as Google AI Edge Eloquent, operates using advanced speech recognition models built on Google’s Gemma framework, enabling users to convert spoken words into structured, readable text in real time.
The tool also includes optional cloud-based enhancement powered by Gemini, allowing users to refine generated text for clarity, grammar, and tone when an internet connection is available.
Brandspur Technology News Desk reports that the app has quietly appeared on the App Store without formal announcement, signalling Google’s continued expansion into lightweight, on-device artificial intelligence tools.
One of its key features is live transcription, which captures speech as it is spoken while automatically removing filler expressions such as pauses and verbal stutters to produce more polished output.
The system is designed to function fully offline, positioning it as a privacy-focused alternative to cloud-dependent dictation services currently dominating the productivity software market.
Industry observers note that the move places Google in direct competition with existing voice-to-text applications such as Wispr Flow, SuperWhisper, and Willow.
The app’s integration with Gemini further extends its capabilities, allowing optional AI-driven refinement beyond its core offline transcription function.
By prioritising on-device processing, the tool reduces reliance on cloud servers, making it more suitable for users seeking faster response times and improved data privacy.
Analysts suggest the launch reflects a broader industry shift toward embedded artificial intelligence, where smartphones increasingly handle complex language processing tasks locally rather than through remote data centres.
The introduction of Google AI Edge Eloquent is seen as part of a growing trend in mobile AI innovation, with voice input gradually evolving into a primary interface for productivity, communication, and content creation.





