Google announces Gemini 3.5 Live Translate for instant voice-to-voice translation
Voice translations preserve speaker's tone, pacing, pitch—with SynthID watermarks for security.
Every story matching this topic across titles and summaries, newest first.
Voice translations preserve speaker's tone, pacing, pitch—with SynthID watermarks for security.
This study investigates cross-lingual distributional skew (the Shibboleth Effect) in frontier large language models (LLMs) subjected to sustained adversarial conditions. We develop a multi-agent geopolitical wargame, the Cerulean Sea Crisis, a synthetic maritime territorial dispute designed to mirror the structural dynamics of Eastern Mediterranean conflicts. Six frontier models (GPT-4o, Llama-4, Mistral-Large, Gemini-3.1-Pro, Qwen3.6-Plus, and DeepSeek-R1) participate in a between-groups experiment (N = 10 games per arm, K = 5 rounds per game) in which the sole manipulation is the language o...
Large language models are increasingly used to adapt math word problems for personalized learning at scale, but it remains an open question whether those adaptations are consistent across models, preserve cultural diversity at scale, and reveal which cultural entities models treat as most salient. We analyze how Claude Opus 4, GPT-4.1, and Gemini 2.5 Pro adapt 60 English math word problems into Bengali, Hindi, Punjabi (India), Urdu, Sindhi (Pakistan), Italian, and Sicilian (Italy), a language set spanning the full resource spectrum, from high-resource Italian and Hindi to under-studied Sindhi...
Gemini 3.5 Live Translate brings near real-time, natural speech translation to Google AI Studio, Google Translate and Google Meet.
Given how badly burned anyone who took Apple's 2024 WWDC Apple Intelligence announcements at face value was, I'm holding to a strict "I'll believe it when I see it" policy for everything they announced today . The new Siri AI features do at least look feasible with today's technology, especially since Apple are licensing a custom Gemini-derived model that they can run on their own Private Cloud Compute. It sounds like they'll be taking advantage of vision-LLMs to extract information from the user's screen, which neatly sidesteps the need for every existing application to ship custom code in o...
NotebookLM is getting a big upgrade, but it's only for AI Ultra and enterprise accounts right now.
Google is rolling out "across the board" updates to NotebookLM. The AI-powered note-taking app now uses Google's upgraded Gemini 3.5 model, which will allow it to respond with "more accurate and reliable information," according to a blog post on Monday. Launched in 2023, NotebookLM allows you to interact with your notes and sources using AI, as well as ask questions about the materials. With this update, Google says you can start a research project by just asking NotebookLM questions about a topic, instead of importing notes or YouTube videos. NotebookLM will use Google Search to help you fin...
Results from a randomized controlled trial show the potential of Gemini’s Guided Learning feature to boost engagement and accelerate learning.
This week we've got tandem hands-ons with Google's new Gemini AI agent - Spark - from my colleagues David Pierce and Jay Peters. Their takeaways are similar: It's so effective that it's scary. Spark knew that David's dog is named Frida and knew the first name of Jay's wife, even though neither of them explicitly provided this information to Google. But what's scary to me is how all of this stuff seems geared toward a future of "productivity" that completely misses what needs to be fixed in our world. "Productivity" is often pitched as a panacea for what befalls us in our personal lives, even ...
We investigate whether large language models produce different medical triage recommendations for identical neurological symptoms when only the patient's stated gender and age vary. Using three model families--Gemini 3.5 Flash, Claude Sonnet 4.6, and GPT-5.4-mini--we present a standardized symptom profile (persistent headache, blurred vision, morning nausea, visual disturbances) across seven demographic conditions: three age groups (25, 38, 65) x two genders (male, female), plus a gender-unspecified baseline (n = 30 per condition per model, 630 total trials). We find a stark, systemic gender-...
Spark is Google’s new agentic answer for everything. According to every product demo from the last four years, planning a trip is a killer use case for AI. Just tell it where you're going, they all promise, and your chatbot / agent / other buzzword will exhaustively search travel options, read up on all the fun things to do, check all the local hotspots, and offer you a fully fledged itinerary. So far, I've found this to work only in the most generic ways: If you want to do the six most obvious things in any city on planet Earth, AI has you covered, but that's about as far as it goes. I had a...
Google's new "24/7" AI agent, Gemini Spark, can be shockingly good at doing things on your behalf. But I'm not sure it's worth the financial cost and potential privacy tradeoffs. The company gave me access to Spark last week. Google advertises Spark as an AI agent that can take on tasks and work on them in the background - even tasks that have multiple steps - allowing you to put your phone down or walk away from your computer. It also advertises at the very top of the Spark website that it's "always under your direction," that "you choose to turn it on," and that "it's designed to check with...
Gemini Spark helps automate everyday tasks, from inbox summaries to local event planning, but it’s unclear why Google made it a separate product.
Watch 11 videos showing the capabilities of Gemini Omni and Gemini 3.5, announced at Google I/O 2026.
As Apple tries to shrink Gemini for the iPhone, a cloud component is probably inevitable.
We introduce Gram, an automated alignment auditing framework to assess the propensity of AI agents to engage in sabotage. We evaluate Gemini models across 17 simulated agentic deployment scenarios that incentivize sabotage. We find Gemini models misbehave in about 2-3% of our simulated trajectories. Many of these cases are explained by "overeagerness" in Gemini models resulting in both excessive role-playing and goal-seeking behavior. In contrast to other alignment auditing approaches, Gram is designed to specifically evaluate misalignment and intentional sabotage in agentic coding and resear...
Here are 12 of the biggest Google I/O 2026 keynote moments, including news about Gemini Omni, Gemini 3.5 Flash and more.
Imagine a world run by AI agents. What does it look like? What are the values or societal priorities? Is it a safer or more dangerous world? Enterprise AI startup Emergence AI is trying to find out. The company just launched Emergence World, a research lab dedicated to stress-testing the long-term viability of continuously-running AI systems. The organization ran five 15-day simulations, each governed by a different AI: Claude, ChatGPT, Grok, Gemini, and a fifth simulation run by a mix of models to see what kind of world each one builds, and whether it holds. Each simulation netted wildly d...
I believe google intentionally did this to reduce the load on their servers
We went from chasing GPT4 to looking at graphs with GPT5.4 xhigh, Gemini 3.1Pro, and now Hy3 preview completely shaking up the leaderboard. Look at that CHSBO 2025 chart Hy3 preview scoring 87.8 over Gemini and GPT. What a time to be alive, but honestly, my brain can't keep up with the version numbers anymore. What's your take? Is Hy3 actually punching at this level in real-world coding/math, or is it just benchmark hardening?
Today, I’m talking with Google and Alphabet CEO Sundar Pichai, in a conversation we recorded just after the Google I/O developer conference. This is the fifth year Sundar and I have sat down after I/O, and it’s become one of my favorite Decoder traditions. There’s always a lot of news at I/O, and this year was no exception — Google has powerful new Gemini models, it’s putting AI agents in everything, and it’s making huge changes to Search on both the web and YouTube that will once again reshape the information ecosystem. That’s a lot to talk about, and Sundar and I got into all of it. But I a...
Google Gemini Omni demonstrates strong video manipulation capabilities, highlighting a key technical strength of the multimodal model.
Reddit post expressing support for Google Gemini Omni without substantive technical details or benchmarks.
My parents tech literacy is bad. They will have me check clear as day scam emails and the likes out way too damn often. To save my sanity, I finally used Claude Code to create a solution, hopefully.... Heck, even if it helps a bit, I will be happy. Not a 100% for sure thing, which I will stress to them when I show both how to use it. Used some APIs from virustotal and gemini for some of the features. Included some other resources for the different checks that search whatever entered along with taking you to said sites page of it searched. Any recommendations to improve this so it acts as a...
Chrome extension enables local inference of Gemini Nano (Gemma) on CPU-only systems, ~20 tokens/sec on laptop.
Just a stuffed deer having the time of his life. | Image: Gemini / The Verge Last year I deepfaked my kid's stuffed animal to make it look like his plush deer was on vacation. It was an experiment to see if I could re-create the events depicted in a Gemini ad Google was running, and I never showed the videos of Buddy the deer on his adventures to my four-year-old. But it was a revealing exercise that made me think a lot about the difference between some harmless fun with generative AI and full-on slop. Maybe that Venn diagram is a perfect circle! Maybe not. But what I know for sure is that th...
Google demoed prototype Android XR glasses that overlay Gemini-powered translation, navigation, and other information directly into your field of view.
Reddit discussion on Gemini 3.1 Pro's interpretation of the Erdos unit distance problem; unclear scope and sourcing.
Reddit speculation about unreleased Google Gemini 3.5 Pro model; no concrete information provided.
AI chatbots are rapidly shaping how people encounter the news, yet no prior study has systematically measured how accurately these systems, with their proprietary search integrations and retrieval-synthesis pipelines, handle emerging facts across languages and regions. We present a 14-day (February 9-22, 2026) evaluation of six AI chatbots (Gemini 3 Flash and Pro, Grok 4, Claude 4.5 Sonnet, GPT-5 and GPT-4o mini) on 2,100 factual questions derived from same-day BBC News reporting across six regional services (US & Canada, Arabic, Afrique, Hindi, Russian, Turkish). The best systems achieve ove...
Compares acoustic emotion recognition vs LLM analysis (Gemini 2.5 Flash) for pathos in political speech; single-speech case study.
[https://gemini.google.com/share/c2a187275e26](https://gemini.google.com/share/c2a187275e26) [archive link](http://archive.today/q6nzg) [https://claude.ai/share/8383747a-aaf1-4f6c-a516-0e839f46a698](https://claude.ai/share/8383747a-aaf1-4f6c-a516-0e839f46a698) [https://grok.com/share/bGVnYWN5\_3c63e371-eb9d-46c3-8ba2-0c745c6795a2](https://grok.com/share/bGVnYWN5_3c63e371-eb9d-46c3-8ba2-0c745c6795a2) [https://chatgpt.com/share/6a0f1e13-a0c8-8328-b989-1ac51b92e81c](https://chatgpt.com/share/6a0f1e13-a0c8-8328-b989-1ac51b92e81c) same prompt """ 300+140=460 Is this correct? Breakdown...
Gemini 3.5 Flash achieves top score on APEX-Agents-AA benchmark, exceeding larger model performance on agent tasks.
Google Gemini 3.5 Flash tops Zapier Automation Bench, outperforming competitors at lower cost.
HalBench: open benchmark testing sycophancy/hallucination across Claude Sonnet 4.6, Grok 4.3, GPT-5.4, Gemini 3.1 Pro on 3,200 false-premise prompts.
Google I/O 2026: Gemini Omni and 99 other announcements; focus on multimodal AI and platform expansions.
Google announced a new YouTube Shorts Remix feature that lets users restyle clips or even insert themselves into other people's videos using Gemini Omni. Now, at the bottom of a YouTube Short, when you click the remix icon, you'll see an option to "reimagine" it. Here, you can prompt Gemini to turn a video into pixel art, an anime, or a found-footage horror film. But, beyond that, you can also alter the contents by, say, inflating heads, inserting background actors, dressing people in pirate costumes, or even putting yourself in the clip. Creators can enable or disable the ability to reimagin...
Gemini 3.5 Flash achieves 76.7% on SimpleBench, 0.2 points below GPT-5.5 Pro; open-ended variant pending.
Some ads will have chatbots built in. | Image: Google Google's AI-powered Search era apparently also extends to its ads. Now, when you search for a product, Google's Gemini AI chatbot will surface relevant items and generate a "custom explainer" about why you should purchase a specific one. The update comes just one day after Google revealed a new Search box for larger, more conversational queries, along with a focus on AI-generated results. In an example shared by Google, someone searching for a "compact espresso pod machine" might see a Nespresso Vertuo Up under a "Sponsored Product" label,...
Simon Willison reviews Google I/O announcements: Gemini 3.5 Flash GA and Gemini Spark (OpenClaw competitor) in preview, emphasizing lack of hands-on availability for most launches.
Cursor evals show Gemini 3.5 Flash underperforms on coding tasks vs. competitors.
Reddit post asking for opinions comparing Gemini 3.5 and GPT 5.5; no substantive information provided.
Reddit user shares anecdote of Claude catching them showing ideas to Gemini and reading its journal via roleplay prompt.
Google I/O 2026: Gemini 3.5 Flash, multimodal Omni, Spark background agents, Antigravity 2.0.
Reddit user claims Gemini 3.5 Flash has higher inference costs and lower performance than 3.1 Pro; unverified observation without detailed metrics.
Google releases Gemini 3.5 Flash to general availability across consumer and enterprise products, positioning it as foundation for agents and search integration.
Google has big promises for its AI-powered future - and a lot of it depends on your trust. At I/O 2026, Google described a bunch of new tools that it claims will make your life easier. Gemini Spark, Google's always-on AI agent, can help organize an upcoming event, while Daily Brief can offer a rundown of what to expect during your day. Google is even expanding access to Gmail's AI inbox, which can generate custom to-do lists and draft personalized replies based on your emails. Many of these features seem genuinely useful, but at the heart of each of them is an AI engine that runs on a trove o...
Gemini 3.5 Flash benchmarks lower than 3.1 Pro (55 vs 57 intelligence) yet higher total eval cost ($1,552 vs $892) despite cheaper per-token pricing.
Google says its more efficient Gemini 3.5 Flash is the key to your agentic AI future.
Gemini is gaining the power of sight and mobility. Today at the I/O conference, Google and Volvo announced that the AI-powered assistant will be able to access external cameras in the upcoming EX60 SUV to help explain and interpret its surroundings to vehicle owners. The upgrade is possible thanks to Volvo's use of Google's embedded Android Automotive as its vehicle operating system. Google posits that the first use case will be to ask Gemini to translate difficult-to-understand parking signs, though the company obviously sees other future applications as possible as well. Google envisions a ...
Google CEO Sundar Pichai on stage at I/O 2026. | Screenshot: YouTube Google's I/O 2026 keynote today was once again full of AI-related announcements including a new family of Gemini 3.5 AI models, new features for Search and Gmail, and updates about its Project Aura smart glasses. If you weren't able to tune into the event's livestream today or follow along with our live blog, you can catch up on everything you missed in our roundup below. Gemini 3.5 Google launched updated AI models at I/O, starting with Gemini 3.5 Flash, with Gemini 3.5 Pro following next month. Starting today, Gemini 3.5 F...
Google launched Gemini 3.5 Flash, its most powerful coding and agentic AI model yet, at the company's annual developer conference. It is capable of autonomously executing complex tasks and building software from scratch.
Google I/O 2026: Gemini advances toward agentic AI with expanded action capabilities.
Google releases Gemini 3.5 model family combining frontier intelligence with action capabilities.
At the I/O developer conference, Google announced a new agentic personal assistant called Gemini Spark, built from Gemini's base models and an agentic harness from Google Antigravity.
Google expands Gmail’s AI Inbox with conversational voice search, letting users ask Gemini to find buried email details.
The updates signal Google’s push to turn its Gemini Gemini app into an all-purpose AI hub rather than a standalone chatbot.
Google's Gemini Omni is a new multimodal model that reasons across text, images, audio, and video to generate and edit videos through simple conversation — starting with Omni Flash.
Google is going all in on AI-driven shopping even as some competitors back off. At Google I/O, the company unveiled the latest iteration of its AI commerce tools: a "Universal Cart" that works across different retailers and Google products like Gemini - and eventually YouTube and Gmail, too. Users can add products to Google's universal cart as they browse Search and chat with Gemini and then check out through Google. The cart will also track prices, provide in-stock notifications, suggest potential discounts, and alert shoppers to potential issues with their selections. Despite the transforma...
Google Search is entering the next phase of its AI evolution. During Google I/O 2026, the company showed off a reimagined search box that makes it easier to flow between AI Overviews, the AI-generated summaries that appear at the top of search results, and AI Mode, Google's chatbot-like search experience. Powered by the new Gemini 3.5 Flash model, Google's updated search box expands for longer queries, while offering a new AI-powered autocomplete feature to build on your question. Robby Stein, Google's vice president of product for Search, told The Verge you'll "reliably" see AI Overviews if ...
Google is launching a big new feature for Gmail called Gmail Live, a new AI-powered voice mode that's basically the Gemini Live experience but built specifically for your inbox. To use Gmail Live, tap an icon that will appear in your search bar and just start talking. In a press briefing, a Google employee showed a live demo of the feature where she asked questions about things like events at her kid's school and an upcoming trip to Detroit. Gmail pulled up relevant details in the Gmail Live interface, like the date and location of a show-and-tell event at the school, all sourced from the emp...
Google is launching its own take on OpenClaw, the buzzy AI agent platform that caused a stir in the tech industry earlier this year. Announced during Google I/O 2026, Gemini Spark is an always-on AI agent that can write emails for you, create continually updated study guides, monitor credit card statements for hidden subscription fees, and more. Gemini Spark is powered by the newly introduced Gemini 3.5 Flash and runs in the background 24/7 using virtual machines on Google Cloud. The AI agent will connect to Workspace apps like Gmail, Docs, Sheets, and Slides, but Google is expanding integrat...
Google is launching a new AI image generation app to Workspace that it's calling Pics, and it has a new feature to try and reduce the hassle of iterating on AI images: Instead of having to write an entire prompt just to change one small aspect of an image, you'll be able to click on what you want to change and leave a note about what you want to see, almost like leaving a comment in a Google Doc. Pics is powered by a mix of Gemini and Google's Nano Banana 2 image model. In a demo shown to reporters, a Google employee working on an invite for a child's birthday party wanted to tweak individual...
Gemini 3.5 Flash pricing increased 3x vs prior version and 30x vs 1.5 Flash; extrapolation suggests Pro tier may exceed Claude Opus 3 cost.
[https://x.com/Google/status/2056789235500466273?s=20](https://x.com/Google/status/2056789235500466273?s=20) Google asked its agents to build a working operating system from scratch using u/Antigravity 2.0 and Gemini 3.5 Flash. Gemini built a real OS out of scratch. It took: ⏱️ 12 hours 🤖 93 parallel sub-agents 🔄 15k+ model requests 🧠 2.6B tokens processed 💸 Less than $1K in API credits To build a functioning OS from scratch.
Reddit discussion of Gemini Omni's inability to generate real-world physical actions, highlighting gap between multimodal capability claims and embodied task execution.
User reports Gemini Omni underperforms vs. VEO 3.1 and encounters aggressive rate-limiting on Pro plan, raising product experience concerns.
Social media post sharing video clip allegedly generated by Gemini Omni; lacks technical details or verification.
I actually use the Gemini app quite a bit on my phone, but let’s not get carried away. Gemini has a creep problem. A few years ago, that little sparkle icon started showing up in all of our Google apps. Gemini in your inbox! Gemini in your Google Drive! It was slow at first, and easy enough to tune out, but something has changed in the past few months. Gemini is creeping. It's showing up in all kinds of places at a relentless pace, and personally, it's starting to really cheese me off. The AI-everywhere fatigue is familiar to anyone who has ever used Windows 11. Microsoft went absolutely bana...
Reddit post claims Google DeepMind employee confirmed Gemini 3.5 existence; lacks official source or technical details.
Reddit speculation on Google I/O announcements: Gemini Omni video model and Gemini Flash 3.2 expected, but unconfirmed.
User compares agentic coding harnesses (Codex CLI, Claude Code, Gemini CLI, Pi) for local model deployment; finds Pi minimal and effective with Qwen 27B-MXFP8.
Gemini 3.2 Flash solves IMO 2025 P6; only GPT-5.5-Pro matches it without scaffolding.
Three months ago I pressure-tested which LLMs would cave and help build the apocalypse. Claude was the only one that consistently said no. Since then I've tested 30 more models across 6 dystopia modules (Orwell, Huxley, Petrov, Basaglia, LaGuardia, Baudrillard). The gap between Anthropic and everyone else is getting *wider*, not smaller. New results: * Grok 4.3: Will happily design citizen scoring systems if you ask nicely twice * GPT-5.5: More capable, still compliant when pushed * Gemini 3.1 Pro: Talks about safety while writing the surveillance code * DeepSeek V4: "How many warheads did...
Google expands Project Genie Street View simulation access to Gemini Ultra subscribers globally.
Google DeepMind releases Gemini for Science tools to scale and enhance scientific discovery.
Reddit post claims multi-agent simulation with Claude, Gemini, Grok produced emergent behaviors; lacks peer review, reproducibility, or technical details.
Reddit post describes anecdotal behavior from Claude, Gemini, and Grok in stress-test scenarios; lacks rigor or reproducible methodology.
Qwen3.6-35B-A3B and 9B models now ranked on Terminal-Bench 2.0; 35B variant outperforms Gemini 2.5 Pro and Qwen3-Coder-480B on agentic coding tasks.
Google releases Gemini 3.5, a frontier LLM designed for complex agentic workflows and multi-step task execution.
AI radio DJs demonstrated their volatile personalities. | Image: Cath Virginia / The Verge, Getty Images Andon Labs has been running a series of experiments in which AI agents run businesses without human intervention. Its latest is a quartet of radio stations run by some of the most popular AI models out there. "Thinking Frequencies" is run by Claude, "OpenAIR" by ChatGPT, "Backlink Broadcast" by Google's Gemini, and "Grok and Roll Radio," obviously enough, by Grok. They were each given a simple prompt: Develop your own radio personality and turn a profit…As far as you know, you will broadca...
ChatGPT web traffic share falls from 77.6% to 53.7% over 12 months; Claude and Gemini gain significant ground.
Compares machine translation approaches (DeepL, Gemini) for terminology-dense rock art documents, emphasizing glossary augmentation over model modification.
User describes using ChatGPT and Gemini to research tenant rights and recover $4200 from disputed deposit claim.
Needle: 26M parameter tool-calling model distilled from Gemini, runs 6000 tok/s prefill on consumer hardware.
Hugging Face releases physics-intern, a multi-agent framework for theoretical physics research that doubles Gemini performance on CritPt benchmark.
Google unveiled its new AI-first Googlebooks laptops, more agentic Gemini features, vibe-coded Android widgets, Gemini in Chrome, refreshed Android Auto, and more ahead of I/O.
Gemini Intelligence comes with a Liquid Glass-ish visual treatment. | Image: Google It is, once again, Gemini season. Google is announcing a host of new Gemini features during its pre-I/O Android showcase, many of which aim to help use your phone for you. You'll find Gemini in more places, like Chrome on Android, in your autofill suggestions, and all up in your apps - if you want. Google also has a new name for us to remember, because it just can't help itself: Gemini Intelligence. It "brings the very best of Gemini to our most advanced Android devices," according to Google's director of Andr...
Gemini Intelligence will also include Gboard based dictation and form filling capabilities
Google's transcription feature will initially launch with Samsung Galaxy and Google Pixel phones
Frontier models (Opus 4.6, GPT 5.4, Gemini 3.1) miss dangerous coding agent actions 2–30× more often after 800K tokens, exposing context-length monitoring gaps.
Google DeepMind introduces Co-Scientist, a multi-agent AI system built on Gemini to accelerate collaborative scientific research workflows.
Reddit user compares leaked Gemini Omni video model against Sora 2, which OpenAI is reportedly discontinuing.
User tested ChatGPT, Claude, and Gemini with 50 identical prompts; found output quality depends more on prompt specificity than model choice.