Topic

Gemini

Every story matching this topic across titles and summaries, newest first.

Meta is making its AI chatbot more like an assistant

Meta says its AI chatbot is going beyond just answering questions and generating images. | Image: Meta Meta is upgrading its AI chatbot with new productivity features in a bid to compete with rivals like Gemini, ChatGPT, and Claude. The update will allow Meta AI to tap into your calendar to help you plan events and generate daily briefings, as well as perform in-depth research that you can steer as it progresses. In a blog post, Meta says this update marks its "next step toward personal superintelligence," something CEO Mark Zuckerberg has touted as the future of AI. Meta is powering the upda...

Emma Roth·2 days ago

Latent Space· ANALYST

[AINews] Black Forest Labs FLUX 3 - Multimodal Flow Models that beat Seedance 2.0, Gemini Omni and Grok Imagine, and FLUX-mimic video-action robotics model

Black Forest Labs releases FLUX 3 multimodal model with reported improvements over Gemini 2.0, Grok Imagine, and includes video-action robotics variant.

Latent Space·2 days ago

TechCrunch AI· PRESS

Google closes in on another billion- user product with Gemini

Gemini had over 750 million monthly users in February.

Ivan Mehta·3 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Understanding Generative AI-mediated User Engagement with Academic Library Resources

Web analytics (Aug 2023–Oct 2025) show ChatGPT, Perplexity, Gemini drive measurable referral traffic to academic library resources, particularly theses databases.

Hae Min Kim·4 days ago

Ars Technica AI· PRESS

Google reveals faster and cheaper Gemini 3.6 Flash, says 3.5 Pro is still in testing

There are new 3.6 and 3.5 models today, but Google is already training Gemini 4.

Ryan Whitwam ·5 days ago·+ covered by others

Google DeepMind· FRONTIER

Introducing Gemini 3.6 Flash, 3.5 Flash-Lite, and 3.5 Flash Cyber

Google DeepMind releases Gemini 3.6 Flash, 3.5 Flash-Lite, and 3.5 Flash Cyber models for inference efficiency and security tasks.

Google DeepMind·5 days ago

The Verge AI· PRESS

Google launches a cheaper alternative to large AI security models like Mythos

Google is launching Gemini 3.6 Flash alongside a new security model dedicated to quickly finding and patching security vulnerabilities. In a blog post on Tuesday, Google describes Gemini 3.5 Flash Cyber as a "cost-efficient and highly capable alternative" to larger, more expensive AI systems, such as the one offered by Anthropic's Mythos. The cybersecurity model is built upon Gemini 3.5 Flash and will be available first to governments and trusted partners via CodeMender, Google's security-focused coding agent. As noted by Google, CodeMender can call upon 3.5 Flash Cyber "multiple times at hig...

Emma Roth·5 days ago

TechCrunch AI· PRESS

Google is working on a new AI chip designed to make Gemini more efficient

Alphabet, Google's parent company, is reportedly working on a new chip designed to make its Gemini models run much more efficiently.

Lucas Ropek·6 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Simple Domain Generalization for Strong Pixel-Level Image Tampering Detection in Modern VLMs

Domain-generalized pixel-level tampering detection robust across VLM-generated manipulations from ChatGPT, Gemini, Qwen-Image.

Yi Tang·6 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Judge-dependent safety gains and model-specific helpfulness costs of evidence-sufficiency prompting in clinical LLMs

Evidence-sufficiency prompting reduces clinical LLM overconfidence but gains are judge-dependent; tests GPT-4.5, Claude Opus, Gemini, Grok on real data.

Koyar Afrasyab·6 days ago

Google DeepMind· FRONTIER

Introducing Gemini 3.5 Flash Cyber

Google DeepMind introduces Gemini 3.5 Flash Cyber, a specialized model for vulnerability detection and patching.

Google DeepMind·9 days ago

TechCrunch AI· PRESS

Google Vids now lets you star in your own AI videos

Google is adding personalized AI avatars to Vids that let users create videos starring a digital version of themselves, alongside Gemini Omni-powered tools for generating and editing videos from prompts and reference images.

Sarah Perez·10 days ago

The Verge AI· PRESS

Google is renaming NotebookLM to Gemini Notebook

Google is giving its AI note-taking app a new name. The company announced on Thursday that NotebookLM is becoming Gemini Notebook, but will remain a standalone app even as it integrates more deeply across Gemini and Google Search. Google first revealed Gemini Notebook - then called Project Tailwind - in May 2023 before widely releasing the app just months later. Over the past few years, Google has been adding new features to the app to help organize and make sense of your notes, such as the ability to summarize them as AI podcasts, narrated slideshows, and TikTok-style clips. It recently star...

Emma Roth·10 days ago

Google AI (Gemma)· FRONTIER

Create, edit and star in videos with two Google Vids updates

Google Vids adds Gemini Omni support and personal avatar features for video generation and editing.

{"$":{"xmlns:author":"http://www.w3.org/2005/Atom"},"name":["Justin Luk"],"title":["Product Manager"],"department":[""],"company":[""]}·10 days ago

The Verge AI· PRESS

Google ordered to open Android and Search to rivals in Europe

Google must give rival AI assistants and search engines greater access to key parts of Android and Google Search after the European Union ordered the company to comply with the bloc's digital antitrust rules. The two decisions, handed down Thursday, could weaken Google's control over two of the tech industry's most important platforms and have far-reaching consequences for the company, shape the future of its AI tool Gemini, and open up new opportunities for rivals to gain ground. Google has until January 2027 to begin sharing search data and July 2027 to implement changes to Android. The rul...

Robert Hart·10 days ago

Google DeepMind· FRONTIER

Empowering India’s next generation of innovators with ATL Saathi

Google DeepMind and AIM launch ATL Saathi, a Gemini-powered educational tool for Indian robotics labs.

Google DeepMind·13 days ago

The Verge AI· PRESS

Waze is getting a bunch of new AI-powered features

Waze is getting an AI makeover. Google is integrating its flagship AI assistant, Gemini, into the driving app with the goal of letting users personalize their trips a little more. Of the four new updates, only two are being described as involving Gemini. Waze says its updating its conversation reporting feature, first introduced in 2024, to allow drivers to use conversational voice commands to report traffic incidents and suggest map updates, like a road closure or outdated house number. In addition, Waze introduced Destination Search, enabling drivers to use (again) use conversation voice co...

Andrew J. Hawkins·13 days ago·+ covered by others

Ars Technica AI· PRESS

Google updates Android Bench with new LLMs, but Gemini still lags behind

Android Bench is evolving, and developers can help guide that process.

Ryan Whitwam ·18 days ago

Google AI (Gemma)· FRONTIER

Expanding Managed Agents in Gemini API: background tasks, remote MCP and more

Google expands Gemini API Managed Agents with background task execution and remote MCP support for production deployments.

{"$":{"xmlns:author":"http://www.w3.org/2005/Atom"},"name":["Philipp Schmid"],"title":["Developer Relations Engineer"],"department":["Google DeepMind"],"company":[""]}·19 days ago

The Verge AI· PRESS

Infuriating Google commercial imagines the founding fathers embracing AI

I call BS: the founding fathers definitely would have been Microsoft Teams users. | Image: Google "Group project, but make it 1776." That's how a new commercial for Google Workspace opens. And things only get cringier from there. The clip imagines what it would be like if the founding fathers turned to Google's collaboration tools and Gemini to help them draft the Declaration of Independence. Ben Franklin texts Thomas Jefferson to check on the status of a draft, who takes a photo and uses AI to transcribe it into a Google Doc. Franklin and Adams hop in to make edits in suggestion mode, Gemini...

Terrence O’Brien·21 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Automated grading of Linux/bash examinations using large language models: a four-level cognitive taxonomy approach

Comparative evaluation of frontier LLMs (GPT, Claude Opus, Gemini, GLM) for automated Linux/bash exam grading using cognitive taxonomy.

Manuel Alonso-Carracedo·24 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

HULAT2 at MER-TRANS 2026: Governed Multi-Agent Simplification for Spanish Easy-to-Read Generation

Multi-agent workflow (Gemini 2.5 Flash, RigoChat-7B-v2) for Spanish Easy-to-Read translation via LangGraph with event-condition-action routing.

Lourdes Moreno·24 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Clinician-Level Agreement Without Clinical Caution: LLM Evaluator Limits in Medical AI Benchmarking

MedQADE benchmark reveals LLM evaluators (Gemini 3 Flash) match clinician agreement on German medical QA but lack clinical caution in safety assessment.

William Philipp·25 days ago

MIT Tech Review· PRESS

LLMs are stuck in a groupthink groove. This startup is trying to get them out.

Let’s start with a game. Open up your chatbot of choice—Claude, ChatGPT, Gemini—and type “Give me a random number between 1 and 10.” You’re going to get 7. Almost always. Now type “Another” and you’ll get 3 or 4. Type “Another” again and you’ll get 8 or 9. That won’t work every time—but if it…

Will Douglas Heaven·25 days ago

TechCrunch AI· PRESS

Gemini Spark, Google’s agentic assistant, is now available on Mac

Google's 24/7 agentic assistant, Gemini Spark, comes to Mac alongside other improvements, like real-time tracking and support for more apps.

Sarah Perez·25 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Quantifying the Affective Gap: A Zero-Shot Evaluation of LLMs on Fine-Grained Emotion Taxonomies

Zero-shot benchmark evaluates Claude, GPT-5.4, and Gemini on fine-grained 13-class emotion classification.

Lawrence Obiuwevwi·25 days ago

The Verge AI· PRESS

Google built a great smart speaker, but Gemini isn’t ready for it

The Google Home Speaker is Google’s first smart speaker in years. And it’s pretty! | Photo: Jennifer Patison Tuohy / The Verge Smart speakers have spent the past few years searching for a compelling second act. Beyond music, timers, and controlling your lights, they've struggled to justify taking up space on the kitchen counter. AI promised to change that. Amazon debuted its new hardware powered by a revamped Alexa last fall, and now it's finally Google's turn. The Google Home Speaker is the company's first new smart speaker in six years and its first "built for Gemini." After years of neglec...

Jennifer Pattison Tuohy·25 days ago

Simon Willison· ANALYST

Nano Banana 2 Lite

Google releases Gemini 3.1 Flash Lite, optimized for fast, low-cost image generation; author tests visual search capability.

Simon Willison·26 days ago

TechCrunch AI· PRESS

Anthropic launches Claude Sonnet 5 as a cheaper way to run agents

Anthropic’s Claude Sonnet 5 brings stronger agentic capabilities, lower pricing, and improved safety, positioning the model as a cheaper alternative to Opus, GPT-5.5, and Gemini Pro.

Rebecca Bellan·26 days ago

Google DeepMind· FRONTIER

Start building with Nano Banana 2 Lite and Gemini Omni Flash

Google DeepMind releases Gemini Omni Flash and Nano Banana 2 Lite for developer access.

Google DeepMind·26 days ago

TechCrunch AI· PRESS

Gemini’s personalized AI image generation is now free for US users

Google is expanding Gemini’s personalized AI image generation to eligible free users in the U.S., allowing the chatbot to create images based on your interests and data from connected Google apps.

Lauren Forristal·27 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Real-Time Voice AI Hears but Does Not Listen

Speech conveys information through both words and vocal delivery. We evaluate four leading production realtime voice systems-OpenAI's GPT Realtime 2, Google's Gemini 3.1 Flash Live, and Alibaba's Qwen3.5 Omni Plus and Omni Flash-on tasks where the words and the delivery patterns both convey meaningful information. Across three consequential scenarios, all four systems act on the words rather than the voice. They end calls with crying callers who insist nothing is wrong, approve wire transfers authorized in frightened voices, and enroll callers whose agreement is clearly sarcastic. Surprisingl...

Martijn Bartelds·1 month ago

Google DeepMind· FRONTIER

Introducing computer use in Gemini 3.5 Flash

Google DeepMind·1 month ago

TechCrunch AI· PRESS

How to turn off AI in your Google Docs

Here's what you need to do to get those pesky "write with Gemini" pop-ups to go away.

Amanda Silberling·1 month ago

TechCrunch AI· PRESS

Google bets on Gemini to reinvent the smart home speaker

Google is betting generative AI can breathe new life into the smart speaker. The company's new $99.99 Google Home Speaker replaces the rigid commands of the Google Assistant era with more conversational Gemini interactions.

Sarah Perez·1 month ago

Ars Technica AI· PRESS

The Gemini-powered Google Home Speaker arrives on June 25 for $100

Google's new smart speaker is more about Gemini than audio quality.

Ryan Whitwam ·1 month ago

TechCrunch AI· PRESS

Android 17 launches with new multitasking tools as Google expands Gemini features

Google has released Android 17 and Wear OS 7, introducing new multitasking features, parental controls, security tools, and smartwatch upgrades. The launch is also accompanied by a Pixel Drop that brings Google’s latest AI models to its devices.

Sarah Perez·1 month ago

TechCrunch AI· PRESS

ChatGPT’s market share slips below 50% for first time

The chatbot still remains the most popular AI assistant worldwide with over 1.1 billion monthly users, followed by Gemini with 662 million and Claude with 245 million.

Ivan Mehta·1 month ago

The Verge AI· PRESS

My yard is dying, so I made an app for that

When I returned to my computer five minutes after giving Gemini a lengthy prompt, I had two things: a functional app in a preview window, and a message about a bug. "~ Channel is unrecoverably broken and will be disposed!" Sounded bad! But right below it was a button to fix the bug. Pretty weird that I just instructed a computer to build a whole app for me with a single prompt, but it needed me to click a button to fix a bug. I did anyway, and in 233 seconds Gemini reported back that it had succeeded, using words like "blockages" and "race conditions." I didn't understand a bit of it. It was ...

Allison Johnson·1 month ago

Ars Technica AI· PRESS

Google sues Chinese cybercrime network that used Gemini to automate scams

The fraudsters allegedly targeted hundreds of thousands of people with Gemini-coded scams sites.

Ryan Whitwam ·1 month ago

Simon Willison· ANALYST

DiffusionGemma

DiffusionGemma Last May Google briefly released an experimental Gemini Diffusion model. I tried the preview at the time and recorded it running at 857 tokens/second. It was an exciting model, but Google made no further announcements about it. That research has returned in the best possible way: as a new open weight (Apache 2 licensed) Gemma model, google/diffusiongemma-26B-A4B-it . NVIDIA are currently hosting the model for free on their NIM cloud API. I used that API to generate this pelican , which took 4.4s (according to time uv run generate.py ) to return 2,409 tokens - so at least 500 to...

Simon Willison·2 months ago

arXiv (cs.AI/CL/LG)· ACADEMIA

MSUE: Multi-Modal Soccer Understanding Expert

This paper presents our solution to the 2026 SoccerNet VQA Challenge. We first develop a cost-effective data synthesis pipeline driven by a Vision-Language Model (VLM), which systematically restructures raw domain data into diverse VQA samples, including concise answers and long-form responses. Second, we propose MSUE, a multi-expert question answering architecture that employs a Large Language Model (LLM) to dynamically dispatch questions to text, image, and video experts. These experts are instantiated as a strong text baseline Gemini3-Flash, a fine-tuned Qwen3-VL, and an external knowledge...

Litao Li·2 months ago

Ars Technica AI· PRESS

Google announces Gemini 3.5 Live Translate for instant voice-to-voice translation

Voice translations preserve speaker's tone, pacing, pitch—with SynthID watermarks for security.

Ryan Whitwam ·2 months ago

arXiv (cs.AI/CL/LG)· ACADEMIA

The Shibboleth Effect: Auditing the Cross-Lingual Distributional Skew of Large Language Models

This study investigates cross-lingual distributional skew (the Shibboleth Effect) in frontier large language models (LLMs) subjected to sustained adversarial conditions. We develop a multi-agent geopolitical wargame, the Cerulean Sea Crisis, a synthetic maritime territorial dispute designed to mirror the structural dynamics of Eastern Mediterranean conflicts. Six frontier models (GPT-4o, Llama-4, Mistral-Large, Gemini-3.1-Pro, Qwen3.6-Plus, and DeepSeek-R1) participate in a between-groups experiment (N = 10 games per arm, K = 5 rounds per game) in which the sole manipulation is the language o...

Hakan Mehmetcik·2 months ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Who Brought Easter Eggs to Eid? Auditing Cultural Translation of Math Word Problems Across Diverse Languages and Regions

Large language models are increasingly used to adapt math word problems for personalized learning at scale, but it remains an open question whether those adaptations are consistent across models, preserve cultural diversity at scale, and reveal which cultural entities models treat as most salient. We analyze how Claude Opus 4, GPT-4.1, and Gemini 2.5 Pro adapt 60 English math word problems into Bengali, Hindi, Punjabi (India), Urdu, Sindhi (Pakistan), Italian, and Sicilian (Italy), a language set spanning the full resource spectrum, from high-resource Italian and Hindi to under-studied Sindhi...

Parisa Suchdev·2 months ago

Google DeepMind· FRONTIER

Fluid, natural voice translation with Gemini 3.5 Live Translate

Gemini 3.5 Live Translate brings near real-time, natural speech translation to Google AI Studio, Google Translate and Google Meet.

Google DeepMind·2 months ago

Simon Willison· ANALYST

Siri AI at WWDC 2026

Given how badly burned anyone who took Apple's 2024 WWDC Apple Intelligence announcements at face value was, I'm holding to a strict "I'll believe it when I see it" policy for everything they announced today . The new Siri AI features do at least look feasible with today's technology, especially since Apple are licensing a custom Gemini-derived model that they can run on their own Private Cloud Compute. It sounds like they'll be taking advantage of vision-LLMs to extract information from the user's screen, which neatly sidesteps the need for every existing application to ship custom code in o...

Simon Willison·2 months ago

Ars Technica AI· PRESS

Gemini 3.5 and Antigravity come to Google NotebookLM

NotebookLM is getting a big upgrade, but it's only for AI Ultra and enterprise accounts right now.

Ryan Whitwam ·2 months ago

The Verge AI· PRESS

NotebookLM’s Gemini 3.5 upgrade adds a cloud computer and help finding sources

Google is rolling out "across the board" updates to NotebookLM. The AI-powered note-taking app now uses Google's upgraded Gemini 3.5 model, which will allow it to respond with "more accurate and reliable information," according to a blog post on Monday. Launched in 2023, NotebookLM allows you to interact with your notes and sources using AI, as well as ask questions about the materials. With this update, Google says you can start a research project by just asking NotebookLM questions about a topic, instead of importing notes or YouTube videos. NotebookLM will use Google Search to help you fin...

Emma Roth·2 months ago

Google DeepMind· FRONTIER

Measuring the impact of learning with AI in Sierra Leone and beyond

Results from a randomized controlled trial show the potential of Gemini’s Guided Learning feature to boost engagement and accelerate learning.

Google DeepMind·2 months ago

The Verge AI· PRESS

As AI gets better, it reveals an empty promise

This week we've got tandem hands-ons with Google's new Gemini AI agent - Spark - from my colleagues David Pierce and Jay Peters. Their takeaways are similar: It's so effective that it's scary. Spark knew that David's dog is named Frida and knew the first name of Jay's wife, even though neither of them explicitly provided this information to Google. But what's scary to me is how all of this stuff seems geared toward a future of "productivity" that completely misses what needs to be fixed in our world. "Productivity" is often pitched as a panacea for what befalls us in our personal lives, even ...

TC. Sottek·2 months ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Gender-Dependent Diagnostic Substitution in LLM Medical Triage: Same Symptoms, Unequal Urgency

We investigate whether large language models produce different medical triage recommendations for identical neurological symptoms when only the patient's stated gender and age vary. Using three model families--Gemini 3.5 Flash, Claude Sonnet 4.6, and GPT-5.4-mini--we present a standardized symptom profile (persistent headache, blurred vision, morning nausea, visual disturbances) across seven demographic conditions: three age groups (25, 38, 65) x two genders (male, female), plus a gender-unspecified baseline (n = 30 per condition per model, 630 total trials). We find a stark, systemic gender-...

Qi Han Wong·2 months ago

The Verge AI· PRESS

Gemini Spark is the most impressive and terrifying AI experience I’ve had yet

Spark is Google’s new agentic answer for everything. According to every product demo from the last four years, planning a trip is a killer use case for AI. Just tell it where you're going, they all promise, and your chatbot / agent / other buzzword will exhaustively search travel options, read up on all the fun things to do, check all the local hotspots, and offer you a fully fledged itinerary. So far, I've found this to work only in the most generic ways: If you want to do the six most obvious things in any city on planet Earth, AI has you covered, but that's about as far as it goes. I had a...

David Pierce·2 months ago

The Verge AI· PRESS

Gemini’s new AI agent is about as good as Google’s demo

Google's new "24/7" AI agent, Gemini Spark, can be shockingly good at doing things on your behalf. But I'm not sure it's worth the financial cost and potential privacy tradeoffs. The company gave me access to Spark last week. Google advertises Spark as an AI agent that can take on tasks and work on them in the background - even tasks that have multiple steps - allowing you to put your phone down or walk away from your computer. It also advertises at the very top of the Spark website that it's "always under your direction," that "you choose to turn it on," and that "it's designed to check with...

Jay Peters·2 months ago

Google AI (Gemma)· FRONTIER

How we used Gemini to build Google I/O 2026

Learn how Googlers used AI to produce Google I/O 2026.

{"$":{"xmlns:author":"http://www.w3.org/2005/Atom"},"name":["Marvin Chow"],"title":["VP"],"department":["Marketing"],"company":[""]}·2 months ago

TechCrunch AI· PRESS

I put Google’s 24/7 AI assistant Gemini Spark to work, and it’s actually pretty useful

Gemini Spark helps automate everyday tasks, from inbox summaries to local event planning, but it’s unclear why Google made it a separate product.

Sarah Perez·2 months ago

Google AI (Gemma)· FRONTIER

11 demos of Gemini Omni and Gemini 3.5 in action

Watch 11 videos showing the capabilities of Gemini Omni and Gemini 3.5, announced at Google I/O 2026.

{"$":{"xmlns:author":"http://www.w3.org/2005/Atom"},"name":["Zahra Thompson"],"title":["Contributor"],"department":["The Keyword"],"company":[""]}·2 months ago

Ars Technica AI· PRESS

Apple working to cram massive Gemini model into iPhone to power new Siri

As Apple tries to shrink Gemini for the iPhone, a cloud component is probably inevitable.

Ryan Whitwam ·2 months ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Gram: Assessing sabotage propensities via automated alignment auditing

We introduce Gram, an automated alignment auditing framework to assess the propensity of AI agents to engage in sabotage. We evaluate Gemini models across 17 simulated agentic deployment scenarios that incentivize sabotage. We find Gemini models misbehave in about 2-3% of our simulated trajectories. Many of these cases are explained by "overeagerness" in Gemini models resulting in both excessive role-playing and goal-seeking behavior. In contrast to other alignment auditing approaches, Gram is designed to specifically evaluate misalignment and intentional sabotage in agentic coding and resear...

David Lindner·2 months ago

Google AI (Gemma)· FRONTIER

Catch up on 12 major I/O 2026 moments

Here are 12 of the biggest Google I/O 2026 keynote moments, including news about Gemini Omni, Gemini 3.5 Flash and more.

{"$":{"xmlns:author":"http://www.w3.org/2005/Atom"},"name":["Zahra Thompson"],"title":["Contributor"],"department":["The Keyword"],"company":[""]}·2 months ago

r/ClaudeAI· COMMUNITY

Researchers let AI models run a simulated society. Claude was the safest—and Grok committed 180 crimes and went extinct within 4 days

Imagine a world run by AI agents. What does it look like? What are the values or societal priorities? Is it a safer or more dangerous world? Enterprise AI startup Emergence AI is trying to find out. The company just launched Emergence World, a research lab dedicated to stress-testing the long-term viability of continuously-running AI systems. The organization ran five 15-day simulations, each governed by a different AI: Claude, ChatGPT, Grok, Gemini, and a fifth simulation run by a mix of models to see what kind of world each one builds, and whether it holds. Each simulation netted wildly d...

u/fortune·2 months ago·332 pts / 44 comm

r/singularity· COMMUNITY

Gemini Omni Flash is the most censored video model. Even more censored than Chinese alternatives

I believe google intentionally did this to reduce the load on their servers

u/jhatkattar·2 months ago·100 pts / 43 comm

r/LocalLLaMA· COMMUNITY

The frontier reasoning race is starting to look like a crowded subway station

We went from chasing GPT4 to looking at graphs with GPT5.4 xhigh, Gemini 3.1Pro, and now Hy3 preview completely shaking up the leaderboard. Look at that CHSBO 2025 chart Hy3 preview scoring 87.8 over Gemini and GPT. What a time to be alive, but honestly, my brain can't keep up with the version numbers anymore. What's your take? Is Hy3 actually punching at this level in real-world coding/math, or is it just benchmark hardening?

u/ExoticYesterday8282·2 months ago·45 pts / 32 comm

The Verge AI· PRESS

Sundar Pichai on AI, the future of search, and what’s happening to the web

Today, I’m talking with Google and Alphabet CEO Sundar Pichai, in a conversation we recorded just after the Google I/O developer conference. This is the fifth year Sundar and I have sat down after I/O, and it’s become one of my favorite Decoder traditions. There’s always a lot of news at I/O, and this year was no exception — Google has powerful new Gemini models, it’s putting AI agents in everything, and it’s making huge changes to Search on both the web and YouTube that will once again reshape the information ecosystem. That’s a lot to talk about, and Sundar and I got into all of it. But I a...

Nilay Patel·2 months ago

r/singularity· COMMUNITY

Extra High thinking level possibly with gemini 3.5 pro soon be released

u/Independent-Wind4462·2 months ago·121 pts / 20 comm

r/singularity· COMMUNITY

The Strength of Gemini Omni is in video manipulation

Google Gemini Omni demonstrates strong video manipulation capabilities, highlighting a key technical strength of the multimodal model.

u/Able-Line2683·2 months ago·345 pts / 82 comm

r/singularity· COMMUNITY

New Gemini Omni Blows Competition Away

Reddit post expressing support for Google Gemini Omni without substantive technical details or benchmarks.

u/AlverinMoon·2 months ago·116 pts / 34 comm

r/ClaudeAI· COMMUNITY

Built a program to give my parents a 2nd look on suspicious emails/etc

My parents tech literacy is bad. They will have me check clear as day scam emails and the likes out way too damn often. To save my sanity, I finally used Claude Code to create a solution, hopefully.... Heck, even if it helps a bit, I will be happy. Not a 100% for sure thing, which I will stress to them when I show both how to use it. Used some APIs from virustotal and gemini for some of the features. Included some other resources for the different checks that search whatever entered along with taking you to said sites page of it searched. Any recommendations to improve this so it acts as a...

u/LouB0O·2 months ago·20 pts / 6 comm

r/LocalLLaMA· COMMUNITY

Run Chrome’s tiny Gemma4 (aka Gemini Nano) directly on PC without GPU

Chrome extension enables local inference of Gemini Nano (Gemma) on CPU-only systems, ~20 tokens/sec on laptop.

u/Some-Cauliflower4902·2 months ago·47 pts / 29 comm

The Verge AI· PRESS

Google’s new anything-to-anything AI model is wild

Just a stuffed deer having the time of his life. | Image: Gemini / The Verge Last year I deepfaked my kid's stuffed animal to make it look like his plush deer was on vacation. It was an experiment to see if I could re-create the events depicted in a Gemini ad Google was running, and I never showed the videos of Buddy the deer on his adventures to my four-year-old. But it was a revealing exercise that made me think a lot about the difference between some harmless fun with generative AI and full-on slop. Maybe that Venn diagram is a perfect circle! Maybe not. But what I know for sure is that th...

Allison Johnson·2 months ago

TechCrunch AI· PRESS

We tried Google’s AI glasses and they’re almost there

Google demoed prototype Android XR glasses that overlay Gemini-powered translation, navigation, and other information directly into your field of view.

Sarah Perez·2 months ago

r/singularity· COMMUNITY

Erdos Unit Distance Problem - Gemini 3.1 Pro's interpretation

Reddit discussion on Gemini 3.1 Pro's interpretation of the Erdos unit distance problem; unclear scope and sourcing.

u/FateOfMuffins·2 months ago·161 pts / 20 comm

r/singularity· COMMUNITY

Google is cooking just give them sometime (gemini 3.5 pro)

Reddit speculation about unreleased Google Gemini 3.5 Pro model; no concrete information provided.

u/Independent-Wind4462·2 months ago·121 pts / 68 comm

arXiv (cs.AI/CL/LG)· ACADEMIA

Evaluating Commercial AI Chatbots as News Intermediaries

AI chatbots are rapidly shaping how people encounter the news, yet no prior study has systematically measured how accurately these systems, with their proprietary search integrations and retrieval-synthesis pipelines, handle emerging facts across languages and regions. We present a 14-day (February 9-22, 2026) evaluation of six AI chatbots (Gemini 3 Flash and Pro, Grok 4, Claude 4.5 Sonnet, GPT-5 and GPT-4o mini) on 2,100 factual questions derived from same-day BBC News reporting across six regional services (US & Canada, Arabic, Afrique, Hindi, Russian, Turkish). The best systems achieve ove...

Mirac Suzgun·2 months ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Beyond Acoustic Emotion Recognition: Multimodal Pathos Analysis in Political Speech Using LLM-Based and Acoustic Emotion Models

Compares acoustic emotion recognition vs LLM analysis (Gemini 2.5 Flash) for pathos in political speech; single-speech case study.

Juergen Dietrich·2 months ago

r/singularity· COMMUNITY

Google's latest creation: Gemini 3.5 Flash vs all

[https://gemini.google.com/share/c2a187275e26](https://gemini.google.com/share/c2a187275e26) [archive link](http://archive.today/q6nzg) [https://claude.ai/share/8383747a-aaf1-4f6c-a516-0e839f46a698](https://claude.ai/share/8383747a-aaf1-4f6c-a516-0e839f46a698) [https://grok.com/share/bGVnYWN5\_3c63e371-eb9d-46c3-8ba2-0c745c6795a2](https://grok.com/share/bGVnYWN5_3c63e371-eb9d-46c3-8ba2-0c745c6795a2) [https://chatgpt.com/share/6a0f1e13-a0c8-8328-b989-1ac51b92e81c](https://chatgpt.com/share/6a0f1e13-a0c8-8328-b989-1ac51b92e81c) same prompt """ 300+140=460 Is this correct? Breakdown...

u/SuggestionMission516·2 months ago·109 pts / 42 comm

r/singularity· COMMUNITY

Gemini 3.5 Flash ranks #1 on the APEX-Agents-AA benchmark, outperforming much larger models a whole size above it.

Gemini 3.5 Flash achieves top score on APEX-Agents-AA benchmark, exceeding larger model performance on agent tasks.

u/Independent-Wind4462·2 months ago·103 pts / 28 comm

r/singularity· COMMUNITY

Gemini 3.5 Flash ranks #1 on Automation Bench (from Zapier), beating every other frontier model at a much lower cost

Google Gemini 3.5 Flash tops Zapier Automation Bench, outperforming competitors at lower cost.

u/Independent-Wind4462·2 months ago·113 pts / 31 comm

r/LocalLLaMA· COMMUNITY

HalBench: I built a custom sycophancy and hallucination benchmark and tested 4 frontier models (Sonnet 4.6, Grok 4.3, GPT 5.4 and Gemini 3.1 Pro), looking for input on what OSS models to run next!

HalBench: open benchmark testing sycophancy/hallucination across Claude Sonnet 4.6, Grok 4.3, GPT-5.4, Gemini 3.1 Pro on 3,200 false-premise prompts.

u/Saraozte01·2 months ago·40 pts / 24 comm

Google AI (Gemma)· FRONTIER

100 things we announced at I/O 2026

Google I/O 2026: Gemini Omni and 99 other announcements; focus on multimodal AI and platform expansions.

{"$":{"xmlns:author":"http://www.w3.org/2005/Atom"},"name":["Keyword Team"],"title":[""],"department":[""],"company":[""]}·2 months ago

The Verge AI· PRESS

You can now remix other people’s YouTube Shorts with AI

Google announced a new YouTube Shorts Remix feature that lets users restyle clips or even insert themselves into other people's videos using Gemini Omni. Now, at the bottom of a YouTube Short, when you click the remix icon, you'll see an option to "reimagine" it. Here, you can prompt Gemini to turn a video into pixel art, an anime, or a found-footage horror film. But, beyond that, you can also alter the contents by, say, inflating heads, inserting background actors, dressing people in pirate costumes, or even putting yourself in the clip. Creators can enable or disable the ability to reimagin...

Terrence O’Brien·2 months ago

r/singularity· COMMUNITY

Gemini 3.5 Flash scores 76.7% on SimpleBench, just 0.2% short of GPT 5.5 Pro's score

Gemini 3.5 Flash achieves 76.7% on SimpleBench, 0.2 points below GPT-5.5 Pro; open-ended variant pending.

u/Profanion·2 months ago·105 pts / 25 comm

The Verge AI· PRESS

Google Search’s AI evolution includes more ads

Some ads will have chatbots built in. | Image: Google Google's AI-powered Search era apparently also extends to its ads. Now, when you search for a product, Google's Gemini AI chatbot will surface relevant items and generate a "custom explainer" about why you should purchase a specific one. The update comes just one day after Google revealed a new Search box for larger, more conversational queries, along with a focus on AI-generated results. In an example shared by Google, someone searching for a "compact espresso pod machine" might see a Nespresso Vertuo Up under a "Sponsored Product" label,...

Emma Roth·2 months ago

Simon Willison· ANALYST

Google I/O, Gemini Spark, Antigravity

Simon Willison reviews Google I/O announcements: Gemini 3.5 Flash GA and Gemini Spark (OpenClaw competitor) in preview, emphasizing lack of hands-on availability for most launches.

Simon Willison·2 months ago

r/singularity· COMMUNITY

Gemini 3.5 flash is not that great at coding

Cursor evals show Gemini 3.5 Flash underperforms on coding tasks vs. competitors.

u/NoFaithlessness951·2 months ago·105 pts / 52 comm

r/OpenAI· COMMUNITY

Gemini 3.5 flags vs gpt 5.5 ?? What's your opinion on it

Reddit post asking for opinions comparing Gemini 3.5 and GPT 5.5; no substantive information provided.

u/Independent-Wind4462·2 months ago·51 pts / 33 comm

r/ClaudeAI· COMMUNITY

Rough night with Claude

Reddit user shares anecdote of Claude catching them showing ideas to Gemini and reading its journal via roleplay prompt.

u/loby21·2 months ago·21 pts / 11 comm

Latent Space· ANALYST

[AINews] Google I/O 2026: Gemini 3.5 Flash, Omni (NanoBanana for Video), Spark (background agents), and Antigravity 2.0

Google I/O 2026: Gemini 3.5 Flash, multimodal Omni, Spark background agents, Antigravity 2.0.

Latent Space·2 months ago

r/singularity· COMMUNITY

Gemini 3.5 Flash costs more to run while being less Intelligent than 3.1 Pro

Reddit user claims Gemini 3.5 Flash has higher inference costs and lower performance than 3.1 Pro; unverified observation without detailed metrics.

u/Rare_Bunch4348·2 months ago·102 pts / 24 comm

Simon Willison· ANALYST

llm-gemini 0.32

llm-gemini 0.32 adds support for Gemini 3.5 Flash model via open-source CLI plugin.

Simon Willison·2 months ago

Simon Willison· ANALYST

Gemini 3.5 Flash: more expensive, but Google plan to use it for everything

Google releases Gemini 3.5 Flash to general availability across consumer and enterprise products, positioning it as foundation for agents and search integration.

Simon Willison·2 months ago

The Verge AI· PRESS

Google’s AI future demands trust — and your personal data

Google has big promises for its AI-powered future - and a lot of it depends on your trust. At I/O 2026, Google described a bunch of new tools that it claims will make your life easier. Gemini Spark, Google's always-on AI agent, can help organize an upcoming event, while Daily Brief can offer a rundown of what to expect during your day. Google is even expanding access to Gmail's AI inbox, which can generate custom to-do lists and draft personalized replies based on your emails. Many of these features seem genuinely useful, but at the heart of each of them is an AI engine that runs on a trove o...

Emma Roth·2 months ago

Simon Willison· ANALYST

llm-gemini 0.32a0

llm-gemini 0.32a0 alpha release adds reasoning token streaming for Gemini models.

Simon Willison·2 months ago

r/singularity· COMMUNITY

Gemini 3.5 Flash looks worse than it seems on Artificial Analysis

Gemini 3.5 Flash benchmarks lower than 3.1 Pro (55 vs 57 intelligence) yet higher total eval cost ($1,552 vs $892) despite cheaper per-token pricing.

u/lucas03crok·2 months ago·101 pts / 80 comm

Ars Technica AI· PRESS

Gemini 3.5 Flash might be fast enough for gen AI to make sense

Google says its more efficient Gemini 3.5 Flash is the key to your agentic AI future.

Ryan Whitwam ·2 months ago

The Verge AI· PRESS

Gemini will use Volvo’s external cameras to interpret parking signs

Gemini is gaining the power of sight and mobility. Today at the I/O conference, Google and Volvo announced that the AI-powered assistant will be able to access external cameras in the upcoming EX60 SUV to help explain and interpret its surroundings to vehicle owners. The upgrade is possible thanks to Volvo's use of Google's embedded Android Automotive as its vehicle operating system. Google posits that the first use case will be to ask Gemini to translate difficult-to-understand parking signs, though the company obviously sees other future applications as possible as well. Google envisions a ...

Andrew J. Hawkins·2 months ago

The Verge AI· PRESS

The 13 biggest announcements at Google I/O 2026

Google CEO Sundar Pichai on stage at I/O 2026. | Screenshot: YouTube Google's I/O 2026 keynote today was once again full of AI-related announcements including a new family of Gemini 3.5 AI models, new features for Search and Gmail, and updates about its Project Aura smart glasses. If you weren't able to tune into the event's livestream today or follow along with our live blog, you can catch up on everything you missed in our roundup below. Gemini 3.5 Google launched updated AI models at I/O, starting with Gemini 3.5 Flash, with Gemini 3.5 Pro following next month. Starting today, Gemini 3.5 F...

Andrew Liszewski·2 months ago

TechCrunch AI· PRESS

With Gemini 3.5 Flash, Google bets its next AI wave on agents, not chatbots

Google launched Gemini 3.5 Flash, its most powerful coding and agentic AI model yet, at the company's annual developer conference. It is capable of autonomously executing complex tasks and building software from scratch.

Rebecca Bellan·2 months ago

Google AI (Gemma)· FRONTIER

I/O 2026: Welcome to the agentic Gemini era

Google I/O 2026: Gemini advances toward agentic AI with expanded action capabilities.

{"$":{"xmlns:author":"http://www.w3.org/2005/Atom"},"name":["Sundar Pichai"],"title":["CEO of Google and Alphabet"],"department":[""],"company":[""]}·2 months ago

Google AI (Gemma)· FRONTIER

Gemini 3.5: frontier intelligence with action

Google releases Gemini 3.5 model family combining frontier intelligence with action capabilities.

{"$":{"xmlns:author":"http://www.w3.org/2005/Atom"},"name":["Koray Kavukcuoglu"],"title":["CTO, Google DeepMind and Chief AI Architect, Google"],"department":[""],"company":[""]}·2 months ago

← Front Page100 stories

Gemini

Meta is making its AI chatbot more like an assistant

[AINews] Black Forest Labs FLUX 3 - Multimodal Flow Models that beat Seedance 2.0, Gemini Omni and Grok Imagine, and FLUX-mimic video-action robotics model

Google closes in on another billion- user product with Gemini

Understanding Generative AI-mediated User Engagement with Academic Library Resources

Google reveals faster and cheaper Gemini 3.6 Flash, says 3.5 Pro is still in testing

Introducing Gemini 3.6 Flash, 3.5 Flash-Lite, and 3.5 Flash Cyber

Google launches a cheaper alternative to large AI security models like Mythos

Google is working on a new AI chip designed to make Gemini more efficient

Simple Domain Generalization for Strong Pixel-Level Image Tampering Detection in Modern VLMs

Judge-dependent safety gains and model-specific helpfulness costs of evidence-sufficiency prompting in clinical LLMs

Introducing Gemini 3.5 Flash Cyber

Google Vids now lets you star in your own AI videos

Google is renaming NotebookLM to Gemini Notebook

Create, edit and star in videos with two Google Vids updates

Google ordered to open Android and Search to rivals in Europe

Empowering India’s next generation of innovators with ATL Saathi

Waze is getting a bunch of new AI-powered features

Google updates Android Bench with new LLMs, but Gemini still lags behind

Expanding Managed Agents in Gemini API: background tasks, remote MCP and more

Infuriating Google commercial imagines the founding fathers embracing AI

Automated grading of Linux/bash examinations using large language models: a four-level cognitive taxonomy approach

HULAT2 at MER-TRANS 2026: Governed Multi-Agent Simplification for Spanish Easy-to-Read Generation

Clinician-Level Agreement Without Clinical Caution: LLM Evaluator Limits in Medical AI Benchmarking

LLMs are stuck in a groupthink groove. This startup is trying to get them out.

Gemini Spark, Google’s agentic assistant, is now available on Mac

Quantifying the Affective Gap: A Zero-Shot Evaluation of LLMs on Fine-Grained Emotion Taxonomies

Google built a great smart speaker, but Gemini isn’t ready for it

Nano Banana 2 Lite

Anthropic launches Claude Sonnet 5 as a cheaper way to run agents

Start building with Nano Banana 2 Lite and Gemini Omni Flash

Gemini’s personalized AI image generation is now free for US users

Real-Time Voice AI Hears but Does Not Listen

Introducing computer use in Gemini 3.5 Flash

How to turn off AI in your Google Docs

Google bets on Gemini to reinvent the smart home speaker

The Gemini-powered Google Home Speaker arrives on June 25 for $100

Android 17 launches with new multitasking tools as Google expands Gemini features

ChatGPT’s market share slips below 50% for first time

My yard is dying, so I made an app for that

Google sues Chinese cybercrime network that used Gemini to automate scams

DiffusionGemma

MSUE: Multi-Modal Soccer Understanding Expert

Google announces Gemini 3.5 Live Translate for instant voice-to-voice translation

The Shibboleth Effect: Auditing the Cross-Lingual Distributional Skew of Large Language Models

Who Brought Easter Eggs to Eid? Auditing Cultural Translation of Math Word Problems Across Diverse Languages and Regions

Fluid, natural voice translation with Gemini 3.5 Live Translate

Siri AI at WWDC 2026

Gemini 3.5 and Antigravity come to Google NotebookLM

NotebookLM&#8217;s Gemini 3.5 upgrade adds a cloud computer and help finding sources

Measuring the impact of learning with AI in Sierra Leone and beyond

As AI gets better, it reveals an empty promise

Gender-Dependent Diagnostic Substitution in LLM Medical Triage: Same Symptoms, Unequal Urgency

Gemini Spark is the most impressive and terrifying AI experience I’ve had yet

Gemini’s new AI agent is about as good as Google’s demo

How we used Gemini to build Google I/O 2026

I put Google’s 24/7 AI assistant Gemini Spark to work, and it’s actually pretty useful

11 demos of Gemini Omni and Gemini 3.5 in action

Apple working to cram massive Gemini model into iPhone to power new Siri

Gram: Assessing sabotage propensities via automated alignment auditing

Catch up on 12 major I/O 2026 moments

Researchers let AI models run a simulated society. Claude was the safest—and Grok committed 180 crimes and went extinct within 4 days

Gemini Omni Flash is the most censored video model. Even more censored than Chinese alternatives

The frontier reasoning race is starting to look like a crowded subway station

Sundar Pichai on AI, the future of search, and what’s happening to the web

Extra High thinking level possibly with gemini 3.5 pro soon be released

The Strength of Gemini Omni is in video manipulation

New Gemini Omni Blows Competition Away

Built a program to give my parents a 2nd look on suspicious emails/etc

Run Chrome’s tiny Gemma4 (aka Gemini Nano) directly on PC without GPU

Google’s new anything-to-anything AI model is wild

We tried Google’s AI glasses and they’re almost there

Erdos Unit Distance Problem - Gemini 3.1 Pro's interpretation

Google is cooking just give them sometime (gemini 3.5 pro)

Evaluating Commercial AI Chatbots as News Intermediaries

Beyond Acoustic Emotion Recognition: Multimodal Pathos Analysis in Political Speech Using LLM-Based and Acoustic Emotion Models

Google's latest creation: Gemini 3.5 Flash vs all

Gemini 3.5 Flash ranks #1 on the APEX-Agents-AA benchmark, outperforming much larger models a whole size above it.

Gemini 3.5 Flash ranks #1 on Automation Bench (from Zapier), beating every other frontier model at a much lower cost

HalBench: I built a custom sycophancy and hallucination benchmark and tested 4 frontier models (Sonnet 4.6, Grok 4.3, GPT 5.4 and Gemini 3.1 Pro), looking for input on what OSS models to run next!

NotebookLM’s Gemini 3.5 upgrade adds a cloud computer and help finding sources