home.social

Search

622 results for “9to5google”

  1. Gemini 3 Flash’s new ‘Agentic Vision’ improves image responses – 9to5google

    Gemini 3 Flash’s new ‘Agentic Vision’ improves image responses

    Abner Li | Jan 27 2026 – 11:40 am PT

    1 Comment

    Agentic Vision is a new capability for the Gemini 3 Flash model to make image-related tasks more accurate by “grounding answers in visual evidence.”

    Frontier AI models like Gemini typically process the world in a single, static glance. If they miss a fine-grained detail — like a serial number on a microchip or a distant street sign — they are forced to guess.

    This new approach “treats vision as an active investigation” by combining visual reasoning with code execution and other tools in the future.

    To answer prompts with images, Gemini 3 Flash will formulate “plans to zoom in, inspect and manipulate images step-by-step.” Specifically, Agentic Vision leverages a “Think, Act, Observe loop.”

    1. Think: the model analyzes the user query and the initial image, formulating a multi-step plan.
    2. Act: The model generates and executes Python code to actively manipulate images (e.g. cropping, rotating, annotating) or analyze them (e.g. running calculations, counting bounding boxes, etc).
    3. Observe: The transformed image is appended to the model’s context window. This allows the model to inspect the new data with better context before generating a final response.

    Instead of just describing an image it’s given, Gemini 3 Flash “can execute code to draw directly on the canvas to ground its reasoning.” One example of this image annotation in the Gemini app is asking “to count the digits on a hand.”

    To avoid counting errors, it uses Python to draw bounding boxes and numeric labels over each finger it identifies. This “visual scratchpad” ensures that its final answer is based on pixel-perfect understanding.

    Meanwhile, Gemini 3 Flash will zoom in when it detects fine-grained details in the image. Agentic Vision can also “parse high-density tables and execute Python code to visualize the findings.”

    Agentic Vision results in a “consistent 5-10% quality boost across most vision benchmarks” for Gemini 3 Flash.

    This is starting to roll out to the Gemini app with the Thinking model. For developers, it’s available today with the Gemini API in Google AI Studio and Vertex AI. 

    Continue/Read Original Article Here: Gemini 3 Flash’s new ‘Agentic Vision’ improves image responses

    #9to5GoogleCom #AgenticVision #ExecuteCode #Gemini #Gemini3Flash #GeminiApp #Google #ImageQuality #NewFromGemini
  2. @9to5google
    Ah, bullshit. I got that case "free" with my Pixel 7a because I didn't look closely at the thumbnail photo of it.

    It's an ugly little mofo, but it fits well.
    #immaterial

  3. @9to5google
    Ah, bullshit. I got that case "free" with my Pixel 7a because I didn't look closely at the thumbnail photo of it.

    It's an ugly little mofo, but it fits well.
    #immaterial

  4. 9to5google.com/2025/07/22/goog
    Google Tasks-Keep reminders migration will happen later in 2025

    Google KeepのリマインダーがTODOリストに移行となると自分は使うのをやめてしまうかな

    #GoogleKeep #GoogleTasks

  5. @Skylled @9to5google Following up on this story from last year. No update.

    Yet another unfinished product from the always-distracted Google. 📱 🚗

    Google Graveyard? How about a Hall of Incomplete Google Projects?

    9to5google.com/2023/05/31/hand

    #GooglePixel #DashCam #GoogleGraveyard #Android

  6. 9to5google.com/2023/11/17/goog

    "...Google Chat’s key advantage...is [how it is] built [for] the web and...mobile. All you need is...[a Gmail account]...[which] most people already have...If you want a...dedicated experience, you can...download a standalone app (Android and iOS).

    On paper, Google Chat is entirely capable of being anybody’s primary messaging service with a low barrier to entry and complete feature set....

    9to5google.com/2023/11/17/goog

    #communication
    #sharingideas
    #ideas
    #googlechat
    #collab
    #collaboration
    #messaging
    #webbasedmessaging
    #docs
    #gdocs
    #googledocs

  7. 9to5google.com/2023/11/17/goog

    "...Google Chat’s key advantage...is [how it is] built [for] the web and...mobile. All you need is...[a Gmail account]...[which] most people already have...If you want a...dedicated experience, you can...download a standalone app (Android and iOS).

    On paper, Google Chat is entirely capable of being anybody’s primary messaging service with a low barrier to entry and complete feature set....

    9to5google.com/2023/11/17/goog











  8. 9to5google.com/2023/11/17/goog

    "...Google Chat’s key advantage...is [how it is] built [for] the web and...mobile. All you need is...[a Gmail account]...[which] most people already have...If you want a...dedicated experience, you can...download a standalone app (Android and iOS).

    On paper, Google Chat is entirely capable of being anybody’s primary messaging service with a low barrier to entry and complete feature set....

    9to5google.com/2023/11/17/goog

    #communication
    #sharingideas
    #ideas
    #googlechat
    #collab
    #collaboration
    #messaging
    #webbasedmessaging

  9. 9to5google.com/2023/11/17/goog

    "...Google Chat’s key advantage...is [how it is] built [for] the web and...mobile. All you need is...[a Gmail account]...[which] most people already have...If you want a...dedicated experience, you can...download a standalone app (Android and iOS).

    On paper, Google Chat is entirely capable of being anybody’s primary messaging service with a low barrier to entry and complete feature set....

    9to5google.com/2023/11/17/goog








  10. @AAKL @9to5google @nexusben

    I liked #GoogleStadia for the reasons you mention. When I was younger, I upgraded my PC frequently, dealt with video card issues, etc., to play the latest and greatest games (usually on #Linux - thanks #LokiGames).

    Now, I'm too busy and have other interests (and expenses) in addition to the PC and gaming. Stadia was perfect for me.

    (Of course, I am also lucky to have decent broadband.)

  11. RE: mastodon.online/@9to5google/11

    Wow, Google has discovered icon strokes! :)

    The pendulum is finally swinging back from overly flat, illegible digital interfaces?

    #design #Android #iconDesign #ux

  12. RE: mastodon.online/@9to5google/11

    Wow, Google has discovered icon strokes! :)

    The pendulum is finally swinging back from overly flat, illegible digital interfaces?

    #design #Android #iconDesign #ux

  13. RE: mastodon.online/@9to5google/11

    Wow, Google has discovered icon strokes! :)

    The pendulum is finally swinging back from overly flat, illegible digital interfaces?

    #design #Android #iconDesign #ux

  14. RE: mastodon.online/@9to5google/11

    Wow, Google has discovered icon strokes! :)

    The pendulum is finally swinging back from overly flat, illegible digital interfaces?

    #design #Android #iconDesign #ux

  15. RE: mastodon.online/@9to5google/11

    Wow, Google has discovered icon strokes! :)

    The pendulum is finally swinging back from overly flat, illegible digital interfaces?

    #design #Android #iconDesign #ux

  16. Review of 22 by 9to5Google

    youtube.com/watch?v=r9eq4xqxev4

    (I would have linked to a privacy frontend instead of Youtube directly, but currently that seems to result in a bad viewing experience)