Latest news with #Gemini2.5Pro

Google's Gemini AI panics while playing Pokémon, takes 800 hours to finish game

Hindustan Times

10 hours ago

Hindustan Times

Google's Gemini AI panics while playing Pokémon, takes 800 hours to finish game

Artificial intelligence has made remarkable strides, but Google's latest chatbot is showing that even the smartest machines can crumble under pressure. A recent report by Google DeepMind reveals that its flagship model, Gemini 2.5 Pro, displayed signs of panic while playing Pokémon Blue—an old-school video game many children breeze through with ease. The findings came from a Twitch channel called Gemini_Plays_Pokemon, where independent engineer Joel Zhang put Gemini to the test. While Gemini is known for its advanced reasoning abilities and code-level understanding, its performance during this gaming challenge exposed unexpected behavioural quirks. Also read: 40-year-old man dies of cancer after doctors told him stomach ache was due to stress According to the DeepMind team, Gemini began to exhibit what they describe as 'Agent Panic.' The report states, 'Over the course of the playthrough Gemini 2.5 Pro gets into various situations which cause the model to simulate 'panic'. For example, when the Pokémon in the party's health or power points are low, the model's thoughts repeatedly reiterate the need to heal the party immediately or escape the current dungeon.' This behaviour didn't go unnoticed. Viewers on Twitch began identifying when the AI was panicking, with DeepMind noting, 'This behaviour has occurred in enough separate instances that the members of the Twitch chat have actively noticed when it is occurring.' Although AI doesn't experience stress or emotion like humans, the model's erratic decision-making in high-pressure situations mirrors how people behave under stress, making impulsive or inefficient choices. In the first full game run, Gemini took 813 hours to finish Pokémon Blue. After adjustments by Zhang, the AI completed a second playthrough in 406.5 hours. Still, this was far from efficient, especially compared to the time a child would take to complete the same game. Social media users were quick to mock the AI's anxious gameplay. 'If you read it's thoughts when reasoning it seems to panic just about any time you word something slightly off,' said one viewer. Another joked: 'LLANXIETY.' A third chimed in with a broader reflection: 'I'm starting to think the 'Pokémon index' might be one of our best indicators of AGI. Our best AIs still struggling with a child's game is one of the best indicators we have of how far we still have yet to go. And how far we've come.' Interestingly, these revelations come just weeks after Apple released a study arguing that most AI reasoning models don't truly reason at all. Instead, they rely heavily on pattern recognition and tend to fall apart when the task is tweaked or made more complex. Also read: Two fired after Michigan man receives $1.6 million salary in major payroll slip-up - Sakshi

Automate Your Browser with Gemini 2.5 Pro for Online Workflows

Geeky Gadgets

11 hours ago

Business
Geeky Gadgets

Automate Your Browser with Gemini 2.5 Pro for Online Workflows

What if your browser could do more than just display web pages? Imagine it seamlessly handling your repetitive tasks, managing complex workflows, and even collaborating with advanced AI models—all while you focus on what truly matters. Enter Gemini 2.5 Pro, the latest evolution in web automation technology. This isn't just another tool; it's a bold redefinition of what your browser can achieve. From automating intricate research projects to streamlining multi-agent workflows, Gemini 2.5 Pro transforms your browser into a dynamic productivity powerhouse. But how does it work, and what sets it apart from the rest? In this detailed report, World of AI explore how Gemini 2.5 Pro integrates innovative AI capabilities with the adaptability of open source frameworks to deliver unparalleled automation. You'll discover how its multi-agent task execution, dynamic adaptability, and robust privacy protections can transform your online workflows. Whether you're a professional looking to optimize your daily operations or a tech enthusiast eager to harness the power of AI, this guide will reveal how Gemini 2.5 Pro can be tailored to your unique needs. By the end, you might just rethink what your browser is truly capable of. Transforming Web Automation The Importance of Open source Flexibility At its core, Nano Browser is built on an open source framework, emphasizing customization and accessibility. This design allows users to integrate their own API keys or connect local AI models, tailoring the browser to specific requirements. For example, you can automate tasks like web research, data extraction, or social media management with ease. The open source nature eliminates dependency on proprietary software, granting you complete control over your automation processes. This flexibility ensures that Nano Browser can adapt to a wide range of use cases, empowering users to create solutions that align with their unique goals. Enhance Efficiency with Multi-Agent Workflows One of Nano Browser's standout features is its ability to handle multi-agent workflows, allowing multiple agents to execute tasks simultaneously. This functionality significantly improves efficiency by allowing parallel task execution. For instance, you could automate booking flights while simultaneously researching accommodations, all within the same browser session. By streamlining these processes, Nano Browser ensures that your workflows remain seamless, time-efficient, and highly productive. Automate Your Browser with Gemini 2.5 Pro Watch this video on YouTube. Advance your skills in Open source Web Automation by reading more of our detailed content. Seamless Integration with Advanced AI Models Nano Browser supports a wide array of AI models, including Gemini 2.5 Pro, the Cloud 4 series, and local models via Olama. This compatibility allows users to integrate innovative AI tools into their workflows, enhancing automation and decision-making capabilities. Whether you're extracting data, generating content, or optimizing complex processes, Nano Browser provides a powerful and adaptable platform for AI-driven solutions. This integration ensures that users can use the latest advancements in AI technology to achieve their objectives more effectively. Comprehensive Task Automation for Complex Processes Task automation lies at the heart of Nano Browser's functionality. The platform enables users to plan, execute, validate, and follow up on tasks with minimal manual intervention. For example, you can automate data scraping, validate the extracted information, and schedule follow-up actions—all within a single, cohesive workflow. This streamlined approach not only simplifies complex processes but also minimizes errors, saving you valuable time and effort. By automating repetitive or intricate tasks, Nano Browser helps users focus on higher-value activities. Interactive and User-Centric Interface Nano Browser features an intuitive interface designed to enhance the overall user experience. A real-time task visualization panel on the right-hand side allows users to monitor progress and make adjustments as needed. This interactive design ensures that you remain in control of your workflows, even as tasks are executed automatically. The interface is both user-friendly and highly functional, making it accessible to professionals across various industries. Advanced Speech-to-Text and Visual Analysis Tools To address more complex challenges, Nano Browser includes speech-to-text conversion and visual analysis capabilities. These tools are particularly useful for tasks such as solving CAPTCHAs or processing visual data. By integrating these advanced functionalities, Nano Browser ensures that even the most demanding tasks can be automated effectively. This makes it a valuable tool for users who require precision and adaptability in their workflows. Dynamic Task Handling for Adaptive Workflows Nano Browser excels in dynamic task handling through a process known as instruction decomposition. This feature breaks down tasks into actionable steps, allowing the browser to adapt to your strategies and requirements. For example, if you're conducting a multi-step research project, Nano Browser can dynamically adjust its actions based on your input, making sure that the results align with your objectives. This adaptability makes it an ideal solution for managing complex and evolving workflows. Browser Extension for Seamless Integration Nano Browser is available as a Chrome extension, making sure compatibility with popular browsers like Chrome and Edge. This format allows for seamless integration into your existing workflows without the need for additional software installations. The extension also simplifies updates and maintenance, making sure that your automation tools remain current and efficient. By offering this level of convenience, Nano Browser makes it easy for users to incorporate advanced automation into their daily routines. Prioritizing Privacy and Local Data Security Privacy is a central focus of Nano Browser's design. By operating locally within your browser, it ensures that your data remains secure and private. Unlike cloud-based solutions, Nano Browser does not require users to upload sensitive information to external servers. This approach provides peace of mind, particularly for professionals handling confidential or sensitive data. By prioritizing local data security, Nano Browser offers a reliable and trustworthy solution for automation. Applications Across Diverse Industries The adaptability of Nano Browser makes it suitable for a wide range of applications. Professionals can use it to automate tasks such as web research, data extraction, social media management, and multi-agent workflows. Its compatibility with advanced AI models further expands its potential, making it a valuable tool for industries ranging from marketing and research to software development and beyond. By offering a flexible and powerful platform, Nano Browser enables users to achieve their goals with greater efficiency and precision. Media Credit: WorldofAI Filed Under: AI, Guides Latest Geeky Gadgets Deals Disclosure: Some of our articles include affiliate links. If you buy something through one of these links, Geeky Gadgets may earn an affiliate commission. Learn about our Disclosure Policy.

Google's AI Chatbot Panics When Playing Video Game Meant For Children

NDTV

13 hours ago

NDTV

Google's AI Chatbot Panics When Playing Video Game Meant For Children

Artificial intelligence (AI) chatbots might be smart, but they still sweat bullets while playing video games that seemingly young kids are able to ace. A new Google DeepMind report has found that its Gemini 2.5 Pro resorts to panic when playing Pokemon, especially when one of the fictional characters is close to death, causing the AI's performance to experience qualitative degradation in the model's reasoning capability. Google highlighted a case study from a Twitch channel named Gemini_Plays_Pokemon, where Joel Zhang, an engineer unaffiliated with the tech company, plays Pokemon Blue using Gemini. During the two playthroughs, the Gemini team at DeepMind observed an interesting phenomenon they describe as 'Agent Panic'. "Over the course of the playthrough, Gemini 2.5 Pro gets into various situations which cause the model to simulate "panic". For example, when the Pokemon in the party's health or power points are low, the model's thoughts repeatedly reiterate the need to heal the party immediately or escape the current dungeon," the report highlighted. "This behavior has occurred in enough separate instances that the members of the Twitch chat have actively noticed when it is occurring," the report says. While AI models are trained on copious amounts of data and do not think or experience emotions like humans, their actions mimic the way in which a person might make poor, hasty decisions when under stress. In the first playthrough, the AI agent took 813 hours to finish the game. After some tweaking by Mr Zhang, the AG agent shaved some hundreds of hours and finished the game in 406.5 hours. While the progress was impressive, the AI agent was still not good at playing Pokémon. It took Gemini hundreds of hours to reason through a game that a child could complete in significantly less time. The chatbot displayed erratic behaviour despite Gemini 2.5 Pro being Google's most intelligent thinking model that exhibits strong reasoning and codebase-level understanding, whilst producing interactive web applications. Social media reacts Reacting to Gemini's panicky nature, social media users said such games could be the benchmark for the real thinking skills of the AI tools. "If you read its thoughts when reasoning it seems to panic just about any time you word something slightly off," said one user, while another added: "LLANXIETY." A third commented: "I'm starting to think the 'Pokemon index' might be one of our best indicators of AGI. Our best AIs still struggling with a child's game is one of the best indicators we have of how far we still have yet to go. And how far we've come." Earlier this month, Apple released a new study, claiming that most reasoning models do not reason at all, albeit they simply memorise patterns really well. However, when questions are altered or the complexity is increased, they collapse altogether.

Google has Supercharged Productivity Alongside Gemini: The Entry of Gemini 2.5 Marks the Beginning of a New Era of Efficiency.

Economic Times

2 days ago

Business
Economic Times

Google has Supercharged Productivity Alongside Gemini: The Entry of Gemini 2.5 Marks the Beginning of a New Era of Efficiency.

iStock At the Google I/O 2025 keynote, CEO Sunder Pichai excitedly spoke of the rapid model progress of Gemini since their first-generation Gemini Pro model. Since then, Gemini 2.5 Pro today delivers 10 times the performance of the previous generation. What makes this stand out more is their infrastructure strength enables them to deliver dramatically faster models, even as the model prices climb down further highlights the numbers wherein, this time last year, Google processed 9.7 trillion tokens a month across products and APIs, whilst today, they are processing 50 times more than that number, over 480 trillion. More than 7 million developers today use Gemini, which is five times more than what was garnered last year. The app Gemini now has over 400 million monthly active users, with a significant growth in engagement with the 2.5 series of models. These numbers are a clear indicator of Gemini 2.5's strength, deeming it as one of the most powerful multimodal models ever released by Google. This brand new version enhances performance across the board, possesses faster reasoning, deeper document understanding, and dramatically improved code generation. Google stands proud on its claims that Gemini 2.5 is capable of working through complex workflows that include summarizing multi-document reports and debugging intricate software projects with immaculacy. This model's rollout is closely associated with Google Workspace, bringing in AI-driven 'Co-pilot for your digital life' rather than a mere AI-powered assistant, with its capabilities extending to Gmail, Docs, Sheets, and Slides. Via the 'Help Me Write' and 'Help Me Organize' features driven by Gemini 2.5, users can now curate detailed project briefs, rewrite proposals, extract spreadsheet insights, and much more on the list, making collaborations more efficient than ever. Gemini is no longer a mere Workspace upgrade; rather, it now exists in the very core of Android and Chrome OS. It can interpret on-screen content, provide contextual suggestions, and even automate device actions. Whether it's summarizing PDFs or drafting replies on chatting apps, Gemini is readily available to assist at the right moment, for its intelligence is now entangled in the depths of the Android experience. For businesses to use AI for internal processes, proprietary data, and domain-specific tasks, Google has also introduced Gemini for Workspace Enterprise and Gemini for Cloud AI. Organizations shall be able to incorporate Gemini into their workflow with optimum-level data privacy and governance controls; this move seemed like an insinuation of a challenge to Microsoft's Copilot in this new-age race of AI-enabled AI grows and develops further into the global workplace, Google's bold push signifies not just a technological advancement but rather a strategic reshaping of how humans and machines shall join hands in the near future. Disclaimer Statement: This content is authored by a 3rd party. The views expressed here are that of the respective authors/ entities and do not represent the views of Economic Times (ET). ET does not guarantee, vouch for or endorse any of its contents nor is responsible for them in any manner whatsoever. Please take all steps necessary to ascertain that any information and content provided is correct, updated, and verified. ET hereby disclaims any and all warranties, express or implied, relating to the report and any content therein.

Google's Gemini chatbot may have a Pokemon game 'problem'

Time of India

2 days ago

Entertainment
Time of India

Google's Gemini chatbot may have a Pokemon game 'problem'

Google's Gemini and other AI chatbots may have a "problem". A new research indicates that these AI models can exhibit irregular behaviour, like "panic," when confronted with challenges in Pokemon games. Tired of too many ads? go ad free now According to a report by DeepMind, Gemini 2.5 Pro experiences "qualitatively observable degradation in the model's reasoning capability" when its Pokemon are close to defeat. This observation comes as AI companies like Google and Anthropic are studying how their latest AI models navigate early Pokemon games. Researchers believe that observing AI models playing video games can provide useful insights into their capabilities. How Google's Gemini and other chatbots reacted to older Pokemon games In recent months, independent developers have launched Twitch streams like 'Gemini Plays Pokemon' and 'Claude Plays Pokemon,' showcasing AI models playing the classic game in real time, the report mentions. This offers an alternative, more contextual way to benchmark AI performance beyond traditional testing methods. Each stream reveals how the AI reasons through problems, offering insight into its decision-making process. While these models have advanced quickly, they still struggle with tasks like playing Pokemon efficiently, often taking hundreds of hours to finish. The real intrigue lies in observing the AI's behaviour and choices during gameplay, rather than its speed. 'Over the course of the playthrough, Gemini 2.5 Pro gets into various situations which cause the model to simulate 'panic,'' the report noted. This means, during gameplay, the AI can enter a state of 'panic,' where its performance declines and it stops using some available tools. Tired of too many ads? go ad free now While the AI doesn't experience emotions, this behaviour resembles how humans may make poor decisions under stress, making it both intriguing and unsettling. 'This behaviour has occurred in enough separate instances that the members of the Twitch chat have actively noticed when it is occurring,' the report added. Apart from Gemini, Claude has also shown unusual behaviour during the gameplay. At one point, it wrongly assumed that fainting all its Pokemon would transport it forward in the game, leading it to intentionally lose battles, which is a strategy that backfired, as it was sent to the last used Pokemon Center instead. Despite such errors, the AI has excelled at solving in-game puzzles. With some human help, it used task-specific AI tools to navigate boulder puzzles and plan efficient routes. 'With only a prompt describing boulder physics and a description of how to verify a valid path, Gemini 2.5 Pro is able to one-shot some of these complex boulder puzzles, which are required to progress through Victory Road,' t he report highlighted. The report also suggests that since Gemini 2.5 Pro independently created many of its tools, upcoming versions may be able to do so without human help, potentially even developing a 'don't panic' module on its own.

Latest news with #Gemini2.5Pro

Google's Gemini AI panics while playing Pokémon, takes 800 hours to finish game

Automate Your Browser with Gemini 2.5 Pro for Online Workflows

Google's AI Chatbot Panics When Playing Video Game Meant For Children

Google has Supercharged Productivity Alongside Gemini: The Entry of Gemini 2.5 Marks the Beginning of a New Era of Efficiency.

Google's Gemini chatbot may have a Pokemon game 'problem'

Get Started Now: Download the App