Daily 8 News | Latest Breaking News & Updates

Soon, you can upload video on Gemini app and ask questions about it: Report

Business Standard

5 days ago

Business
Business Standard

Soon, you can upload video on Gemini app and ask questions about it: Report

Google's Gemini app is reportedly getting a new capability that will let users upload videos in the chat and get an analysis done on them. So far, users have been able to upload pictures and documents for the artificial intelligence chatbot to analyse or scan. However, soon they will also be able to upload videos in the prompt. According to a report by 9To5Google, Gemini will analyse the video and let users ask questions about it. 9To5Google tested the feature by sharing a video and asking the AI bot to describe the video, which it did pretty accurately. It is to be noted that Gemini video upload feature has not yet been rolled out widely. The availability of the feature varies depending on accounts/devices that 9To5Google checked. However, this feature will reportedly be made available to both free and paid users across Android (Google app 16.23 beta) and iOS, as well as 2.5 Flash and 2.5 Pro. The feature is not live on the web interface yet. Video in Gemini: How to use Open the plus (+) menu to upload a file. Select Gallery or Files from the options. If video upload is available for your account, you'll be able to select video files. If not, video files will appear grayed out and cannot be uploaded. In other related news, Google officially rolled out its Gemini 2.5 series of AI models on Tuesday, making them widely accessible. As part of the launch, users can now interact with the stable releases of both Gemini 2.5 Pro and Gemini 2.5 Flash. The tech giant has also extended access to the Pro model for users on the free tier of the Gemini platform. Alongside these, Google introduced Gemini 2.5 Flash-Lite — touted as the company's fastest and most cost-effective AI model to date.

Google launches its most cost-efficient and fastest Gemini 2.5 model yet

Time of India

6 days ago

Business
Time of India

Google launches its most cost-efficient and fastest Gemini 2.5 model yet

Google has expanded its family of Gemini 2.5 of hybrid reasoning AI models . The company said that its Gemini 2.5 Pro and Gemini 2.5 Flash models are now generally available. Further, it released a preview of the new 2.5 Flash-Lite model which it claims is its most cost-efficient and fastest model yet. "We designed Gemini 2.5 to be a family of hybrid reasoning models that provide amazing performance, while also being at the Pareto Frontier of cost and speed," Google stated in its announcement. General availability of Gemini 2.5 Pro and Gemini 2.5 Flash models The generally available versions of Gemini 2.5 Flash and 2.5 Pro are now ready for production applications, a move Google attributes to valuable developer feedback gathered over recent weeks. Adding to the lineup, Google has introduced a preview of Gemini 2.5 Flash-Lite, touted as its most cost-efficient and fastest 2.5 model to date. by Taboola by Taboola Sponsored Links Sponsored Links Promoted Links Promoted Links You May Like Is it better to shower in the morning or at night? Here's what a microbiologist says CNA Read More Undo "Gemini 2.5 Pro + 2.5 Flash are now stable and generally available. Plus, get a preview of Gemini 2.5 Flash-Lite, our fastest + most cost-efficient 2.5 model yet," Google CEO Sundar Pichai said in a post on X. "Exciting steps as we expand our 2.5 series of hybrid reasoning models that deliver amazing performance at the Pareto frontier of cost and speed," he added. Google says that this new version is designed to excel in high-volume, latency-sensitive tasks like translation and classification, offering lower latency than its predecessors, 2.0 Flash-Lite and 2.0 Flash, across a wide range of prompts. Despite its enhanced efficiency, 2.5 Flash-Lite retains the core capabilities that define the Gemini 2.5 family. These include the ability to adjust computational "thinking" based on budget, integrate with tools such as Google Search and code execution, support multimodal input (processing various data types), and offer a substantial 1-million-token context length, the company says. According to Google, the model also demonstrates "all-around higher quality" than 2.0 Flash-Lite across benchmarks in coding, math, science, reasoning, and multimodal tasks. Developers can access the preview of Gemini 2.5 Flash-Lite through Google AI Studio and Vertex AI, alongside the newly stable versions of 2.5 Flash and Pro. Both 2.5 Flash and Pro are also now accessible directly within the Gemini app. Furthermore, custom versions of 2.5 Flash-Lite and Flash have been integrated into Google Search.

Google unveils Gemini 2.5 upgrades for reasoning & security

Techday NZ

22-05-2025

Business
Techday NZ

Google unveils Gemini 2.5 upgrades for reasoning & security

Google has provided a series of updates to its Gemini 2.5 model series, with enhancements spanning advanced reasoning, developer capabilities and security safeguards. The company reported that Gemini 2.5 Pro is now the leading model on the WebDev Arena coding leaderboard, holding an ELO score of 1415. It also leads across all leaderboards in LMArena, a platform that measures human preferences in multiple dimensions. Additionally, Gemini 2.5 Pro's 1 million-token context window was highlighted as supporting strong long context and video understanding performance. Integration with LearnLM, a family of models developed with educational experts, resulted in Gemini 2.5 Pro apparently becoming the foremost model for learning. According to Google, in direct comparisons focusing on pedagogy and effectiveness, Gemini 2.5 Pro was favoured by educators and experts over other models in a wide range of scenarios. The model outperformed others based on the five principles of learning science used in AI system design for education. Gemini 2.5 Pro introduced an experimental capability called Deep Think, which is being tested to enable enhanced reasoning by allowing the model to consider multiple hypotheses before responding. The company said, "2.5 Pro Deep Think gets an impressive score on 2025 USAMO, currently one of the hardest math benchmarks. It also leads on LiveCodeBench, a difficult benchmark for competition-level coding, and scores 84.0% on MMMU, which tests multimodal reasoning." Safety and evaluation measures are being emphasised with Deep Think. "Because we're defining the frontier with 2.5 Pro DeepThink, we're taking extra time to conduct more frontier safety evaluations and get further input from safety experts. As part of that, we're going to make it available to trusted testers via the Gemini API to get their feedback before making it widely available," the company reported. Google announced improvements to 2.5 Flash, describing it as the most efficient in the series, tailored for speed and cost efficiency. This version now reportedly uses 20-30% fewer tokens in evaluations and delivers improved performance across benchmarks for reasoning, multimodality, code, and long-context tasks. The updated 2.5 Flash is now available for preview in Google AI Studio, Vertex AI, and the Gemini app. New features have also been added to the Gemini 2.5 series. The Live API now offers a preview version supporting audio-visual input and native audio output. This is designed to create more natural and expressive conversational experiences. According to Google, "It also allows the user to steer its tone, accent and style of speaking. For example, you can tell the model to use a dramatic voice when telling a story. And it supports tool use, to be able to search on your behalf." Early features in this update include Affective Dialogue, where the model can detect and respond to emotions in a user's voice; Proactive Audio, which enables the model to ignore background conversations and determine when to respond; and enhanced reasoning in live API use. Multi-speaker support has also been introduced for text-to-speech capabilities, allowing audio generation with two distinct voices and support for over 24 languages, including seamless transitions between them. Project Mariner's computer use capabilities are being integrated into the Gemini API and Vertex AI, with multiple enterprises testing the tool. Google stated, "Companies like Automation Anywhere, UiPath, Browserbase, Autotab, The Interaction Company and Cartwheel are exploring its potential, and we're excited to roll it out more broadly for developers to experiment with this summer." On the security front, Gemini 2.5 includes advanced safeguards against indirect prompt injections, which involve malicious instructions embedded into retrieved data. According to disclosures, "Our new security approach helped significantly increase Gemini's protection rate against indirect prompt injection attacks during tool use, making Gemini 2.5 our most secure model family to date." Google is introducing new developer tools with thought summaries in the Gemini API and Vertex AI. These summaries convert the model's raw processing into structured formats with headers and action notes. Google stated, "We hope that with a more structured, streamlined format on the model's thinking process, developers and users will find the interactions with Gemini models easier to understand and debug." Additional features include thinking budgets for 2.5 Pro, allowing developers to control the model's computation resources to balance quality and speed. This can also completely disable the model's advanced reasoning capability if desired. Model Context Protocol (MCP) support has been added for SDK integration, aiming to enable easier development of agentic applications using both open-source and hosted tools. Google affirmed its intention to sustain research and development efforts as the Gemini 2.5 series evolves, stating, "We're always innovating on new approaches to improve our models and our developer experience, including making them more efficient and performant, and continuing to respond to developer feedback, so please keep it coming! We also continue to double down on the breadth and depth of our fundamental research — pushing the frontiers of Gemini's capabilities. More to come soon."

Google introduces the Deep Think reasoning model for Gemini 2.5 Pro and a better 2.5 Flash

Yahoo

20-05-2025

Business
Yahoo

Google introduces the Deep Think reasoning model for Gemini 2.5 Pro and a better 2.5 Flash

Google has started testing a reasoning model called Deep Think for Gemini 2.5 Pro, the company has revealed at its I/O developer conference. According to DeepMind CEO Demis Hassabis, Gemini's Deep Think uses "the latest cutting-edge research" that gives the model the capability to consider multiple hypotheses before responding to queries. Google says it got an "impressive score" when evaluated using questions from the 2025 United States of America Mathematical Olympiad competition. However, Google wants to take more time to conduct safety evaluations and get further input from safety experts before releasing it widely. That's why it's making Deep Think initially available to trusted testers via the Gemini API first in order to get their feedback first. The company has also introduced a better Gemini 2.5 Flash model, which is optimized for speed and efficiency. It's now more efficient than before, uses fewer tokens and has scored higher in benchmarks for reasoning, multimodality, code and long context than its predecessor. It will be generally available in early June. For now, the improved Gemini 2.5 Flash is available as a preview via Google AI Studio for developers, via Vertex AI for enterprise customers and via the Gemini app for other users. While most of the efficiency gains covered on the I/O stage were focused on 2.5 Flash, Google did announce that it's bringing the 2.5 Flash concept of "Thinking Budgets" to its more advanced 2.5 Pro model. This feature will let you balance tokens spent vs. accuracy and speed of output. Separately, Google is bringing Project Mariner into the Gemini API and Vertex AI, as well. Project Mariner is Google's Gemini-powered AI agents that can navigate pages on the web browser to complete tasks for users. The company will roll the agents out more broadly this summer so that developers can experiment with them. In addition, the company is releasing new previews for text-to-speech on both 2.5 Pro and 2.5 Flash models via the Gemini API, with support for two voices in 24 languages.

Engadget

20-05-2025

Business
Engadget

Google introduces the Deep Think reasoning model for Gemini 2.5 Pro and a better 2.5 Flash

Google has started testing a reasoning model called Deep Think for Gemini 2.5 Pro, the company has revealed at its I/O developer conference. According to DeepMind CEO Demis Hassabis, Gemini's Deep Think uses "the latest cutting-edge research" that gives the model the capability to consider multiple hypotheses before responding to queries. Google says it got an "impressive score" when evaluated using questions from the 2025 United States of America Mathematical Olympiad competition. However, Google wants to take more time to conduct safety evaluations and get further input from safety experts before releasing it widely. That's why it's making Deep Think initially available to trusted testers via the Gemini API first in order to get their feedback first. The company has also introduced a better Gemini 2.5 Flash model, which is optimized for speed and efficiency. It's now more efficient than before, uses fewer tokens and has scored higher in benchmarks for reasoning, multimodality, code and long context than its predecessor. It will be generally available in early June. For now, the improved Gemini 2.5 Flash is available as a preview via Google AI Studio for developers, via Vertex AI for enterprise customers and via the Gemini app for other users. While most of the efficiency gains covered on the I/O stage were focused on 2.5 Flash, Google did announce that it's bringing the 2.5 Flash concept of "Thinking Budgets" to its more advanced 2.5 Pro model. This feature will let you balance tokens spent vs. accuracy and speed of output. Separately, Google is bringing Project Mariner into the Gemini API and Vertex AI, as well. Project Mariner is Google's Gemini-powered AI agents that can navigate pages on the web browser to complete tasks for users. The company will roll the agents out more broadly this summer so that developers can experiment with them. In addition, the company is releasing new previews for text-to-speech on both 2.5 Pro and 2.5 Flash models via the Gemini API, with support for two voices in 24 languages.

Latest news with #2.5Flash

Soon, you can upload video on Gemini app and ask questions about it: Report

Google launches its most cost-efficient and fastest Gemini 2.5 model yet

Google unveils Gemini 2.5 upgrades for reasoning & security

Google introduces the Deep Think reasoning model for Gemini 2.5 Pro and a better 2.5 Flash

Google introduces the Deep Think reasoning model for Gemini 2.5 Pro and a better 2.5 Flash

Get Started Now: Download the App