Latest news with #Gemini-powered


Tom's Guide
14-06-2025
- Tom's Guide
Gemini can turn text into audio overviews — here's how to do it
You may already be familiar with NotebookLM, Google's Gemini-powered research companion. If you're new to it, it's worth exploring its standout feature called Audio Overview, which takes uploaded information and turns it into a podcast hosted by two AI presenters. This feature is, in many ways, one of the main reasons for using NotebookLM, allowing you to more easily digest even the most complex of information. But the feature has been airing elsewhere. Indeed, you'll now find it in Google's AI assistant Gemini either on Android or iOS. It works in the exact same way but it plays the audio in your browser. Let's check it out. Launch the Gemini app and, in the Ask Gemini box, tap the Plus button. Next, tap Files and select a document you'd like Gemini to work with. You will be able to look through files in Google Drive and lots of file types are supported including DOC, DOCX PDF, RFT and TXT. Once the file has been uploaded, tap Generate Audio Overview. You will need to wait a few minutes while your file is being processed but eventually you will get a result. You don't actually have to stay in the chat — you will be notified when it's ready. You can now tap the Audio Overview in order to listen to it — you may need to tap play. You can find the Audio Overview at any time if you select the Chats and Gems icon in the top-left of the screen. You can share the Audio Overview from this playback screen. Just tap the three-dot icon in the top-right of the screen and tap the Download icon (a downward-pointing icon at the top of the menu). You can also tap Share and select a method such as Messages, email or social media. And there you go. You now know how to generate audio overviews in Gemini, but there's so much else you can do. You can learn how to use Google Gemini to summarize a YouTube video or figure out how to use Gemini AI to create the perfect workout music playlist. It's even possible to discover how to find the best haircut for your face shape. Get instant access to breaking news, the hottest reviews, great deals and helpful tips.
Yahoo
13-06-2025
- Entertainment
- Yahoo
Google Just Kicked In Hollywood's Trailer Door
What a difference a year makes… Not long ago, AI's best attempt at video generation resulted in that cursed clip of Will Smith shoveling spaghetti into his mouth with his four-fingered hands. But now the world has Google's Veo 3 at its fingertips – the tech titan's latest AI video generation tool. And the results we're seeing are nothing short of astonishing. InvestorPlace - Stock Market News, Stock Advice & Trading Tips This shiny new model can generate ultra-realistic, 1080p, synchronized audio-visual content based on a simple text prompt… 'A woman, classical violinist with intense focus, plays a complex, rapid passage from a Vivaldi concerto in an ornate, sunlit baroque hall during a rehearsal. Her bow dances across the strings with virtuosic speed and precision. Audio: Bright, virtuosic violin playing, resonant acoustics of the hall, distant footsteps of crew, conductor's occasional soft count-in (muffled), rustling sheet music.' And within seconds, there she is, in video so realistic, you can even see individual hairs on her head highlighted by the sun. She's almost tangible. The music is swelling. And no human lifted a single camera. What we're witnessing with the launch of Google DeepMind's Veo 3 isn't some gimmicky tech demo or mere novelty for nerds on X. This seems more like the starting pistol for the next great creative-industrial upheaval – and if you're in the business of making or investing in content, it's time to get serious. Yes, Veo 3 may be limited to eight seconds today. But that's not a wall; it's a runway. And if you've been paying any attention to the exponential trajectory of AI development, you know where this might go next. Longer clips, then full scenes, entire episodes… and eventually, complete seasons. Perhaps one day, personalized stories crafted in real-time based on what you like to watch. It's coming – . This could be the beginning of the end of Hollywood as we know it… And the start of a new era of AI stock dominance in the content world. Obviously, this isn't the industry's first attempt at AI-generated video. Runway's Gen-2 was a cool prototype. OpenAI's Sora looked great in a lab. But Veo 3 is different. It's the first model with: 4K visual quality fully integrated audio cinematic camera movement deep prompt adherence and, crucially, a launch partner with billions of users and a roadmap to global rollout In our view, Google has aimed a shotgun full of GPU clusters directly at Hollywood's business model. And Veo 3 is just the tip of the spear. Behind it are entire pipelines – Gemini-powered plot generators, scriptwriting agents, motion planners, and real-time editors. Google is compressing the entire TV and film production supply chain into a single generative stack. Do you know what happens when you take a years-long, $100-million content pipeline and squeeze it down into a GPU-powered prompt that costs pennies? You break the game… If you work in video production – or the hundreds of satellite roles orbiting it – AI just kicked in your trailer door with Veo 3. Think about it. With this quantum leap in AI's video generation capabilities, actors could soon be replaced by photorealistic avatars and voice clones. No need for makeup artists; glam will be digitally rendered in post. Goodbye, set designers; hello, infinite virtual stages. Cinematographers? AI models now handle camera movement with humanlike precision. Now, writers, you're still needed… but you'd better learn to prompt. This might feel like sci-fi, but it's more so basic economics. Studios are always hunting for ways to reduce cost and time. And AI doesn't sleep, unionize, forget lines, or demand a four-figure payday. That's why we expect that over the next five to 10 years, AI will eat the technical backend of filmmaking the way Amazon ate retail – and with the same ruthless cost-efficiency. The same kinds of players always win when the tech curve steepens: Those who ride the exponential wave instead of trying to fight it. Take Netflix (NFLX) – Blockbuster killer; once DVD-dealer, now data king in entertainment. It knows what you watch, when you watch it, what you love, and what you hate. Imagine what an AI script engine could do with all that data. You're a fan of fictional period romance stories? Netflix's AI could create 10 different versions of the next Bridgerton, testing which hooks you harder – then instantly generate the winner in full. Or how about Alphabet (GOOGL)? It runs YouTube and Veo 3 – the delivery pipelines and creative infrastructure. Combine Veo with Gemini and YouTube Studio, and you've got a vertically integrated AI content machine with billions of monetizable eyeballs. And then there's Meta (META). It's got LLaMA, Emu, and a raging addiction to immersive content. Just picture Veo-level video generation tailored to your social graph, optimized for infinite scroll, and seamlessly injected into Instagram, Threads, and the Metaverse. Engagement meets hallucination. And the rest of Hollywood? Well… Legacy studios, crew-heavy productions, anyone betting their future on union-only sets and hundred-million-dollar shoots… it seems you are on notice. The economics just changed – permanently. When it comes to AI-native studios that can churn out hyper-targeted content at 1/100th the cost and 100x the speed, there's no competition. And it's not likely that audiences will resist. Pundits said the same thing about CGI, YouTube, reality TV, TikTok. People don't care how it's made. They care how it feels. And if AI gives them a hit of dopamine, they'll hit 'Next Episode' without a second thought. This latest AI breakthrough feels a lot like the early 2000s, when Amazon used the internet to undercut brick-and-mortar retail. Lower costs, faster delivery, wider selection. Incumbents laughed… until they went bankrupt. Remember Sears, JCPenney, K-Mart? Same script, different industry. AI is the internet. Veo 3 is Netflix is Jeff Bezos, sitting atop its throne with a popcorn bucket in hand. And once one company starts passing cost savings to consumers with cheaper subscriptions, faster content cycles, and more personalization, others have to follow. That's how you get a full-blown economic reset. Currently, Veo 3 is available to select creators via waitlist — but given Google's track record with rapid deployment, widespread rollout to YouTube creators and enterprise partners could come quickly. Here's what we think could be next: Custom AI-generated series and movies tailored to individual users Interactive stories where the plot evolves based on viewer engagement Fan-generated shows that rival studio hits Ad-supported, AI-produced content that costs nothing to stream Veo 3's launch proved that the AI Content Economy is just around the corner. We are years – not decades – away from this becoming a widespread reality. So, if you're an investor, go long AI. This breakthrough tech is eating the whole global economy. Hollywood is just one entree in a seven-course meal. Buy the platforms, AI chipmakers, infrastructure enablers, and appliers – Alphabet, Meta, Nvidia (NVDA) – and yes, Netflix. These are the architects of the new media world. Learn to prompt like a boss; curate, direct, and remix. AI is the orchestra, but someone still has to conduct. And if you're in denial, you might want to check the mirror – and ask Blockbuster how things shook out after ignoring the curve. AI's industrialization of content creation isn't a theory anymore: it's a living, accelerating disruption. Veo 3 marks the moment when generating Hollywood-quality video no longer requires Hollywood-scale budgets. And we're just at the starting gate. Just as streaming upended cable and smartphones reshaped the internet, generative video is about to redefine content itself – who created it, how fast it's made, and who profits. The big studios? Maybe. But more likely, it'll be the AI-native platforms, the chipmakers, and the investors who saw it coming. And yet, Veo 3 is just one front in a much broader AI revolution. While the world watches digital actors take center stage… Another trillion-dollar transformation is forming in the wings. Humanoid robots – what we're calling '' According to Morgan Stanley (MS), this market could be worth as much as $30 trillion in the coming decades. That's bigger than today's global e-commerce and cloud computing markets combined. Why? Because humanoid robots won't just generate videos or write code. They'll do the jobs. Real, physical tasks in factories, on farms; in homes, hospitals, and warehouses. Every job the global economy depends on could be automated, accelerated, and made profitable at scale. And it's all happening faster than most expect. . The post Google Just Kicked In Hollywoodâs Trailer Door appeared first on InvestorPlace.


Hindustan Times
04-06-2025
- Business
- Hindustan Times
Google Photos brings text search feature to find images- Know how it works
Google introduced a new advanced feature to the Photos app that lets users search images using text in quotes. It is a Gemini-powered feature that uses natural language queries to search for relevant images. This text search feature can make image search smarter and more seamless for users who are looking for older images, or even a specific person or place. Photos will also let users put specific words within the image to locate it in the app. Therefore, if you are someone who struggles to find images on the Photos app, then this new text search feature could come in handy. Know more about the new search feature on the Photos app and how it works. Also read: Google to let users test Android 16 desktop mode on phones with external display support, here's how On Tuesday, Google made a small announcement about the latest Photos app features, which allow users to search images using text. Google said, 'You can now use quotes to find words in images' across Android, iOS, and the web. Within the Google Photos app, users can use text matches to find filenames, camera models, captions, or text within photos. However, the query or search prompt should be mentioned in quotation marks. Doing so will enable the app to find exact text and visual matches. Google also provide examples of queries such as 'Alice and me laughing,' colourful sunsets in Mexico, 'Emma at the playground.' This feature can be useful in finding screenshots and documents using specific keywords. Also read: Uber users can now book Delhi Metro tickets within the app: Here's how to do it Step 1: Open Google Photos app on your smartphone Step 2: Click on the Search tab placed at the bottom Step 3: Type your search queries with quotes. Step 4: Based on the text, the app will provide users with related text in photos or visual matches. Mobile Finder: Google Pixel 10 Pro LATEST specs, features, and price This feature is currently available on Android, iOS, and the web version of Google Photos as well. However, make sure to use natural language queries in English for the best search results. With such AI-powered features, Google is bringing greater AI integration into its apps and services, making it easier for users to manage tasks.


Forbes
27-05-2025
- Business
- Forbes
Your Google Smart Home Is About To Get Smarter
Google natural AI language alerts from Nest Cams Google is taking a big step toward making your smart home feel, well, smart, by baking in Gemini AI into the Home APIs that developers use to build their own apps and devices. Announced during Google I/O 2025 last week, the move means third-party developers will be able to bring Gemini-powered features such as AI security camera summaries, natural-language routines, and proactive automation suggestions into their own smart home apps. It means that, even if you never open the Google Home app, you could still benefit from all that AI brainpower behind the scenes. Brands such as Wyze, Ring and Roborock and Govee have introduced their own AI features in the past few months, but the move from Google means their devs could potentially offload this sort of stuff to Gemini within the Home APIs. At I/O last year, Google first launched the Home APIs and gave a few early partners access to its software and we've already seen the likes of ADT, Yale, Cync, iRobot and Motorola take advantage. Since then the ecosystem's grown from 600 million to over 750 million supported devices, with Matter giving that number a boost as it's grown in popularity. One of the more headline-grabbing features from the new AI smarts is the rollout of Gemini-powered capabilities for Nest Cams; the same ones Google started testing in the Home app last fall. Developers can now bring live view, event history, two-way talk, and camera settings into their own apps, but also take things further with AI-generated summaries and searchable video history. So you could ask something like:'Did anyone open the back door last night?' and get a relevant clip in return. That same functionality could land in apps like ADT's or even other home security platforms using the Home APIs, without requiring Google's own UI. Google is also trying to fix one of the smart home's biggest pain points of complex routine setup. With the new Gemini-powered features in the Automations API, developers can offer natural language-based automation tools that let users just say what they want: 'Turn on the garage light when I walk in there,' and let the system figure out the rest. It's similar to the natural language prompts that have arrived with Amazon's Alexa+. Gemini will also proactively suggest useful routines based on what devices are in the home too, potentially surfacing ideas for automations that users wouldn't have thought of on their own. Google says that the new Gemini-powered APIs will roll out to a small group of early access partners later this year. You'll need to sign up the Developer Newsletter if you want be among the partners. Gemini-powered experiences are already showing up in public preview on Nest Audio and Nest Mini speakers for users with a Nest Aware subscription, offering deeper context and more natural back-and-forth control. With a revamped Siri set to launch later this year - potentially with new Apple smart home hardware - and the AI supercharged Alexa+ already rolling out, it's going to be fascinating to see how the evolution of the smart home revolution plays out.


Tom's Guide
22-05-2025
- Tom's Guide
You can generate AI images in Google Docs with Gemini — here's how
Google Docs has just quietly rolled out one of its most useful Gemini-powered features: built-in image generation. This integration allows you to create custom visuals directly within your documents without switching between apps or navigating to external AI tools. The technology behind this feature is Google's Gemini, which can transform text descriptions into detailed images. While similar to standalone AI image generators, having this capability built right into Google Docs makes it easier to create all kinds of content. The image quality is solid (no surprise there) but the real win is being able to whip up visuals right inside the doc without breaking your flow. Here's how to generate images in Google Docs. Open your Google Doc (new or existing) and position your cursor where you want the image. Click Insert in the top menu, select Image, then choose Generate image from the dropdown options. This opens the Gemini image generation panel. Type a detailed description in the prompt box. Be specific about subject matter, colors, lighting, style, and any other elements you want to see. The clearer your prompt, the better your results will be. For example, instead of "beach scene", try something like "a sunny tropical beach with palm trees and turquoise water at sunset." This step is optional. Before generating, you can customize your image settings using the options below the prompt box. Click the drop-down arrow next to Add a style to select artistic styles like watercolor, photorealistic, or cartoon. You can also choose between square, wide (landscape), or tall (portrait) formats based on what you're after. Click Create to generate your image. Gemini will produce four different variations based on your description. Take a moment to review all options before making your selection. If none match your vision, click View more to generate additional variations. When you find an image you like, simply click on it. The selected image will be automatically inserted into your Google Doc at your cursor position. You can then resize or reposition it as needed to fit your document layout. Now you've learned how to generate your own images in Google Docs, why not take a look at our other useful guides? Check out I used Gemini to find the best haircut for my face shape — here's how and 5 smart ways to use Gemini Live with your phone right now. And if you want to stop your browser draining your laptop battery, we've got you covered. Get instant access to breaking news, the hottest reviews, great deals and helpful tips.