logo
#

Latest news with #NotebookLM

Google's Audio Overview can turn those boring documents into engaging podcasts
Google's Audio Overview can turn those boring documents into engaging podcasts

Mint

timea day ago

  • Mint

Google's Audio Overview can turn those boring documents into engaging podcasts

The New Normal: The world is at an inflexion point. Artificial Intelligence is set to be as massive a revolution as the Internet has been. The option to just stay away from AI will not be available to most people, as all the tech we use takes the AI route. This column series introduces AI to the non-techie in an easy and relatable way, aiming to demystify and help a user to actually put the technology to good use in everyday life. The first time I heard an article I had written being discussed, I sat up and listened in utter surprise. Two people I had never come across before were deep in conversation about what I'd written. This man and woman team went through everything, making up a slick podcast. These were AI voices that sounded totally natural and pleasant. This kind of conversation is generated by a feature called Audio Overview. To experience it immediately, download the Gemini app on your phone. Tap the plus sign at the bottom and navigate to one of your documents. Once uploaded, see the tab on top of it, click - and go make yourself a cup of coffee. By the time you get back with your streaming cup, the Audio Overview should be ready. Click, as indicated, and sit back to listen. The two AI hosts will now talk about your content. And they do so with impressive clarity and skill. It's no gimmick or party trick. Also read: Why India is so far behind in the fight for AI supremacy Listening to content can be a great way of absorbing it. Anyone can get tired of reading, since we have to do so much of it each day. As long as you have content that is in a Word file, plain text, a PDF, or Google doc, you can feed it to Gemini to turn it into an Audio Overview. I was putting off going through an 83-page document, when I figured I could quickly get the general gist of it with an Audio Overview. At work this can really help productivity. It's also great for just giving your eyes a rest. If you happen to have a visual impairment, the feature is a relief as you can get so much more done. NotebookLLM: podcasts from anything Audio Overview can be even more magical in its original home, Google's NotebookLM. To find that, go to your browser on any device and type NotebookLM in the search bar. Sign in with your Google account and you're in. Add up to 50 items of content including articles, notes, YouTube videos, presentations and more, to make up a notebook. All of these will be combined into an Audio Overview or a more full-fledged Deep Dive conversation through the Chat and Studio tabs. This does take a few minutes, so find something else to do for a bit. Once the conversation is ready you can listen in the browser, or download for later. Or even share it. This amazing audio feature gives you more control in NotebookLM than it does in Gemini. NotebookLM does have an app, but that doesn't seem to have all the features. You can select the playback speed, the length of the conversation, and incredibly even the language the AI hosts should speak. And yes, Hindi is on the list, making it possible to reach a wider audience with that content. It's easy enough to imagine the feature being used for training and education, making it so much more widely useful. Also read: AI didn't take the job. It changed what the job is. As if all this weren't impressive enough already, here's another way you can control the conversation. In NotebookLM you'll also find a Customise tab for the Deep Dive audio. Here, you can actually describe what you want the hosts to focus on. Request a focus on some selected aspect of the content, or ask to keep the language simple or technical. You have the option of deleting the conversation and re-generating it with fresh instructions. You can easily create a conversation in multiple languages for use with different audiences, or change the difficulty level. If you visit aistudio via the browser, you'll see that Google is experimenting with users being able to change the accent or style of speaking in a feature called Native Speech Generation. There's no announcement to the effect but one can easily see how this could be added to Audio Overview sometime. It works very well and is fascinating to try out. Join the conversation Another impressive but experimental feature lets you actually 'join' the podcast, by tapping a button. Interrupt the hosts and ask a question or make them change focus or ask for a comment on your opinion on the subject. This is a little slow and you'll be left wondering if the hosts heard you at all, but I fully expect it to become more fluid in the future as Google adds new features quite frequently. Also read | Mary Meeker's AI report: Decoding what it signals for India's tech future Audio Overview isn't flawless, but chances of getting things wrong are minimised because it's you giving the content. The feature has worked well enough for Google to have brought it to Search, where it will give you AI Overview in audio form – being tried out in the US first. Mala Bhargava is most often described as a 'veteran' writer who has contributed to several publications in India since 1995. Her domain is personal tech and she writes to simplify and demystify technology for a non-techie audience.

Is Google secretly testing new AI voices in NotebookLM?
Is Google secretly testing new AI voices in NotebookLM?

Android Authority

time2 days ago

  • Android Authority

Is Google secretly testing new AI voices in NotebookLM?

Andy Walker / Android Authority TL;DR NotebookLM users are hearing a mysterious third male voice during longer Audio Overviews. Google has confirmed that additional voices and dialects are coming, alongside APIs and video overviews, but it's unclear if this third voice is a feature or a bug. AI features can be quite a hit and miss, but Google's NotebookLM is definitely one of the hits. NotebookLM is a personal AI research assistant that lets users control the AI's data sources, enabling them to tap into AI to do their bidding without hallucinations. NotebookLM's Audio Overviews feature can transform your notes into a podcast, complete with two AI hosts discussing your data. The feature works great, but if you want some variety, a third AI host seems to be on the way. Reddit user Life_Machine_9694 (h/t XDA) observed a third male voice sprinkled in their Audio Overviews conversation, complementing the existing male and female voices currently running the AI podcast. Other users have also observed the same, especially when they attempt to create a longer-form podcast. One user notes this is a glitch; they have noticed more glitchy voices lately in Audio Overviews. Glitch or not, Google has promised that new voices and dialects are coming soon to NotebookLM, along with new source types, APIs, and even video overviews. It's fair to presume that Google is secretly testing out the new voices, but I reckon the company would do it openly if that were the case instead of giving users a fleeting sneak peek. If new voices are coming, it also makes sense that users would get the option to choose which ones they want in the Audio Overviews — some deep dive topics are best handled by two hosts, after all. We've asked Google to clarify whether this is a bug or a feature. We'll keep you updated when we learn more. Got a tip? Talk to us! Email our staff at Email our staff at news@ . You can stay anonymous or get credit for the info, it's your choice.

Build a Powerful Custom AI Research Stack : AI Tools Redefining Research Workflows
Build a Powerful Custom AI Research Stack : AI Tools Redefining Research Workflows

Geeky Gadgets

time3 days ago

  • Business
  • Geeky Gadgets

Build a Powerful Custom AI Research Stack : AI Tools Redefining Research Workflows

What if the future of research wasn't just faster, but fundamentally smarter? Imagine a world where the most complex questions—like the economic ripple effects of automation—could be unraveled with precision and clarity in a fraction of the time. This isn't a distant dream; it's the reality being shaped by an overpowered AI research stack. Tools like NotebookLM, Grok, and Gemini are redefining how we approach data, analysis, and collaboration. These aren't just tools—they're co-researchers, capable of synthesizing vast datasets, generating actionable insights, and even challenging traditional workflows. But with such fantastic power comes a pressing question: are we ready to embrace the full potential of AI in research, or will we be held back by our own limitations? In this breakdown, David Shapiro shares his research stack and explores how these innovative platforms are transforming research workflows, particularly in fields like post-labor economics. You'll discover how NotebookLM turns sprawling datasets into intuitive visualizations, how Grok and Gemini keep research grounded in real-time discourse, and how the integration of tools like GPT-4 (03 Pro) streamlines everything from hypothesis testing to publishing. Whether you're a seasoned researcher or simply curious about the future of knowledge creation, this journey through the AI research stack will challenge your assumptions and spark new ideas. After all, when technology evolves faster than our questions, the real challenge is learning how to ask better ones. AI Tools Transform Research AI-Driven Research Workflow AI tools are at the core of modern research workflows, offering unprecedented capabilities to generate expert-level outputs. GPT-4 (03 Pro) serves as a cornerstone, using advanced natural language processing to refine research questions, test hypotheses, and draft detailed reports. Complementing this, tools like NotebookLM, Grok, and Gemini bring specialized functionalities that enhance data management and analysis. NotebookLM: This tool excels in organizing and exploring large datasets. It enables the creation of mind maps and supports context-based queries, making complex data more accessible and easier to interpret. This tool excels in organizing and exploring large datasets. It enables the creation of mind maps and supports context-based queries, making complex data more accessible and easier to interpret. Grok and Gemini: These platforms provide real-time insights from academic and social discourse, making sure that research remains relevant and grounded in current developments. These platforms provide real-time insights from academic and social discourse, making sure that research remains relevant and grounded in current developments. GitHub Pages: By exporting research outputs to GitHub Pages, findings can be hosted in a version-controlled, publicly accessible format, promoting transparency and collaboration. Together, these tools create a seamless and efficient workflow. They simplify the management of extensive datasets, ensure high-quality outputs, and foster open collaboration among researchers and stakeholders. Exploring Post-Labor Economics The shift toward a post-labor economy, driven by advancements in AI and robotics, represents one of the most significant challenges of our time. This AI-powered research workflow is particularly well-suited to examining the economic implications of automation, including its impact on labor markets, the rise of capital ownership models, and the need for policy-driven transitions. AI tools enable researchers to conduct comprehensive literature reviews and synthesize data from diverse perspectives. Key themes explored in this field include: The structural challenges automation poses to traditional wage labor systems. The advocacy for innovative distribution mechanisms, such as universal basic income (UBI). The critical role of active policy interventions in guiding economic transitions effectively. These insights are compiled into purpose-built research papers and shared in open-access repositories under Creative Commons licensing. This approach ensures that findings are accessible to a broad audience, fostering collaboration and encouraging informed discussions on the future of work and economic systems. Powerful AI Research Stack Watch this video on YouTube. Expand your understanding of AI-powered research with additional resources from our extensive library of articles. Maximizing the Potential of AI Tools Each AI tool in this workflow plays a distinct role in enhancing research efficiency and depth. Their combined application allows researchers to tackle complex topics with greater precision and speed. Here's how these tools contribute: GPT-4 (03 Pro): Refines research questions, assists hypothesis testing, and generates comprehensive reports with expert-level detail. Refines research questions, assists hypothesis testing, and generates comprehensive reports with expert-level detail. NotebookLM: Organizes and visualizes complex datasets, supports mind mapping, and enables contextual querying for deeper insights. Organizes and visualizes complex datasets, supports mind mapping, and enables contextual querying for deeper insights. Grok and Gemini: Offer real-time feedback and insights from academic and social contexts, making sure research remains relevant and well-informed. Offer real-time feedback and insights from academic and social contexts, making sure research remains relevant and well-informed. GitHub Pages: Provides a platform for hosting research outputs in a transparent, version-controlled format, encouraging public engagement and collaboration. By integrating these tools into their workflows, researchers can streamline processes, improve the quality of their outputs, and maintain a transparent and collaborative research environment. Key Findings and Research Outputs The application of this AI-powered workflow has already resulted in the production of over 50 research papers addressing various aspects of post-labor economics. These papers highlight several critical findings: Automation poses a significant structural threat to traditional wage labor systems. Broad-based capital ownership and innovative distribution mechanisms are essential to mitigating economic inequality. Active policy interventions are necessary to manage economic transitions effectively and equitably. Ongoing debates persist over solutions, such as universal basic income versus alternative approaches. By making these findings publicly available, the workflow not only promotes collaboration but also deepens the understanding of the challenges and opportunities presented by automation and economic transformation. Addressing Challenges and Limitations While AI tools offer numerous advantages, they also present challenges that require careful consideration. Managing large datasets and making sure the quality of outputs demand significant oversight. Researchers must strike a balance between AI-generated insights and human interpretation to avoid over-reliance on automation. Additionally, addressing gaps in public understanding and effectively communicating findings are essential to making sure that research outcomes are both accessible and actionable. Future Directions in AI-Driven Research The continued refinement of AI tools and methodologies promises to expand their applicability to a broader range of complex topics. In the context of post-labor economics, this workflow aims to culminate in the publication of a comprehensive book synthesizing insights gained from AI-driven research. By using these tools, researchers can explore new frontiers, contribute to global discussions, and shape policies that address the challenges of automation and economic transformation. Media Credit: David Shapiro Filed Under: AI, Top News Latest Geeky Gadgets Deals Disclosure: Some of our articles include affiliate links. If you buy something through one of these links, Geeky Gadgets may earn an affiliate commission. Learn about our Disclosure Policy.

Google Adds Button to Generate Error-Laden AI Podcast About Your Search Results Instead of Just Reading Them Like a Functioning Member of Society
Google Adds Button to Generate Error-Laden AI Podcast About Your Search Results Instead of Just Reading Them Like a Functioning Member of Society

Yahoo

time4 days ago

  • Yahoo

Google Adds Button to Generate Error-Laden AI Podcast About Your Search Results Instead of Just Reading Them Like a Functioning Member of Society

Google has released a baffling new AI feature that turns your web search into a podcast. Why anybody would want to enable the feature is unclear. Why be plagued by misleading and hallucinated AI Overviews search results when you can have a robotic voice read them out loud instead? Have we really lost the ability as a species to parse written information, nevermind original sources? The opt-in feature — which currently lives inside Google's experimental "Labs" section and has to be manually turned on — harnesses the power of the company's Gemini AI model to turn a search query into "quick, conversational audio overviews." According to the tech giant, an "audio overview can help you get a lay of the land, offering a convenient, hands-free way to absorb information whether you're multitasking or simply prefer an audio experience." But is this anything anybody really asked for? Having two fake podcast hosts rant about a subject you're researching — likely with a smattering of hallucinations — sounds like an incredibly counterintuitive and needlessly obtuse way to get quick access to information. The feature first surfaced last year as part of Google's NotebookLM, a note-taking tool that uses AI to help users organize their thoughts and summarize notes. An "Audio Overviews" feature can then take your notes and turn them into AI-generated podcasts, with often unintentionally hilarious results. While AI researchers have gushed over the feature, using it to turn Wikipedia pages into hours-long podcast episodes they allegedly listen to, we still can't shake the feeling that Google may be barking up the wrong tree. Particularly when it comes to search results, where speed has conventionally trumped anything else, turning AI summaries into rambling audio snippets sounds pretty exhausting. Besides, if Google's AI Overviews are anything to go by, the tech's propensity to make up facts is still enormous. The feature has been plaguing users with outright wrong and misleading information for quite some time now, with users desperately reaching out to Reddit to find ways to disable it. It's a sign of the times, with tech companies desperately looking for ways to shoehorn AI into every aspect of our digital lives to justify their enormous investments in the space. Soon we won't just be inundated with AI slop in text and image format; a fake podcast host could one day be talking your head off while you're simply trying to figure out the winner of the Pedro Pascal lookalike contest in Brooklyn. More on Google AI: Google's AI Is Actively Destroying the News Media

Here's why you should be excited about Audio Overviews coming to Google Search
Here's why you should be excited about Audio Overviews coming to Google Search

Yahoo

time4 days ago

  • Yahoo

Here's why you should be excited about Audio Overviews coming to Google Search

When you buy through links on our articles, Future and its syndication partners may earn a commission. Google is testing the NotebookLM feature Audio Overviews in Search The feature will offer short, AI-generated audio summaries for certain queries. The feature uses Gemini models to deliver podcast-style explanations with clickable links. I've been a fan of the Audio Overviews feature in Google's NotebookLM since I first experimented with it last year. Now, it's coming to Google Search, currently only as a test in the Labs, but it brings a more bite-sized version of the AI-generated "podcasts" that I like in NotebookLM. Once you've opted in through Labs, you'll start seeing a little prompt on some search results pages saying, 'Generate Audio Overview.' Tap that, wait about 30 to 40 seconds, and out comes a compact audio clip of around five minutes, sometimes less, that explains what you looked up in the form of two AI-generated voices having a discussion. Not too deep, but not one-sentence shallow either. Think of a middle ground between 'Wikipedia rabbit hole' and 'I read the headline only." While you listen, the audio player stays docked in your results page, showing clickable links to the sources the AI pulled from. You can keep browsing, tap into related articles, or just listen and absorb. If you like what you hear, you can give it a thumbs up. If it's egregiously wrong, the thumbs down is there too. Though similar to what NotebookLM does with its Audio Overviews, the Search version has one major difference. NotebookLM only uses documents you upload, YouTube videos, and websites you specifically link to. Google Search's version pulls from public web content. That can be good or bad, depending on what you look up. Something straightforward and scientific might be fine, but a discussion about the best movie ever might get a different audio track every time you look. Here's an example I recorded a clip from. It's hardly perfect, and while the voices are good, they are still AI voices. You also might notice it parroting phrases straight from someone's Reddit post. But it is listenable and, as Google points out, hands-free, with the option to adjust the speed of the speakers and the links there to provide more context. You can speed it up or slow it down, skip around, or follow the links as you go. It's AI-enhanced Search, not a new audiobook. For now, not every search will offer to create an Audio Overview. You also have to be in the U.S. and sign up for Labs right now. But, I'd expect it to have a general release pretty soon. Then you can ask how lithium-ion batteries work or why Roman concrete is still standing, and get a nice mini discussion from digital characters. Think of it as how video summaries and image carousels brought new dimensions to how we take in information online. Audio Overviews are another aspect of that and a win for auditory learners or people with visual impairments, With OpenAI and Perplexity and a dozen AI search engines nipping at its heels, Google needs whatever tricks it can muster to stand out and an AI podcast as the answer to a serach is definitely one way to be unique, at least for now. One of my favorite AI tools is getting an iPhone app, and here's why you should install it I used NoteBookLM to help with productivity - here's 5 top tips to get the most from Google's AI audio tool Google will turn those long documents into your next favorite podcast I fed NotebookLM a 218-page research paper on string theory and the podcast results were mind-blowing

DOWNLOAD THE APP

Get Started Now: Download the App

Ready to dive into a world of global content with local flavor? Download Daily8 app today from your preferred app store and start exploring.
app-storeplay-store