327 pointsby saikatsg7 days ago23 comments
  • ljoshua7 days ago
    NotebookLM audio overviews/podcasts have been an absolute boon for my homeschooled kids. They devour audiobooks and podcasts, and they love learning by listening to these first. Then when we come together for class, we discuss what was covered, and can spend time diving into specifics or doing activities based on the content. It’s super nice to have another option for a learning medium here.

    To generate them, we’ve scanned the physical book pages, and then with a simple Python script fed the images into GCP’s Document AI to extract the text en-masse, and concatenated the results together into a text-only version of the chapter. Give that text to NotebookLM and run with it.

    • SecretDreams7 days ago
      I've used them. They're very nifty. Google did good here.

      One thing I'll note is they only cover the "high level" aspects. No depth. I'd recommend them for someone who is either already very knowledgeable or for someone not at all knowledgeable who is looking for an overview before they plan to do deeper learning/studying through reading.

      • bbatsell7 days ago
        > or for someone not at all knowledgeable who is looking for an overview before they plan to do deeper learning/studying through reading

        Yep. This is what I have used them (sparingly) for — a scaffold to build the deeper learning onto. My brain struggles to retain information when it doesn’t have a high-level understanding of how/why a system works and how individual parts connect and interact, even if it is all eventually revealed later.

    • rosquillas7 days ago
      Why not simply upload the pdf version of the scanned book or document? Extracting the text out of a scanned document via GCP Document AI API sounds like unnecessary use of resources
      • ljoshua6 days ago
        I was running into context window issues doing this. I could have gone in and split up the scanned book into chapters or something to get around this, and did that for a couple of subjects. But it wasn't too much work (and literally cost me pennies, like six of them) to get the pure text extract, and it's pretty easy to work with now. (Besides, which random dev doesn't love a little side challenge to explore new APIs at home every now and then? ;) )
    • suddenlybananas7 days ago
      I hope you encourage your kids to actually read as well.
      • ljoshua7 days ago
        Oh don’t worry, they make excellent use of their library cards. :)
    • mleonhard7 days ago
      > for my homeschooled kids

      Learning requires making mistakes. Kids need to learn social skills in low-stakes environments. School is the best environment for this. When a person misses this part of their childhood education, they may struggle to learn these skills later in life.

      • ljoshua6 days ago
        It sounds like you may be speaking from experience, and if so, I respect that.

        My kids have done both public schooling and now homeschooling. For a variety of personal reasons, public schooling was not going to be an option for a couple of them, so we're trying this out now and it has been successful. We are tightly integrated into a very active church group, and they have lots of social interactions on a regular basis there, as well as opportunities with other homeschooled kids around town.

        It's definitely a balance, and there's no one silver bullet on either side of the fence, but the best any of us can do is actively strive for giving each child the best and most appropriate experiences for them.

        • mleonhard6 days ago
          The ability to recognize sociopaths and manipulation is an important life skill which may not be obtainable at your church activities or with trusted families. People without these skills may be manipulated in the workplace and suffer avoidable career setbacks, stress, and attending health problems.
  • mikeocool7 days ago
    NotebookLM podcasts are like a caricature of a real podcast. Every little verbal technique or narrative style that might be used by a normal podcaster in a subtle way is taken to an extreme.

    The last one I listened to one host would repeat a keyword or phrase the other host had just said for emphasis — except they did incessantly — with multiple words in every sentence for many sentences in a row.

    • gervwyk7 days ago
      Although I 100% agree, there is still a place for it. We place generated conversations with our case studies, and have receive good positive feedback so far, especially from the non-technical crowd. See example https://resonancy.io/case-studies/flava-process-digitization

      Of course one can invest more in better authenticity but for what it is, I believe it is a good bang for effort..

      Also, if you listen to it for a while, and get over the initial cringe, it becomes enjoyable, at least for me. Some visitors even asked if it was Ai generated. lol

      Excited and frightened about the future where its more a real. This was a cool comparison I came across recently [2]

      Interestingly I saw today the Descripts Avatars are made to sound and look non-realistic on purpose to avoid I guess all kind of issues, but they claim they want to leave something authentic on the table for real content. Which I think is a good move..

      [1] - https://resonancy.io/case-studies/flava-process-digitization [2] - https://yummy-fir-7a4.notion.site/dia

      • hammock6 days ago
        Using them on case studies is a really interesting use case
      • Jolter7 days ago
        I really enjoyed the “fire!” example. Very naturalistic!
    • meta_ai_x7 days ago
      The Google NotebookLM team should take this comment as a badge of honor that they must be doing something right.

      HN is the worst place to get product feedback (and I'm sure the NotebookLM team has internal metrics that validates their approach)

    • sakopov7 days ago
      Yeah it was incredible in the beginning because it was so novel. Now it's just annoying. Half of the dialogue is repeated and it takes forever to get a point across. Never used NLM, but I wonder if that's something that can be tuned out?
      • BakeInBeens7 days ago
        You can always use interactive mode and ask the podcaster for exactly what you want.
    • Spooky237 days ago
      It sounds like an NPR podcast, which have been self parodied for a long time.
      • 7 days ago
        undefined
    • michelb6 days ago
      I have to say it's a caricature of a lot of primarily American podcasts.
    • 7 days ago
      undefined
    • latentsea7 days ago
      > NotebookLM podcasts are like a caricature of a real podcast. Every little verbal technique or narrative style that might be used by a normal podcaster in a subtle way is taken to an extreme.

      So true.

    • mensetmanusman7 days ago
      That sounds like a good comedy sketch!
    • retinaros7 days ago
      It is slop in ways that even ghibli OAI is not. I never understood why it ever got good press
  • tkgally7 days ago
    I tried it with Japanese, and it sounded about as good as in English. Only at one point did it sound unnatural. Japanese two-person conversation uses a lot of backchannelling (aizuchi), that is, semilinguistic sounds made by the listener to indicate attention and emotional reaction. At one point, the female voice said very distinctly "fumu fumu," which is how such aizuchi might be written in a script or manga. In actual speech, though, it would be a continuous sound without syllables and with a rising and/or falling intonation.

    That brief TTS-like moment was the only time I was reminded that the voices were not human.

    • latentsea7 days ago
      Sometimes you actually say fumu fumu out loud in a conversation for comedic effect.
      • tkgally7 days ago
        Yes. In fact, I laughed when I heard the NotebookLM voice say that. It was comically out of place in the context.
        • okdood647 days ago
          Have an audio link and timestamp to this?
          • tkgally7 days ago
            The link is here:

            https://notebooklm.google.com/notebook/c36ea335-6686-474d-bf...

            The fumu fumu is at 01:50.

            The podcast is about the impact of AI on higher education in Japan. I prompted NotebookLM briefly in Japanese about the topic, and it collected ten sources in Japanese and English that it used as the basis for the audio overview.

            • latentsea3 days ago
              Hahahaha. Oh wow that is comedically out of place in a way that no human tries to use it comedically. That's... gorgeously funny. It's like fumu fumu but so deadpan in the delivery. I think I might just have to try and insert some of these completely out of place deadpan fumu fumu into my everyday speech. Too good.
            • 7 days ago
              undefined
  • ipsum27 days ago
    Do people find NotebookLM useful? For my use case of converting papers into podcasts, the explanations are too general (which misses the important parts of the paper) and contain too much fluff.

    I suspect that changing the underlying model to Gemini 2.5 Pro would produce better transcripts, but right now there's no way of determining what model is being used.

    • alphabetting7 days ago
      I found this prompt online and tweaking it for audio overviews works extremely well for me.

      https://open.substack.com/pub/lawsen/p/notebooklm-podcasts-b...

      Generate a deep technical briefing, not a light podcast overview. Focus on technical accuracy, comprehensive analysis, and extended duration, tailored for an expert listener. The listener has a technical background comparable to a research scientist on an AGI safety team at a leading AI lab. Use precise terminology found in the source materials. Aim for significant length and depth. Aspire to the comprehensiveness and duration of podcasts like 80,000 Hours, running for 2 hours or more.

      • smusamashah7 days ago
        Where do you put this prompt?
        • sumedh7 days ago
          In the Audio Overview, click on Customize and enter the prompt then generate the podcast.
          • venusenvy476 days ago
            Do I need to be on a paid version or Pro? I don't see Customize in Audio Overview in the notebook that I just tested.

            Edit: I actually see Customize on a notebook where I hadn't already created a podcast. But on a notebook where I had already created one, I can't find a way to Customize. I guess I just need to create a new notebook with the same source material.

            • yuzhun6 days ago
              Delete generated audio, then generate a new one using customized prompt.
    • dobladov7 days ago
      I find NotebookML really useful as a book reading companion, by simply uploading the same book I want to read and asking questions about it, like:

      - List the characters in chapter [x] and add a small description about each one. - What's [x] device used for? - What happened in chapter [x]?

      It works very well without hallucinations and referencing all the answers.

    • da_chicken7 days ago
      I've found it useful for processing the documentation for our data system. The vendor provides the doc in something around 60 PDF files, and a lot of the information is poorly organized within the PDFs.

      I can say, "Hey, NotebookLM, explain the difference between feature X and feature Y to me," or, "How do I configure Z to work the way we want?" And while the answers still kinda suck because the documentation is pretty shitty, it's way faster than digging through the PDFs. And it cites the PDFs so I can (with some trouble) find the actual documentation in the PDF if I need it.

      The worst part of it is that it only accepts 50 PDFs at once.

      Honestly, though, the best use for it I've seen was when my GM added the PDF rulebooks to our TTRPG to NotebookLM. We were then able to ask NotebookLM rules questions, and it would answer us pretty well. That's what it's really great for.

      I don't care about the audio features at all. The first thing I do is close the audio pane.

    • harryf7 days ago
      It’s useful for getting summaries of long YouTube videos - I’m found it semi helpful for improving my Davinci Resolve skills.

      That said Google is screwing the pooch as usual by trying to make it another walled garden. Slap an API on NoteboolLM already! The market research has already been done - there’s even an unofficial API https://www.reddit.com/r/notebooklm/comments/1eti9iz/api_for...

      • energy1237 days ago
        For YouTube videos it's hard to beat (1) copy transcript to clipboard (from eg tactiq) (2) paste into LLM chat and ask for summary
        • Rebelgecko7 days ago
          Full disclosure, I work for Google opinions are my own etc etc

          The LLM built into YouTube is one of the few LLM chatbots bolted onto existing apps that I actually find useful. Not just for summaries but questions like "what is the timestamp in this 2 hour video where they talk about _____".

          • jjwiseman7 days ago
            "LLM built into YouTube…" The what now? This is the first I've heard of this.
            • Rebelgecko7 days ago
              I thought it was for everyone my bad. Turns out except for some educational videos it's just for premium subscribers with certain location/language combos (you can probably guess which...)

              https://support.google.com/youtube/answer/14110396?hl=en

            • mvdtnz7 days ago
              I suspect he doesn't know he's talking about some internal tool that Google hasn't released to the public.
          • hu37 days ago
            > "what is the timestamp in this 2 hour video where they talk about _____"

            wow I gave up searching specific timestampos of long videos before. Never again.

            Thank you!

          • 7 days ago
            undefined
          • aryehof7 days ago
            Thanks, but it seems that because I am outside the USA I’m not quite “premium” enough.
          • trees1017 days ago
            how do we access this?
        • skeptrune7 days ago
          It's hard for every AI product to beat that workflow lol. It works well for basically everything.
        • dieortin7 days ago
          Or just paste the video URL onto Gemini and ask for summary, no need to search for any transcript
    • HanClinto7 days ago
      I've found it very useful for providing accessible introductions to technical papers that are otherwise difficult for me to get started with understanding.

      If I encounter a paper that is too difficult for me to digest just by reading, then I take a step back, feed it into NotebookLM, and listen to that summary. I've only done this a few times, but so far it hasn't failed to give me the overview and momentum that I need to take another stab and successfully dig into the paper and digest it on my own.

      As others have noted, it can gloss over certain details and miss important points from time to time, but overall it does a fantastic job of giving me an introduction to a complex topic and making it far less indimidating / overwhelming.

    • jsnell7 days ago
      You can enter a prompt from the "customize" dialog. Have you tried asking for a more specifics, assume the audience is an expert on the subject, and cut down on the fluff?
    • jszymborski7 days ago
      I've run them on my own papers and, while sometimes they are accurate, they are sometimes very very wrong and misrepresent things. And I don't mean in nuanced or unimportant ways.

      The TTS is amazing, but the audio overviews are frankly useless for me.

    • Spooky237 days ago
      I used it for a a bootcamp class to study for an exam. I recorded about 50 hours of lecture and Q&A, and was able to generate good Anki cards from it. What was awesome was that I could ask “make a list of all of the topics the instructor thought would be questions on the exam” and it did a great job at that.

      The podcast thing is more a novelty to me.

    • pottertheotter7 days ago
      What’s interesting is that the create podcast thing is just a feature of NotebookLM. But everyone thinks that’s what NotebookLM is
      • bongodongobob7 days ago
        It seems to be the only unique feature though. Any LLM can summarize things for me or make bullet points.
        • twoWhlsGud7 days ago
          If you have a corpus of documents you are working with (say thousands of pages of related standards docs), Notebook can be handy for doing targeted summaries of aspects of the docs with pointers back into the actual docs to the relevant source material. That's something I end up needing a lot (I've never used the podcast feature) and so it feels very differentiated to me...
        • alphabetting7 days ago
          The one other unique thing I use from them is the interactive mind maps. Like a table of contents on steroids
    • primax7 days ago
      I use it for loading up source materials and notes for a DnD campaign I run. Then I ask it questions when I need off the cuff answers, instead of researching.

      It's also good for when I can't think of anything (like a background NPCs name and backstory)

    • 7 days ago
      undefined
    • jcims7 days ago
      I haven't really found it interesting for technical content but do think it's somewhat useful for hashing out more subjective and/or personal things like goals, difficulties, conflict, etc.
    • bradly7 days ago
      At Shopify I working as an engineer in financial services and certain changes required approval by our banking partners. I was able to upload our credit policies to NotebookLM and easily ask questions without having to ping our the legal team in Slack. I'm about as bearish as they come as far as AI tools go and NotebookLM was one of the few tools that felt useful to me straight away.
  • Almondsetat7 days ago
    I really don't understand why they went with this podcast style. Sure, it makes an impression the first few times, great for a showcase or an announcement. The problem though is that it soon becomes pretty annoying, especially because the hosts go back and forth between knowing nothing and knowing everything about the topic. They should at least choose randomly which one does the explaining to whom.
    • razster7 days ago
      Absolutely agree with you, we ran into the same issue. Our company actually tried using it for our software documentation and user onboarding, hoping it would be a helpful and engaging format. But the podcast-style delivery just didn’t fit our needs. It’s fine for a quick showcase or intro, but for ongoing support or business-oriented material, the format became distracting. If only they offered alternative styles—something more structured and professional—we might have stuck with it.
    • moribunda7 days ago
      You should check new features - like asking questions as a listener.

      I don't use it a lot, but it's useful when you want to have an engaging audio interface to long (50p+) reports, which you wouldn't normally read because it's not your area of expertise or you don't have time, but you can listen while doing some cardio or chores.

  • TekMol7 days ago
    I find the podcast style audio it produces super annoying.

    Is there an easy way to simply have text read to me unaltered?

    • sega_sai7 days ago
      Absolutely the same complaint. I wanted to see if it could summarize papers well, but I just could not handle all the conversation and attempts to make it 'exciting'. Especially in areas where I already know the background.
    • threeducks7 days ago
      Over seven years ago, this has been foretold exactly by the show Silicon Valley:

      https://www.youtube.com/watch?v=K3pYZwol6Dc&t=73s

      Transcript of the fridge scene:

          Fridge (after a bar code was scanned): "Ah, there we go."
          Gilfoyle: "It's bad enough that it has to talk. Does it need fake vocal ticks like 'uh'." 
          Dinesh: "Well it just makes it sound more human."
          Gilfoyle: "Humans are shit. This thing is addressing problems that don't exist. It's solutionism at its worst. We are dumbing down machines that are inherently superior."
      
      I would like to have a Gilfyole mode for NotebookLM where the machine answers only with cold precision instead of endless "Mmmhmm", "Yeah!", "Amazing!", "That's so cool!".
    • shreezus7 days ago
      It's one of those things that's impressive initially, but after generating a couple it feels quite formulaic.
      • 7 days ago
        undefined
    • crawsome7 days ago
      I really is an obnoxious level of over-enthusiasm.
      • kleiba7 days ago
        Give German a try - trust me, you don't have to speak the language but anyone can tell that it's quite different in tone. No valley girls in Deutschland!
  • chupchap7 days ago
    I used NotebookLM for holiday planning. I put in a dozen links with touristy things to do at the destinations and 5 odd Youtube videos. I then asked it to craft an itinerary as a travel agent who is planning holiday for a couple without kids. Included the type of things I would like to do and not do as well. The result was pretty good. The podcast generated was fun as well
  • lenwood7 days ago
    I like the NotebookLM podcasting feature, have used it a few times to come up to speed. There's one quirk of the dialogue that I find annoying though, the two speakers finish one another's sentences. At first I thought that was a nice touch, but it happens often enough that it became distracting. I should experiment with the prompt to limit how often it happens.
  • hu37 days ago
    I like to feed Hacker News comments to generate a podcast.

    It's good to get the big picture about the discussion with 300+ comments.

  • tsurba6 days ago
    The best thing to feed the podcast is a dump of all one-liner macros we’ve added to an IRC bot over 15 years (for fetching weather, stocks, and 99% stupid jokes) without any context. Cannot stop laughing listening to it trying to figure it out and bringing up the weirdest ones.
  • davidg7077 days ago
    I created a NotebookLM podcast based on a blog post I wrote and played it for my parents. They got very excited thinking that I 'made it' because other people were talking about my work. Then I told them what it really was and they were a little bit disappointed and a little bit amazed.
  • qwertox7 days ago
    I uploaded a Python script I wrote last week (system backup script, changed extension to .py.txt) and the Podcast it created was pretty suitable to give none-tech people who might be asking what you've been doing an idea about it.
  • ahmedfromtunis7 days ago
    The best feature is by far the ability to interact with the "hosts" to ask for clarifications or to guide them into focusing on a particular aspect; even for things that weren't covered in the source material.
  • leopoldj6 days ago
    Generated a Bangla (Bengali) podcast from a complicated property lease and price trend analysis document. I'm floored. Impossible to tell that the pod casters not real. I'm sure over a long term it will sound monotonic and disengaging. But what we have here is simply a breakthrough.
  • ksynwa7 days ago
    Tangential: Anyone knows a free/cheap service that can turn English text articles into an audio file narrating them? Can NotebookLM do this? I don't want to turn them into podcasts or conversations.
    • tmoravec7 days ago
      Eleven Reader works well enough for me on iPhone. Free tier.
  • xnx5 days ago
    I kind of want the opposite of NotebookLM: take verbose conversational information and distill it down to concentrated content
  • ccbikai7 days ago
    His Chinese voice effect is not as good as Minimax.

    You can use Hacker Podcadt to compare

    https://hacker-podcast.agi.li/

  • tinyhouse7 days ago
    They don't have an app? strange.
  • anyfactor7 days ago
    https://support.google.com/notebooklm/answer/15731776

      - Afrikaans
      - Albanian
      - Arabic
      - Armenian
      - Azerbaijani
      - Basque
      - Bengali
      - Bulgarian
      - Burmese (Myanmar)
      - Catalan
      - Cebuano
      - Chinese (Simplified)
      - Chinese (Traditional)
      - Croatian
      - Czech
      - Danish
      - Dutch
      - English
      - Estonian
      - Filipino
      - Finnish
      - French (Canada)
      - French (European)
      - Galician
      - Georgian
      - German
      - Greek
      - Gujarati
      - Haitian Creole
      - Hebrew
      - Hindi
      - Hungarian
      - Icelandic
      - Indonesian
      - Italian
      - Japanese
      - Javanese
      - Kannada
      - Konkani
      - Korean
      - Latin
      - Latvian
      - Lithuanian
      - Macedonian
      - Maithili
      - Malay
      - Malayalam
      - Marathi
      - Nepali
      - Norwegian (Bokmål)
      - Norwegian (Nynorsk)
      - Oriya
      - Pashto
      - Persian
      - Polish
      - Portuguese (Brazil)
      - Portuguese (Portugal)
      - Punjabi
      - Romanian
      - Russian
      - Serbian (Cyrillic)
      - Sindhi
      - Sinhala
      - Slovak
      - Slovenian
      - Spanish (European)
      - Spanish (Latin America)
      - Spanish (Mexico)
      - Swahili
      - Swedish
      - Tamil
      - Telugu
      - Thai
      - Turkish
      - Ukrainian
      - Urdu
      - Vietnamese
    • behnamoh7 days ago
      > Persian

      I'm glad the name of my native language is written correctly. In many cases, people say "Farsi", which is offensive to many Iranians because it's the Arabic version of the word "Parsi" (unlike Persian, Arabic doesn't have "p", "g", "ch", "zh").

      It's like someone calling English "Anglaise" because that's how the French say it.

      PS: Contrary to common belief, Persian and Arabic are totally different languages, though they have borrowed words from one another (think English and French). Persian is an Indo-European language whereas Arabic is Aramaic (same roots as Hebrew).

      • crazygringo7 days ago
        > It's like someone calling English "Anglaise" because that's how the French say it.

        That is the case for some other languages, though. We call the language German rather than Deutsch because Germani was the Latin name for tribes in the area, for example.

        Or native names get modified too -- in English we don't call it Espanish, just Spanish, even though it comes from español.

        The names of languages in other languages tend to get modified in tons of different and random ways for lots of reasons. Is there really a reason to take offense at it?

        It doesn't bother me that Italians call me an americano instead of an American. It's just a letter change. So why is it so bothersome that it's called Farsi rather than Parsi? Can't the change from "p" to "f" be seen as an interesting historical quirk, due to the fascinating effect of Arabic on European languages in the Middle Ages? At the same time that we got Arabic words like "algebra" and "alcohol"?

      • FlyingSnake7 days ago
        Interesting. This is the first time I’m hearing that Farsi is offensive to Iranians. None of my Irani friends have objected so I’m curious if I’m missing something.

        Wikipedia says Farsi should be avoided in Western languages, but what about others? Persian is called Farsi in Indian subcontinent due to the deep historical connections we share. We have proverbs saying Farsi is the sign of a learned person etc.

      • myth_drannon7 days ago
        Small nitpicking, Arabic is from a different branch of Semitic languages than Aramaic or Hebrew (which are very similar).

        And TIL I learned that Aramaic replaced Hebrew in Judea because the Persian Empire maintained Aramaic as the official administrative language, and Jews brought it back, coming back from the Babylonian captivity.

      • omneity7 days ago
        Arabic is not Aramaic. Please correct your sources.

        I’m also quite curious about the sounds of “ch” and “zh” which exist in Arabic as ش and ج, or did you mean something else?

        • behnamoh7 days ago
          "ch" is written as "چ" in Persian (sounds like channel).

          "zh" is written as "ژ" in Persian (sounds like bourgeoisie in French).

          • omneity7 days ago
            Looking at the Persian IPA table[0] for the letters you wrote, we get `/ʒ/` for `ژ` and `/tʃʰ/` for `چ`

            In Arabic[1], there are two close phonemes: `/dʒ/` for `ج` and `/ʃ/` for `ش`

            The difference in both phonemes is minimal and are practically affricates[2] of each other (where `d` or `t` can precede a `ʒ` or a `ʃ`), so it seems these sounds are present in both Arabic and Persian.

            These variations are also within the dialectal distribution of either languages. For example `ج` is pronounced `/dʒ/` in Algeria and `/ʒ/` in Morocco.

            0: https://en.wikipedia.org/wiki/Help:IPA/Persian

            1: https://en.wikipedia.org/wiki/Help:IPA/Arabic

            2: https://en.wikipedia.org/wiki/Affricate

    • riffic7 days ago
      cool, thanks for the wall of text.
      • jszymborski7 days ago
        There's a collapse feature (the [-] link at the top of the post)
        • crazygringo7 days ago
          It's still far too much for a HN comment.

          You have to scroll down a couple pages' worth before you even realize this might be SO long you need to collapse it. So then you've got to scroll back UP a couple pages, find the teensy [-] link...

          It's enough to just post the link to the list of languages. The list itself doesn't belong in a comment here, when it's that long.

          • riffic7 days ago
            not to mention most of us know how to click to a source to view the authoritative list. why repeat it here?
  • iWontSayMyName7 days ago
    [flagged]
  • iWontSayMyNamey7 days ago
    [flagged]
  • HackerOfTheYear7 days ago
    [flagged]
  • shihabkhanbd7 days ago
    [dead]