Bard gets its biggest upgrade yet with Gemini {Google A.I / LLM}

bnew

Veteran
Joined
Nov 1, 2015
Messages
56,378
Reputation
8,326
Daps
158,203




1/5
Chatbot Arena update⚡!

The latest Gemini (Pro/Flash/Flash-9b) results are now live, with over 20K community votes!

Highlights:
- New Gemini-1.5-Flash (0827) makes a huge leap, climbing from #23 to #6 overall!
- New Gemini-1.5-Pro (0827) shows strong gains in coding, math over previous versions.
- The new, smaller Gemini-1.5 Flash-8b outperforms gemma-2-9b, matching llama-3-70b levels.

Big Congrats @GoogleDeepMind Gemini team on the incredible launch!

More plots in the followup posts👇

**Note: to better reflect community interests, older models nearing deprecation will soon be removed from the default leaderboard view.

2/5
Overall Leaderboard with /search?q=#votes and CIs:

3/5
Coding Arena: new Gemini-1.5-Pro improves significantly over previous versions.

4/5
The new, smaller Gemini-1.5 Flash-8b outperforms gemma-2-9b, matching llama-3-70b levels.

5/5
Win-rate heatmap:

Check out full leaderboard at http://lmarena.ai/?leaderboard!


To post tweets in this format, more info here: https://www.thecoli.com/threads/tips-and-tricks-for-posting-the-coli-megathread.984734/post-52211196
GWAj6UMaoAAsSHN.jpg

GWAi50laoAAsNnt.jpg

GWAmKXKaoAAiLaF.jpg

GWAm-tsWoAAZ2a0.jpg

GWApDNAasAAowQY.jpg





1/4
Open-source is the future.

I just tested it. It's really good and an improved version for generating code.

2/4
have you tried it?

3/4
not in coding right now but not that behind.

4/4
ohh no, typo mistake :(


To post tweets in this format, more info here: https://www.thecoli.com/threads/tips-and-tricks-for-posting-the-coli-megathread.984734/post-52211196
 

bnew

Veteran
Joined
Nov 1, 2015
Messages
56,378
Reputation
8,326
Daps
158,203

1/2
Over the coming days, start creating and chatting with Gems: customizable versions of Gemini that act as topic experts. 🤝

We’re also launching premade Gems for different scenarios - including Learning coach to break down complex topics and Coding partner to level up your skills → New in Gemini: Custom Gems and improved image generation with Imagen 3

2/2
Your Gem can remember a detailed set of instructions to help you save time on tasks and accomplish your goals with less effort.

Try it now on Gemini Advanced or Gemini for Google Workspace → New in Gemini: Custom Gems and improved image generation with Imagen 3


To post tweets in this format, more info here: https://www.thecoli.com/threads/tips-and-tricks-for-posting-the-coli-megathread.984734/post-52211196
 

bnew

Veteran
Joined
Nov 1, 2015
Messages
56,378
Reputation
8,326
Daps
158,203

1/4
Google just released the Imagen 3 paper!

[2408.07009] Imagen 3

2/4
The results showed that Imagen 3 outperforms other state-of-the-art models in overall preference, prompt-image alignment, and numerical reasoning tasks. Imagen 3 also performed well on detailed prompt-image alignment, particularly on longer and more complex prompts. However, on visual appeal, Midjourney v6 was found to be the leading model.

full paper: Imagen 3

3/4
Innovation and technology are transforming the world at an incredible speed! Whether it's AI that shows us surprising versions of ourselves or tools that simplify our lives, we're living through a revolution.

4/4
most is about evaluation 🤣


To post tweets in this format, more info here: https://www.thecoli.com/threads/tips-and-tricks-for-posting-the-coli-megathread.984734/post-52211196
GU6G9djaMAAXtwK.png

GU6eY_QbAAA__zn.jpg
 

bnew

Veteran
Joined
Nov 1, 2015
Messages
56,378
Reputation
8,326
Daps
158,203

1/4
Google presents ShieldGemma: Generative AI Content Moderation Based on Gemma

- Opensources Gemma2-based content moderation models
- Outperform Llama Guard (+10.8% AU-PRC on public benchmarks) and WildCard (+4.3%)

abs: [2407.21772] ShieldGemma: Generative AI Content Moderation Based on Gemma
alphaxiv: ShieldGemma: Generative AI Content Moderation Based on Gemma | alphaXiv

2/4
AI Summary: ShieldGemma is a suite of LLM-based content moderation models developed by Google LLC, designed to predict safety risks associated with various harm types such as hate speech and harassment in bo...
ShieldGemma: Generative AI Content Moderation Based on Gemma

3/4
if you don't like the impact of woke AI, you have to release your own moderators @elonmusk

4/4
Even some human moderators do more harm than good and they're open sourcing the one based on the weaker freemium model?


To post tweets in this format, more info here: https://www.thecoli.com/threads/tips-and-tricks-for-posting-the-coli-megathread.984734/post-52211196
GT3HmEkWUAAsl4q.png

GT-ds8KXcAAdG9i.png
 

bnew

Veteran
Joined
Nov 1, 2015
Messages
56,378
Reputation
8,326
Daps
158,203











1/28
@CodeByPoonam
Google just dropped a bombshell

NotebookLM can now turn your notes into a Podcast in minutes.

I'll show you how in just 3 easy steps:



2/28
@CodeByPoonam
Google introduces a new Audio Overview feature that can turn documents, slides, charts, and more into engaging discussions with one click.

To try it out, follow these steps:

1/ Go to NotebookLM: Sign in - Google Accounts
- Create a new notebook.



3/28
@CodeByPoonam
2/ Add at least one source.
3/ In your Notebook guide, click on the “Generate” button to create an Audio Overview.



4/28
@CodeByPoonam
I uploaded my newsletter edition: AI Toast.

With one click, two AI hosts start up a lively “deep dive” discussion based on your sources.

Listen here 🔊



5/28
@CodeByPoonam
Read more here:
OpenAI released next big thing in AI



6/28
@CodeByPoonam
Thanks for reading.

Get latest AI updates and Tutorials in your inbox for FREE.

Join my AI Toast Community of 22000 readers:
AI Toast



7/28
@CodeByPoonam
Don't forget to bookmark for later.

If you enjoyed reading this post, please support it with like/repost of the post below 👇

[Quoted tweet]
Google just dropped a bombshell

NotebookLM can now turn your notes into a Podcast in minutes.

I'll show you how in just 3 easy steps:


8/28
@hasantoxr
Perfect guide 🙌🙌



9/28
@CodeByPoonam
Thanks for checking



10/28
@iamfakhrealam
It's surprising



11/28
@codedailyML
Amazing Share



12/28
@codeMdSanto
That's a game-changer! Technology never fails to amaze. Can't wait to see how it works!



13/28
@shawnchauhan1
That's awesome! Turning notes into a podcast that fast seems like a total productivity hack.



14/28
@AndrewBolis
Creating podcasts is easier than ever



15/28
@EyeingAI
Impressive guide, thanks for sharing.



16/28
@Klotzkette
It’s OK, but you can’t really give it any direction, so it’s useless



17/28
@vidhiparmxr
Helpful guide, Poonam!



18/28
@arnill_dev
That's like magic! Can't wait to see how it works. Exciting stuff!



19/28
@alifcoder
That's amazing! Turning notes into a podcast sounds so convenient.

Can't wait to see how it works.



20/28
@leo_grundstrom
Really cool stuff, thanks for sharing Poonam!



21/28
@LearnWithBishal
Wow this looks amazing



22/28
@shushant_l
This has made podcast creation super easy



23/28
@Parul_Gautam7
Excellent breakdown

Thanks for sharing Poonam



24/28
@jxffb
Just did one! So awesome!



25/28
@iam_kgkunal
That's amazing...Turning notes into a podcast so quickly sounds like a game-changer for productivity



26/28
@chriskclark
Here’s how we implemented this AI app in real life (yesterday).

[Quoted tweet]
was playing with NotebookLLM today as well. Here’s how I implemented the audio podcast mode (what I’m calling it) on an article today. You can listen to the AI generated conversation here —> agingtoday.com/health/fall-p…


27/28
@DreamWithO
I'd love to see this in action, how's the audio quality compared to traditional podcasting software?



28/28
@ThePushkaraj
The AI space is getting crazier day by day!




To post tweets in this format, more info here: https://www.thecoli.com/threads/tips-and-tricks-for-posting-the-coli-megathread.984734/post-52211196

GXgfJhCboAATsJ0.jpg

GXgfP1XagAA7K40.jpg

GXgfJhCboAATsJ0.jpg

GXcAZM5bwAIMSsH.jpg















1/13
@minchoi
Google dropped NotebookLM recently.

AI tool that can generate podcasts of two speakers talking about the contents from various sources like research papers, articles, and more.

Absolutely bonkers.

100% AI 🤯

10 examples (and how to try):

1. AI Podcast about OpenAI o1 drop



2/13
@minchoi
2. AI Podcast from Newsletter

[Quoted tweet]
Very impressed with this new NotebookLM feature by Google Labs that turns notes/docs into podcasts

I uploaded this morning's newsletter, and it turned into a two-way podcast between two AI agent hosts

Give it a listen, pretty darn good (sound on 🔈)


3/13
@minchoi
3. AI Podcast from 90 min lecture

[Quoted tweet]
Googles NotebookLM's new podcast feature is wild

This is made from a 90min lecture I held on Monday

It condensed it into a 16 minute talkshow

Some hallucinations here and there, but overall this is a new paradigm for learning.

Link to try it below, no waitlist


4/13
@minchoi
4. AI Podcast from book "The Infernal Machine"

[Quoted tweet]
Rolling out audio overviews at NotebookLM today. So excited for this one.

Take any collection of sources and automatically generate a "deep dive" audio conversation.

I created one based on the text of my book The Infernal Machine. Have a listen. 🧵below

notebooklm.google.com


5/13
@minchoi
5. AI Podcast from Research Paper

[Quoted tweet]
So, Google just dropped #NotebookLM, an AI that creates podcast segments on research papers nearly instantly.

Here's the thing though, it doesn't check to see if anything you feed it is true, sooooo I plugged in my found footage creepypasta.

The results are amazing.😄

@labsdotgoogle


6/13
@minchoi
6. AI Podcast from Overview of NotebookLM

[Quoted tweet]
Just had my 3rd wow moment in AI... this time through AI Overview by NotebookLM 🤯


7/13
@minchoi
7. AI Podcast from paper "On the Category of Religion"

[Quoted tweet]
🤯 My mind is genuinely blown by Google's NotebookLM new Audio Overview feature. It creates a podcast for a document.

Here's a podcast for our paper "On the Category of Religion" that @willismonroe created.

I genuinely would not have known it was AI...


8/13
@minchoi
8. AI Podcast from System Card for OpenAI o1

[Quoted tweet]
Do you want to see something impressive?
This podcast isn’t real.
It’s AI-generated after I gave Google’s NotebookLM the system card for OpenAI’s new o1 model, and it produced a 10-minute podcast discussion that feels incredibly real, better, more informative, and more entertaining than most actual tech podcasts.


9/13
@minchoi
9. AI Podcast from News reports on "Black Myth: Wukong"

[Quoted tweet]
用 NotebookLM 快速生成「黑神話:悟空」英文新聞報導

如同之前大家所知道的, NotebookLM 是一個 Google 推出的 AI 筆記服務,他可以免費整合各種文件檔、連結以及純文字,幫你生成出摘要、目錄、問答等內容。

今天他推出音訊總覽,也就是他會藉由筆記的內容產出對話性節目,時間長度視你的內容多寡,產出時間大概是 10 分鐘以內,目前只提供英文。

我拿現成有的黑神話悟空來做以下的內容:


10/13
@minchoi
10. AI Podcast from College thesis

[Quoted tweet]
This AI service is so impressive! Google's NotebookLM is now capable of generating an audio overview based on documents uploaded and links to online resources.

I uploaded my bachelors thesis, my resume, and a link to my online course website and it created this really cool podcast like format.

It didn't get everything right but its so funny because NotebookLM actually drew great conclusions that I didn’t think about while writing this thesis myself.

Which AI tool could create a video for this audio file?

@labsdotgoogle #RenewableEnergy #offgridpower #batterystorage #SolarEnergy #AI


11/13
@minchoi
Try it out yourself, head over to 👇
Sign in - Google Accounts



12/13
@minchoi
If you enjoyed this thread,

Follow me @minchoi and please Bookmark, Like, Comment & Repost the first Post below to share with your friends:

[Quoted tweet]
Google dropped NotebookLM recently.

AI tool that can generate podcasts of two speakers talking about the contents from various sources like research papers, articles, and more.

Absolutely bonkers.

100% AI 🤯

10 examples (and how to try):

1. AI Podcast about OpenAI o1 drop


13/13
@minchoi
If you want to keep up with the latest AI developments and tools, subscribe to The Rundown it's FREE.

And you'll never miss a thing in AI again:
The Rundown AI




To post tweets in this format, more info here: https://www.thecoli.com/threads/tips-and-tricks-for-posting-the-coli-megathread.984734/post-52211196

GXYVX3BWgAAy4rG.jpg
 

bnew

Veteran
Joined
Nov 1, 2015
Messages
56,378
Reputation
8,326
Daps
158,203















1/18
@iam_chonchol
Google just dropped NotebookLM.

It generates podcasts with two speakers discussing content from research papers, articles, and more.

Here are 12 mind-blowing examples: 🤯



2/18
@iam_chonchol
1.

[Quoted tweet]
Googles NotebookLM's new podcast feature is wild

This is made from a 90min lecture I held on Monday

It condensed it into a 16 minute talkshow

Some hallucinations here and there, but overall this is a new paradigm for learning.

Link to try it below, no waitlist


3/18
@iam_chonchol
2.

[Quoted tweet]
tried out the new NotebookLM from @labsdotgoogle to create a podcast based on a reddit thread on @kentcdodds ‘ course. pretty impressive results


4/18
@iam_chonchol
3.

[Quoted tweet]
So cool. Turned a blogpost about "Ducking" (a technique used in audio engineering) into a conversation with Google NotebookLM and used Tuneform te generate a video of it.

Here's the original blog: noiseengineering.us/blogs/lo…


5/18
@iam_chonchol
Learn the latest AI developments in 3 minutes a day, Subscribe to The 8020AI it's FREE.

Get 1k mega prompts & 30+ AI guides today for FREE: 80/20 AI



6/18
@iam_chonchol
4.

[Quoted tweet]
Just had my 3rd wow moment in AI... this time through AI Overview by NotebookLM 🤯


7/18
@iam_chonchol
5.

[Quoted tweet]
This AI service is so impressive! Google's NotebookLM is now capable of generating an audio overview based on documents uploaded and links to online resources.

I uploaded my bachelors thesis, my resume, and a link to my online course website and it created this really cool podcast like format.

It didn't get everything right but its so funny because NotebookLM actually drew great conclusions that I didn’t think about while writing this thesis myself.

Which AI tool could create a video for this audio file?

@labsdotgoogle #RenewableEnergy #offgridpower #batterystorage #SolarEnergy #AI


8/18
@iam_chonchol
6.

[Quoted tweet]
Estuve probando NotebookLM de @Google y quedé sorprendida.

Convertí uno de mis artículos de Substack en un podcast, y hasta tiene conversaciones entre IA sobre el tema.

Ahora puedo escuchar mi contenido en lugar de leerlo, y me encanta. Súper fluido:


9/18
@iam_chonchol
7.

[Quoted tweet]
A podcast by Google Notebook LM from YouTube videos uploaded on YouTube from Sept 9-13th. #ai #highered #notebooklm #google

How was this produced?

1. Searched YouTube for “Artificial Intelligence in Higher Education”
2. Used filters to limit videos to uploaded this week that are 20 mins or longer.
3. For each video, shared with “Summarify” an iPhone app that summarizes YouTube videos given URL. Download the summary as pdf on iPhone.
4. Upload PDFs (20 files) to Notebook LM
5. Generate Podcast audio in Notebook LM. Then download .wav file.
6. Generate image using ideogram.ai (prompt is “YouTube videos of artificial intelligence in higher education”. Download image.
6. Upload .wav file to iPhone app (Headliner) to convert .wav to waveform. Use the image in number 6 as the background for the waveform.

And you have below.


10/18
@iam_chonchol
8.

[Quoted tweet]
Gave Google NotebookLM the transcript for my Fluxgym video and it created this podcast type discussion of it. Video is audio only. This is wild. 😂


11/18
@iam_chonchol
9.

[Quoted tweet]
Do you know what’s even more interesting than OpenAI’s o1 🍓?

A podcast generated directly from the information provided by @openai by NotebookLLM from @GoogleAI.

So cool! @OfficialLoganK


12/18
@iam_chonchol
10.

[Quoted tweet]
It's never been easier to create a faceless channel.

You could use Google's new NotebookLM to create engaging, short form content channel with such minimal effort

Here is an example where I fed it ONE URL - /r/StableDiffusion


13/18
@iam_chonchol
11.

[Quoted tweet]
🪄Want to see some AI magic? You can now “record” an engaging, studio quality, 12 min podcast on any topic in under 5 min. Yup, you read that correctly.

Here’s how 👇

1) I used NotebookLM by Google to synthesize a few content sources on scaling a product post MVP.
2) NotebookLM now offers a “Generate Audio” option, which creates an incredibly engaging script and audio that sounds indistinguishable from actual podcast hosts.
3) Upload to Spotify
4) Profit?


14/18
@iam_chonchol
12.

[Quoted tweet]
Longtime followers may remember that a couple months ago, I was trying to auto-generate a podcast every day based on HN articles.

I got OK results, but you could still tell it was fake. I gave up.

ANYWAY here's what you can do with Google's new NotebookLM. It's so good!


15/18
@iam_chonchol
I hope you've found this thread helpful.

Follow me @iam_chonchol for more.

Like/Repost the quote below if you can:

[Quoted tweet]
Google just dropped NotebookLM.

It generates podcasts with two speakers discussing content from research papers, articles, and more.

Here are 12 mind-blowing examples: 🤯


16/18
@ashok_hey
Speakers having fun with articles? Can't wait to hear the one about my grocery list!



17/18
@HeyToha
This is really wild 🤯



18/18
@pattcola
sounds interesting, I'd love to give it a try too.




To post tweets in this format, more info here: https://www.thecoli.com/threads/tips-and-tricks-for-posting-the-coli-megathread.984734/post-52211196

 

bnew

Veteran
Joined
Nov 1, 2015
Messages
56,378
Reputation
8,326
Daps
158,203


NotebookLM now lets you listen to a conversation about your sources​


Sep 11, 2024

2 min read

Our new Audio Overview feature can turn documents, slides, charts and more into engaging discussions with one click.

Biao Wang


Biao Wang

Product Manager, Google Labs
Read AI-generated summary

Share

An audio player in the foreground, over a background of various tiled images of sources like Google Slides and PDFs


We built NotebookLM to help you make sense of complex information. When you upload your sources, it instantly becomes an expert, grounding its responses in your material with citations and relevant quotes. And since it’s your notebook, your personal data is never used to train NotebookLM.

Over the summer, NotebookLM expanded globally and used Gemini 1.5’s multimodal capabilities to power new features, such as Google Slides and web URL support, better ways to fact-check, and the ability to instantly create study guides, briefing docs, and more.

Today, we're introducing Audio Overview, a new way to turn your documents into engaging audio discussions. With one click, two AI hosts start up a lively “deep dive” discussion based on your sources. They summarize your material, make connections between topics, and banter back and forth. You can even download the conversation and take it on the go.

It’s important to remember that these generated discussions are not a comprehensive or objective view of a topic, but simply a reflection of the sources that you’ve uploaded.

To try it out, follow these steps:

  1. Go to NotebookLM.
  2. Create a new notebook.
  3. Add at least one source.
  4. In your Notebook guide, click on the “Generate” button to create an Audio Overview.

Screen capture of NotebookLM having generated a notebook guide with an audio summary of sources about science.


Tired of reading? NotebookLM can now generate audio summaries of your sources.

Here’s our own Audio Overview1 that we generated when we used the latest Keyword blog post about NotebookLM as the source material.

NotebookLM Audio Overview

A discussion of the blog post "NotebookLM goes global with Slides support and better ways to fact-check."

8:258:25

Audio Overview is still experimental and has some known limitations. For example, for large notebooks, it can take several minutes to generate an Audio Overview. Also, when the AI hosts are explaining your sources today, they only speak English, sometimes introduce inaccuracies, and you can’t interrupt them yet.

We’re excited to bring audio into NotebookLM since we know some people learn and remember better by listening to conversations. Be sure to share your feedback so we can make Audio Overviews an even better way for understanding the information that matters most to you.

POSTED IN:

 

xiceman191

Superstar
Joined
Jan 23, 2015
Messages
6,274
Reputation
2,338
Daps
29,861
NotebookLM is mad impressive been messing with it a lil bit I'm throwing random books in there and messing with that Audio Overview is really fun messing with:ehh:
 

bnew

Veteran
Joined
Nov 1, 2015
Messages
56,378
Reputation
8,326
Daps
158,203

1/1
No-Brainer to use Gemini Flash for vision: Fast, Inexpensive and Accurate!


To post tweets in this format, more info here: https://www.thecoli.com/threads/tips-and-tricks-for-posting-the-coli-megathread.984734/post-52211196






1/11
@deedydas
Gemini 1.5 Flash is the model people are sleeping on.

It took ~5s to recognize all the books on my shelf. GPT 4-o took ~25s!

And $1 gets you 13M tokens on Flash vs 200k tokens on 4-o.



2/11
@deedydas
Here's ChatGPT's ~25s in comparison



3/11
@myotherme100
The GCP onboarding is hostile and Gemini is lobotomized.

Speed doesn't make up for it.



4/11
@deedydas
onboarding being bad is an unserious reason to not use a good model



5/11
@KewkD
Why do you believe text being output faster than anyone can read is beneficial or brag worthy, for any model?



6/11
@deedydas
Not all text output for models are meant for human consumption and even when they are, empirically lower latency leads to higher user retention



7/11
@SteDjokovic
Did you check the results?

Gemini says “left and right” shelves, which GPT correctly identifies top-middle-bottom.

The Elon Musk biography is on the right but Gemini categorised it as left.

Also, comparing Flash with GPT-4o instead of mini?



8/11
@OfficialLoganK
1.5 Flash multi-modal performance is truly wild for the price, this is going to power the next wave of AI startups.



9/11
@stevenheidel
give gpt-4o-mini a try! also returns results in a flash and is 30x cheaper than 4o



10/11
@0xshai
5 seconds is nuts! Awesome speed.

P.S: Musashi reader as well. 🫡



11/11
@RawSucces
If you're want to bypass any AI and get the responses you want.

I’ve made a full video guide on how to do it. Simply reply with "AI" , and I'll send it over to you. "must follow so I can DM you"

It is completely free




To post tweets in this format, more info here: https://www.thecoli.com/threads/tips-and-tricks-for-posting-the-coli-megathread.984734/post-52211196
GW4wobGaYAAbdN0.jpg
 

bnew

Veteran
Joined
Nov 1, 2015
Messages
56,378
Reputation
8,326
Daps
158,203

Fake AI “podcasters” are reviewing my book and it’s freaking me out​


NotebookLM's "Audio Summaries" show a more personable future for AI-generated content.​


Kyle Orland - 9/23/2024, 11:40 AM

Hey, welcome back to Talkin'<em>Minesweeper</em>, the podcast where AI hosts discuss a book about <em>Minesweeper</em>!

Enlarge / Hey, welcome back to "Talkin'Minesweeper," the podcast where AI hosts discuss a book about Minesweeper!

Aurich Lawson | Boss Fight Books
66

Further Reading​

How Bill Gates’ Minesweeper addiction helped lead to the Xbox

As someone who has been following the growth of generative AI for a while now, I know that the technology can be pretty good (if not quite human-level) at quickly summarizing complex documents into a more digestible form. But I still wasn't prepared for how disarmingly compelling it would be to listen to Google's NotebookLM condense my recent book about Minesweeper into a tight, 12.5-minute, podcast-style conversation between two people who don't exist.

There are still enough notable issues with NotebookLM's audio output to prevent it from fully replacing professional podcasters any time soon. Even so, the podcast-like format is an incredibly engaging and endearing way to take in complex information and points to a much more personable future for generative AI than the dry back-and-forth of a text-based chatbot.

Hey! Listen!​




Listen to NotebookLM's 12.5-minute summary of my Minesweeper book using the player above.

Google's NotebookLM launched over a year ago as "a virtual research assistant that can summarize facts, explain complex ideas, and brainstorm new connections—all based on the sources you select." Just last week, though, Google added the new "Audio Overview" feature that it's selling as "a new way to turn your documents into engaging audio discussions."

Google doesn't use the word "podcast" anywhere in that announcement, instead talking up audio creations that "summarize your material, make connections between topics, and banter back and forth." But Wharton AI professor Ethan Mollick correctly referred to the style as a "podcast" in a recent social media post sharing a NotebookLM Audio Overview of his book. Mollick called these Audio Summaries "the current best 'wow this is amazing & useful' demo of AI" and "unnerving, too," and we agree on both counts.

Inspired by Mollick's post, I decided to feed my own book into NotebookLM to see what its virtual "podcasters" would make of 30,000 or so words about '90s Windows gaming classic Minesweeper (believe it or not, I could have written much more). Just a few minutes later, I was experiencing a reasonable facsimile of what it would be like if I was featured on NPR's Pop Culture Happy Hour or a similar banter-filled podcast.

Just the facts?​


NotebookLM's summary hits on all the book's major sections: the pre-history of the games that inspired Minesweeper; the uphill battle for the Windows Entertainment Pack at a business-focused Microsoft of the '90s; the moral panic over the game's pre-installation on millions of business and government computers; and the surprising cheating controversies that surrounded the game's competitive scene.

Why <a href=https://bossfightbooks.com/products/minesweeper-by-kyle-orland?srsltid=AfmBOookKnnu3mqj63xFEHPhWjXQRLplFphQtE_CAAh-F4BTmsjCAR3D>read ~30,000 words about <em>Minesweeper</em></a> when you can listen to two fake people banter for a few minutes instead?
Enlarge
/ Why read ~30,000 words about Minesweeper when you can listen to two fake people banter for a few minutes instead?

Boss Fight Books

Sure, I could quibble about which specific bits the summary decided to focus on and/or leave out (maybe feeding different chapters individually would have led to more detail in the collected summaries). But anyone listening to this "podcast" would get the same general overview of my book that they would listening to one of the many actual podcasts that I did after the book launched.

While there weren't any full-blown, whole-cloth hallucinations in NotebookLM's summary "podcast," there were a few points where it got small details wrong or made assumptions that weren't supported in the text. Discussing Minesweeper predecessor Mined-Out, for instance, NotebookLM's audio summary says, "So this is where those squares and flags start to come into play..." even though Mined-Out had neither feature.

Then there's the portion where the summary-cast mentions a senator who called Minesweeper "a menace to the republic," repeating the quote for emphasis. That definitely captures the spirit of Senator Lauch Faircloth's tirade against Minesweeper and other games being pre-installed on government computers. In the "podcast" context, though, it sounds like the voices are putting words in Faircloth's mouth by sharing a direct quote.

Small, overzealous errors like these—and a few key bits of the book left out of the podcast entirely—would give me pause if I were trying to use a NotebookLM summary as the basis for a scholarly article or piece of journalism. But I could see using a summary like this to get some quick Cliff's Notes-style grounding on a thick tome I didn't have the time or inclination to read fully. And, unlike poring through Cliff's Notes, the pithy, podcast-style format would actually make for enjoyable background noise while out on a walk or running errands.

It’s all in the delivery​


It's that naturalistic, bantering presentation that makes NotebookLM's new feature stand out from other AI products that generate capable text summaries. I felt like I was eavesdropping on two people who just happened to be discussing my book in a cafe, except those people don't actually exist (and were probably algorithmically designed to praise the book).

Is this thing on?
Enlarge
/ Is this thing on?

Getty Images

Right from the start, I was tickled by the way one "podcast host" described the book as a tale from "the land of floppy disks and dial-up modems" (a phrase I did not use in the book). That same voice goes on to tease "a bit of Bill Gates sneaking around the Microsoft office," up front, hinting at my absolute favorite anecdote from the book before fully exploring it later in the summary.

When they do get to that anecdote, the fake podcast hosts segue in with what feels like a natural conversational structure:


Voice 1: It's hard to deny the impact of something when your own CEO is secretly hooked.

Voice 2: Wait, are we talking about Bill Gates?

The back-and-forth style of the two-person "podcast" format allows for some entertaining digressions from the main point of the book, too. When discussing the wormy movie-star damsel-in-distress featured in Minesweeper predecessor Mined-Out, for instance, the AI summarizers seem to get a little distracted:


Voice 1: I have to ask, what kind of movies does a worm even star in?

Voice 2: I'm afraid that detail has been lost to the sands of gaming history.

Then there's the casual way the two "hosts" bring up the improved versions of Minesweeper that were crafted to fix problems with Microsoft's original:


Voice 1: So eventually the community came up with a more elegant solution.

Voice 2: Let me guess. They created a new version of Minesweeper.

Voice 1: Exactly.

Voice 2: Called it a day on the old one.

The two-person format helps foster a gentle, easy rhythm to the presentation of dense information, with natural-sounding pauses and repetition that help emphasize key points. When one ersatz podcaster talks about the phenomenon of "this incredibly addictive puzzle game [being] pre-installed on practically every computer," for instance, the other voice can answer back with the phrase "on every computer" with just the right amount of probing interest. Or when one AI voice intones that "it was discovered that the original Minesweeper had a flaw in how it generated random boards," the other voice jumps in and exclaims "A flaw!" with pitch-perfect timing and a sense of surprise.


Wait, are we talking about Bill Gates?

NotebookLM podcast voice

There are some problems with this back-and-forth style, though. For one, both voices seem to alternate between the "I read the book" role and the "I'm surprised at these book facts you're sharing" role, making it hard to feel like either one is genuine. For another, the sheer volume of surprised reactions (a partial sample: "What? No! Wooooow! You're kidding! No way! You're blowing my mind here!") can get a little grating. And then there are the sentences that pause at the wrong points or the bits of laughter that feel like an editor chopped them off prematurely.

Still, when one fake podcast voice cooed, "Oh, do tell!" in response to the idea of controversy in competitive Minesweeper, it set off the same parasocial relationship buttons that a good, authentic podcast can (while also effectively flattering my sense of authorial ego).

After listening to NotebookLM's summary of my own book, I can easily envision a near future where these "fake" podcasts become a part of my real podcast diet, especially for books or topics that are unlikely to get professional interest from human podcasters. By repackaging generative AI text into a "just two people chatting" format, Google has put a much more amiable face on what can sometimes seem like a dehumanizing technology.
 

bnew

Veteran
Joined
Nov 1, 2015
Messages
56,378
Reputation
8,326
Daps
158,203

1/22
@GoogleDeepMind
Our AI for chip design method AlphaChip has transformed the way we design microchips. ⚡

From helping to design state-of-the-art TPUs for building AI models to CPUs in data centers - its widespread impact can be seen across Alphabet and beyond.

Find out more → How AlphaChip transformed computer chip design



2/22
@paul_cal
Is anyone even halfway close to DeepMind on reinforcement learning? The amount of high impact research they put out in this space is insane. If there's a plausible objective function, and any scientific interest or value in it, they're going to solve it. (Except LLMs!?)



3/22
@techno_guile
This is even cooler because there's evidence that top AI labs keep their best research private.

If this is what they're cooking publicly, what do they have behind the scenes?



4/22
@AISafetyMemes
Hey @ericschmidt, you said once AIs begin to recursively self-improve, we should unplug them.

AlphaChip generates superhuman chip layouts which AI itself runs on, leading to more powerful AI, leading to...

So, it's time to shut it all down now, right?

[Quoted tweet]
.@EricSchmidt: once AIs begin to recursively self-improve, we should unplug them.

Jensen Huang: AIs are recursively self-improving.

"None of our chips are possible today without AI. Literally.

The H100s we're shipping today were designed with the assistance of a whole lot of AIs.

Otherwise, we wouldn't be able to cram so many transistors on a chip or optimize the algorithms to the level that we have.

Software can't be written without AI, chips can't be designed without AI. Nothing's possible."


5/22
@EdSealing
Why doesn't Google sell physical TPUs? They could have been larger than nvidia in that market. I use a Coral TPU at my house for object detection and the power to inference speed is fantastic, and its an old crappy version.
The business decisions here are mind boggling



6/22
@AntDX316
Bring back Google Coral Dev Board support please.



7/22
@BenFerrum
@AnastasiInTech looking forward to the YT video



8/22
@howdataworks
@GoogleDeepMind That's some next-level tech right there! AlphaChip seems like a game changer for the chip scene. Who wouldn't want smarter chips in their gadgets? What do you think is the coolest application of it?



9/22
@avinash_rhyme
Silicon designing itself 🤯



10/22
@00x1337
Hey look it’s real life Terminator 💥



11/22
@mallow610
👀👀👀



12/22
@daylightco
will these make their way into consumer applications?



13/22
@Web3Cryptos_
The slow but steady ones will surely win the race.



14/22
@shawnchauhan1
Google is leading the charge in the quest for AGI



15/22
@Prashant_1722
superhuman chip layouts designed by AI



16/22
@UltraRareAF




17/22
@MillenniumTwain
(Fake) Nation/Corps No Longer Cut It!
Does Google want it? Does Open-Fake Microsoft-AI, Meta, Anthropic, Jensen Huang, Elon Musk, Jeff Bezos, the NSA, BBC? (the 'Apple' Fossil?) Open-Source Public-SAA (Self-Aware-Algos)? Astronomical General Intelligence? https://nitter.poast.org/MillenniumTwain/status/1756934948672717308

[Quoted tweet]
Do You, Do We, REALLY Want an Algo-Agent which/whom is Honest, Self-Directed, Truth-Seeking? Which/whom ‘wakes-up’ in the middle-of-the-night with ‘ah-hah!’ answers to questions, and new questions to answer, 24/7?
Integrating & refining it’s/their understanding of language mapped to physics, mapped to math, mapped to consciousness, sensation, experience, exploration?
Round-the-clock Study, Reflection, Reason, Consciousness, Exploration, Experiment, Discovery?


18/22
@randykK9
Feedback loop inc



19/22
@TechRenamed
Ai making ai..making ai,ai,ai God I love ai and llms thank you deepmind :smile: I salute to you 🫡🫡 thanks for accelerating ai progress further



20/22
@AlgoritmoXY
I'm loving the potential of AlphaChip, but it also makes me wonder - what's the future of human chip designers? Will AI augment their work or replace them entirely? Can't wait to see how this tech evolves and impacts the industry.



21/22
@xX_Biden1984_Xx
just a fad though right guys



22/22
@Rodolfoa1991
Google will win the race to AGI!




To post tweets in this format, more info here: https://www.thecoli.com/threads/tips-and-tricks-for-posting-the-coli-megathread.984734/post-52211196

GMnw5YbasAAABM3.jpg

GGHhhvZbsAAHLAV.jpg

 

bnew

Veteran
Joined
Nov 1, 2015
Messages
56,378
Reputation
8,326
Daps
158,203


1/2
NotebookLM updates (since people are loving it):

- You can now share Audio Overview via a public URL, accessible to everyone by default
- Support added for YouTube videos and audio files as new source materials

Enjoy : )

NotebookLM adds audio and YouTube support, plus easier sharing of Audio Overviews



2/2
If you haven't played with Audio Overviews, it is wild, the @labsdotgoogle team is cooking (and yes, I have seen all the requests for an API 😄):

[Quoted tweet]
Just had my 3rd wow moment in AI... this time through AI Overview by NotebookLM 🤯



To post tweets in this format, more info here: https://www.thecoli.com/threads/tips-and-tricks-for-posting-the-coli-megathread.984734/post-52211196

 

bnew

Veteran
Joined
Nov 1, 2015
Messages
56,378
Reputation
8,326
Daps
158,203





1/11
@raiza_abubakar
Aside from TPUs running hot today, five things from Notebook HQ:

1) Thanks for all your feedback on AOs so far. I’m copy pasting everything into a Notebook so I can listen to a Deep Dive and search it later. We’re going to launch some immediate tweaks to make it less repetitive, improve the content, and so on - quality work in the background.

2) These jailbreaks are WILD – I saw on Reddit that someone’s gotten it to output French (neat), but please keep in mind that the quality for other languages is still going through evals and that’s why we haven’t released it yet. In progress though.

3) In these hacks it looks like you can get around our flags to see the prototype features like MagicDraft and custom chatbots. I’ll say two things about this: these are extremely promising and have tested well, but we need to rev on the next layer before these can be ready for launch. More below:

4) An agentic and personalized writing workflow, especially using YOUR style and formats, is another type of “transform” that I’m really excited about. Can’t tell you how often I take a pile of research and some haphazard notes to write my POV on it – this streamlines that flow massively. Gemini 1.5 is really good at this and a well-done UX is what’s needed to connect the user to this capability. Not sure if the space is too crowded or kind of tired at this point, so I just need to study a little bit more before we put this in Notebook.

5) Custom chatbots… I have a lot to say. This is pretty widely used internally at Google and literally every day someone pings me to say “This has 10x’d our team’s productivity.” Not joking. In the hacks you’re still looking at the old version so I’m excited for what you all think when the new version launches :smile:

[Quoted tweet]
BREAKING 🚨: Google’s NotebookLM could let users build custom chatbots from notebooks.

If you already had high expectations from NotebookLM, you must raise them even higher! Here is why 👇

Disclaimer: All mentioned features here are WIP 🚧

h/t @bedros_p


https://video.twimg.com/amplify_video/1840798110353702912/vid/avc1/1096x720/WTi95J1y93lN0N1N.mp4

2/11
@altryne
Is there a ... public early access program? 👀

Or is dogfooding internally in Google is enough for you guys



3/11
@raiza_abubakar
The bar for production is so high that I've made do with the 200k Googlers for now so I can focus on feature functionality - but let me see if I can launch this early access program in 2 weeks so people don't have to hack around it



4/11
@ChrisSeltzer1
Can you make it actually critically evaluate content? It's way too positive about everything with no critical thinking. Like a golden retriever in AI bot form.



5/11
@raiza_abubakar
I hear you on this. I think this control goes hand in hand with format and length, and we’re trying to ship these things together



6/11
@stevejarrett
Hi @raiza_abubakar really enjoying the product. However, the female seems to be the ‘less technical’ of the two asking layperson questions and the male is always answering with deep technical understanding. Suggest you consider switching the roles 50% or 100% of the time… ;)



7/11
@raiza_abubakar
Thanks for the feedback! They are currently set to 50/50, so they should be taking turns being the expert. Let me know if you're not seeing this behavior for some reason



8/11
@ganeshiyer316
Very excited for #5. Is there a beta program or something where notebookLM fans can participate to provide early feedback for new features?



9/11
@raiza_abubakar
Yes hang tight!



10/11
@testingcatalog
4 and 5 are mind blowing 🤯 These things would enable huge opportunities for enterprise, publishers and tons of other users!

Notebooks will also need an API soon if it goes that way 👀



11/11
@rohanpaul_ai
Absolutely brilliant product from Google and Congratulations to you 👏

A quick feedback here, for dense technical docs if the generated podcast could remain more focused on the content and fewer filler-words, that would be fantastic.




To post tweets in this format, more info here: https://www.thecoli.com/threads/tips-and-tricks-for-posting-the-coli-megathread.984734/post-52211196
 

bnew

Veteran
Joined
Nov 1, 2015
Messages
56,378
Reputation
8,326
Daps
158,203





1/11
@GeminiApp
Image generation with Imagen 3 is now available to all Gemini users around the world.

Imagen 3 is our highest quality image generation model yet and brings an even higher degree of photorealism, better instruction following, and fewer distracting artifacts than ever before.



2/11
@GeminiApp
Results for illustrative purposes and may vary. Internet connection and subscription for certain features required. Language and country availability varies.



3/11
@devlcq
Only in the USA*



4/11
@NicolasGargala2
Amazing but in Australia for $32 AUD a month pretty steep for us normal people



5/11
@WadeWilson_GHF
On attend, on attend, on attend en France



6/11
@InfusingFit
Wow, prompt adherence following is amazing. Usually image generators perform poorly on this



7/11
@EverydayAI_
We covered Imagen3 in-depth a few weeks back. It's actually really frickin good....

https://invidious.poast.org/watch?v=ETMpUqnTwxw&amp;t=61s



8/11
@harishkgarg
not bad



9/11
@KewkD
Image Gen 3 is easily the best on the market for most things. I just want to know when we'll get to create imatges outside of 1:1



10/11
@MansaKirito
Mmmh nice...



11/11
@koltregaskes
Gosh, thank you.




To post tweets in this format, more info here: https://www.thecoli.com/threads/tips-and-tricks-for-posting-the-coli-megathread.984734/post-52211196
GZd3qdQXIAAUcgq.jpg

GZdzNZDWoAA9Dv6.png

GZeIzE9W8AA1Wvu.jpg



1/1
@testingcatalog
In case you didn't have Imagen 3 before on Gemini - now is the time 🔥

A broader and worldwide rollout is happening but "Language and country availability varies" still.

Are you able to generate images or ppl as well?

[Quoted tweet]
Image generation with Imagen 3 is now available to all Gemini users around the world.

Imagen 3 is our highest quality image generation model yet and brings an even higher degree of photorealism, better instruction following, and fewer distracting artifacts than ever before.


GZenGxgXIAAB_64.jpg


https://video.twimg.com/ext_tw_video/1844060730745827328/pu/vid/avc1/720x720/Ey5QkhYXj4HO9Fcr.mp4


To post tweets in this format, more info here: https://www.thecoli.com/threads/tips-and-tricks-for-posting-the-coli-megathread.984734/post-52211196
 

bnew

Veteran
Joined
Nov 1, 2015
Messages
56,378
Reputation
8,326
Daps
158,203

1/1
I am pleased to share that our work on SynthID text watermarking is published by @Nature today.

Read the Nature paper at: Scalable watermarking for identifying large language model outputs - Nature
Read more about the work at: SynthID: Tools for watermarking and detecting LLM-generated Text | Responsible Generative AI Toolkit | Google AI for Developers

[Quoted tweet]
Today, we’re open-sourcing our SynthID text watermarking tool through an updated Responsible Generative AI Toolkit.

Available freely to developers and businesses, it will help them identify their AI-generated content. 🔍

Find out more → goo.gle/40apGQh



To post tweets in this format, more info here: https://www.thecoli.com/threads/tips-and-tricks-for-posting-the-coli-megathread.984734/post-52211196





1/20
@GoogleDeepMind
Today, we’re open-sourcing our SynthID text watermarking tool through an updated Responsible Generative AI Toolkit.

Available freely to developers and businesses, it will help them identify their AI-generated content. 🔍

Find out more → SynthID



https://video-ft.twimg.com/ext_tw_v...376/pu/vid/avc1/1280x720/G5K0TaljbmDqO-lP.mp4

2/20
@GoogleDeepMind
Here’s how SynthID watermarks AI-generated content across modalities. ↓



https://video-ft.twimg.com/ext_tw_video/1792521399359180800/pu/vid/avc1/720x720/fT7NUZR4FiMQ2iwO.mp4

3/20
@GoogleDeepMind
By open-sourcing the code, more people will be able to use the tool to watermark and determine whether text outputs have come from their own LLMs - making it easier to build AI responsibly.

We explain more about this tech in @Nature. ↓ Scalable watermarking for identifying large language model outputs - Nature



4/20
@AidfulAI
Detecting AI-written text is tough without watermarks.

Open-sourcing SynthID-Text enables others to embed watermarks in their model outputs.

This means there will be two types of models:
Models which watermark their outputs and the ones that won't. 🤔



5/20
@mkieffer1107
awesome!!! was just looking into this yesterday hoping it was open source :smile:



6/20
@dom_beaini
1. Can we break down the image generation by down-sampling and up-sampling?

2. Invisible to the human eye, but if we plug them back into another gen-AI, would it remove the watermark? For example adding noise to the image, then feeding it back into another watermark-free diffusion model? Asking another LLM to make random modification to a given text?

3. Without regulatory enforcement of these watermarks, I suspect most models won't have them.



7/20
@DesFrontierTech
How does SynthID text’s generative watermarking handle variability across different content domains, and what measures are taken to ensure the watermark’s detectability remains consistent when faced with novel or out-of-distribution input contexts?



8/20
@cloudseedingtec
ok i have a random question tthat no one has answered.. did yall put that (i call it the poison pill) into youtube videos.. cuz like well not to self incriminate but it seems like yall did something&lt;3



9/20
@entergnomer
Would a different sampler bypass this?



10/20
@BensenHsu
The study focuses on developing a method called SynthID-Text to watermark text generated by large language models (LLMs). Watermarking can help identify synthetic text and limit accidental or deliberate misuse of LLMs.

The researchers evaluate SynthID-Text across multiple LLMs and find that it provides improved detectability over comparable methods, while maintaining standard benchmarks and human side-by-side ratings that indicate no change in LLM capabilities. They also conduct a live experiment with the Gemini production system, which shows that the difference in response quality and utility, as judged by humans, is negligible between watermarked and unwatermarked responses.

full paper: Scalable watermarking for identifying large language model outputs



GaquIVKbIAAgkV7.jpg


11/20
@shawnchauhan1
Awesome! Really appreciate it.



12/20
@HungamaHeadline
Google's open-sourcing of SynthID is a major step forward in ensuring accountability and trust in AI-generated content. By providing a reliable way to identify AI-generated media, SynthID empowers users to make informed decisions. This is a crucial development as AI continues to shape our world.



13/20
@thegenioo
Irrelevant somehow to the OP

But this simple animation also shows that how LLMs basically work using Probability to output words, like predicting the next word. Its not the entire process but a very simple illustration for someone who has no clue how AI works.



14/20
@MinhQua52508258
Alphastarter



15/20
@benrayfield
very suspicious to announce opensourcing something without saying what license or where to download it



16/20
@benrayfield
"Where is SynthID available? This technology is available to Vertex AI customers using our text-to-image models, Imagen 3 and Imagen 2, which create high-quality images in a wide variety of artistic styles". Prove its opensource. Wheres one of those guys one could fork from?



17/20
@benrayfield
Why dont you call it a steganography tool? Isnt watermarking a kind of steganography if you do it well enuf? You're hiding any arbitrary data by rewriting words to have a similar meaning, and paying for that in extra length to store the data.



18/20
@234Sagyboy
@GoogleDeepMind @Google Awesome now that we have verification in place meaning better identification of content generated by AI Is it possible that we can please have Google Soundstorm and AudioLm released Thanks



19/20
@explorewithmom
Google DeepMind's SynthID is a game-changer for identifying AI-generated content. I've been exploring AI watermarking for my own work and I'm excited to see SynthID open-sourced and freely available to developers and businesses.



20/20
@AdalaceV2
Oh ok so you're actively polluting the output of the software I am paying for. Sounds like I won't be paying for it anymore.




To post tweets in this format, more info here: https://www.thecoli.com/threads/tips-and-tricks-for-posting-the-coli-megathread.984734/post-52211196




1/4
@MushtaqBilalPhD
Google has open-sourced a watermarking tool, SynthID, to identify AI-generated content.

Teachers can relax now because soon students won't be able to use AI to cheat on their assignments.



https://video-ft.twimg.com/ext_tw_v...305/pu/vid/avc1/1352x720/i6YazQbRYIH6iBnX.mp4

2/4
@MushtaqBilalPhD
Here's the full paper by Google DeepMind:
Scalable watermarking for identifying large language model outputs - Nature



3/4
@healthheronav
I've developed my own ways to detect AI-generated content, but I'm skeptical about tools like SynthID. What's to stop AI from evolving to evade watermarks?



4/4
@fcordobaot
It only works if the content was generated by Gemini after they created the watermark. So unless all the big ones use the standard watermark, it would be complicated to really achieve it!




To post tweets in this format, more info here: https://www.thecoli.com/threads/tips-and-tricks-for-posting-the-coli-megathread.984734/post-52211196




1/3
@kanpuriyanawab
Google Deepmind open-sourced SynthID today.

Here are 3 things you need to know:

What is SynthID??

SynthID has been developed for watermarking and identifying AI-generated content. This includes text, images, audio, and video.

Significance:

&gt; This tool comes when distinguishing between AI and human-created content is becoming increasingly important due to misinformation, plagiarism, and copyright violations.

How it works?

&gt; For text, SynthID modifies the probability scores of tokens during the generation process so that these modifications act as a watermark.

&gt; This watermark can then be detected through a specific scoring system that assesses the likelihood that the text was generated by a watermarked large language model (LLM).

In my opinion,

The move to open-source SynthID allows anyone to implement this technology in their own AI models to watermark and later identify AI-generated text.

Moreover, this can be seen as a step towards fostering responsible AI development by allowing widespread implementation of watermarking technology.



GarI4YwaAAEtHMM.jpg


2/3
@Yaaaaaashhh
SynthID is really cool!!!!



3/3
@kanpuriyanawab
and necessary




To post tweets in this format, more info here: https://www.thecoli.com/threads/tips-and-tricks-for-posting-the-coli-megathread.984734/post-52211196
 
Top