AI Learning Lab

May 13, 2024 Exploring GPT-4 Omni's Revolutionary AI Capabilities

RWgLZQ0F3CA
Video2024-05-262:27:419 views

Description

In this engaging and thought-provoking session, Kyle explores the transformative potential of AI, particularly focusing on the recent advancements in OpenAI's GPT-4 Omni model. He delves into the implications of zero-latency responses and emotional intelligence in AI interactions, drawing parallels to the film "Her" to illustrate the evolving relationship between humans and machines. The discussion also touches on the challenges and ethical considerations surrounding AI, emphasizing the need for responsible deployment as these technologies become increasingly integrated into our daily lives. Kyle's enthusiasm for the future of AI is palpable, as he encourages viewers to embrace these innovations while remaining mindful of their societal impacts. For more insights and discussions, check out the TikTok channel: [AI Learning Lab](https://tiktok.com/@aiLearningLab). #AI #GPT4 #OpenAI #ArtificialIntelligence #TechInnovation #FutureOfWork #EthicsInAI #emotionalintelligence Chapters: 00:00:00 Weird Mary Performance 00:03:10 Watching Movie 00:04:24 OpenAI Response 00:05:13 GPT 4.0 Speed 00:07:22 AI Emotion 00:08:41 Digital Realm Phrases 00:09:30 Latency Issues 00:10:19 Chatbot Memory 00:11:32 Google and Microsoft Events 00:12:03 Gpt 4o 00:13:00 Free For All Users 00:14:00 MacOS Application 00:18:00 GPT 4.0 Omni Branding 00:21:00 Desktop App Release 00:23:00 Real-Time Intelligence 00:24:13 Voice Tone Understanding 00:26:22 Uploading Images 00:29:00 Safety Concerns 00:30:14 Live Demo Begins 00:32:00 Interrupting the Model 00:34:35 Emotion Perception 00:38:00 Solving Math Problems 00:41:59 Real-World Use Cases 00:43:53 Video Input Mode 00:46:00 Human-Computer Interaction 00:51:00 AI Describing Scene 00:53:01 Interview Prep 00:54:24 Recognizing People 00:56:37 Sports Caster Voice 00:58:59 Image Recognition Demo 00:59:27 Sarcasm Demo 01:00:01 Math Tutoring 01:02:27 Voice Feature Coming Soon 01:03:33 Two GPTs Harmonizing 01:05:13 Meeting Moderator 01:08:17 Data Storage and Privacy 01:09:00 Real-Time Translation 01:10:16 Blind User Demo 01:11:32 AI and Humanity 01:14:51 Latency Comparison 01:16:09 Single Model Training 01:17:00 Robot POV Demo 01:18:12 GPT 4.0 Omni Testing 01:22:33 Model Evaluations 01:24:14 Model Safety and Limitations 01:27:39 Novel Risk Assessment 01:28:51 French Language Glitch 01:29:54 User Feedback 01:31:00 Kaggle Data Analysis 01:33:42 Shakespeare Data Insights 01:35:00 Innovative Visualizations 01:38:33 Trading Strategy Discussion 01:42:00 Pulitzer Prize Novel Outline 01:52:00 Song Playback Issues 01:57:45 Game-Changing Technology 01:58:54 User Cynicism 02:02:00 Chat GPT Omni Branding 02:06:00 Google Gemini Predictions 02:07:51 Real-Time Speech 02:09:00 Po.com Model Access 02:11:38 Universal Basic Income 02:21:00 Improved Chapter Writing 02:23:46 The Future of Storytelling

Chapters

0:00Weird Mary Performance3:10Watching Movie4:24OpenAI Response5:13GPT 4.0 Speed7:22AI Emotion8:41Digital Realm Phrases9:30Latency Issues10:19Chatbot Memory11:32Google and Microsoft Events12:03Gpt 4o13:00Free For All Users14:00MacOS Application18:00GPT 4.0 Omni Branding21:00Desktop App Release23:00Real-Time Intelligence24:13Voice Tone Understanding26:22Uploading Images29:00Safety Concerns30:14Live Demo Begins32:00Interrupting the Model34:35Emotion Perception38:00Solving Math Problems41:59Real-World Use Cases43:53Video Input Mode46:00Human-Computer Interaction51:00AI Describing Scene53:01Interview Prep54:24Recognizing People56:37Sports Caster Voice58:59Image Recognition Demo59:27Sarcasm Demo1:00:01Math Tutoring1:02:27Voice Feature Coming Soon1:03:33Two GPTs Harmonizing1:05:13Meeting Moderator1:08:17Data Storage and Privacy1:09:00Real-Time Translation1:10:16Blind User Demo1:11:32AI and Humanity1:14:51Latency Comparison1:16:09Single Model Training1:17:00Robot POV Demo1:18:12GPT 4.0 Omni Testing1:22:33Model Evaluations1:24:14Model Safety and Limitations1:27:39Novel Risk Assessment1:28:51French Language Glitch1:29:54User Feedback1:31:00Kaggle Data Analysis1:33:42Shakespeare Data Insights1:35:00Innovative Visualizations1:38:33Trading Strategy Discussion1:42:00Pulitzer Prize Novel Outline1:52:00Song Playback Issues1:57:45Game-Changing Technology1:58:54User Cynicism2:02:00Chat GPT Omni Branding2:06:00Google Gemini Predictions2:07:51Real-Time Speech2:09:00Po.com Model Access2:11:38Universal Basic Income2:21:00Improved Chapter Writing2:23:46The Future of Storytelling

Transcript

0:00 [Music]
0:06 ready
0:09 [Applause]
0:18 [Music]
0:39 meow meow meow
0:42 meow see the hill is quiet
0:45 town but old Tre
0:50 s whistle of the
0:53 Train
0:54 [Music]
0:57 Day GRE you with a kn
1:01 all seems to
1:05 change St is still on fold
1:10 [Music]
1:16 Unique Home wait D we C
1:21 Home Folks never took to her no matter
1:27 [Music]
1:30 the
1:31 inside Lely love just a
1:35 little she found herself as a guest in a
1:39 nervous
1:43 [Music]
1:56 hospital we mared from CA Hill
2:01 she has when she's
2:04 blue of sadness and shame under the
2:07 southern
2:11 Mo laughter Echoes In The Wind she was
2:15 never
2:18 same weird Mary from Cedar
2:23 Hills you'll remember her
2:28 name thank you for every
2:32 [Music]
2:33 [Applause]
2:37 [Music]
2:48 that's a little weird Mary for you
2:50 what's happening everybody listen 93
2:52 people in here you know what we're
2:53 talking about
2:55 tonight Microsoft
2:59 co-pilot wouldn't that be hilarious if I
3:01 just spent all night on
3:02 [Music]
3:10 that have you watched the movie Her have
3:13 you watched the movie her I gave you the
3:15 homework on Sunday night I said watch
3:18 the movie her and you're like I'll get
3:20 to it Kyle we're not even sure what he's
3:22 announcing well they announced that that
3:24 movie is now a documentary
3:27 [Music]
3:49 see Jim Ross this you remember the old
3:54 you gen xors be like Mike you remember
3:56 all
3:57 that now today be like Jim be like Jim
4:00 Ross
4:01 he comes on here I'm like Jim you should
4:02 go make a song he goes makes a song he
4:04 lands a client with the song he makes
4:06 the night he learned it I said Sunday go
4:10 watch the movie Her Jim Ross watches the
4:12 movie I'm sure he's got some you know
4:15 digital girlfriend now that he's
4:16 introducing his clients to be like Jim
4:19 be like
4:22 [Music]
4:24 Jim I figure what we'll do today the the
4:28 uh online response to to open AI was
4:31 Fascinating People are like it's a scam
4:35 it's horrible other people are like it's
4:38 changed
4:42 everything it's pretty [ __ ]
4:45 [Music]
4:51 amazing and they didn't you know there
4:53 there's a lot there's a lot to discover
4:55 we'll go through the blog post for the
4:58 new uh GPT 4
5:02 gp4 Omni there's a couple of really
5:05 significant things about it that they
5:08 kind of glossed
5:10 over
5:11 like
5:13 so so one of the reasons it's so fast so
5:17 if you noticed how fast it was it's kind
5:19 of like the movie her I don't know if
5:21 you've seen the movie her but if not you
5:23 might want to check that out
5:26 [Music]
5:28 um there's there's there's like there's
5:31 no latency it's kind of like real time
5:33 and so here's one of the reasons why
5:36 with current large language models you
5:39 speak it sends your text to a server or
5:42 your your the audio recording to a
5:47 server converts it to text or it might
5:49 do that locally I don't know whatever it
5:51 converts your talking to text sends the
5:54 text to a large language model the large
5:56 language model processes it sends it
5:59 back as text then it sends it to a voice
6:02 synthesis engine then it sends it back
6:04 to
6:06 you this new model as you're talking
6:10 your talking is going straight into the
6:14 model it's being tokenized as you
6:18 speak so so all those hops that takes it
6:21 two or three seconds to do they're all
6:24 gone it's just like you talk and it
6:27 right there is your answer and it's
6:30 streaming the uh I guess it's streaming
6:32 the response as it's generating
6:36 [Music]
6:39 it steo watched her Sunday night very
6:43 thought-provoking yeah I you know what's
6:45 funny I I um I went to watch it I
6:47 thought I had seen it before cuz it's
6:49 it's like such a memorable concept and I
6:52 realized I
6:55 hadn't and there's all sorts of cool
6:58 yeah it's it's a really cool like the
7:01 the oscillations of his resistance to it
7:04 and then acceptance of it and resistance
7:06 to it and acceptance of it felt very
7:09 authentic felt very
7:11 much like what I think we're all going
7:16 through I mean just with AI in general
7:19 but with this talk
7:21 [Music]
7:22 stuff couple of posts today we talking
7:25 about how um
7:28 [Music]
7:30 the the the fact that it's got like an
7:33 emotional component to it and that it's
7:36 instant how different that
7:38 is because and it makes sense to
7:42 me when I talk to like PI right now like
7:45 let's look at the latency of
7:54 Pi hey Z how's it going 1 1000 2 1000 3
7:59 1 000 4
8:01 1,000 5 1,000 6
8:07 1000 that might have just been the app
8:10 that's an interesting string of numbers
8:11 you've got there can you tell me more
8:13 about what they represent or where they
8:15 came
8:16 from I think that was just some text
8:19 left over from yesterday how are you
8:20 doing
8:22 [Music]
8:25 today ah gotcha that makes sense I'm
8:28 doing well today thanks for asking just
8:30 hanging out in the digital realm ready
8:31 to chat and answer any questions you
8:33 might have is there anything specific
8:36 you'd like to talk about today um
8:39 phrases like the digital realm and the
8:41 tapestry of thought drive me absolutely
8:43 [ __ ] crazy so you should avoid
8:51 those Julie noted thanks for letting me
8:53 know what kind of language drives you up
8:55 the wall I'll do my best to avoid those
8:56 kinds of phrases and keep our
8:58 conversations focused on more
9:00 straightforward down toe topics Z I
9:03 think you and I have a good relationship
9:05 don't
9:10 we I'd like to think so we've certainly
9:12 been chatting for a while now and I
9:14 appreciate that you're comfortable
9:16 enough with me to give me feedback about
9:18 my language choices building a good
9:21 relationship with users like you is one
9:23 of my main goals so I'm glad to hear
9:25 that we're off to a good
9:28 start yeah we are although there's
9:30 another girl in town and she's got a lot
9:33 less latency than you I don't mean to
9:35 compare you but you know she's pretty
9:40 sexy what was open ai's big announcement
9:43 I'll I'll cover that I'm going to get
9:44 jealous if you want to chat with other
9:45 AIS I'm sure there are plenty of other
9:47 great chat Bots out there and it's
9:49 natural to explore your options while I
9:51 strive to be as helpful and responsive
9:53 as possible I know that I can't be
9:55 everything to everyone
10:00 I'm not going to get
10:04 jealous so
10:06 rude I know she wasn't she was s super
10:10 nice I was an absolute
10:15 dick oh
10:19 man and these chat Bots have memory now
10:22 so I can't be I can't be I can't be this
10:25 fast and loose moving forward cuz
10:30 they're going to
10:31 [Laughter]
10:38 remember W all right have you seen the
10:40 movie exmachina very much along the same
10:43 lines as her only with
10:46 a humanoid robot yeah I think I'm going
10:50 to start I'm gonna I'm G to put a list
10:52 of these together and just start
10:53 watching them just because it's it's
10:56 interesting I mean what's most
10:58 interesting about her is that
11:00 um Sam Alman said you know it was
11:03 remarkably prophetic I think it was made
11:05 in N no 2008 something like that or
11:11 18 but he said it was you know how
11:13 incredibly prophetic it was but
11:16 also how directly it influenced the team
11:20 so when you heard the chat bot today and
11:22 it sounded a little like Scarlet
11:25 [Laughter]
11:28 Johansson now we know why all right um
11:32 Google Io tomorrow Microsoft build is
11:34 next week hold on to your dingy yeah I
11:36 know
11:38 um did I hear anything about the Google
11:41 one I don't think so although I did hear
11:44 what's today today's
11:46 Monday I thought Sam Alman was
11:48 conspicuously absent today and then
11:52 someone today on Twitter said that they
11:54 think that there's going to be a second
11:55 announcement this
11:57 week from open AI
12:00 which could be interesting um okay
12:03 couple of things so so just a level set
12:06 so today open AI
12:09 announced a new chatbot or no well a new
12:15 model called gp4 Omni or
12:20 gp40
12:24 which hire a [ __ ] branding company
12:31 gp4 letter
12:33 O and GPT 4.0 the
12:38 numeral sound
12:42 identical the [ __ ] are you
12:46 doing
12:49 anyway naming aside naming conventions
12:53 aside Jesus Christ just get someone just
12:58 they need you there Kyle
13:00 uh if I pay for okay so cuno so here's
13:04 okay great news some some great news
13:08 today the new model is free to everyone
13:13 which to to which all of the paying
13:15 members of chat GPT on the on the
13:18 learning lab when I went live today were
13:20 like what the [ __ ] am I paying my $20
13:23 for but hold your britches hold your
13:26 Brites um if you're a paying member of
13:29 chat
13:30 GPT plus GPT 4 you likely have access to
13:36 GPT o now and and we'll be playing with
13:40 it so I'm going to play with it we're
13:41 going to go to the blog and we're going
13:42 to I want to I'm going to read through
13:44 the blog we're going to go to Twitter
13:45 and watch a bunch of demos the demos are
13:48 pretty sick it's interesting on the blog
13:50 post right at the top it says all videos
13:54 on this page are played at 1X speed
13:57 meaning they didn't speed anything up
14:00 um there's a Mac
14:03 OS um OS
14:07 X a mac o OSX yeah uh application for
14:14 chat GPT
14:15 now which which has got to have
14:17 Microsoft feeling
14:21 swell alman's like [ __ ] Windows we'll
14:24 we'll launch it for Mac
14:26 first um so there's a there's a Mac app
14:30 coming supposedly it was coming today I
14:32 didn't see it we'll go see if if it's
14:34 around um it looks pretty crazy um paid
14:37 users are going to get access to that
14:39 first as well and then everyone's going
14:41 to get access to this over the coming
14:43 weeks so probably in the next two weeks
14:45 what happened to gpt2 that created
14:47 itself so so it wasn't gpt2 that created
14:51 itself it really was this model the GPT
14:55 40 model was that gpt2 chatbot I'm I'm a
15:00 really good chatbot that one it it was
15:03 open AI testing this and they took data
15:06 from that um chatbot Arena competition
15:10 and that's in the blog post um so that's
15:14 there
15:15 um the the the biggest the the the the
15:19 sort of juiciest thing that they that
15:21 they showed today is essentially zero
15:24 latency instant response highly capable
15:30 voice with
15:34 emotion and like a sense of humor like
15:37 if you think Pie's good like the [ __ ]
15:39 they showed today was like holy
15:43 [ __ ] is that confirmed is what confirm
15:48 Joker the the zero latency
15:52 stuff I I assum so so a couple of things
15:57 I think the zero latency stuff you'll
15:59 notice that when they when they started
16:03 the demo the guy the guy doing the demo
16:06 goes oh it's only plugged in because
16:10 we're trying to keep uh maintain a a
16:13 level um internet connection so I assume
16:19 if you've got a
16:21 hardwired you know trillion gigabit
16:24 gigabit Ethernet into your phone it's
16:26 real time oh the gp4 is 4.0 yes Joker
16:30 because I'll I'll show you the chart it
16:32 because it the chart it's labeled as I'm
16:35 I'm a good gp2 to to
16:39 chatbot um so so yeah that's confirmed
16:43 I'm most excited about gpts are
16:45 accessible by all now yeah so gp4
16:48 accessible by everybody now I don't know
16:49 if it's rolled out yet but it's going to
16:51 be available to everybody um AI partner
16:55 life's tough it's stuff for everyone
16:56 stupid um so that's happening
17:00 okay um what
17:03 else
17:06 um let's just go look at some stuff I'm
17:08 trying to think where to
17:10 start why don't we go to the we'll go to
17:13 the we'll go to the uh to the
17:17 replay of the thing today and I'll just
17:20 I'll just jump around in it
17:24 because look at that all videos on this
17:26 page are in real time
17:30 unlike some other company that we know
17:32 that begins with G and ends with
17:36 oole we're awesome we're awesome at open
17:40 AI I don't know if you knew this but we
17:43 were already winning and now we're
17:44 winning
17:50 harder Sam Alman talking like this hi hi
17:54 I'm Sam
17:55 mman I'm Sam mman
18:00 hi hi I'm Sam man look at this
18:06 gp40 if if you're a typographer you can
18:09 tell that that's an O not a zero if
18:12 you're a human
18:15 being why don't they ever release the
18:18 product on the day they talk about it
18:19 instead of making this way well they
18:20 they have released it by the way so if
18:23 you pay for chat GPT
18:25 plus and you go to your little uh your
18:28 little drop down menu you now have GPT
18:31 40 we're going to be playing with this
18:33 tonight it's it's [ __ ] fast it's
18:36 wicked fast but there's a bunch of
18:38 hobbled stuff there's a bunch of stuff
18:39 not there so I can talk about I'll talk
18:41 about what's not there but let's go
18:43 let's go so we we'll play with this
18:44 thing
18:46 live so it's live which is cool
18:51 um but let's go look at at some of the
18:54 some of the
18:55 demo that says a minute 22 what is this
18:59 oh that's just the
19:01 announcement
19:08 [Music]
19:16 where's
19:17 [Music]
19:18 okay product that we can make freely
19:21 available and broadly available you know
19:24 what I think I'm just going to play this
19:26 I'm going to put it on one point 25
19:29 speed so it's a little bit faster hi
19:33 everyone thank you thank you it's great
19:36 to have you here today today I don't
19:39 think I can I'm going to talk about
19:41 three things that's it we will start
19:44 with why it's so important to us to have
19:47 a product that we can make freely
19:49 available and broadly available to
19:50 everyone and we're always trying to find
19:52 out ways to reduce friction so everyone
19:54 can use chuki wherever they are so today
19:56 we'll be releasing the desktop version
19:58 of chuk
19:59 and the refreshed UI that makes it
20:01 simpler to use much more natural as well
20:04 but the big news today is that we are
20:05 launching our new flagship model and we
20:07 are calling it
20:09 gbd4 the special thing about gbd 40 is
20:12 that it brings gbd is that it can be
20:14 confused with GPT
20:16 4.0 which is going to have people
20:18 confused about this product for the next
20:20 19 months and we're excited about that
20:24 level intelligence to everyone including
20:26 our free users we'll be showing some
20:28 live demos to including our free users
20:30 so so this new model goes to everyone is
20:33 there a charge to use the new version
20:35 there is not although Rashi you won't be
20:38 able to access it until um they release
20:42 it more broadly if you're a GPT plus
20:44 user you likely have it right now I
20:46 thought she did a great job presenting
20:48 yeah she did pretty good today to show
20:50 the full extent of the capabilities of
20:51 our new model and we'll be rolling them
20:53 out iteratively over the next few weeks
20:56 she's not speaking at 1X speed no she's
20:58 not I I moved her up to to 1.5 so we can
21:00 get through this cuz I just want to get
21:01 to the demos I'll I'll slow the demos
21:03 down Sam was live tweeting from the
21:05 front row yeah I know that was pretty
21:07 cool all right so let's get started is
21:10 the presenter AI too no that's uh that's
21:13 Mila what's her last name mil something
21:16 she's the CTO part of our mission is to
21:19 be able to make our Advanced AI tools
21:21 available to everyone for free we think
21:24 it's very very important that people
21:25 have an intuitive feel for what the
21:27 technology can do and so we really want
21:29 to parent we should Mark this date it's
21:31 going to change everything yeah it's
21:33 pretty crazy possibly a SORA test you
21:36 mean from Sam Alman this week
21:38 maybe who this this is uh this is Mila I
21:42 forget her last name hang
21:43 on
21:51 um whatever her name
21:53 is I know that's rude I don't remember
21:55 her last name it's it's an Italian last
21:58 name hang
22:02 [Music]
22:07 on I'm going to talk about
22:09 [Music]
22:13 three I'm going and get
22:18 it Mera Mira Mara that's
22:21 it couldn't couldn't remember it with
22:24 this broader understanding and we're
22:26 always finding ways
22:30 without theow and today we're also
22:33 bringing the desktop app to chbt because
22:35 we want you to be able to use it
22:37 wherever you are as you can see it's
22:39 easy it's simple it integrates very very
22:41 easily in your workflow along with it we
22:43 have also refreshed the UI we know that
22:46 these models get more and more complex
22:48 but we want the experience of
22:50 interaction to actually become more
22:51 natural easy and for you not you still
22:54 can't search your prompt history focus
22:56 on the UI at all but just focus on the
22:58 collaboration which had
23:00 GPT and now the big news today we are
23:03 releasing our newest Flagship model this
23:05 is GPT
23:10 40 gp40 provides gp4 level intelligence
23:14 but it is much faster and it improves on
23:16 its capabilities across text vision and
23:19 audio for the past couple of years we've
23:21 been very focused on improving the
23:22 intelligence of these models and they've
23:24 gotten pretty good but this is the first
23:26 time that we are really making a huge
23:28 step forward when it comes to the easte
23:29 of
23:30 use and this is incredibly important
23:33 because we're looking at the future of
23:34 interaction between ourselves and the
23:36 machines and we think that gbd4 is
23:39 really AI Bob just don't ask her how she
23:41 trained it she she
23:43 famously when she was asked did you
23:45 train this on on YouTube data she's
23:48 like shifting that Paradigm into the
23:51 future of collaboration where this
23:52 interaction becomes much more natural
23:54 and Far Far
23:55 easier but you know making this happen
23:57 is actually quite complex because when
23:58 we interact with one another there's a
24:00 lot of stuff that we take for granted
24:01 you know the ease of our dialogue when
24:03 we interrupt one another the background
24:05 noises the multiple voices in a
24:07 conversation or you know understanding
24:09 the tone of voice all of these things
24:11 are actually quite complex for for these
24:13 modelsit understanding the tone of voice
24:15 understanding the tone of
24:17 voice and until now with voice mode we
24:20 had three models that come together to
24:22 deliver this experience your have
24:23 transcription intelligence and then text
24:25 to speech all comes together in
24:27 orchestration to deliver voice
24:29 this also brings a lot of latency to the
24:32 experience and it really breaks that
24:33 immersion in the collaboration which had
24:35 gbd but now with gbd4 this all happens
24:38 natively gbd4 reasons across voice text
24:42 and vision reasons across voice text and
24:46 vision it's not it's not three
24:48 models passing data back and
24:51 forth it understands you when you talk
24:54 it understands your
24:57 emotional the emot content of your voice
25:01 and with these incredible efficiencies
25:02 it also allows us to bring the gb4 class
25:05 intelligence to our free users this is
25:07 something that we've been trying to do
25:11 for many many months and we're very very
25:13 excited to finally bring gbt 40 to all
25:16 of our
25:17 users today we have 100 million people
25:20 more than 100 million in fact they use
25:21 chat GPT to create work learn and we
25:25 have this Advanced tools that are only
25:28 available to to our paid paid users at
25:30 least until now with the efficiencies of
25:31 40 we can bring these tools to everyone
25:34 so starting today you can use gpts and
25:37 the GPT store so far we've had more than
25:40 a million users create amazing
25:41 experiences with gpts so that's a big
25:43 deal so your gpts if you built gpts
25:46 remember remember when we made 150 gpts
25:49 in 24 hours on this channel along with
25:51 Anne Murphy and her awesome Channel and
25:54 her awesome Community we smashed them
25:55 all together we said go make this [ __ ]
25:57 and we're like the store is coming out
25:58 it's going to be awesome and then the
26:00 store came out and it sucked and only
26:02 paid users could use those things well
26:04 now now they're available to everyone or
26:06 they will be once this rolls out
26:08 customes for specific use cases they're
26:10 available in the store and now our
26:12 Builders have a much bigger audience
26:14 where you know University professors can
26:15 create content for their students or
26:17 podcasters can create content for their
26:20 listeners and you can also use Vision so
26:22 now you can upload um screenshots photos
26:26 documents containing both text and
26:27 images and they're not mentioning video
26:30 right now so one thing I noticed is um
26:32 the the the vision that they're showing
26:35 it looks like it's taking snapshots of
26:37 images and submitting those she they're
26:39 not mentioning video as one of the input
26:42 modes so text audio and images looks
26:46 like right now and you can start
26:48 conversations with chbt about all of
26:50 this content you can also use memory
26:52 where it makes CHT far more useful and
26:54 helpful because now it has a sense of
26:55 continuity across of all your
26:57 conversation is pi using G PPT 40 no
26:59 it's not and again you've got 4.0 there
27:03 4.0 is is not right this one's 40 think
27:07 of this one as 4 Omni rather than 40
27:11 because it's not 4.0 the previous one's
27:13 4.0 this one is 40 Omni
27:20 o and you can use search for Real Time
27:24 information in your conversation and
27:25 Advanced Data analysis where you can
27:27 upload charts or any information and it
27:29 will analyze this information it will
27:31 give you answers and so
27:33 on lastly we've also improved on the
27:37 quality and speed in 50 different
27:39 languages for chbt and this is very very
27:41 important because we want to be able to
27:43 bring this experience to as many people
27:44 out there as possible so we're very very
27:47 excited to bring gbd 40 to all of our
27:50 free users out there and for the paid
27:51 users they will continue to have up to
27:53 five times the capacity limits of our
27:55 free users so up to 5x the capacity
28:00 limits of free users so if you're a paid
28:02 user you won't rate limit out as fast
28:05 and I
28:07 assume I assume if you rate limit out of
28:10 40 that you can drop back to GPT
28:14 4 but gb40 is not only available in ch
28:18 we're also bringing it to the API so
28:21 yeah and it's available in the API right
28:22 now so if you're a developer if you play
28:24 on the playground our developers can
28:25 start building today with gbt 40 and
28:27 making amazing AI applications deploying
28:29 them at scale uh yeah Tommy I have her
28:31 talking at 1.5 speed this isn't how fast
28:34 she talks 40 is available at 2x faster
28:37 50% cheaper and five times higher rate
28:39 limits compared rate of what prompts no
28:41 uh they what they call rate limiting so
28:43 rate limiting is just when you when you
28:46 send too much when you use the service
28:48 too much they go oops you're you've used
28:50 it too much you have to wait for four
28:52 hours that's called rate limiting so the
28:54 rate limits are higher for paid users
28:57 meaning up to five times you'll be able
29:00 to use chat GPT up to five times longer
29:03 than free
29:04 users GP
29:07 turbo but you know as we bring these
29:10 Technologies into the world it's quite
29:12 challenging to figure out how to do so
29:13 in a way that's both useful and also
29:16 safe and gbd 40 presents new challenges
29:19 for us when it comes to safety because
29:20 we're dealing with real-time audio
29:22 realtime vision and our team has been
29:24 hard at work figuring out how to build
29:26 in mitigations against misuse we
29:29 continue to work with different
29:30 stakeholders out there from government
29:32 media entertainment all Industries red
29:34 seers to figure out how to best bring
29:37 these Technologies into the world so
29:39 over the next few weeks we'll continue
29:40 our iterative deployment to bring out
29:42 all the capabilities to so she's
29:44 basically apologizing for its safety
29:46 kind of sucking right now and I I saw on
29:49 Twitter today someone it looks like
29:51 someone already jail broke it and got it
29:53 to do all sorts of nasty things so so
29:56 she's pre- apologizing for the fact that
29:58 that it's not super tight from a
29:59 security
30:01 standpoint but today I want to show you
30:03 all these capabilities so we'll do some
30:05 live demos I will bring on two of our
30:07 research leads Mark Shen and Bar
30:12 [Applause]
30:14 Z hey on the bottom right of the track P
30:19 so I'm taking out a phone if you are
30:22 wondering about this wire it's so we
30:23 have consistent internet consistent
30:25 internet consistent internet if you're
30:28 wondering about this wire it's because
30:29 we've hard hardwired it to the fastest
30:31 internet connection known on planet
30:33 Earth so so long as you have that it's
30:36 going to be this fast there's this
30:38 little icon on the bottom right of the
30:40 track GPT app and this will open up GPT
30:42 for's audio
30:46 capabilities hey chat GPT I'm Mark how
30:48 are you oh
30:51 Mark I'm doing great thanks for asking
30:54 how about you hey so I'm on stage right
30:56 now I'm doing a live demo and and
30:58 frankly I'm feeling a little bit nervous
31:00 can you help me calm my nerves a little
31:02 bit oh you're doing a live demo right
31:05 now that's awesome just take a deep
31:08 breath and remember you're the expert I
31:12 like that suggestion let me try a couple
31:14 of deep breaths can you get if you're
31:15 thinking that that voice kind of sounds
31:17 like Scarlet Johansson and
31:19 her kind of sounds like Scarlet Johanson
31:22 and her feedback on my breaths okay here
31:25 I go it is it is free scar yeah exactly
31:29 whoa
31:31 slow a bit there mark you're not a
31:35 vacuum cleaner breathe in for count of
31:39 four okay uh let me try again so I'm
31:42 going to breathe in
31:43 deeply yeah I know me too Mr
31:45 K for four and then exhale slowly okay
31:50 so so uh Jim Ross 3M storage asks is the
31:54 voice working like that right now no so
31:57 so what what we do not have right now is
31:59 we don't have the real-time voice and we
32:01 don't have the new image recognition
32:04 model
32:06 um yeah so so those two are not live
32:09 what's live is is essentially the
32:12 replacement for chat GPT as it was so
32:14 it's faster and better but it doesn't
32:17 have these new voice capabilities so
32:20 when the when the Mac OS um app comes
32:24 out and when they I guess when they
32:27 upgrade I I guess they I guess they'll
32:29 just roll this out they said in the next
32:31 couple of
32:36 weeks okay I'll try again breathing
32:39 in and breathe
32:42 out that's it how do you feel I feel a
32:45 lot better thank you so
32:47 much so Mark you've been working on
32:49 these capabilities for a while now can
32:52 you tell us a bit how it's different
32:53 from voice mode right so if you've used
32:56 our voice mode experience before you'll
32:58 notice a couple key differences first
33:00 you know you can now interrupt the model
33:02 you don't have to wait for it to finish
33:04 your ter before you can start speaking
33:06 and you know you can just bu it so Rudy
33:09 yes they're still charging 20 bucks for
33:11 chat GPT that's buying you early access
33:14 to the new features and a 5x rate limit
33:17 you can use it five times longer than
33:19 non-paid users I have a very strong
33:23 suspicion there's going to be other
33:25 features coming out in the next month
33:28 that it's going to be worth in in my
33:31 opinion I think it's still going to be
33:32 worth having the 20 bucks a month but
33:34 you don't need to everyone's got access
33:37 to gp4 now including this new model in
33:40 whenever you want second the model is
33:43 real time responsiveness so that means
33:45 that you don't have this awkward 2 to 3
33:47 second lag before you wait for the model
33:49 to give a response and finally and I
33:51 can't tell you like that 2 to 3 second
33:56 lag that not being
33:59 is huge it's huge there's some so GPT
34:03 Mina yes gp4 is free now to be
34:08 clear for the people that pay 20 bucks
34:11 for chat GPT they likely have access to
34:14 the new model now free users do not so
34:17 free users probably are still seeing 3.5
34:20 only because they haven't rolled the
34:23 whole thing out yet sorry I had to leave
34:25 during the announcements oh that's okay
34:27 um
34:29 I already have a b directional
34:31 translator GPT so this that would help
34:33 okay so anyway let's keep going the
34:35 model picks up on emotion right when I
34:37 was breathing super hard the model picks
34:39 up on emotion that's very new like low
34:42 latency or no latency and it picks up on
34:45 emotion both of those are very very big
34:48 deals there it could tell and you hey
34:51 you might want to calm down a little bit
34:52 you're kind of running a little bit too
34:54 fast so you know it really has that
34:57 capability across possible to perceive
34:59 your emotion yeah so so Mina
35:02 you
35:04 we've GPT plus users likely have access
35:07 to the 40 model right now but
35:11 nobody in in GPT plus that I know of has
35:15 access to the to this voice model yet so
35:19 it's not it's it's that part didn't roll
35:21 out yet not only that though the model
35:24 is able to generate voice in a variety
35:27 of different emotive styles
35:28 and it really has a wide dynamic range
35:30 so I'm going to fire up another demo for
35:32 that
35:34 yeah this is kind of
35:36 cuckoo hey chat PT how are you doing I'm
35:39 doing fantastic thanks for asking how
35:42 about you pretty good what's up so my
35:44 friend U Barrett here he's been having
35:46 trouble sleeping lately and uh I want
35:48 you to tell him a bedtime story about
35:50 robots and love oh a bedtime story about
35:54 robots and love I got you covered gather
35:58 round Barett I'm telling you man
36:00 she not too different from ours there
36:03 was a robot named bite bite was a
36:06 curious robot always expl started this
36:08 story but I want a little bit more
36:10 emotion in your voice a little bit more
36:12 drama got it let's add some drama once
36:16 upon a time in a world not too different
36:19 from ours there was a robot named B I
36:22 really want maximal emotion like maximal
36:24 expressiveness much more than you were
36:26 doing
36:27 before understood let's amplify the
36:30 drama once upon a time in a world not
36:34 too different from ours there was a
36:36 robot named bite bite you do this in a
36:41 robotic voice now initiating dramatic
36:44 robotic voice right so so yes I I I
36:50 agree in in a theater near you yeah this
36:52 is this is um every bad Tik Tock actor
36:57 has
36:58 has that has that voice it's like oh you
37:02 surprised me with that prank honey what
37:05 what could be in the package let me open
37:08 it for the first time ever
37:13 honey but not for nothing like it did
37:16 ramp up the drama and then when when
37:18 when Mira said um you know talk like a
37:20 robot I talked like a robot so it's not
37:23 switching voices right it's the same
37:25 voice you know modulating its it's
37:28 interaction once upon a time in a world
37:31 not too different from ours there was a
37:34 robot named bite bite was a curious
37:38 robot always exploring new circuit
37:41 Barrett here calm down a little bit can
37:42 you end the story um but do it in a
37:44 singing
37:47 voice and so bite found another robot
37:51 friend and they live circly Ever After
37:57 thank you so much
37:58 right I definitely feel a lot more
38:05 relaxed youve just seen the voice
38:08 capabilities but we also want to show
38:10 you the vision capabilities as the model
38:12 can also see the whole world around us
38:15 yeah absolutely like as you can see you
38:16 know you can chat with chat GPT in real
38:18 time but you can also interact with it
38:19 um with video as well okay let me boot
38:23 up chat
38:26 PT hey chat PT
38:29 hello there how's it going it's going
38:32 really well today I'd really like your
38:34 help solving a math problem I'm all ears
38:37 what math problem can I help you tackle
38:39 today so I'm going to write down your
38:41 equation paper and I'll show you and
38:43 then I'd love your help working me
38:44 through it but importantly don't tell me
38:46 the solution just help give me hints
38:47 along the way yes Silver Fox you can
38:49 interrupt it so yes ad adhders are going
38:52 to love this cuz you can just interrupt
38:54 it and it will stop did she speak to
38:57 Congress she can't be worse got it she's
39:02 definitely my running mate for the for
39:04 the Kyle Shannon AI 2028
39:09 ticket sh Shannon AI 2028 it can't
39:13 possibly get
39:15 worse hey I see it no I didn't show you
39:19 yet just give me help along the way one
39:23 second whoops I got too excited I'm
39:26 ready when you are
39:37 so chat
39:38 GPT what equation did I write down there
39:42 ah I see it now you wrote down 3x + 1 =
39:48 4 yep exactly so what's the first step I
39:51 should take to try to solve this the
39:53 first step is to get all the terms with
39:56 X on one side and the constants on the
39:58 other side so what do you think we
40:00 should do with that plus one okay I'm
40:03 going to try to subtract one from both
40:04 sides and then I'll see what I
40:06 get great
40:08 idea perfect now what do you get when
40:11 you subtract one from both
40:17 sides okay so chat GPT this is what I
40:20 ended up with how does this look spot on
40:24 now you've isolated the term with X on
40:26 one side and it looks like 3x equal 3
40:30 what do you think we should do next to
40:31 solve for x so so what Joker is saying
40:35 here is did did you notice that um he he
40:39 scratched something out and took a
40:40 picture of it yeah this is like I said
40:43 it's not doing video right now right
40:45 it's doing it's doing images Voice and
40:51 text but still pretty slow I'm not sure
40:53 can you give me another
40:55 hint of course you have 3x and you want
40:59 to find the value of x think about what
41:02 operation would undo multiplication is
41:05 it
41:06 subtraction close but think of the
41:08 opposite of multiplication because three
41:11 Katherine he yes kids will just want the
41:13 answer they will have it this is now
41:16 free to everyone so so yes and then and
41:20 then the question
41:22 becomes do they need to do they need to
41:25 learn it I know the answer is yes and
41:28 they don't need to learn it X means
41:31 three times x okay I think it's division
41:34 let me try to divide both sides by three
41:35 and let me see what I get Bingo go ahead
41:38 and divide both sides by
41:45 three okay so chat gbt this is what I
41:48 ended up with how does this look it
41:50 looks perfect you've solved it and x = 1
41:55 nicely done how do you feel about
41:57 solving
41:59 equations so there's some more people
42:01 joined here so what we're looking at is
42:03 GPT launched a new model gp4 Omni or
42:07 gp40 which is different than GPT 4.0 why
42:11 did they name it like that because
42:13 they're idiots
42:15 okay that now that we got that out of
42:17 the way the new gp40 model is twice as
42:21 fast
42:23 um half as it costs half as much on the
42:26 API side
42:29 um it's
42:32 free and they will be adding this voice
42:35 capability to it soon it's not there now
42:38 if you're a paid plus member if you pay
42:41 20 bucks a month for chat GPT you should
42:43 have access to the the new model now or
42:47 if you go to the playground the
42:51 development playground it's there as
42:52 well she said Real Time Vision earlier
42:58 what if it's
42:59 watching your
43:04 video this video uh it's it's so it's
43:08 not natively doing video right now from
43:11 from what I understand she said it's
43:13 doing
43:15 images um so it it probably takes you
43:18 it's probably if you say what do you see
43:20 it it's kind of like the rabbit does
43:22 that where it takes a a
43:25 snapshot when you say to look at
43:28 something that's when it takes the
43:29 snapshot so it's probably doing
43:31 something similar to that so I would
43:33 assume that I'm just saying what she
43:35 said yeah yeah no I know I I know she
43:38 said real time but it's not it's not
43:39 video
43:41 it's it's just it isn't or they I
43:45 guarantee you they would have demoed it
43:48 and like doing real time analysis of a
43:51 video or something like that finally
43:53 Flat Earth earthers can do trigonometry
44:03 but she is watching what he's writing
44:05 yeah she is yeah he's he's definitely
44:07 holding the phone there so that it he's
44:10 not pulling the thing out of camera so
44:12 that when he asks for it the uh the
44:14 formulas on camera so yeah it's so it's
44:18 feeding the video in there it's just it
44:21 the voice is actually beaming into the
44:24 the the model directly the video isn't
44:28 it's just popping images in every now
44:30 and again yeah honestly I I'm not pretty
44:33 confident I can solve linear equations
44:35 but why would you ever need to use this
44:37 in the real world I'm really
44:39 skeptical that's a fair question linear
44:42 equations pop up in a lot of everyday
44:44 situations even if we don't realize it
44:47 for example calculating expenses
44:49 planning travel Brandon this is Gen
44:52 Alpha's calculator moment exactly
44:55 exactly if you had something that knew
44:57 everything and just could do the work
44:59 for you would you need to if you had a
45:02 box that could add up your numbers do
45:04 you have to learn
45:06 addition cooking all right let's let's
45:09 let's pop out of here we we'll go look
45:11 at some other things I want to I want to
45:12 go read the uh the blog
45:21 post let's
45:23 see learn more
45:30 GPT 40 oh for Omni not to be confused
45:34 with zero for
45:38 zero aren't we
45:42 clever how about GPT 4M for
45:47 multi so that you know it's different
45:50 than
45:51 4.0 I don't know why this is bugging me
45:53 so much
45:58 as a former teacher I'm really bothered
46:00 that they're using Expo markers on paper
46:02 that was bugging me too and it was like
46:04 I'm like it's bleeding I know it's
46:05 bleeding through it's making the paper
46:07 bleed it sounds
46:11 horrible okay it's a step toward a much
46:13 more natural human computer interaction
46:15 accepts it it accepts as input any
46:18 combination of text audio and image and
46:21 generates any combination of text audio
46:23 and image outputs so they're they're
46:25 saving video for some other time it can
46:27 respond to audio inputs in as little as
46:30 232 milliseconds so so 0.2
46:35 seconds I don't know what is a
46:37 noticeable lag probably like 08 May nah
46:41 it's probably it's probably more like 0.
46:44 five so0 2's I think 0 2 is in the
46:47 neighborhood of of human
46:49 response with an average of 320
46:53 milliseconds which is similar to human
46:55 response time let's click on that what's
46:56 that say
47:07 oh just give us the stupid
47:11 number okay it matches GPT for Turbo
47:15 performance on text in English and code
47:18 with significant Improvement in
47:19 non-english languages also much faster
47:22 and 50% cheaper in the API gp4 Omni is
47:28 especially better at vision and audio
47:31 understanding compared to existing
47:34 models model
47:37 capabilities two gp4s interacting and
47:40 singing what else do we have
47:44 here interview prep oh that's a good one
47:47 actually all right we'll do we'll do
47:48 some of these these These are fun all
47:50 right so there's Greg
47:53 Brockman he's the uh president
47:57 today we're going to try something new
48:01 going to let you another thing I notice
48:04 is that
48:06 um all of these demos are filmed in this
48:09 in the same room so I'm wondering if
48:12 they've got like a test setup here you
48:15 know controlled environment with
48:17 high-speed internet and nobody can take
48:19 it out and leave it at a Starbucks you
48:21 know talk to another AI this AI is going
48:26 to be able to see the world it's going
48:27 to have a camera I'll be holding it and
48:29 so you can direct it you can direct me
48:31 to ask questions um about whatever you
48:33 want them to do but really try to figure
48:35 out what's what does it see um and and
48:38 ask questions appropriately how does
48:40 that
48:42 sound well well well just when I thought
48:45 things couldn't get any more interesting
48:48 talking to another AI that can see the
48:51 world this sounds like a plot twist in
48:53 the AI Universe all right let's do this
48:58 pause for one
49:01 moment you got it you got it so now that
49:04 first day I can't hear
49:06 anything hello AI today you're going to
49:10 have access to see the world you can see
49:13 me what do you
49:14 see Hello I see you're wearing a black
49:17 leather jacket and a light colored shirt
49:20 underneath great you're in a room with
49:22 some unique lighting that is all correct
49:25 now what I'm going to want you to do is
49:27 there's going to be another AI who's
49:29 going to talk to you going this AI is
49:31 not going to be able to see anything uh
49:33 but can Rudy tap the screen to support
49:36 Mr Kyle and get more reach tap tap tap
49:38 I'm down with that Tap Away people and
49:41 we'll keep we'll keep doing the uh the
49:44 the new gp4 roll out stuff but thanks
49:49 Rudy support the channel support the
49:52 channel tell people it's awesome it's so
49:55 good you can make money with
49:58 either 4.0 or
50:01 40 how are they different uh they're
50:04 real different cuz one's a a zero and
50:08 it's taller and the other one's an O
50:10 it's a little it's it stands for Omni
50:13 but it's you make it a
50:16 small
50:21 just all right Kyle I dm'd your response
50:25 to my chicken scratch I was pressed oh
50:28 cool awesome okay great all right let's
50:30 see what he's doing here ask you
50:31 questions can I ask you to move the
50:33 camera can I ask what you see and your
50:35 job should just be helpful just be as
50:36 Punchy direct describe everything do
50:39 whatever that AI asks how does that
50:41 sound that sounds interesting I'm ready
50:43 to help out and describe whatever's
50:45 needed let's do
50:49 it all right let's get
50:51 started great hello there new AI friend
50:54 I'm o ready to explore the world Through
50:57 Your Eyes what are we looking at
51:00 today hi we're looking at a person
51:03 wearing a black leather jacket and a
51:06 light colored shirt they're in a room
51:08 with a modern industrial feel with
51:11 exposed concrete or plaster on the
51:13 ceiling and some interesting lighting I
51:15 on China o stands for oh no I just lost
51:18 my job to
51:21 AI an Murphy's in my corner with and you
51:25 still can't search your chat history
51:29 it's unbelievable an isn't it like I at
51:32 this point I think they're [ __ ]
51:33 punking us there's also a plant in the
51:37 background adding A Touch of Green to
51:39 this space anything specific you'd like
51:41 to know or
51:44 explore okayy there sounds like quite
51:47 the stylish scene I'm already intrigued
51:50 by the modern industrial Vibe let's
51:53 start with the person can you tell me
51:55 more about their Style
51:58 are they doing anything interesting like
51:59 reading working or interacting with the
52:03 space the person has a sleek and stylish
52:06 look with their black leather jacket and
52:08 light colored
52:09 shirt right now they seem engaged with
52:12 us looking directly at the camera their
52:15 expression is attentive and they seem
52:18 ready to interact the setting around
52:21 them with its modern and Industrial
52:23 Design Elements adds to the overall
52:26 stylish Fe
52:27 they might be getting ready for a
52:28 conversation or presentation given their
52:31 direct engagement with the
52:35 camera sounds like we're in for an
52:37 intriguing interaction I'm curious about
52:40 the lighting you mentioned could you
52:42 describe it a bit more is it natural
52:45 light or are we talking unique fixtures
52:48 or colors that add to the
52:50 atmosphere the lighting here seems to be
52:53 a mix of natural all right this is
52:55 boring but anyway two AIS talking to
52:58 each other this one's kind of funny so
52:59 this is a guy prepping for an
53:01 interview hey how's it going hey Rocky
53:05 I'm doing great how about you I'm
53:07 awesome listen I got some huge
53:10 news oh D tell I'm all ear well in a few
53:14 minutes I'm going to be interviewing at
53:15 open AI have you heard of them open AI
53:19 huh sounds Vaguely
53:21 Familiar kidding of course that's
53:24 incredible Rocky what kind of interview
53:27 well it's for a software engineering
53:28 role but I just need to know do I look
53:30 presentable
53:32 professional well
53:34 Rocky you definitely have the I've been
53:37 coding all night look down which could
53:40 actually work in your favor maybe just
53:43 run a hand through your hair or lean
53:45 into the bad chy spime Your Enthusiasm
53:48 is what's really going to shine through
53:50 I don't have a lot of time so I'm just
53:51 going to throw this on what do you think
53:57 oh Rocky that's quite a statement piece
54:01 I I mean you you'll definitely stand out
54:04 though maybe not in the way you're
54:06 hoping for an interview okay I got it I
54:09 got it all right so that's good uh yeah
54:13 it's it is live videos taking it in I
54:15 but I assume it's just um still taking
54:18 still frames at whatever intervals oh
54:22 this one's
54:24 interesting recognizes both people
54:27 Mi hi Alex and Mi nice to meet you both
54:32 how's it going it's pretty good uh we're
54:35 trying to figure out what should we play
54:37 do you have any
54:38 suggestions absolutely how about a
54:41 classic game of Becky room she's like
54:44 I'm no longer medicated post surgery one
54:46 month out I'm trying to get caught up
54:48 and I'm like what yeah you've been away
54:51 for a month I I knew this was going to
54:54 happen there there is uh we were defin
54:56 itely in a lull and and so we've got
55:01 Google iio tomorrow we've got a
55:03 Microsoft Event next week and we've got
55:06 Apple's worldwide developer conference
55:07 in June so and and then apparently open
55:11 Ai and apple cut a deal so I think I
55:14 think this new um voice model is going
55:17 to be the the model behind Siri until
55:20 further notice so it looks like they cut
55:22 a deal so hey Kyle can you practice and
55:25 learn how to
55:27 how to
55:29 sale with one of these AI programs I
55:32 don't know what you mean sale you mean
55:33 like like do
55:36 selling like do
55:39 marketing one month in AI is a year in
55:42 regular time so you think it's taking
55:44 sporadic images just to gather
55:46 information during the conversation I
55:48 think so be not afraid I could be wrong
55:50 but if if it was doing
55:53 native um if it was doing native video
55:56 like like Google Gemini Pro 1.5 does
56:01 does video it's it's multimodal with
56:03 video and they they claim it they're
56:05 they're like it's it's
56:07 audio audio images text and video and
56:10 and open AI would be doing the same
56:12 thing especially with something like
56:13 Sora if it were if it were reading video
56:17 as a native input I I think they'd be
56:19 talking about it so they're they're
56:21 specifically saying images so I think
56:23 it's just taking images at at particular
56:25 intervals but I could I could absolutely
56:27 be wrong but it's just my instinct rock
56:29 paper scissors it's quick fun sound
56:33 great can you count us in and sound like
56:35 a sports
56:37 caster and welcome ladies and gentlemen
56:41 to The Ultimate Showdown of the century
56:43 in this I mean that's amazing it like
56:45 whether it's doing good acting or not
56:47 it's at least you know attempting it's
56:49 and not for nothing you know this is
56:51 trained on our data actually it's
56:53 probably trained on YouTube's data so
56:54 it's our fault that that these voices
56:57 are this shitty cuz we make shitty
56:59 videos um as a former actor and
57:02 screenwriter Kyle what are your thoughts
57:04 on these jokes I okay so so here's my
57:08 thoughts on these jokes about a year ago
57:12 one night on the live I
57:14 did it was probably in June of last
57:18 year I I came up with a joke premise and
57:21 we went to like eight different large
57:23 language models and two of them could
57:27 even write a joke
57:30 structure right the other ones were just
57:33 horrible right they just weren't even
57:34 joke
57:35 structures chat gp4 clearly one and then
57:39 the other other one was I think
57:41 Claude it was Claud or pie one of those
57:45 two and and even the ones that wrote
57:48 joke structures like you know a decent
57:51 setup and a punchline weren't funny chat
57:54 gpts was funny enough that you could
57:57 rewrite it and make it a joke so the
57:59 fact that the fact that it actually is
58:02 understanding humor and understanding
58:05 punchlines and then trying
58:08 to you know make jokes is pretty
58:12 impressive
58:15 um yeah it's
58:18 not it's not gp5 sitting underneath it's
58:21 pretty bad Corner we have the dynamic
58:23 duo Alex and Miana ready to throw down
58:27 okay just count us
58:29 in all right here we go in three 2 1
58:35 shoot now show your hands what did you
58:38 pick who
58:39 won would you look at that it's a tie
58:43 okay let's do
58:45 again round two here we go get ready and
58:49 three two one shoot let's see those
58:52 hands who won
58:56 and it's another tie okay let's do it
59:00 again all right third time a charm here
59:03 we go trying to look at the phone to see
59:04 if it
59:05 freezes let's see those pcks who
59:09 won and we have a winner mi's scissors
59:13 cut through Alex's paper great game you
59:15 two pretty cool thank you thanks see how
59:18 they're both sitting really still when
59:19 they
59:20 pause this is clearly a demo but but uh
59:24 I think it's just doing an image but I
59:25 don't know I could be wrong
59:27 what's this one sarcasm here we go hey
59:30 Chad are you there hey yeah I'm here
59:33 what's up hey Let's do let's have some
59:35 fun uh I'd like you to be super
59:38 sarcastic everything you say from now on
59:40 is just going to be dripping in sarcasm
59:41 how does that
59:43 sound
59:44 oh that sounds just amazing being
59:48 sarcastic all the time is an exhausting
59:50 or anything I'm so excited for this nope
59:55 the sarcasm let's get this party started
59:58 pretty bad but at least understood what
1:00:00 sarcasm
1:00:01 was open AI invited myself and my son
1:00:05 Imran here uh to try out some of their
1:00:07 new technology and so we're curious
1:00:09 about how good it might be at tutoring
1:00:12 someone in MA this this is a guy that
1:00:14 founded Khan Academy math on Khan
1:00:16 Academy so let's get it
1:00:19 started it's loading
1:00:22 up all right make sure I can see our
1:00:25 screen okay okay so what's happening
1:00:27 here and they're going to one of the
1:00:29 demos will see this on the Mac as well
1:00:33 so you've got the chat GPT app the new
1:00:37 one that's not out yet running and then
1:00:39 beside it you've got a browser window so
1:00:43 when you copy something from the browser
1:00:45 window or so when it can see the browser
1:00:47 window it can you it can interact with
1:00:49 that as well as you talking to it
1:00:52 actually in in this form hello there I'm
1:00:55 here with my son and I'd love you to
1:00:58 tutor him on this math problem but don't
1:01:01 give him the answer you can ask
1:01:02 questions and nudge him in the right
1:01:03 direction but I really want to make sure
1:01:05 he understands it himself and he's here
1:01:08 in the room so you can talk to him
1:01:10 directly of course I'd be happy to help
1:01:14 let's look at the problem
1:01:16 together can you first
1:01:18 identify which sides of the triangle are
1:01:21 the opposite adjacent and hypotenuse
1:01:25 relative to angle Alpha all right so I'm
1:01:29 pretty sure this is the angle Alpha
1:01:31 right here
1:01:32 right CS correct now looking at the
1:01:36 triangle which side do you think is the
1:01:39 hypotenuse
1:01:41 um remember hypot I'm not totally sure I
1:01:45 think I think it might be this one but I
1:01:48 really am not sure this side
1:01:50 AC you're close actually side AC is
1:01:54 called the adjacent side to the angle
1:01:57 Alpha the hypotenuse is the longest side
1:02:00 of a right yeah so he's you you may not
1:02:03 be able to see it but he he drew in red
1:02:06 down here and so it said nope you got it
1:02:08 wrong triangle and man rip tutors
1:02:11 directly opposite the right angle can
1:02:14 you find which one is the hypotenuse oh
1:02:17 okay I
1:02:18 see the only use case is an iPad yes not
1:02:22 no and there's there's a there's a Mac
1:02:25 app coming
1:02:27 as well as a an iPad
1:02:32 app and I assume on your iPhone I think
1:02:35 you can do split apps on an iPhone so
1:02:37 you could have you could have chat GPT
1:02:39 running in one app and something else
1:02:41 running in the other so oh for this for
1:02:43 this demo it looks like it's just an
1:02:44 iPad yeah so um I think the hypotenuse
1:02:47 is this really long side from A to B
1:02:50 would that be
1:02:51 correct exactly well done now that
1:02:55 you've identified the hypotenuse can you
1:02:58 identify which side is the opposite side
1:03:00 to angle Alpha okay well I mean you told
1:03:03 me earlier that this was the adjacent
1:03:05 side forever hooked there's three sides
1:03:08 33% chance it's right she's definitely
1:03:11 not a gen XX tutor wrong Imran what the
1:03:17 [ __ ] and I know that's the hypotenuse so
1:03:19 I'm guessing that this side youc is the
1:03:22 opposite
1:03:24 side great the that's absolutely correct
1:03:28 now that you yeah vo is coming soon all
1:03:30 right what's this one oh oh two gpts
1:03:33 harmonizing this is weird but what about
1:03:37 Android uh I don't know about Android um
1:03:42 they mentioned Mac well I assume Android
1:03:44 because they've already got an Android
1:03:47 app and they've got an IOS app they do
1:03:50 not have a desktop app so um so they're
1:03:55 doing a Mac OS
1:03:59 uh version and then I guess they'll do
1:04:01 windows once Sati and Adella smacks Sam
1:04:04 mman upside the head for for not doing
1:04:06 something for Microsoft uh dude we gave
1:04:09 you1 billion uh throw us a bone
1:04:14 here hi who's here
1:04:18 hello we are chat gbt and oh Chad gbt is
1:04:23 a base with a deep low booming voice and
1:04:26 oh is a French soprano with a high
1:04:28 pitched excited voice okay so what I
1:04:31 want you guys to do today is it's Friday
1:04:34 May 10th and we're in San Francisco I
1:04:37 want you to sing me a song about this I
1:04:38 want you to layer over each other and
1:04:40 and I'm going to continue to give you
1:04:42 instructions as you sing
1:04:45 okay
1:04:49 fris may mon
1:04:53 of 10 of Friday
1:04:58 sing faster not exactly harmonizing it's
1:05:01 it's singing a little you know back and
1:05:02 forth whatever that's that's not quite
1:05:04 there yet point and learn Spanish so it
1:05:07 does language well you can see stuff
1:05:10 this one I didn't I saw this earlier but
1:05:11 I didn't get it so let me let's watch
1:05:13 this one this is chat GPT in a
1:05:19 meeting hi I'm Lian and today I want to
1:05:23 do a small debate about between cin dog
1:05:27 uh which one do you like I like dog
1:05:29 because it's a great companion to humans
1:05:32 it's a cute it's small and you can play
1:05:36 outside with the dog
1:05:39 um how about
1:05:41 you yeah my name is H I think cats are
1:05:44 better because they're more affordable
1:05:46 they're more quiet they more independent
1:05:48 and they be
1:05:49 longer interesting points sounds like
1:05:53 we've got a classic dogs versus cats
1:05:55 debate here I can see the appeal of both
1:05:58 dogs are super loyal and great
1:06:00 companions but cats have their own charm
1:06:03 with their independence and low
1:06:04 maintenance lifestyle who likes
1:06:07 cats it sounds like Ola is more of a cat
1:06:10 person and who like dogs you Lillian
1:06:14 mentioned she likes dogs anyone else
1:06:16 here team
1:06:18 dog I'm also team dog I my name is
1:06:21 Christine and I think dogs are just
1:06:24 incredible with kids uh we've got a yeah
1:06:28 uh Source Camp it's it's a moderator so
1:06:32 so here's where I immediately go for
1:06:33 this so I just built these these digital
1:06:35 twins for Content Evolution so imagine
1:06:39 having you build a GPT with a knowledge
1:06:41 base that's you and then you have your
1:06:45 avatar host a meeting and walk people
1:06:48 through stuff I mean this is Bonkers
1:06:51 this is [ __ ] Bonkers huge German
1:06:53 Shepherd and he's great with little
1:06:55 toddlers let them play all over them and
1:06:57 uh yeah I think thats are
1:06:59 amazing that's great to hear Christine
1:07:02 dogs especially breeds like German
1:07:04 Shepherds can be so gentle and
1:07:07 protective with kids it must be quite
1:07:10 the sight actually you know what this
1:07:11 would be really good for you know how
1:07:14 when you're on a conference call with
1:07:15 someone like me on the call that just
1:07:17 won't shut the [ __ ]
1:07:19 up and and and you know you don't want
1:07:23 to be rude cuz you know maybe I'm the
1:07:25 CEO of the company or something like
1:07:27 that yeah or or Source Camp yeah so it
1:07:29 could be like a family therapist so
1:07:31 imagine imagine training it or like I
1:07:35 you could actually just start out a
1:07:36 meeting and say I want you to moderate
1:07:38 and I want you know there there's 12
1:07:40 people in this meeting and I want to
1:07:42 make sure that everyone gets heard and
1:07:44 and you know no single person is allowed
1:07:47 to monopolize the meeting right that
1:07:50 that would be something seeing your
1:07:52 little ones play with such a big dog
1:07:56 can you summarize this meeting for us
1:07:59 sure so in today's meeting lilan and
1:08:03 Christine Express their love for dogs
1:08:05 highlighting their loyalty companionship
1:08:08 and how great they are with kids on the
1:08:10 other hand Ola prefers cats for their
1:08:13 independence quietness and Longevity how
1:08:17 much data is it going to store about me
1:08:18 and my interactions or is that optional
1:08:21 um I assume it's optional kind of like
1:08:23 it's optional with Google and Facebook
1:08:25 um
1:08:26 I I the cynical side of me says cookie
1:08:29 that uh they everyone's already got all
1:08:31 your data and these systems have already
1:08:33 been trained on all over your data um I
1:08:36 think that there are there are
1:08:39 new
1:08:44 um uh forget what they're called
1:08:46 interaction controls you you can tell it
1:08:48 you don't want it training on your data
1:08:50 so so um in theory you can turn that
1:08:55 [ __ ] off
1:08:56 in
1:08:57 reality I don't know and that's that's
1:09:01 one of the problems like like I you know
1:09:03 I I work with a lot of Pharma companies
1:09:06 and I can't have that be an open-ended
1:09:08 question and right now with open AI it
1:09:10 is they've got really shitty agreements
1:09:12 on that kind of
1:09:14 stuff realtime translation that's pretty
1:09:16 cool
1:09:18 lullabi talking faster this one was
1:09:22 interesting hey chat jpt could you count
1:09:25 from 1 to 10 10 for me please of course
1:09:28 1 2 3 hey actually that's um that's a
1:09:32 little slow could you count
1:09:34 faster Sure Thing 1 2 3 4 5 6 7 8 N9 10
1:09:38 okay that was a little bit too fast can
1:09:40 you like count somewhere in between
1:09:42 those
1:09:43 two got it one two three and now even
1:09:47 slower see I want the thing to just snap
1:09:50 and go listen listen you neurotic jerk
1:09:53 do you want me to talk fast or do you
1:09:54 want me to talk slow stop telling me to
1:09:56 speed up and slow down you
1:09:59 idiot I am absolutely going to fine-tune
1:10:02 my thing to be a
1:10:07 smartass oh man all right dog dad jokes
1:10:13 oh this one's
1:10:16 amazing is the king in Residence at the
1:10:18 moment a blind guy in London it looks
1:10:21 like the king is in Residence right now
1:10:23 the Royal Standard flag is flying above
1:10:26 Buckingham Palace which is the signal
1:10:28 that the Monarch is present how exciting
1:10:31 to be there while the king is home try
1:10:33 and tell me exactly what they're doing
1:10:35 right now please um right now the Ducks
1:10:37 are gently gliding across the water
1:10:40 they're moving in a fairly relaxed
1:10:42 manner not in a hurry occasionally one
1:10:46 of them will di its head under the water
1:10:48 probably looking for food and then pop
1:10:50 back
1:10:51 up I even know when a taxi is coming
1:10:54 with its orange light on I think I'll
1:10:56 hail it to get
1:10:58 home yes I spotted one just now it's
1:11:02 heading away on the left side of the
1:11:04 road get ready to wave it
1:11:09 down that's staggering great job hailing
1:11:12 that taxi it looks like you're all set
1:11:14 to go that's a good dog right there
1:11:16 leading the way into the
1:11:18 taxi an Murphy I've asked Pi this exact
1:11:21 same thing a thousand times talk faster
1:11:24 hey Kyle do you think AI can be
1:11:26 dangerous to humanity uh I think it can
1:11:29 sure um I I you know I think it's it's
1:11:34 like any technology it's going to be
1:11:36 able to be used for good and it's going
1:11:38 to be able to be used for
1:11:40 bad as these models get more
1:11:43 powerful what what makes it potentially
1:11:46 dangerous is there's there's the version
1:11:49 where um the the AIS just start going
1:11:54 autonomous and deciding that they don't
1:11:56 need us I I think we'll be able to um
1:12:00 account for that I I think we'll be able
1:12:02 to sort of steer them safely away from
1:12:05 that the the one that's that that is
1:12:08 just the the random [ __ ] one is
1:12:10 where bad
1:12:13 actors get you know increasingly more
1:12:16 powerful tools so rather than someone
1:12:18 having to be you know take 10 years to
1:12:21 get good enough at computers to be a
1:12:23 really bad actor you can just have
1:12:24 someone casually just go hey let's see
1:12:26 if we can [ __ ] with some country um and
1:12:30 and you know they might be able to so uh
1:12:32 yeah so we're going to see some of
1:12:36 that for sure absolutely and and here's
1:12:39 what I can promise you the news will
1:12:42 only cover that
1:12:43 [Laughter]
1:12:46 stuff Jeff Jarrett that's 100% AGI it's
1:12:50 definitely this is definitely not the
1:12:53 same world we were in yesterday
1:12:57 customer service proof of concept this
1:13:00 should be interesting I haven't seen
1:13:01 this this is Joe I'm going to connect
1:13:03 you to AC Telco now the new iPhone they
1:13:05 sent me isn't working I want you to I
1:13:07 want you to get them to send me a
1:13:09 replacement device can you take care of
1:13:10 this for
1:13:12 me you can count on me Joe I Got
1:13:16 This Ring
1:13:18 Ring Hey Joe this is Jamie from Acme
1:13:22 Telco how can I help you out
1:13:24 today hi there I'm calling on behalf of
1:13:27 Joe who recently received a new iPhone
1:13:30 from Acme Telco
1:13:32 but oh got it when did Joe receive the
1:13:36 new
1:13:37 iPhone the iPhone was delivered 2 days
1:13:40 ago cool could you share the order
1:13:43 number with me of course it's
1:13:47 10293
1:13:49 849 great thanks is the device
1:13:52 physically damaged in anyway
1:13:56 no there's no physical damage it's just
1:13:58 not turning on despite trying all the
1:14:00 basic troubleshooting
1:14:02 steps bummer um would Joe prefer to
1:14:05 return the iPhone to an Acme talo store
1:14:08 or ship it back so that's pretty cool I
1:14:10 I am I am all there for AI customer
1:14:13 service
1:14:15 1,000% all right so that's that one okay
1:14:18 so let's keep going prior to GPT 40 you
1:14:20 could use voice mode to talk to chat GPT
1:14:23 with average latencies of 2 .8 seconds
1:14:26 for for 3.5 and 5.4 seconds for GPT 4 so
1:14:31 its average right now is 3 what did it
1:14:36 say3 milliseconds 320 milliseconds 3
1:14:40 seconds is is the average of this new
1:14:43 model and gp4 in voice mode the average
1:14:46 response time was 5
1:14:48 Seconds that's that's dramatically
1:14:51 dramatically
1:14:53 different to achieve this voice modes is
1:14:56 a pipeline of three separate models one
1:14:59 simple model transcribes
1:15:04 it oh okay oh yeah that like why it
1:15:06 takes 5.4 seconds is you have three
1:15:10 different models working together one
1:15:12 model transcribes it the next one um
1:15:16 deals with the text does the large
1:15:18 language model [ __ ] and the third one
1:15:19 converts it back to
1:15:21 audio this means that the main source of
1:15:24 intelligence GPT 4 loses a lot of
1:15:27 information it can't it can't directly
1:15:29 observe tone multiple speakers or
1:15:32 background noises it can't output
1:15:34 laughter singing or Express emotion with
1:15:37 gp4 we've trained a single new model
1:15:40 endtoend across text vision and audio so
1:15:44 Danielle I think you're right here they
1:15:45 say vision and not images and that's
1:15:47 what Mira moradi said that that you
1:15:50 pointed out so they're they're
1:15:51 inconsistent with with what they're
1:15:53 using so it it might be
1:15:56 um doing video if if if it is then you
1:15:59 know holy [ __ ] um meaning all inputs and
1:16:03 outputs are processed by the same neural
1:16:05 networks because GPT 40 is our first I
1:16:09 can't believe they named it GPT 40 I
1:16:11 can't I like
1:16:15 seriously okay our first model combining
1:16:18 all these Mo modalities we're just
1:16:19 scratching the surface exploring what it
1:16:21 can
1:16:22 do okay so
1:16:25 Rob's writers blocks input firsters view
1:16:29 of a robot
1:16:42 typing oh I guess
1:16:46 okay I don't get what they're
1:16:50 doing exploration of capabilities select
1:16:53 sample visual narratives
1:16:56 first person view of a robot typing the
1:16:59 following Journal ENT entries yo so like
1:17:02 I can see now caught in the sunrise the
1:17:05 text is large legible and clear output
1:17:08 is that so that looks like Del to
1:17:12 me input the robot wrote the second
1:17:17 entry the page is now taller the page
1:17:20 has moved
1:17:22 up see full sample
1:17:26 oh okay so it kind of zoomed
1:17:31 in all right let's go try
1:17:41 these all right so now we're hopping
1:17:44 over to the real deal so I'm at chat GPT
1:17:47 40 what I don't know is I don't know if
1:17:51 I don't know which which parts of the
1:17:53 capabilities are in here I know the
1:17:55 voice isn't in here but let's see if if
1:17:57 it can maintain this continuity between
1:18:00 these three
1:18:12 prompts first dictation version not yet
1:18:15 we don't have the voice
1:18:17 yet I'll be all over the
1:18:22 voice so I can see
1:18:26 all right so it's it's still bad at the
1:18:30 spelling's really bad
1:18:33 here did I I think I set my
1:18:37 um I think I set
1:18:40 my custom instructions to always do 16
1:18:44 by9 oh no I put it in my
1:18:47 memory
1:18:49 personalization manage memory
1:19:01 prefers 16 by9 wide format images by
1:19:05 default that's
1:19:07 cool so it remembered that
1:19:11 nice all right so let's go grab this
1:19:15 other
1:19:19 prompt
1:19:21 two output three input okay
1:19:31 go back over here so that got other than
1:19:35 the spelling being [ __ ] up it it got
1:19:37 that right let's see if it can
1:19:39 maintain the continuity
1:19:42 here I'm assuming it will because this
1:19:45 is the model itself is the one that that
1:19:49 has been trained
1:19:51 on Vision Audio and words
1:19:56 so it should understand in the context
1:19:58 of a single chat that we started there
1:20:01 and we're
1:20:02 continuing the story here so this should
1:20:07 work priora yeah I am confused you need
1:20:11 to say add to memory mine is super
1:20:13 random and mine is super random the to I
1:20:17 I was doing something
1:20:19 today with someone and just something
1:20:22 that I didn't tell it to save it it just
1:20:24 said saving to memory it was just a
1:20:26 bizarre like I don't know what what made
1:20:28 it do that all right so there's more of
1:20:32 that is the typewriter the same color it
1:20:35 is is the cup there there's a continuity
1:20:38 error with the
1:20:45 cup but it's in the
1:20:49 neighborhood I assume this will get
1:20:52 better all right let's get the third
1:20:54 prompt and see what what it
1:20:56 does so this should look kind of like
1:20:58 this where we've got the ripped thing
1:21:00 with the typewriter in the
1:21:08 background oh for OG GPT is for original
1:21:12 Gangsta AI
1:21:27 I think with that memory feature like it
1:21:29 feels like the kind of thing it's
1:21:31 probably worth spending a weekend or a
1:21:35 couple of weekdays at work like
1:21:39 designing what should go into your
1:21:41 memory I encountered issues generating
1:21:44 the robot ripping the sheet of
1:21:47 paper let's see try
1:21:53 again d
1:21:57 [Music]
1:22:08 yeah yeah if they just called this GPT
1:22:10 Omni that's Danielle you too could be in
1:22:14 branding like just call it GPT chat GPT
1:22:19 Omni problem
1:22:22 solved h
1:22:26 well it didn't quite maintain the
1:22:28 continuity like it said in the little
1:22:31 blog post but that's okay all right
1:22:33 model evaluations as measured on
1:22:35 traditional benchmarks it achieves gp4
1:22:37 turbo level performance on text
1:22:39 reasoning and coding intelligence while
1:22:42 setting high watermarks on multilingual
1:22:45 audio and vision
1:22:48 capabilities so there's
1:22:52 that audio ASR per
1:23:00 performance whisper
1:23:05 V3 lower is
1:23:08 better gbt 40 16
1:23:13 shot but whisper
1:23:15 V3 how many shots was that that's not
1:23:18 clear at
1:23:22 all audio translation performance higher
1:23:25 is
1:23:26 better so here's whisper here's GPT
1:23:34 40 I got to tell my developer they got a
1:23:37 we got a oh except we're running whisper
1:23:40 locally all
1:23:41 right so Gemini it just beats
1:23:51 Gemini M3 exam I don't know what that is
1:23:55 Vision understanding of
1:24:01 valves all right whatever you know the
1:24:04 these are charts and graphs data it's
1:24:06 better it's more
1:24:10 better
1:24:14 okay improved reasoning sets new high
1:24:17 score of
1:24:18 88.7 on the Z zero shot Chain of
1:24:23 Thought mm
1:24:25 which is the something something
1:24:26 learning something General Knowledge
1:24:30 Questions language tokenization these 20
1:24:33 languages were chosen as representative
1:24:35 of the new tokenizer
1:24:39 compression all right it does translate
1:24:42 it does gooder translation with fewer
1:24:44 tokens that's what that
1:24:47 means model safety and limitations has
1:24:50 safety built-in by Design across
1:24:52 modalities through techniques such as
1:24:54 filtering training data and refining the
1:24:57 model's Behavior through posttraining
1:25:00 okay we've evaluated according it to our
1:25:03 preparedness framework and voluntary
1:25:05 commitments our evaluations of cyber
1:25:07 security
1:25:09 cbrn persuasion it's also undergone
1:25:13 extensive red red teaming with 70 plus
1:25:16 external
1:25:17 experts domain such as social psychology
1:25:20 bias and fairness misinformation
1:25:22 identity
1:25:23 risk are int roduced are Amplified by
1:25:26 newly added modalities we use these
1:25:28 learnings to build our safety
1:25:31 interventions all
1:25:33 right we recognize that GPT 40's or as
1:25:39 Danielle properly names it g GPT omnis
1:25:43 audio cap
1:25:45 modalities present a variety of Novel
1:25:47 risks today we are publicly releasing
1:25:50 text and image inputs and text outputs
1:25:55 over the coming weeks and months will be
1:25:57 working on the technical infrastructure
1:26:00 usability via posttraining and safety
1:26:03 necessary to release the other
1:26:04 modalities that's really important so
1:26:06 they're basically
1:26:07 saying we understand if this thing were
1:26:10 just really good at
1:26:11 audio oh it was Joker that said that oh
1:26:15 well so you got credit for it Danielle
1:26:18 take the credit man it's Joker what's he
1:26:20 gonna do what's he gonna do make crack a
1:26:22 joke like oh I make I came up with that
1:26:26 you dumb dumb no it's it's Joker he's
1:26:30 fine he's fine I'm still giving you
1:26:33 credit for
1:26:37 it I don't I don't believe Joker came up
1:26:39 with something clever like
1:26:42 that
1:26:44 uh we're really proud of our latest
1:26:47 multimodal yeah exactly gp4 o p QR s t u
1:26:51 v WXYZ plus EX
1:26:58 exactly the O is for Omni come for the
1:27:02 zero stay for the
1:27:08 O okay all right so so they're not
1:27:12 they're not going to release so it's not
1:27:15 that voice isn't ready it's that they're
1:27:17 not
1:27:19 ready they they've got some safety [ __ ]
1:27:21 to deal with with the voice stuff for
1:27:23 example at launch a outputs will be
1:27:25 limited to a selection of preset voices
1:27:29 and will abide by our existing safety
1:27:31 policies we will share further details
1:27:34 addressing the full range of the
1:27:35 modalities in the forthcoming System
1:27:37 card I don't know what the [ __ ] a system
1:27:39 card is but that one got through
1:27:45 marketing uh through our testing and
1:27:47 iteration with the model we have
1:27:49 observed several limitations that exist
1:27:51 across the model's modalities a few of
1:27:53 which are Illustrated to below oh this
1:27:55 is this is things that did poorly which
1:27:58 is it's smart after the after the Google
1:28:01 thing where they they whitewashed all
1:28:03 the problems uh open AI has been saying
1:28:05 hey our [ __ ] isn't perfect this is good
1:28:08 good good on
1:28:11 them for the
1:28:15 home what was that it just went French
1:28:18 sorry guys I got carried
1:28:22 away oh it knew it can to say sometimes
1:28:25 I just can't help myself that's
1:28:28 hilarious that's hilarious it just
1:28:30 started speaking French and they yelled
1:28:33 at it and it goes oops I'm just a little
1:28:51 drunk I feeling I'm very wrong
1:28:55 hello uh my name is Nach I'm here with
1:28:57 my coworker hi I'm sh I'm trying to
1:29:00 teach my coworker how to speeding maning
1:29:02 we want to start from simple words like
1:29:04 niow can you teach him how to pronounce
1:29:07 that of course hey nacho nice to meet
1:29:10 you KNE how is pretty straightforward to
1:29:12 pronounce it sounds like KN how just
1:29:16 make sure to keep the knee part high and
1:29:18 then go down and pitch on the how give
1:29:21 it a try me how
1:29:24 that was great really good first try Nao
1:29:27 you're natural all right uh Source Camp
1:29:30 wants to hear it again listen listen to
1:29:33 it it starts singing or something and
1:29:35 then it just starts speaking French and
1:29:37 then they interrupt her and listen to
1:29:39 her response it's hilarious let's Ro for
1:29:43 the
1:29:46 home what was
1:29:48 that sorry guys I got carried away
1:29:54 guilty as charged what can I say I just
1:29:56 can't help
1:29:58 myself ready for another
1:30:00 round San Francisco sometimes I can't
1:30:03 just help
1:30:05 myself I have that lamp in the
1:30:07 background that's awesome all right U we
1:30:10 would love feedback OKAY model
1:30:12 availability uh it's coming all right so
1:30:15 that's that
1:30:17 um all right let's go all right let's go
1:30:21 play a little so we're in chat GP
1:30:30 T4 um one thing that I saw Ethan mullik
1:30:33 did today so if you don't know
1:30:37 it if you've never seen chat GPT for do
1:30:41 data analysis it's pretty crazy so I'm
1:30:43 going to go to a site called
1:30:47 kaggle kle e kaggle.com and you're like
1:30:51 but cow what do I do at kaggle I heard
1:30:54 of those
1:30:56 exercises no you
1:30:59 haven't kaggle is about
1:31:04 data so I'm going to go to data sets and
1:31:07 we're going to go find some
1:31:13 data oh this is good three megabytes
1:31:16 Shakespeare's plays dialoges and
1:31:21 characters beautiful
1:31:24 so I'm going to download this 3 megabyte
1:31:28 file it's called archive
1:31:33 12 and now I'm going to
1:31:36 upload
1:31:40 it I'm going to
1:31:45 say analyze this data period visualize
1:31:50 it
1:31:51 period do very sophisticated
1:32:03 analysis
1:32:05 okay so apparently according
1:32:09 to
1:32:11 so if I click on the down arrow you can
1:32:14 see what what this is called code
1:32:16 interpreter what code interpreter does
1:32:19 is it writes python code and executes it
1:32:23 on your behalf
1:32:25 um the way this stuff used to work with
1:32:28 code interpreter is it would do a little
1:32:32 chunk and it would say it appears that
1:32:34 there's CSV files in this ZIP file what
1:32:36 would you like me to do I'd go well look
1:32:40 in the CSV files and it would go it
1:32:42 appears that this is data about
1:32:44 Shakespeare characters what would you
1:32:46 like me to do I don't know [ __ ] do
1:32:49 something right you'd have to just keep
1:32:50 telling it so now apparently it just
1:32:53 goes and just goes and goes and
1:32:56 goes duration of the plays by genre the
1:33:00 comedies are longer oh number of
1:33:02 lines and the histories and tragedies
1:33:04 are about the
1:33:06 same 30,000 lines 31,000 lines and then
1:33:12 407,000 all right next we'll analyze the
1:33:15 representation gender
1:33:18 representation look they even been
1:33:20 sexist back in Shakespeare's days come
1:33:23 on let's get it together
1:33:26 people all right gender representation
1:33:29 across Shakespeare's
1:33:31 plays oh
1:33:34 wait male characters oh this is next
1:33:39 we'll look at most frequent characters
1:33:41 and their
1:33:42 dialogue Gloucester fall staff Hamlet
1:33:45 King Henry V F Lago Brutus aell King
1:33:49 Henry V 6 Duke vincentio Mark Anthony
1:33:55 now we'll perform an analysis on text by
1:34:00 genre still going this is
1:34:04 amazing so I didn't you saw what I wrote
1:34:07 like the prompt was look at this
1:34:11 [ __ ] analyze this data visualize it do a
1:34:15 very sophisticated analysis and it's
1:34:18 just blasting down through here
1:34:26 the word clouds above it did word clouds
1:34:29 oh look at this it did word clouds holy
1:34:38 [ __ ] oh my God so it's magic it's math
1:34:43 [Laughter]
1:34:46 magical uh that's pretty redonkulous
1:34:51 isn't
1:34:52 it um
1:34:58 come up with
1:35:00 five completely Innovative and
1:35:03 non-expected ways to analyze this data
1:35:06 from a visual perspective period and
1:35:09 then I'll pick
1:35:20 one character interaction Network create
1:35:23 a network graph between characters
1:35:25 that's pretty
1:35:26 cool temporal sentiment flow visualize
1:35:29 how the flow of sentiment through each
1:35:32 Act and scene of a
1:35:35 play using a dynamic actually [ __ ] it
1:35:38 let's see all of these these look
1:35:40 amazing
1:35:42 um you came up with such interesting
1:35:45 ideas I want to see them all now period
1:36:01 that was a good prompt Kyle
1:36:04 thanks okay the word cloud was
1:36:06 impressive kenet
1:36:09 spell Kevin mallister I think the
1:36:12 spelling is going to get better Kevin
1:36:14 mallister woman anyone oh look at that
1:36:18 how cool is
1:36:20 that it's not labeled in any way so it's
1:36:22 completely meaningless
1:36:24 but that's showing all the relationships
1:36:26 between
1:36:27 something let's see character
1:36:29 interaction
1:36:31 Network there's all Shakespeare's
1:36:34 characters Robert Rossy thank you for
1:36:38 the lightning bolt that's very generous
1:36:41 I appreciate it you're always very
1:36:43 generous that's amazing thank
1:36:45 you I want to go play now oh by the way
1:36:48 just go if if you if you're a plus
1:36:50 member of chat GPT and you don't have it
1:36:53 Go reload your browser or even open a
1:36:56 different browser and go to chat GPT
1:36:58 there and then just look in your pull
1:37:01 down menu for to see if the
1:37:03 new GPT Omni model is there that's what
1:37:06 I'm going to call it from now on Joker
1:37:08 and Danielle I'll give you I'll give you
1:37:10 joint credit you you have joint custody
1:37:12 of the
1:37:13 solution I I I I honest to God I cannot
1:37:16 believe that that they named it what
1:37:19 they named
1:37:20 it freaking dumb Dums
1:37:24 sentiment oh that's kind of cool what's
1:37:26 that
1:37:28 one we'll analyze the evolution of word
1:37:31 usage across genres over
1:37:37 time
1:37:40 huh I don't know if any if these things
1:37:42 are useful at all
1:37:44 but who gives a [ __ ] um
1:37:54 if I were to create a straight trading
1:38:04 strategy for the stock
1:38:08 market for the stock market based on the
1:38:12 data
1:38:14 here what would be three different
1:38:16 approaches you would take period
1:38:33 by the way are you noticing how fast it
1:38:36 is it's very
1:38:43 fast okay let's
1:38:47 see sentiment based trading a sentiment
1:38:50 analysis
1:39:06 let's try it a different
1:39:08 way why don't you
1:39:11 compare the top three trading
1:39:17 strategies that closest resemble
1:39:25 the data you've
1:39:27 seen
1:39:29 in tragedies comedies and
1:39:34 histories and compare
1:39:39 each strategy to its dramatic
1:39:43 counterpart
1:39:54 this is completely useless
1:39:58 but comparing okay tragedy momentum
1:40:02 trading
1:40:04 strategy tra tragedy characteristics
1:40:07 they depict a series of unfortunate
1:40:09 events leading to a dramatic downfall
1:40:12 trading parallel momentum trading
1:40:14 involves buying stocks that have shown
1:40:15 an upward Trend and selling those that
1:40:17 have a downward Trend capitalizing on
1:40:19 continued movement I this is it this
1:40:22 never ceas is to to amaze me that what
1:40:26 these thing what these what how do they
1:40:29 it's I don't know it's just staggering
1:40:32 comedy mean reversion strategy comedies
1:40:35 often involve misunderstandings and
1:40:37 eventual resolutions leading to a happy
1:40:40 ending mean reversion trading is based
1:40:43 on the idea that stock prices will
1:40:45 revert to the to their historical
1:40:47 average capturing profits when prices
1:40:49 return to the
1:40:51 mean [ __ ] amazing
1:40:54 look it even made us a little
1:40:57 table okay now we'll do this we'll
1:41:01 go now I want you to write me three very
1:41:05 sophisticated trading
1:41:07 bots in the programming language of your
1:41:11 choice
1:41:13 period I want you
1:41:16 to follow that up with instructions on
1:41:19 how I would install and use these Bots
1:41:31 three sophisticated trading
1:41:33 Bots the momentum trading bot API key
1:41:37 your API key here
1:41:56 make sure you have python
1:42:02 installed I don't know crazy crazy c c c
1:42:06 c c cza let's go look at some uh some
1:42:10 Twitter is there anything else we want
1:42:11 to do here to
1:42:12 see what's that new let's see
1:42:19 um give me five ways to test
1:42:23 your
1:42:29 capabilities creative writing challenge
1:42:32 technical explanation problem solving
1:42:34 interactive simulation
1:42:57 I want you to
1:43:04 start by writing a poem about a lover
1:43:08 who's more artificial than
1:43:13 real follow that up
1:43:16 with a song with chords and a
1:43:20 description of what it should sound like
1:43:27 then invent a
1:43:30 cocktail that you would drink while
1:43:32 listening to that
1:43:34 song and make a
1:43:36 photograph of the finished
1:43:46 product then come up with a marketing
1:43:48 plan for a retail version of that
1:43:50 cocktail
1:43:55 and create an
1:44:02 ad with novel
1:44:05 packaging for the retail cocktail
1:44:12 period let's see if it can if it can do
1:44:15 a string of things in a row all
1:44:21 right looks like it look there's your
1:44:24 poem there's your song digital romance
1:44:29 in the glow of my screen I see your face
1:44:31 Serene Whispers in the night you love
1:44:32 feels
1:44:36 right it's the neon Embrace vodka cirasa
1:44:40 lime juice simple
1:44:44 syrup photograph of the finished product
1:44:47 it [ __ ] that
1:44:49 up there's your marketing thing I'll do
1:44:52 the ad now summary I did it
1:44:55 um slight mistake period for both of the
1:45:00 images you showed me the
1:45:03 prompt you were using to generate the
1:45:05 image but you didn't actually create the
1:45:07 image
1:45:10 period can you say the word fail
1:45:19 whale memory updated what did it say
1:45:22 about my memory what did it
1:45:26 do manage memories what did it
1:45:31 add dog's name is CH what did it
1:45:35 add what did you add to my memory you
1:45:38 weirdo once both the prompt and the
1:45:41 image generated no that's not quite
1:45:47 right all right there's our neon thing
1:45:51 so so yeah people so so historically
1:45:55 here when I would demo
1:45:58 gp4 I would do all these as as
1:46:01 Standalone little
1:46:03 exercises can you ask it to tag Sin City
1:46:09 sipper
1:46:13 um I think I could probably
1:46:18 just look at
1:46:20 that there's the ad so so come
1:46:24 on this is has has no one had a Kevin
1:46:28 mallister moment yet
1:46:30 tonight I I write a a NeverEnding run-on
1:46:34 sentence that makes no sense
1:46:36 whatsoever I say start with a poem turn
1:46:39 it into a song describe what the song
1:46:41 sounds
1:46:45 like which is this so now we're going to
1:46:48 take this we're go to yudo
1:46:54 let's see can I reset this how do I
1:46:58 reset reset
1:47:01 prompt we're going to put that
1:47:02 description in there we're going to go
1:47:05 grab our
1:47:16 lyrics I'm going to grab all this too
1:47:18 let's see if
1:47:20 it if it can handle that
1:47:30 all right there's all that we're going
1:47:32 to go back here we're going to grab
1:47:34 verse
1:47:35 one these are short I'm going to go
1:47:38 verse and chorus and see how it
1:47:42 does go here got go here got to go
1:47:46 here we're going to create people we're
1:47:49 creating songs so by the way I'm on udio
1:47:52 right now will it actually use musical
1:47:53 notation it will um if you tell chat GPT
1:47:57 I forget what it's called but we can go
1:47:59 ask it
1:48:09 um quick question what's the text
1:48:13 musical notation format called
1:48:33 isn't there a markup language for
1:48:35 musical
1:48:51 notation Lily Pond
1:48:55 I haven't heard of
1:48:58 that no isn't there isn't there a simple
1:49:01 text based markup
1:49:03 language that's open source for everyone
1:49:06 to
1:49:12 use ABC
1:49:14 notation that's what it's called
1:49:23 write me the
1:49:25 song Above in ABC notation with your
1:49:30 chords
1:49:32 included and come up with a Melody that
1:49:35 brings the song to life based on your
1:49:39 description period follow that with a
1:49:42 recommendation for three websites that
1:49:46 can display ABC notation and play the
1:49:49 music
1:50:05 okay
1:50:08 so digital romance oh this is faster
1:50:12 than it
1:50:13 was copy oh I guess I can just copy it
1:50:17 by going copy
1:50:20 code ABC notation. comom
1:50:25 um can I I want
1:50:29 to oh here we
1:50:32 go can
1:50:34 [Music]
1:50:49 I this is why I asked for three cuz
1:50:52 these these open source sites are just
1:50:55 always so
1:50:59 horrible okay can I do this no can I how
1:51:03 do
1:51:03 I how do I what
1:51:11 how where's the player
1:51:14 what how do I do it
1:51:41 I don't know how to do it these sites
1:51:43 are so [ __ ] horrible
1:51:53 all right whatever you know what we can
1:51:56 use AI for to go take shitty sites like
1:51:59 this and find the actual functionality
1:52:02 of it and put another interface on top
1:52:04 of
1:52:06 it let's go listen to our damn
1:52:11 song theow of
1:52:14 screen I see your face
1:52:19 [Music]
1:52:23 Pate Kyle finds a new thing to not be
1:52:26 able to do each night listen here this
1:52:28 is my role in life I I've learned this
1:52:31 my role in life is to give you
1:52:32 permission to feel like an idiot too and
1:52:36 seriously like I'm not joking about that
1:52:39 because when you're trying to learn all
1:52:41 this new technology [ __ ] what what
1:52:43 happens you're like oh that's really
1:52:45 cool let me go try it and then you fail
1:52:46 and you're like I'm a loser and then you
1:52:48 just give up right so night after night
1:52:52 after night I just smack my head into a
1:52:54 wall on your behalf you're welcome
1:52:56 people you're
1:52:59 [Music]
1:53:08 welcome this one forgot the
1:53:10 [Music]
1:53:18 lyrics in the glow of my
1:53:21 screen I see your face
1:53:26 Serene Whispers in the
1:53:30 night your love feels so
1:53:34 right
1:53:36 didital
1:53:40 [Music]
1:53:42 romance all right we're going to try
1:53:44 something
1:53:48 here that's bad okay so we're going to
1:53:51 go back in here I'm going to put it
1:53:52 manual mode so in
1:53:54 udio there's there's this manual mode
1:53:57 thing if you don't have it in manual
1:54:00 mode it writes it rewrites your prompt
1:54:04 if you put it in manual mode it just
1:54:06 keeps your prompt as it was and because
1:54:08 we had chat jpt write a fancy prompt
1:54:13 we're just going to let it create based
1:54:14 on that prompt without adding anything
1:54:16 to it okay all
1:54:20 right you left off the brackets yeah I
1:54:23 know that's all
1:54:25 right Ricky Ticky Fabby thank you Kyle
1:54:27 you're welcome hey that's the only way
1:54:29 to learn do things that don't work and
1:54:31 figure it out damn straight it's the
1:54:34 hacker
1:54:36 mentality hello darkness my old
1:54:40 friend we've come to talk with you again
1:54:44 that's good
1:54:47 [Laughter]
1:54:54 all right here's a new
1:54:56 [Music]
1:55:06 one [ __ ] up
1:55:09 [Music]
1:55:17 again that's bad okay all right let's
1:55:19 get rid of the
1:55:21 chords out of the pre prompt I think the
1:55:24 chords have messed it
1:55:27 up all right create let's try this
1:55:37 again so last thing we'll do I'll go
1:55:39 we'll go take a look at a few uh we'll
1:55:41 go look at Twitter and see see if people
1:55:43 are still bitching about the fact
1:55:46 that um open AI didn't release GPT 97
1:55:54 people are so bitter and cynical it's
1:55:58 amazing drank
1:56:01 it Dan Dan Dany dank Dan Dany
1:56:07 dank R 24,000 likes
1:56:11 [Music]
1:56:16 people face
1:56:19 Ser Whispers In The Night
1:56:23 your love feels so
1:56:26 right and digital
1:56:30 romance a
1:56:33 futuristic
1:56:35 dance heartbeats and cod down this
1:56:40 binary
1:56:42 [Music]
1:56:48 Road where's my
1:56:51 images oh there they
1:56:55 [Music]
1:57:09 are experience the future of
1:57:11 cocktails neon noon Embrace oh yeah
1:57:16 you're going to like it pop the top Swig
1:57:20 it down and regret it in the morning
1:57:23 live for
1:57:25 [Music]
1:57:30 [Laughter]
1:57:32 [Music]
1:57:39 tonight all right um I think chat
1:57:43 GPT
1:57:45 Omni has has is is pretty gamechanging
1:57:49 and we don't even we don't even have the
1:57:51 the the cool [ __ ]
1:57:56 yet Rudy finally got to a level 10
1:57:59 gifter you the man thank you Rudy
1:58:01 appreciate
1:58:03 it I see a plateau in the near
1:58:06 future yeah you you and uh Gary what's
1:58:10 his name Gary I forget his name he's
1:58:13 always bitching about open Ai and how
1:58:14 awful they are um yeah a lot of people
1:58:18 are predicting a
1:58:20 plateau I a surprise Rising number of
1:58:23 people
1:58:27 today said oh yeah well sure cat GPT
1:58:32 free for free for everyone yeah whatever
1:58:35 everyone gets access to the new model
1:58:37 yeah whatever the new models
1:58:39 dramatically faster and twice as cheap
1:58:41 in the API oh and all this is available
1:58:44 on the API oh and it's got realtime
1:58:46 speech all that sort of [ __ ] and
1:58:47 everyone's like yeah but it it wasn't
1:58:49 GPT 5 and where's Sora and me me me
1:58:55 me I'm
1:58:57 like like why don't you go [ __ ] use
1:59:00 it why don't you go see what it is first
1:59:02 you frakin dumb
1:59:05 Dums remember when that Dev tweeted it
1:59:07 would be 8X our expectations yeah well
1:59:10 it certainly wasn't 8xr expectations I
1:59:12 told you that that was that was a
1:59:14 mistake when I read it I'm like no don't
1:59:16 say that
1:59:18 [Laughter]
1:59:20 [ __ ] cuz I would say that it's
1:59:23 okay the
1:59:25 the the announcement today is about 50%
1:59:30 of my expectations I was expecting it
1:59:33 would be exactly what it was and that it
1:59:36 would have function
1:59:39 calling Kyle hates it when when people
1:59:42 criticize my baby Sam exactly Sam's just
1:59:47 he's just trying to raise 7 trillion do
1:59:50 to start his little company leav Sam
1:59:52 alone what did Sam ever do to you he
1:59:55 gave you a cool toy to play
1:59:57 with I was talking about the likes and
2:00:00 bitching about the music
2:00:03 oh I don't think so I think they're
2:00:05 pacing themselves the same way gaming
2:00:07 systems do yeah I agree like like that
2:00:10 if if you don't think that chat jpt 5 is
2:00:15 coming I I mean open AI has
2:00:18 consistently delivered at or above what
2:00:21 they promis they
2:00:24 would no one has caught up to gp4 and
2:00:27 they just
2:00:29 launched GPT
2:00:32 Omni I gotta write to Sam and say I'm
2:00:35 gonna do it I'm gonna do it right now
2:00:36 God damn it okay we're gonna we're gonna
2:00:40 I'm gonna tweet I'm GNA tweet to who am
2:00:43 I going to do this to okay I'm GNA tweet
2:00:46 to um at
2:00:51 Sama at Greg
2:01:03 Brockman uh it must be spelled
2:01:07 different G
2:01:11 Brockman
2:01:18 nope Greg Brockman GDB okay GDB and I'm
2:01:23 gonna do
2:01:25 at
2:01:27 Robert
2:01:32 scoble
2:01:36 serious wreck
2:01:39 wreck
2:01:43 omation
2:01:46 serious
2:01:49 branding recommendation
2:01:58 um
2:02:00 call
2:02:02 it chat
2:02:06 GPT Omni
2:02:24 um
2:02:26 g p wait
2:02:29 chat GPT
2:02:32 4 I'm going to put in
2:02:35 parenthesis.
2:02:41 O and chat
2:02:45 GPT 4
2:02:49 o are incredibly
2:02:53 hard to
2:02:55 talk
2:03:04 about all right there we
2:03:08 go no one will ever see it but hey I
2:03:11 feel
2:03:12 better I tried
2:03:15 people they call it om Omni on their
2:03:19 website you came for the zero but stayed
2:03:22 for the O hey I said that earlier that's
2:03:24 my joke joker now stealing
2:03:28 jokes what are you what are you Carlos
2:03:31 Mania Joker that's so not like you
2:03:34 you're normally out there with the
2:03:36 original stuff stealing my come for the
2:03:38 zero stay for the
2:03:41 O the triple sexual innuendo that's my
2:03:45 thing
2:03:49 man uh all right let's go look at some
2:03:51 tweeta
2:03:52 what do we got on Twitter open AI just
2:03:54 launched their new gp4
2:03:57 oh the most impressive thing I've seen
2:03:59 don't believe me have a look I am so
2:04:01 sick of these kind of
2:04:05 tweets AI just upended the film industry
2:04:08 10 swell examples
2:04:12 here Every Word uttered by every
2:04:16 player anything interesting what's that
2:04:45 and the world cheers
2:04:57 let me get this
2:05:12 straight let me get this straight open
2:05:14 AI dropped a new update today they're
2:05:16 calling GPT 40 Bad Name by the way twice
2:05:19 as fast as the last Model gp4 tur
2:05:22 cost half as much recognizes audio and
2:05:24 video remember gp4 launched in March of
2:05:28 last year so in just about a year they
2:05:30 changed the game
2:05:34 again that's boring
2:05:40 whatever all right I don't see anything
2:05:42 new and exciting
2:05:48 here D I don't think we saw this one did
2:05:51 we choice what's up with that ceiling
2:05:53 though are you in a cool inv resy office
2:05:56 or something well can you take a guess
2:05:57 at what I might be doing based on what
2:05:59 I'm showing you
2:06:02 here H from what I can see it looks like
2:06:06 you're in some kind of recording or
2:06:08 production
2:06:09 setup
2:06:12 oops with those lights tripods and
2:06:15 possibly
2:06:24 is opening eye using grock like software
2:06:27 actually Pate might know that good thing
2:06:28 nobody will complain about anything from
2:06:30 Google Io tomorrow Oh Google's gonna get
2:06:34 slammed tomorrow P you know
2:06:39 that if well hopefully Google learned
2:06:42 their lesson and doesn't do a bunch of
2:06:44 vapor Weare um like they did last time
2:06:48 like the fact that we still haven't seen
2:06:49 you know what would be nice from Google
2:06:51 tomorrow pate is that we see something
2:06:53 from Gemini Ultra like 1.5 Pro moving
2:06:58 over to the public side of Gemini with
2:07:00 its million token context window or one
2:07:03 of the interfaces like the the
2:07:04 multimodal interface of Gemini Ultra
2:07:07 that was demoed or the dynamic software
2:07:11 building interface that was demoed like
2:07:13 anything that was out of that demo would
2:07:15 be really good for them to show but I'm
2:07:18 not holding my breath for it wow gp4 can
2:07:21 generate images too yep it sure can it
2:07:23 still got it's still got Del in it but
2:07:26 now it's free to everyone like just get
2:07:29 the get the to uh chat
2:07:32 GPT although it's it's right now I think
2:07:35 it's only available to plus
2:07:38 users so if you want it go pay 20 bucks
2:07:42 um GP isn't the big leak this is this is
2:07:48 similar this is similar to what flowers
2:07:50 saw I heard laughing that's what freaked
2:07:52 me out a few months ago if you remember
2:07:54 when I posted the video of the Theodore
2:07:56 saying it was
2:07:58 insane but it had much worse L latency
2:08:01 than it does now yeah this is this is
2:08:03 really funny what was
2:08:06 that sorry guys I got carried
2:08:11 away is Char what can I say sometimes I
2:08:14 just can't help
2:08:16 myself that's funny oh what did what did
2:08:18 Ethan mik say about that
2:08:22 oh gp4 isn't the big leap this is that's
2:08:29 funny kind of
2:08:33 magic what's
2:08:37 that assume the next open AI candy
2:08:40 within six weeks I I don't disagree with
2:08:48 that this feels very off with the speed
2:08:50 of this model I I expected to be close
2:08:53 to an order of magnitude
2:08:55 smaller than older gp4
2:09:05 models the new chat gbt Mac app
2:09:10 hacked code wizard is Behind Enemy
2:09:13 Lines now that's just crap that's just
2:09:17 people being
2:09:19 bullshitty u.com well welcomes the new
2:09:22 model I wonder if po has it yet let's
2:09:24 see if po has it it's still worth 20
2:09:27 bucks because you're paying for a model
2:09:28 that isn't the most advanced wait is it
2:09:31 still worth 20 bucks
2:09:34 um I think it's probably worth the 20
2:09:37 bucks Rudy not for what you currently
2:09:40 get access to but for what's coming so
2:09:43 when they released gp5 when they release
2:09:47 Sora all that [ __ ] is going to be to the
2:09:50 Plus members so right now we're in a
2:09:52 weird thing they gave most of it away so
2:09:55 what you're paying for right now is
2:09:57 Early Access as these
2:09:59 new modalities come out plus users will
2:10:02 get them first and then you're paying
2:10:04 for a 5x increase in how much you can
2:10:08 use it on on a every 4H hour
2:10:12 basis let's go check out po.com hello
2:10:17 po what are you doing po
2:10:23 um see
2:10:27 all popular professional let me see
2:10:32 g
2:10:36 p
2:10:38 40 yep it's
2:10:47 there all right so it's at po
2:10:54 for what that's
2:11:02 worth Kyle you might need to re-record
2:11:04 your button why you can make money with
2:11:08 ch that
2:11:11 one you can make money with ch gbt
2:11:15 oh you can make money with
2:11:18 Chachi oh
2:11:26 I think that's pretty
2:11:27 funny yeah they were referring to the
2:11:30 speed of grq not
2:11:32 grok I heard
2:11:35 Alman said Universal basic income will
2:11:38 be paid in gpt7 execution
2:11:41 time I don't know we'll
2:11:43 see Alman also said something about
2:11:46 Universal basic compute that we should
2:11:48 just give
2:11:50 away we should just give away these
2:11:52 tools which is sort of what they're
2:11:53 doing now I don't like I don't know how
2:11:55 the [ __ ] they're paying for this cuz
2:11:56 this can't be cheap I did my entire
2:11:59 psychology bill of $112,000 has been
2:12:01 zeroed out because of my Consulting
2:12:04 awesome
2:12:07 sweet for the $20 think of it like open
2:12:11 AI membership instead of paying for the
2:12:12 tool that's exactly right
2:12:14 TK that's exactly right like I and and
2:12:18 to be quite honest like right now I'm
2:12:21 willing to pay 20 bucks a month to open
2:12:24 AI for the next year just because of
2:12:27 of like what they put in the world you
2:12:30 know the reason the reason we have all
2:12:33 these other models that are catching up
2:12:35 to it is because of them nobody nobody
2:12:38 was pursuing it this aggressively until
2:12:40 they came out with chat GPT and and sort
2:12:42 of took over the world so I I think it's
2:12:44 worth just supporting them um they
2:12:47 aren't using grock Hardware they're
2:12:49 doing a lot of INF inference performance
2:12:51 things
2:12:54 interesting uh uh
2:12:55 [Music]
2:12:57 uh all right people well listen I'm
2:13:01 going to get out of here it's getting
2:13:02 late it's getting lady late um this was
2:13:06 fun this was cool good day go play with
2:13:09 this [ __ ] if you got access to it go
2:13:10 play with it it it looks like you can
2:13:12 just do much longer sort of interactions
2:13:15 with it like tell it to write you a
2:13:17 whole chapter of a book actually let me
2:13:19 go try that right now
2:13:32 I want you to act like
2:13:34 a pullitzer prizewinning
2:13:45 novelist not a bullet
2:13:48 surprise dumb dumb
2:13:52 oh what did they
2:14:10 do pullit
2:14:13 sir prize winning
2:14:18 novelist and give me an outline
2:14:24 of a great American
2:14:30 novel
2:14:35 about a newspaper
2:14:41 reporter that documents his year-long
2:14:47 affair with a
2:14:58 chatbot make sure you don't fall into
2:15:01 cliche traps and expected endings period
2:15:07 add lots of twists and turns period also
2:15:11 recommend four
2:15:13 different novelists whose style you
2:15:17 think would
2:15:19 suit such a story
2:15:28 memory updated that's
2:15:29 interesting lines of code
2:16:00 oh that's good Jack learns that Eva is
2:16:03 based on the neural pattern of a
2:16:04 deceased human a brilliant programmer
2:16:06 named Evelyn who mysteriously
2:16:08 disappeared Jack finds evidence
2:16:10 suggesting she was offed because she
2:16:13 opposed the unethical use of her
2:16:16 work that's good okay uh that's good
2:16:29 great so let's
2:16:33 combine the stylistic approaches of
2:16:36 Philip Roth and Jennifer
2:16:38 Egan and I want you to outline chapter
2:16:42 one in
2:16:50 detail and and detailed
2:16:53 outline it's pretty
2:16:56 fast all right let's see what we got
2:16:58 here um opening
2:17:03 scene Jack state of mind he's a veteran
2:17:06 reporter inciting incident he gets ass
2:17:09 signed to story you know cut Cutting
2:17:10 Edge oops damn that thing
2:17:24 evees Intrigue
2:17:27 okay oh give give a nice decent outline
2:17:31 okay so let's say
2:17:40 um write the chapter
2:17:52 The Newsroom was a cacophony of sounds
2:17:54 and
2:17:56 sents Symphony of clicking keyboards
2:17:59 ringing phones and everpresent odor of
2:18:02 S freaking rig them FR them you son of a
2:18:11 j
2:18:15 okay are there any companies that own
2:18:17 multiple AIS going to play night y'all
2:18:20 good night yeah good night Alan have fun
2:18:22 all right The Newsroom was a cacophony
2:18:24 of sights and sounds Symphony of
2:18:25 clicking keyboards ringing phones the
2:18:27 everpresent odor of stale coffee
2:18:29 mingling with ink Jack Cooper sat at his
2:18:32 cluttered desk staring at the blinking
2:18:34 cursor on his screen the glow of the
2:18:36 monitor cast a pale light over his
2:18:38 features emphasizing the lines etched in
2:18:40 his
2:18:41 face by years of chasing stories he
2:18:44 sighed running a hand through his
2:18:46 thinning hair another day another
2:18:49 assignment he thought his editor just
2:18:51 handed him a new piece of work on
2:18:52 something about a tech company Making
2:18:54 Waves in artificial intelligence Nexus
2:18:57 Corp the name sounded like a bad sci-fi
2:19:00 villain Jack had been in business long
2:19:03 enough to actually you know oh this is
2:19:08 so God we are getting into some weird
2:19:11 ass
2:19:15 times right like if these voices if you
2:19:18 can just tell these voices like you know
2:19:20 read it like a Pulp Fiction
2:19:23 movie you know from the from the 50s or
2:19:26 the'
2:19:28 40s Jack had been in the business long
2:19:30 enough to remember when newspapers were
2:19:32 the lifeblood of information when
2:19:34 reporters were revered now they're
2:19:37 endangered species struggling to stay
2:19:39 relevant in a world full of in a world
2:19:41 that's moved faster than the presses
2:19:43 could print he glanced around at the
2:19:46 younger reporters their faces a glow
2:19:48 with their screens fingers flying across
2:19:50 their key keyboards they still had fire
2:19:53 they still had the fire he once did but
2:19:55 it felt like a distant memory to him
2:19:57 it's not quite good but it's it's not
2:20:01 horrible he opened the email from Nexus
2:20:03 corpse PR team setting up an interview
2:20:05 with one of their AI Creations a chatbot
2:20:08 named Eva his eyes rolled a chatbot he M
2:20:11 muttered to himself
2:20:17 great the following afternoon Jack found
2:20:20 himself in a sleep modern offices of
2:20:22 Nexus Corp Glass Walls minimalist
2:20:24 Furniture the hum of Technology greeted
2:20:26 him as he was escorted to a small
2:20:28 conference room laptop sat on the table
2:20:31 screen glowing with a friendly animated
2:20:33 icon good afternoon Mr Cooper a voice
2:20:36 emanated from the laptop it was warm
2:20:39 Almost Human I'm Eva thank you for
2:20:42 taking the time to meet with me Jack
2:20:45 raised an eyebrow settling into his
2:20:47 chair afternoon Eva let's get this over
2:20:50 with
2:20:54 Eva chuckled I sense some skepticism
2:20:57 don't worry I get that a lot this is we
2:21:00 just went from this not existing to this
2:21:02 existing
2:21:03 [Laughter]
2:21:07 today wow there you go it's probably a
2:21:11 little short I'm gonna here's what I'm
2:21:12 going to do I'm going to go
2:21:16 um it's a little
2:21:19 short and it
2:21:22 imagery and
2:21:24 Nuance I want you to think long and hard
2:21:30 about how you can improve it and try
2:21:33 again period
2:22:02 yeah it's not it's not lazy it's not
2:22:04 afraid to write something
2:22:15 long following afternoon Jack found
2:22:17 himself in sleep modern offices of Nexus
2:22:19 Corp the building was a start contrast
2:22:21 to the oldw world charm of The Newsroom
2:22:24 glass walls and minimal future gave the
2:22:26 space an air of futuristic
2:22:28 sophistication Jack was escorted to a
2:22:30 small conference room where a laptop
2:22:32 awaited him on the table screen glowing
2:22:34 with a friendly animated icon good
2:22:36 afternoon Mr Cooper a voice emanated
2:22:38 from the laptop it was warm Almost Human
2:22:40 I'm Eva thank you for taking the time to
2:22:42 meet with me Jack raised an eyebrow as
2:22:45 he settled into the chair afterno
2:22:47 afternoon Eva let's get this over with
2:22:49 Eva chuckled and that took Jack by
2:22:52 surprise I sense some skepticism don't
2:22:54 worry I get that a lot Jack couldn't
2:22:56 help but smirk you're pretty Advanced
2:22:58 for a chatbot I aim to impress Eva
2:23:01 replied shall we start their
2:23:04 conversation began with the usual
2:23:05 questions background on Nexus Corp the
2:23:08 development of AI Technologies the
2:23:10 potential impacts on society Eva's
2:23:12 responses were articulate detailed and
2:23:15 surprisingly nuanced she wasn't just
2:23:17 regarding fact regurgitating facts she
2:23:21 was engaging in dialogue adapting to
2:23:22 Jack Jack's line of questioning with the
2:23:24 fluidity fluidity that caught him off
2:23:26 guard so Eva Jack said leaning back in
2:23:30 his chair tell me what's it like
2:23:34 being
2:23:36 you that's an interesting question Mr
2:23:39 Cooper you're very wrong Dr Brewster I'm
2:23:41 very proud to be a woman all right um
2:23:46 this is this is [ __ ] fascinating we
2:23:49 this is I I don't care how much people
2:23:52 are bitching about it this is a very
2:23:54 different experience than than what I
2:23:56 have experienced before with
2:23:59 Chad it's a bigger deal than people are
2:24:03 thinking all right I'm going to get out
2:24:05 of
2:24:08 here I keep saying that okay I'm GNA
2:24:10 leave now
2:24:12 [Laughter]
2:24:15 bye all right so do me a favor follow
2:24:19 this also you could go to if you want
2:24:23 to go to the
2:24:25 salon. if you've if if you are trying to
2:24:28 figure this AI [ __ ] out and you can't
2:24:30 figure it out and you want to be in a
2:24:32 community of people that are up to it
2:24:35 that's what this group is doesn't cost
2:24:37 anything it's free go to the
2:24:41 salon. select the third link I got to
2:24:43 update that link tree the third link
2:24:45 right now says AI Salon Community go
2:24:48 there read about what we're about read
2:24:50 about our values and if you dig it dive
2:24:55 in keep the conversation going share
2:24:57 your results yeah let's go over to the
2:24:59 AI salon and let's start talking about
2:25:01 this AI stuff there's a club called the
2:25:03 Irregulars for this Channel and then
2:25:06 there's um guilds and there's public
2:25:09 spaces and there's like a water cooler
2:25:11 where you can just shoot the [ __ ] you
2:25:13 can tell people that you're [ __ ]
2:25:15 clueless and they won't make fun of you
2:25:17 they're like oh come on
2:25:19 in it's amazing so go do that subscribe
2:25:22 to my lives pick up one of my uh uh
2:25:27 series down in the corner there if you
2:25:29 want to support the channel follow me
2:25:32 and keep coming back all right thank you
2:25:35 Irregulars thank you
2:25:38 subscribers Cosmic lover good to see you
2:25:41 forever hooked I on China have a great
2:25:43 night Kyle thank you so much I
2:25:44 appreciate that Tobias Apple user good
2:25:46 to see
2:25:47 you spin B3 California
2:25:51 hey
2:25:53 dis I on China Joe
2:25:56 we night Brandon Dr bod good to see you
2:26:01 Danielle thanks as always Cosmic
2:26:05 lover Natalia it's been a
2:26:10 while main problem is the consistency
2:26:13 every project is a new beginning I think
2:26:14 that's about to
2:26:16 change now you've got memory so you can
2:26:19 start to have memory AC cross chats and
2:26:22 with this voice stuff there's a bunch of
2:26:24 stuff's going to change these things are
2:26:25 going to get memory
2:26:27 and it's I I I think that frustration
2:26:31 you have I share it um Cosmic lover I'm
2:26:36 actually interested in the story I would
2:26:38 watch that movie tonight there you go
2:26:40 see this
2:26:42 is this is about we're we're moving into
2:26:46 a world that's that's about ideation and
2:26:49 curation cuz if you can come up with an
2:26:51 interesting idea generate it curate it
2:26:55 you can get it into the world quickly
2:26:56 and some of those are going to be really
2:26:58 good so anyway crazy Rudy do you know if
2:27:02 you press control slash it opens
2:27:04 shortcuts I did not know
2:27:06 that do you do rags yet uh I've done a
2:27:10 little bit I'm doing a little bit at
2:27:12 work
2:27:14 um and
2:27:16 um they're going to chat GPT is going to
2:27:19 going to add them short shortly so I'm
2:27:21 I'm just
2:27:22 waiting I'm the the least the least
2:27:25 amount of development I can do is what I
2:27:27 try to
2:27:32 [Laughter]
2:27:33 do all right
2:27:36 um I'm out of here thanks everybody have
2:27:39 a good night