Google DeepMind's Logan Kilpatrick: Why the Model Eats the Harness
The entire startup ecosystem is racing to build agent harnesses. Logan Kilpatrick, who leads Google AI Studio and the Gemini API, argues that scramble has a roughly 12-month shelf life. Models will absorb the scaffolding and run it natively, so the edge moves elsewhere. Google's own bet runs in parallel: a single agent harness, born from the Windsurf team and now called Antigravity, has become the connective tissue across search, the Gemini app, Cloud, and AI Studio — the role Gemini-the-model used to play. Logan makes the case that coding already feels like narrow superintelligence, and that "jagged" vertical superintelligence (in math, finance, and science) will arrive well before AGI. He argues Google's real goal is maximizing outcomes for users, not eyeball time. He unpacks Omni, the single model built to replace multiple separate systems Google once trained for text, audio, music, image, and video. His throughline: AI is an accelerant for human ambition, not a substitute for it. Hosted by Sonya Huang, Sequoia Capital
- Published
- Published Jun 11, 2026
- Uploaded
- Uploaded Jun 11, 2026
- File type
- Podcast
- Queried
- 00
Full transcript
Showing the full transcript for this episode.
AI-generated transcript with timestamped sections.
[00:00] - So we could edit this set so it looks like we're-- - We should, yes. - Okay? - Yeah, I want this where we were talking off camera, like we should do that for the intro, because I think it just like makes all this stuff more capable. I've seen these examples of like, [00:12] such subtle nuance that like make me appreciate that it's like the world understanding playing out I was I was giving a talk. [00:20] and was on stage with my friend Tulsi, who leads the model team. I had mentioned to someone in the crowd to edit the video, and they literally took the picture, edited it with Omni in real time, and this dog came on the stage. In the edited version, the other guests sort of look down, see the dog. They chuckle a little bit. This is while I'm opining about whatever AI nonsense. Maybe they're laughing at your jokes. Yeah, it was not my jokes. They laugh at the dog coming up. It jumps onto my lap. I sort of acknowledge the dog. I keep talking. [00:50] and just like there's like so much subtle subtlety in getting that right and the model crushed it and it's just it's very interesting and like still trying to like absorb and digest like what that means for you know the way we make content and all these other things that's so interesting [01:20] . [01:24] - I'm delighted to have Logan on the show. Logan runs Google AI Studio and the Gemini API. You spend a lot of your time thinking and building for the next generation of builders.
[01:33] So I'm excited to talk to you about everything from agentic AI to AI coding, world models and more today, and right off the heels of Google I.O. So what better timing? Yeah, I'm super excited. Thank you for having me. [01:45] Wonderful. Let's start with agentic AI. So Sundar opens I/O by calling this the agentic Gemini era. [01:53] What does agentic AI mean for Google? Yeah. Yeah. [01:56] It's a good question. I think, and we were... [01:59] we sort of, if you followed closely, we did sort of mention some of these things back with like Gemini 2.0, which I think was like a little bit early. And so I think... [02:06] this era, this like Gemini 3.5 era feels like it's actually becoming true now. And we're in the era of agentic coding or agentic products and everything agents, as far as Gemini goes. [02:18] I think for us, this agentic layer, and I think we announced this actually at IO, sort of being powered by the anti-gravity agent harness, is this like additional through line for Google that sort of connects all of our products that they're sort of like based on now. And so historically, like prior to Gemini, there actually like wasn't a through line for the, you know, probably... [02:39] sub hundred number of Google products that we have, the 50 Google products we have, [02:43] There wasn't a through line. We had Gemini. It became this through line. Everything is now sort of using Gemini in some way. [02:49] That's now becoming true for anti-gravity as sort of all of the products rebase products [02:54] to become sort of like agentic native products and like actually taking action on behalf of users and helping them get things done. You see this like new through line emerging, which I think is actually really, really interesting. And sorry, help me with anti-gravity is the...
[03:08] The IDE, right? Or the non-IDE? Yeah, anti-gravity is a lot of things, which I think is sort of, again, is an opportunity for us. You have sort of a core IDE. You have sort of like the agent first experience if you want it on the web. You have a CLI. You have an SDK. So I actually think, and I don't know how much we've framed it this way, but it really is an ecosystem of stuff that we built. And it's designed to sort of like meet developers wherever they are. [03:38] for. And then the most interesting bit is like, it's not just the ecosystem of anti-gravity stuff, it's also powering, like literally it's the same harness is actually powering all the other Google products. So anti-gravity will be powering a bunch of agent stuff in search in the Gemini app across like cloud and in AI studio, which is really exciting. I see. So it used to be the Gemini API. So like the language model was the through line in terms of how AI gets, [04:03] baked into every Google product. And now it's not only the API, it's the coding harness. [04:09] Exactly. That's being used in each of these products and therefore it's the coding agent itself that's driving more agentic properties. [04:16] Yeah. Fair description. Fair description. I think more generically, too, it's just like it is the agent harness. I think like coding as sort of like a specialized use case of the agent harness, I think, is obviously powerful. But it is like coding has proved to be the general purpose agent harness in addition to also working really well for coding. [04:33] Are Agent Harness Encoding... [04:35] Harness synonymous or not? [04:37] There's definitely nuance. I think there's like optimization that you can squeeze out of like specializing. And actually you see this where like the, you know, technically the agent harness that gets used for the way that AI studio uses it is like a little bit specialized for, you know, the vibe coding use case. And the way that the Gemini app is using the agent harness is a little bit specialized for the sort of consumer always on 24 by seven agent. So I think you have that base harness that like probably has like 80% of the same stuff. And then you specialize for coding or for whatever the use case is.
[05:07] How do you think about the cannibalization of the existing business, especially now that you are going much more aggressively into agentic properties? Because I could see, for example, if all you're doing is search or summarization, there's not as much of a cannibalization fear. Whereas if you're actually... [05:27] going through my emails, replying to them for me. Am I even going through my email anymore? And so I could imagine that there's actually just... [05:34] fewer human eyeball hours on your products as a result of having more agentic capabilities. Is that fair or how do you think about the cannibalization? Yeah, it's interesting. [05:45] One sort of observation I have is that like at the beginning, and I think Sundar has done a great job of sort of talking through this, is at the beginning of the sort of current AI era, like everyone assumed that... [05:58] AI being able to answer questions for you was going to be like, [06:01] negative sum for search. And actually what's ended up happening is an incredibly positive sum for search. Like people are searching more, people are doing more. [06:10] And so I think agents are searching, too. Yeah. And agent. Actually, again, there's like this whole market that spawned at the same time that agents are doing more at the same time that humans are also searching more. And so I think it will be obviously there's a finite amount of like human time in the world. [06:25] But from my early feelings of how a lot of this is playing out, it does feel like it's very positive. So I'm from an ecosystem value creation, like how the human behavior aspect of it turns out, I think is somewhat clear in the next one to two years, much less clear three to five years from now when the technology is improved and the products probably look a little bit different than the way that they do. But ultimately, that is the success of product. I think we have a bunch of conversations with Demis all the time and it's like,
[06:53] the point of building... [06:55] the technology is so that it can go and do stuff for you. Like success for Google probably doesn't look like maximizing eyeball time in front of our products. It's like maximizing outcome for customers to do the thing that they want to do so that they can go and live their life and do what they want. And so I feel like... [07:12] you'll probably see us go down the route of maximizing outcomes for customers and not maximizing eyeballs. Yeah. I have this term stuck in my head, agent-led growth. It seems to me, so I'm using coding agents a lot in my personal time, and... [07:26] I just let the agent make all the infrastructure choices for me. I'm like, I don't care which database, you tell me. And the reason I ask is... [07:35] you know, it's true in coding today. I would imagine it's maybe going to be [07:38] generally true for a lot of things, let's say shopping, [07:42] down the line. How do you think that's going to change how advertising works, how value capture works for the aggregators? It feels like it's a very similar trend. This isn't perfectly true, but a lot of these things are just like proxies of each other. Like the way that SEO works, I think, is directly correlated with the way that, I forgot what the term now, it's like GEO is like the [08:12] lot of correlation between the things. My guess is it looks like much less of a radical shift than I think maybe what we assume right now, just because these things compound on top of each other.
[08:24] If you were to [08:26] you know. [08:27] grade the scale of agenticness in terms of crawl, walk, run. [08:30] Where are we in terms of how I don't take the Google suite of products is? Yeah, that's a that's a great question. It's definitely like crawl right now. And I think some of this is. [08:39] like all of the inherent product tension for Google is like, you have what, 13 billion plus user products. And so like, I actually think we have some, [08:48] more like labs like experiences where you're probably closer to running or walking. But I think like most of the product experience today is definitely closer to crawling. And I think that's just like the stewardship responsibility we have sort of building a product that's being used by lots of people. Like I don't think the long tail of customers are like ready to have AI running and just doing all the things like they probably they want to be in the driver's seat. They're cautiously taking the first step. And I think the Google team and like search is maybe [09:18] of responsibility to actually do that in a way that it brings people along and doesn't just like change everything of how they interact with the internet and the way they associate with products and stuff like that. So... [09:28] Yeah. Which products do you think are closest to the walk? That's a good question. [09:32] I think Gemini app is definitely closest to walk. And so for Spark, I think having a 24-7, always-on agent, like literally going and potentially doing a bunch of actions on your behalf is definitely like one of the frontier use cases. And I think you'll see, I think like anti-gravity is another one where it's like you could have autonomous coding agents, you know, rebuilding operating systems and doing, you know, billions of tokens and spending thousands of dollars on your behalf. And I think those are...
[09:58] Again, like more and actually like they're in GDM as well as another angle of this. So I think like GDM is taking like very much like a frontier look at this where I think like the rest of Google's products, I think, are like more incrementally getting there, which again makes makes reasonable sense to me. [10:13] Do you think that Google ends up with... [10:16] One, two, three product surfaces for using AI or thousands? It's tough. [10:22] I think a lot of this is actually baked in just like how humans consume products. And my sense is that [10:29] There's something nice about like having this like compartmentalization and this like specialization of products where like it becomes if you end up with a product that is like doing everything for you, inherently there's more work involved in using that version of the product. I think I think would be like the default state. I think maybe somebody will spin together like the truly magic experience that doesn't make that true. [10:59] versus like there's something nice about, I click my calendar app, it just shows me my calendar. Like I don't need to worry and deal with anything else. [11:06] This is my hot take for why slide decks have existed for so long of just like... [11:10] You know, the thing, the piece of information, you want it to be exactly in the same place. And I think we as humans are just actually very used to that as opposed to... [11:19] The idea of a gender-to-face sounds so cool to me, but it's like, do our brains really... [11:23] Isn't that just more cognitive overhead for us? - It definitely is in certain cases. And I think somebody needs to, again, there's a lot of incredibly smart people in the world. And so maybe somebody will find the experience that like makes it feel more natural. But to me right now,
[11:37] I'm, I'm maybe not 10,000 is the extreme version. I'm guessing it looks more like more products going after sort of like different, uh, or, and maybe the other answer is like, I don't know what it looks like for Google for the ecosystem. It looks like a lot more products, I think like, and that's, that's really exciting. I think like how Google will end up strategically deciding, like, do our customers want to deal with us having 10,000 products or would it be better to only have three? Um, we'll come down to like a strategic decision for us. [12:03] That totally makes sense. When I talk to companies in the enterprise, they say, you know, everyone's talking about agentic AI, but the only place they've seen agents really working is coding agents. Hmm. [12:15] Do you agree or disagree with that take? [12:17] Yeah, I think it depends what your bar for working is, which I think is a lot of the nuance of this. I think if you're truly trying to offload... [12:25] very complicated tasks for domains in which like it's the models haven't actually crossed the threshold of quality, then like I think that's definitely true. Like the it's not going to solve the problem, but this is something that I want. I wish we could like measure a good example is like Open Router, for example, is like measuring, you know, the total token consumption that's happening. And so you can sort of like see these trends play out over time of like how much more intelligence is in the world. [12:50] you know, now versus a year ago. In parallel, the thing that I'm actually really interested to measure is like, how long is the average like thing [12:59] the average like agent run or the average task actually taking place. And I don't think it's something that they publish, but I feel like they probably have interesting data. I'm sure there's others because, because I, I do think you're like seeing these like new model capability lands or new model drop and, and it's like spiking up and, and maybe the, the curve is still like very low right now, but like, you're seeing those like early signs of it spiking up or to like long running tasks and all the model labs are talking about,
[13:29] of autonomous work or whatever it is. That's the extreme. But I think in practice, [13:35] you're seeing that like trickling up like pretty, pretty quickly, which is really interesting. So even if the enterprises haven't felt it outside of coding, like they are going to like this year, um, as, as sort of a bunch of those other use cases get, get much better as well. From like a, you know, from the deep mind perspective, um, [13:51] Do you think long horizon agents is like a KPI that matters? Is it they, is it the KPI that matters? [13:57] It definitely, it definitely matters. I think for DeepMind, like we're doing lots of things, which we can talk more about later. Like there's, you know, a huge portfolio of different bets that are taking place. Long-running agents obviously matters a lot. And I think also like specifically coding agents and that matters a lot. Like it, [14:13] clearly is an accelerant of like every other part of your business if you have a great coding model. And so making sure we have that, I think it's super top of mind. Got it. I'd love to shift gears a little bit and talk about coding. Yeah. Okay, I'm going to ask a hard question. A lot of my developer friends were using Claude for a long time. [14:33] OpenAI saw that declared code red. Codex is now really good. I would say my friends are maybe split 50-50 now in using Claude. [14:41] and using codex. I don't hear a ton of them using Gemini, which has always kind of puzzled me. [14:47] um, [14:48] What's going on with that? [14:49] Yeah, it's a great question. I think there's one part of the story that I'll add, which makes it even more interesting, which is – [14:58] December, [14:59] the narrative was that Google had won. And when we landed Gemini 3, I think it was like such a profound improvement from a model capability perspective. I think a lot of the narrative was like Google has taken a huge leap forward and made that happen. And I think what was interesting to see sort of as an ecosystem participant is like how not how quickly that narrative shifted, but just like,
[15:22] the next wind of the narrative obviously was like all the agent encoding stuff that happened over, over the holidays and then into January and beyond. Um, and that was, that was not that long ago. [15:32] Um, and so it's a, it is a, we've been in warp speed ever since. Yeah, for sure. And it, but it is, it's a meta reminder of like just how fast things can, can change. Um, yeah. [15:41] I think the observation is not unreasonable. I do think what's happening behind the scenes for us is trying to push the frontier as fast as possible on coding. And so I think anti-gravity actually is an important part of that. I think one of the takeaways is that it's actually really hard to make a great coding model for this, like, [16:00] Um, [16:01] for this developer use case of like really long running SWE work if you don't actually have a product that does that. And so I think like Google realized that that's why the sort of like WindSurf deal happened. It's why those folks came over and then ultimately built anti-gravity and sort of we've been using internally actually and Sunar showed this at IO, just like the graph of growth of token consumption inside of Google. So you sort of like you need that engine to spin and sort of [16:31] in order to like actually make model progress. But I'm super confident. I think the, the folks, the group of folks who we have working on code is like, I describe it as like the Avengers of AI internally. And so like, it really is like the, some of the best people inside of Google trying to push the rock up the hill on this stuff. And it, [16:49] taking it super seriously and trying to push. And I think three flash, um,
[16:54] And notwithstanding some of the conversation about the price and stuff like that is sort of a step towards actually starting to bring a lot of these capabilities and the fruits of that labor paying off. It's a flash model that's better than any pro model we've ever released from a coding standpoint. And the pro models were really good before. So there's another thread of this. [17:15] Also, which is like everyone forgets that there's like pre-training windows. And I wonder, like somebody should like track this online, which would be interesting to see. Meaning like the big run, like what clusters have been available. Exactly. The big runs are like are an interesting thread of this. And so it like it might look from an external perspective that like, oh, you're you're super behind in some way. And like, actually, you miss all the context of like where the big runs are and where the large pre-training runs are. [17:45] That also, like, obviously there's pre-training has historically been like a massive strength for DeepMind. Like we have some of the best people in the world. And so excited to see sort of the fruits of that labor and everything else that's happened. [17:57] Flash was like all post-training gains, which is really cool. So a huge, huge testament to the team, the work that that team did to actually make. [18:05] the level of gains and like surpass the previous pro model, um, literally just with post-training, which is awesome. How religious are you all about dog fooding internally? [18:15] Like our, for example, our deep mind folks still allowed to use other models or is it like you guys are using the Gemini harness now and we have to make this really, really good? Yeah, there's I mean, I think people it's so healthy to be using other models just because like it's sometimes hard to like actually grok what's happening in the ecosystem if you're not to like I use all the models. I use all the products. I think like, you know, folks across the rest of deep mind are doing the same thing. You definitely have to use the Gemini models, though. It's just like great from from a feedback flywheel perspective.
[18:45] and it's part of how they get better is like DeepMind has and Google more broadly has like [18:49] a hundred thousand plus incredible engineers who are using the models and giving feedback. And like, it should be a competitive advantage for Google because we have that scale of sort of engineering resources and like the depth of the talent and can run, you know, AB tests and live experiments and all that stuff. So, um, [19:05] I think you have to use all the models, but I think for the majority of folks, it's like Gemini as the daily driver, which is great. Do you believe in this narrative around like a soft takeoff of like once you have a good enough agentic coding model, then it accelerates the pace of research progress and like it's a self-reinforcing cycle? [19:23] It seems obvious that that's true, but maybe I'm too, I'm drinking too much Kool-Aid that that's the case. Are you seeing the signs of it yet? Yeah. I mean, you definitely see some signs of this. I think the signs that are like still early is doing this from a model perspective. And I think part of the context of that is like the resource allocation for some of these like larger training runs is just like significant. And so like you definitely still have like a human in the driver's seat of making those decisions because like you're not going to. [19:52] accidentally, you know, take 10,000 TPUs to go kick off some job that like actually doesn't make that much sense. But from a product perspective, you for sure see it. Like, I think we're seeing this on our team, like we've built mobile apps using anti-gravity and like, we'll launch them to the world like faster than I think any team at Google has ever built a mobile app. Josh's team did this with the Gemini Mac OS app and sort of like end to end delivered an app sort of faster than any team had ever delivered a Mac app at Google. And it's because of,
[20:22] because of identity coding. And so it's great from a product perspective. [20:25] I think you've said in the past that if you could have a system that could build anything with [20:29] code [20:30] Humans can't compete on the same level, and that's... [20:32] narrow super intelligence [20:34] Do you think we've reached that point? [20:36] It is interesting. I think... [20:39] this like narrow super intelligence example, um, [20:43] is interesting to see how, obviously it kind of feels that way for coding right now, where like coding is like just so... [20:50] good, that it does kind of feel like narrow superintelligence. I don't know... [20:55] It depends how you actually end up the details of quantifying this. But I think the important thing is like, to your point earlier, it works incredibly well for code. [21:04] it would be great if it did a bunch of other things, but it's actually just like so impactful that it can be great at code. And so I spent a lot of time just like, [21:13] letting that [21:14] that fact sort of just like wash over me because I think it's like, obviously building AGI is super important and very interesting, but like building AGI, if it's sort of like takes away from the story of like the current... [21:26] present capability of the technology, I think is actually like kind of a bad, [21:30] a bad sort of like trade-off. And so I'm trying to like always hold these two things in my head equal at the same time, which is we need to build general purpose technology, but obviously it's so impactful to have this thing. [21:41] And it feels like it hasn't taken away sort of – it's been one of the best – [21:46] positive outcomes is that... [21:48] I feel like it hasn't taken away from human developers. It really does feel...
[21:54] like an accelerant of what human development, like I, as a human developer feel like I have more agency in the world. I feel like I can tackle my personal experience. I feel like I can tackle more ambitious problems. I feel like I used to. [22:06] kick around ideas and they were like, [22:09] slightly out of reach. Um, and I would just be like, ah, wouldn't it be nice? Um, and now I have the opposite problem, which is I'm, I'm kicking around an idea and I'm like, [22:18] I could probably make this even more ambitious. And sort of it does, it adds a different layer of sort of, um, [22:24] responsibility or like some a different layer of burden, actually, because I'm like, oh, I can't just like do the sort of MVP of this. Like, I actually need to like go 10 steps further because the technology enables me and like resetting my level of ambition, I think, is something that I've also spent a bunch of time thinking about. But I think that will happen in other areas. [22:44] these like vertical super intelligence domains. Um, [22:48] which will be interesting. And it feels like we're going to get a bunch of those before we've like solved the, [22:53] Like it's almost like jagged, like jagged super intelligence, I think is what we'll end up with. What verticals do you think we'll get super intelligence at next? [23:01] That's a great question. I do spend a lot of my time, too much time probably thinking about coding these days. So I'll think for a second of like the other... [23:09] the other domains. [23:11] I think part of this is things that have better verifiability, obviously, are the ones where you'll see the gains happen more quickly. So things with math and finance. Actually, science could be a really interesting one. It would be fascinating to see some of these domains where...
[23:29] there's some level of verifiability, like actually like really start to take off, um, [23:34] which would be cool. And I also think like an important thing in this like broader narrative about just like what, what impact AI is having on the world. Like you almost like want that to be the case in the sequencing of like things that work. You want a lot of these like really, really good, impactful, positive things for the world to happen as early on as humanly possible. So that like folks understand what the potential positive impact of the technology is. So I think science could be a really interesting one. Yeah. Obviously there's all the stuff happening right [24:04] that, which I'm not a mathematician, so it's somewhat over my head. I saw a great tweet the other day. Why did Airdos have so many problems? Exactly. That's a good one. I like that. That's a good t-shirt. So funny. Okay. Speaking of Twitter, I went through your Twitter before this, so I'm going to read back another tweet at you. The good thing about Twitter is there's a public record of all your predictions. I need to turn on that auto-tweet deleting feature or whatever it is. Last October, you tweeted, everyone is going to be able to vibe code video games by the end [24:34] of 2025. Yeah. Did that end up being true? It feels close. And I think there's, I mean, [24:40] Obviously not AAA games, like you're not building the next Call of Duty or GTA yet. [24:46] But I think it feels... [24:48] closer than it's ever been. Um... [24:52] And [24:53] I think actually a lot of the interesting bit about video games is you actually need to end up building a lot of this other stuff. Like models, and we were talking off camera before this, like 3JS is a great example of this. Like 3JS makes a lot of things possible that weren't before. But there's still all these rough edges that just a coding agent doesn't solve. And so you need sprite generation. And the models aren't very good at doing that natively. And so you need some orchestration layer and tooling in order to make that happen.
[25:23] are. [25:24] core to like the gaming video game experience. [25:27] that need to have a high degree of reliability that I think it feels like it's within reach, but actually requires a lot of product scaffolding work in order to create experiences that are reusable and replayable and have the level of depth and requires a little bit of taste in there. Do you see people making a lot of video games inside AI Studio and the other developer surfaces that you have? Yeah. And so this was actually based on us looking at the early data. And there [25:57] folks were making were actually games. Like people were trying to build games. Is that the most popular category? It's not the most popular category anymore. Um, just cause I think like the, the ecosystem has shifted and like the user base has shifted, but it's a lot of, a lot of games. Um, what is the most popular category? I think it was like, it's like 20% like finance related stuff. 20% like people like counting their money that much people like, I think it's, it's something around crypto actually, I think is what people are doing a lot of stuff with, uh, with finance. Um, [26:26] a lot of like personal productivity things and a lot of gen media stuff actually, because obviously the Google suite of gen media stuff is amazing. Yeah. It's done a great job. Um, [26:36] But I also think GDM has sort of like a, obviously Demis cares a ton about games and sort of like started his career and doing AI stuff because of games. And so I think we'll have some interesting swings at this. And our team actually in Kaggle, which is sort of a bunch of the AI benchmarking stuff we do in GDM, sort of works with GDM to build this game arena, which is sort of our way of sort of like testing progress towards AGI, like using games as a proxy, which, again, is like very deeply rooted in GDM.
[27:06] history. [27:07] How close do you think we are to, you know, rando off the street with a good idea, Kim Vibe Code? Yeah. [27:13] a really fun, playable game. I want to say this year... Actually, I think the model capability makes it possible. I think this is where I've gotten excited on the product side. And again, we were also talking off camera about the startups in this ecosystem because... [27:27] It feels like it's possible. It doesn't feel like there's a gap in model quality. It feels like there's a gap in like you someone who knows what it takes to build a great game actually like putting the scaffolding together in the right way to make that possible. I think there are folks who are doing this right now. And so some of it is like a discoverability and awareness thing that like people just don't even know that they can do that. And some of it is just like. [27:50] maybe certain categories of model capabilities are just like slightly off and we're like, you know, [27:55] weeks or months away from like that chasm being crossed. And then it just like working for most people. And so this is a good segue into, I want to ask you about role models next, but do you think vibe coded video games is more likely to, [28:08] um, [28:09] going to be game engine plus coding agents based, or do you think it's more likely to be world model based? [28:17] Yeah, I think what will end up happening is the definition of world models will blur, which we should talk about with Omni. [28:26] And... [28:27] it will still... [28:29] I think the coding agent will look like some sort of world model type system, but you actually do need...
[28:38] To make world models useful for like real things, you need like scaffolding. And so I think there's, again, there's actually a bunch of interesting startups like doing work, like figuring out what is the scaffolding for world models so that you can take them from these like very open-ended, inherent design of world models, very open-ended spaces and like do it in a tangible way so that it's like grounded in a use case that like you could use in a reoccurring way. That could be somebody maybe will figure out the scaffolding for world models to make games possible. [29:08] inherent nature of world models right now, I think, make it so that it's like actually not well suited for [29:14] like games in the current form but [29:16] The progress has been crazy. So who knows, maybe in like two years, the versions will be able to, but at least in the short term, it's like coding agent plus some sort of game engine, I think is like where you'll see way more alpha from a game's perspective. That makes sense. Okay. So you said the definitions of world models are blurry. Can we unpack that? [29:46] as a world model because of just like the level of understanding that it has of the world. I think that like technically looks different than, and I'm not an architecture expert on like the way that we've done world models before, but it is different from an architectural standpoint than what's happened in the past. Yeah. [30:02] which I think is positive because it's getting closer to like some of the ways in which it might actually be more scalable. Um, and historically, like it's been like super not scalable. It's like very, very expensive to run traditional, like online world models. Yeah. Like Jeannie being like, yeah. Okay. So if you think of traditional world models as being like an action conditions, video model, then like,
[30:26] Right now, when we say world model, what we actually mean is... [30:30] a model that has some understanding of the world as opposed to being strictly technically a action condition video model. Yeah. And so the interesting thing though, is like, it has understanding of the world, but then it also has that like really great. And that's where like the line is blurry to me where it's like, it can do a lot of those same use cases. It's not real time right now, but like it can do a lot of those same use cases that you would describe or like visually could create with that same exact world model, which I think is [31:00] I feel like this like world model video model thing is going to, is going to, [31:06] change and play out in a different way than was obvious before. And how does it work under the hood? Like whatever you're able to share, like, is it Gemini plus video models? Is it [31:15] Something different entirely? It is a single model, which I think is the important part. Like, this was actually part of the original desire was, like, [31:24] you [31:26] we're training like eight different models to do all of those things. Historically, it's like you have a text model with the baseline Gemini model. You have audio, you have music models with Lyria, you have nano banana, you have VO video models. You have a, we have a whole suite of audio models and like, it would be great for us, our customers, um, [31:44] If you just had a single model to do all those things. So it is like a, a new, [31:49] setup that sort of makes that possible. It's not like routing to a bunch of different models, which like we [31:55] you could have imagined we could have done something like that actually before and done like a Gemini Omni model, but this is like a true Omni model. Um, and it's starting with like the, the use case that works the best right now, which is why it's the one that's available. Um,
[32:08] is this like video editing capability. Um, [32:12] The technically it's like functional with the other things. It's just like the quality isn't isn't like perfect and is not state of the art. So we haven't rolled that out yet. It's also just like the first. [32:24] crank of the model turn on Omni. It's the Omni Flash model, the first iteration. And so we'll have like much, much more capable, powerful versions, which will be exciting to see. Hmm. [32:35] So we could edit this set so it looks like we're... We should, yes. Yeah, I want this. Again, we were talking off camera. We should do that for the intro because I think it just makes all this stuff more capable. And I've seen these examples of... [32:48] such subtle nuance that like make me appreciate that it's like the world understanding playing out I was I was giving a talk. [32:56] and was on stage with my friend Tulsi, who leads the model team, who I don't know if you've ever had on before, but she's amazing. I love Tulsi. [33:04] And I had mentioned to someone in the crowd to, like, edit the video, and they literally, like, took the picture, edited it with Omni in real time, and this, like, dog came on the stage. And, like, the other – in the edited version, the other guests sort of, like, look down and see the dog. They, like, chuckle a little bit. This is while I'm, like, opining about whatever AI nonsense. Maybe they're laughing about your jokes. Yeah, it was not my jokes. [33:26] They laugh at the dog coming up. It jumps onto my lap. I sort of like acknowledge the dog. I keep talking. I'm like petting it or whatever. And just like there's like so much subtle subtlety in getting that right. And the model crushed it. And it's just it.
[33:41] very interesting and like still trying to like absorb and digest like what that means for you know the way we make content and all these other things that's so interesting yeah i'm i'm the biggest bull on generative media and what it means and [33:55] I mean, one of the things we've thought about for our podcast is the visuals matter as much as the content. For sure. That's how you catch people's attention in the first place, right? Yeah. And so, okay, I'm excited to play with Omni. I'm excited too. And I think you probably feel this way as somebody who makes content. But I've historically been very, for myself personally, I don't use AI to make any content that I produce. It's all my words. It's always my voice. It's always my image and picture showing up. [34:25] And so like I would much rather it be me than some AI version of me. [34:30] What I like so much about Omni is that it's like not changing me. It is like changing a bunch of these other bits which are not me. Like I didn't choose any of the like set around us or the coffee table. It's like so our words can stay the same and like you can change these bits that are like not personal and do something more interesting with them, which I think is really, really cool and feels. [34:54] It feels like the version of what I want sort of like Gen Media to be, which is like not a bunch of like AI avatars. No Fruit Island videos. Exactly. Truly. Like it really is like it's the original content. It's the person. It's like the personhood is there. It's just... [35:10] different and amplified.
[35:12] Super interesting. Okay, I'm excited to play with it. Yeah, we should... [35:15] send some prompts right after this and try. I don't mind the fruit video. So I'm, I'm, I'm, I'm happy for a world of both. Um, on the coding side, you launched the ability in AI studio for people to vibe code Android apps. Yeah. Yeah. Um, I'd love to, you know, hear how that's going so far and, and where are you going to take that? Yeah, it's super exciting. I think one of the strategic things for AI studio, and actually this is based on like a lot of the feedback from the ecosystem and actually from developers from others, it's like so many Google products. Um, [35:45] ways in which you like touch Google through all these different journeys of building a startup or bringing idea to your life. And so we have this like first class principle of like, how do we bring. [35:56] things into AI Studio that make it so that you [36:00] are exposed to other parts of the Google ecosystem without having to go through nine different UIs across Google. And so Androids are a great example, not only of that, but also of [36:11] enabling people who wouldn't have otherwise built an Android app. And so I literally built my first Android app in AI studio. Um, very cool to see it's, uh, what is it? Yeah. I just did like a plan, not a crypto app, just a plant one. I was planting trees in my backyard. Yeah. And so it was just like playing around with the gardening app as I, as I was kicking the tires. Um, I haven't had my like breakthrough idea yet of what I want for a mobile app, but I'm going to, I'm going to come up with something and see, go compete on the app store. Have you seen anything vibe coded, [36:41] in the app store yet? That's a good question. It'd actually be interesting to like see some analysis. I don't know. I'm sure it's like accelerating a lot of things on the app store, but I don't know how much, like I don't know anyone like personally who's done that. Yeah.
[36:54] It is interesting, and I was going to make the observation, too, that I think the last time I checked the numbers, we were viewing it this morning, it was like 350,000 Android apps built in AI Studio since last week, which is crazy. And, like, excitingly, it's like 350,000 apps that, like, [37:10] probably no one was going to build before. [37:12] a lot of these are personal too. And so this is where I think this like, maybe GenUI is like farther out there, but I think like the idea of you building software, [37:22] to solve your personal problem is like very real right now. And like people are doing that. It's like one of the most common use cases of a lot of these products. [37:29] And being able to like unlock a bunch of the native capabilities of the phone, I think is also really interesting because you just have so much context that's like in different places. [37:40] So I'm getting very excited about sort of that opportunity. And Android feels like it's becoming the... [37:47] the platform for builders. - Does it matter that something is an app versus just like the web is so powerful now? - Yeah, that is, it's also very interesting to see that play out. Web is definitely powerful. There are certain things that the operating systems have that like you just can't unlock, like lots of like native richness that actually like [38:05] make experiences feel... [38:08] so much richer. I think about this for like text messaging, actually, that like the text messaging experience in all of the [38:14] and all the main operating systems feel way richer to me than like any AI chat app that I've ever used. Like if I could just talk to AI and whatever texting app I use, like I would be way happier than having to go to some other app. Um,
[38:27] Because I think we're also just like conditioned on like the operating systems. So yeah, makes sense. [38:32] Okay, I want to ask about the muddle eats the harness or the muddle eats the scaffolding. [38:37] What are your thoughts? Yeah, I think it's true. And I think part of this is like... [38:41] What we... [38:43] have historically thought of as the model... [38:45] is not the model anymore. I think two years ago when LLMs were popular, the model was actually just a set of weights. It was a set of weights and it was really, how can you as simple as possible, send tokens in and get tokens out? And I think we've just progressively, step by step by step, we still call it the model, we still call it Gemini 3.5, we still call it GPT whatever and Claude whatever, but [39:11] it's actually not just the weights anymore. It's like an entire expanding sprawling system that's built around the weights that sort of like enable a lot of these like next generation experiences from agentic tool calling to tool, you know, like all these hosted tools, search, code execution, et cetera. You know, the models are now being spun up in containers and sort of have an agent harness and all that stuff. So the scaffolding is like oftentimes a couple of steps ahead of like [39:41] into the model. And then what ends up happening is like the model eats that scaffolding and it becomes part of like the native model system. And there's still value in having sort of, [39:50] the external scaffolding in certain cases. And like search maybe is an example of this. Like there's lots of folks who use different search providers and there's different like use cases that you want. And so like, sure, maybe the model can natively use search, but you also want something else. Code execution, another example of that. Um,
[40:06] but it does feel like [40:08] like maybe the agent harness is like the quintessential example of this right now, where like everyone's like, ah, we got to go build a harness and like the harness is where the alpha is. And like, like, [40:17] I think that... [40:18] Perhaps won't be true, at least in the way that we think of the harness today in 12 months. I think the models will have sort of just like digested a bunch of that. It'll be upstreamed into the model and the Alpha will be somewhere else now. It won't be in sort of trying to spin your own harness because the model just like does it natively. [40:35] But I thought that part of the reason why people are building their own harnesses is because if you use a harness from any given model provider, you're locked in, right? So a lot of the application companies want flexibility, which is why they're building their own harnesses. Yeah, and I think that's part of the scaffolding story is that starts out perhaps true, but then as the model capability improves, it becomes less true over time, actually. I think the model, you don't have... [40:57] a generalized [40:58] model if it can't use another harness. And so it is important. And I mentioned this in another conversation with someone a few weeks ago, but we need something like harness bench, which is like actually measuring like, [41:09] how good are all these different models at adapting to all the different harnesses? I feel like that seems like a reasonable thing we should, we should measure as an ecosystem. And I'd be curious to see like what models are actually best, but, [41:21] I think over time you expect they'd be able to use every harness, unless you're like completely out of distribution, which in that case, like... [41:31] you're still going to be completely out of distribution, even if you're using your own harness. So not sure it matters much. Fair enough. What about the application layer? How do you think about where independent companies can have a hope of surviving when the model eats the harness and eats the stuff around it? Yeah. It's an interesting story that both of these things feel true. Both on one hand, everywhere I look, I'm like, there's never been more opportunity to go and build something. At the same time, obviously the models are doing
[42:01] they've ever done before. I think there's like, you know, there's that thread of capability overhang, which I think there's a huge amount of alpha in. There's the thread of the model companies are like going after these like very general problems. And there's just like so much value in these like verticalized domains. If you have expertise in that domain, you sort of like know the customers, you know, the ecosystem, like it's just, you can really like, [42:24] run laps around even the best model labs, because like focus is the like superpower of startups. Like if you can focus, you can do anything. And if you look at all of the companies that are big or doing lots of stuff, like there's just not a lot of focus. And for some, for some reasons, like rightfully so, because maybe I'm, I'm overly justifying, you know, Google strategy, but like, we just have a lot of products. We have a lot of users. We have a lot of different things going on. And so like, we actually can't focus in one domain. We have an obligation to do a bunch [42:54] company. [42:55] I think that's not true for startups. And so I think like 24 months ago, we were all asking ourselves like, oh, wow, it seems like the opportunity space is shifting. And maybe it's possible one of the outcomes is there's less opportunity for startups in the future. That feels like so far in a way, not what has ended up playing out, which is really positive. If anything, it feels like there's just even more opportunity than there was. Like now coding has helped you like close the gap on like larger companies that have like established code bases [43:25] because you can just like run way faster and write software quicker. [43:30] the agentic like primitive is like a new category that you can sort of build products around that. Like actually in a lot of cases to the conversation about like,
[43:39] the risks involved with building, like there's risk involved. And so like, what's your, like the risk appetite of different companies is different. And so if you're willing to take more risk in some domains, like you can win a user cohort who's like interested in also taking risk. There's so much opportunity. [43:53] Awesome. I'd love to talk about Google DeepMind's culture, and I'm curious... [43:58] What does it feel like to be inside... [44:00] GDM right now. You know, we had Demis at AI Sense. He was so inspiring. I've heard Sergei's back. You guys have Noam Shazir back. Like, [44:09] Walk me through what it's like to be at GDM right now. It's incredible. I do try to take it all in because it's like a moment. I try to reflect as much as possible in the chaos of all the things that are happening just because there's so much cool stuff going on. GDM's culture is interesting and maybe three observations. One, back to this thread of focus. [44:33] we're doing a lot of things. And so I think you see sort of, I think about this a lot, like from a portfolio perspective, I think we have like one of the strongest portfolios, which is really exciting. But you do see these moments where like another lab or another company, whatever it is, will like pull ahead in a certain area where like we underinvested, just like hadn't been focused enough in that domain. And it's cool to see... [44:56] like the the way we go about trying to like close that gap. I very much I very much appreciate it. I think I've I've watched the Demis thinking game documentary a few times and like you see sort of like a lot of like details of that, like original culture and just like the way that strikes work and all this stuff, which is actually really similar today is like you just get a bunch of smart people together and like go solve the problem. And I love that. And it's like very cool.
[45:24] to be a part of. Another one is this [45:28] I think you see the culture permeate from who the leaders are. [45:35] this isn't like a perfect characterization of the ecosystem, but like Demis is... [45:41] Nobel Prize scientist and the sort of OG of a lot of this stuff. And you feel that in the DeepMind culture. I think Sam is maybe one of the world's best businessmen ever. And you sort of see that in the open AI culture and the way that they go about the world. I don't have a strong sense of who Dario is, but I think Anthropic is a very interesting place. And you sort of, at least as an external observer, he seems like an interesting guy and so somewhat esoteric. And so it seems like [46:11] dna and the culture of the company you know the other labs are interesting um [46:16] But I like this very scientific approach to the world and the way that Demis looks at... [46:23] the reason he's doing this and the reason they started this mission was like literally to like solve disease and all these things. And it's like so easy to get. And again, I'm always trying to pull myself out of the moment, but like it's so easy to get lost in this like competitive race of who's. [46:39] pushing a number higher on Sweebench or whatever it is, it's very easy to lose sight of like, the reason we're doing that is so that like we're, [46:47] can solve problems that humans actually have. And there's a... [46:51] My favorite quote from all of Silicon Valley is something like, you know, we can't let other people make the world a better place more than we can, which is like what this moment feels like. The Gavin Belson quote. The Gavin Belson quote. And I think about that all the time. And it's like we're all fighting over.
[47:05] who can make the world better more than the other person, which just like, when you frame it like that, it seems really goofy to me. And so, [47:13] It's very much not zero-sum. [47:16] And I think that's like a... [47:18] a way of looking at the world. I think the last thing about DeepMind's culture is like, we're very, we're sort of the engine room of Google, which I think is like literally the Twitter bio now of the DeepMind Twitter account, which I love. You man the DeepMind Twitter account? I don't. I don't want any responsibility manning other people's accounts online too much, too much responsibility to do that. But it does feel like that too. So it's like, on one hand, you have sort of like the deep rooted lab culture. On the other hand, you have sort of like all of [47:48] collaborating with everybody from Android that we talked about earlier to Google Cloud, to Gmail, to Workspace, et cetera, et cetera. And so [47:56] it's an interesting blend of like, [47:58] I think there's lots of research work happening, but like there's tons of applied work that's happening to like actually like work with some of the like the forefront customers, like deploying Gemini to billion user products is a problem that like only two companies in the world have. And we have 13 of those products. And like we, you know, Google goes through this all the time now. And it's such an interesting place to like see that happen and see the innovation that takes place in order to make that actually possible. And I feel like it's, [48:28] which is really cool. [48:30] Beautifully said. [48:31] Did they give him a lot of heartburn when you joined and were tweeting a lot?
[48:35] That's a good question. Do you have to get sign-off from comms? [48:43] the silver linings to my Google experience has been just like how great that group of like folks across marketing comms are to, to work with. And I think like, you know, their job is protect Google, make sure we tell the right story and make sure a bunch of bad things don't happen. And so I have a ton of appreciation and partnership with them, but it's been an incredible experience to like, be able to go try to tell the story that, [49:06] that resonates with developers in a way that feels authentic and not – [49:11] have a huge amount of [49:13] you know, [49:14] I don't have to get my tweets approved all the time and all this stuff. It's a very, very positive culture. And I think hopefully I'm always trying to walk the line of not burning the trust and goodwill that I've accumulated with those folks. But it's been super positive because ultimately I think it's really hard to... [49:32] for Google to tell [49:33] this like authentic story. So there's just like, [49:36] It's a big company. There's a lot of people. There's a lot of opinions. And so you take the magic of Google and you water it down through a lot of people and a lot of process. And you actually, you miss the beautiful story, which is Google's doing the most interesting technology in the world and helping our users with some of the hardest problems in the world. And it feels, it's a privilege to get to help tell that story. So it's a lot of fun. I enjoy it. I love what you're doing. I love what Josh is doing. I think you guys have put a really
[50:06] as you put it, the most important problem of our time. Thank you. [50:10] Well, wonderful, Logan. Thank you so much for joining me today. This is a very far-ranging conversation, everything from agents and coding to world models and harnesses and GDM culture and lots of nuggets here. Thank you for joining me today. This is a ton of fun. Thank you for having me. And I'm excited to see what the folks cook up of where we've been sitting this whole time, maybe in front of us. Maybe there'll be a dog. A dog something. You can make my dog dreams come through. I love it. Awesome. Thanks, Logan. Of course. [50:38] Music.
Want to learn more?