Google Called I/O 2026 the Start of the Agentic Era. Here Is What That Framing Is Hiding.

Written byBen Rogers

PublishedMay 21, 2026

UpdatedJun 22, 2026

13 min read

Google held I/O 2026 on May 20 and announced, with characteristic sweep, that the agentic Gemini era has begun. The keynote delivered Gemini 3.5 Flash as the new default model across Search’s AI Mode, the Gemini app, and the Gemini API; Gemini Spark, a persistent AI agent running continuously on dedicated virtual machines within Google Cloud infrastructure; Managed Agents in the Gemini API, which abstracts away agent infrastructure setup; and Antigravity 2.0, Google’s agent development platform, now with the ability to orchestrate subagents across complex multi-step workflows.

These are substantive announcements. Gemini 3.5 Flash’s positioning — “frontier-level intelligence with the speed and price profile of a flash model” — directly addresses the cost and latency concerns that have limited Gemini adoption relative to OpenAI’s GPT-4o and Anthropic’s Claude Sonnet. Managed Agents genuinely lowers the operational burden for developers building agent systems. Gemini Spark, if it delivers on its persistent execution promise, represents a meaningful capability leap over stateless query-response AI.

What the announcements are hiding — or more precisely, what the agentic era framing is designed to obscure — is that Google is still catching up on agent infrastructure rather than defining it. The question for operators making AI platform decisions is not whether Google’s I/O announcements are real. They are. The question is what the competitive dynamics of this race mean for the platform commitments that operators are making today.

What Gemini 3.5 Flash Actually Represents

Gemini 3.5 Flash is a model that positions on cost and speed rather than raw capability. Google’s own framing — “frontier-level intelligence with Flash speed and pricing” — is a carefully constructed claim. “Frontier-level intelligence” does not mean the best model; it means a model that is competitive at the frontier without being the frontier leader. The careful reader hears “competitive” where the marketing says “frontier.”

The competitive context matters. Claude Sonnet 4 and GPT-4o are the primary benchmarks against which Gemini 3.5 Flash is positioned. Both have established developer mindshare and production deployment records that Gemini’s various model iterations have not matched at scale. The pattern across Google’s model releases since Gemini 1.0 has been: announce impressive benchmarks, observe slower-than-expected developer adoption, revise and re-release. Whether Gemini 3.5 Flash breaks that pattern depends on production performance in diverse workloads, not benchmark scores announced at a developer conference.

The Flash designation is meaningful, however, on the specific dimension of inference cost. If Google is genuinely delivering frontier-competitive reasoning at significantly lower inference cost than GPT-4o, that is a real commercial advantage for cost-sensitive workloads — particularly agentic workloads where a single user request may trigger dozens or hundreds of model calls across a multi-step agent workflow. The economics of agentic AI make inference cost a more important variable than it was for single-query applications. A model that is 80% as capable at 40% of the cost may be the correct platform choice for most production agent deployments.

Gemini Spark and the Persistent Agent Question

Gemini Spark — a persistent AI agent that runs continuously on dedicated virtual machines within Google Cloud — is the I/O announcement that deserves the most scrutiny, because it makes a bold architectural claim and the details matter enormously.

A truly persistent agent — one that maintains continuous context, executes long-horizon tasks without session boundaries, and learns from its operational history — would represent a genuine architectural advance over the stateless session model that has characterised most current AI deployments. “Runs continuously on dedicated virtual machines” sounds like persistent execution. But the key variable is context management: does Gemini Spark maintain a genuinely continuous context window across tasks and time, or does it use external memory systems to simulate continuity across what are effectively new sessions with retrieved context?

Google has not been transparent about this distinction in its I/O announcements, and the distinction is commercially significant. Simulated continuity through retrieved memory is useful but it is not the same as genuine persistent context — it introduces retrieval latency, retrieval errors, and context compression artifacts that affect agent behaviour in ways that true persistence does not. Developers who build on Gemini Spark need to understand which architecture they are building on before committing production workloads to it.

This is not scepticism for its own sake. It is the kind of technical question that determines whether a platform delivers on its architectural promise or creates a dependency on a capability that does not fully exist. The governance of the AI agent infrastructure layer matters for operators precisely because these architectural differences compound over time as workloads are built on top of them.

Managed Agents and What Google Is Actually Competing For

Managed Agents in the Gemini API — which provides a fully provisioned agent environment via a single API call — is Google’s direct response to Anthropic’s Claude Agent SDK and OpenAI’s Assistants API. The product removes infrastructure friction: instead of provisioning compute, managing state, handling tool integration, and building the scaffolding around a model to make it behave as an agent, developers call an API endpoint and receive a functional agent environment.

The competition Google is entering here is not primarily about which model is better. It is about which agent infrastructure platform captures developer workflows and the organisational dependencies that follow. Agent infrastructure is stickier than model APIs: when your workflows, tool integrations, memory systems, and evaluation frameworks are built on a specific agent platform, switching platforms requires rebuilding those components. The switching cost is real and grows over time as the deployment matures.

This is the strategic logic of Google’s I/O positioning. By announcing Managed Agents, Gemini Spark, and Antigravity 2.0 simultaneously, Google is attempting to present a complete agent infrastructure stack — not just a model, but a development environment, an execution layer, and a persistence layer — that developers can commit to as a platform rather than assembling from components.

OpenAI and Anthropic have been building these same components for longer. AWS’s Bedrock Agents and Amazon’s Strands framework are in production at enterprise scale. The question is not whether Google can compete — it clearly can — but whether the I/O announcements represent a closing of the gap or a reframing of a gap that remains. Operators who are currently building on OpenAI or Anthropic agent infrastructure have limited reason to migrate on the basis of I/O announcements; operators who are yet to commit to an agent platform have genuine reason to evaluate Google’s stack seriously alongside the alternatives.

The Microsoft Context Google Is Not Mentioning

Any assessment of Google’s I/O 2026 agent announcements needs to account for the competitive context that Google’s keynote did not acknowledge: Microsoft’s existing position in enterprise AI deployment. Microsoft’s Copilot ecosystem, built on OpenAI’s models and integrated across the Microsoft 365 product suite, already has the largest enterprise AI deployment footprint of any vendor. GitHub Copilot has more than 1.8 million paying subscribers. Azure OpenAI Service is the preferred enterprise API layer for most large organisations that have standardised on Azure infrastructure.

Google Workspace does not have equivalent AI adoption numbers in enterprise. Google’s response to Microsoft’s enterprise AI position has been Gemini in Workspace, which has rolled out across Google’s productivity suite — but adoption evidence suggests it has not disrupted Microsoft’s lead in the enterprise segment. The Microsoft platform incumbency in enterprise is the headwind that Google’s agentic era announcements need to overcome, and no I/O keynote changes that dynamic. What changes it is developer adoption over time, enterprise sales cycles, and whether Gemini’s production performance justifies switching costs — none of which are visible on May 20.

What Operators Should Do With the I/O Announcements

For operators making AI platform decisions in response to I/O 2026, the honest framework is straightforward.

If you are currently using Google Cloud and Google Workspace as primary infrastructure, the I/O announcements represent genuine capability additions that are worth evaluating on their technical merits. Gemini 3.5 Flash’s cost profile is worth testing against your current inference costs. Managed Agents is worth assessing against the infrastructure overhead you are currently managing. Gemini Spark is worth tracking closely — but defer production commitments until the architectural details are public and you have assessed whether “persistent” means what it implies.

If you are currently building on OpenAI, Anthropic, or AWS agent infrastructure, the I/O announcements do not provide compelling reason to migrate. They provide reason to benchmark Gemini 3.5 Flash on your specific workloads, which is worth doing if inference cost is a material operating expense. Migrating agent infrastructure mid-deployment carries real switching costs and risk that are not justified by the current gap between Google’s announced capabilities and its production track record.

If you are making a greenfield platform decision for agent infrastructure, Google’s stack is now a credible option alongside OpenAI, Anthropic, and AWS. The right selection criterion is production reliability on your specific workload type, total cost of ownership at your expected usage scale, and the quality of the developer tooling and support ecosystem. The “agentic era” framing is marketing; the evaluation criteria are technical and operational.

FAQ

What did Google announce at I/O 2026? Gemini 3.5 Flash (new default model, positioned on cost and speed); Gemini Spark (persistent agent on dedicated VMs within Google Cloud); Managed Agents in the Gemini API (single-call fully provisioned agent environment); and Antigravity 2.0 with subagent orchestration and improved developer tooling.

What is Gemini 3.5 Flash’s competitive position? Google positions it as “frontier-level intelligence” at Flash speed and pricing — meaning competitive with GPT-4o and Claude Sonnet on capability, at lower inference cost and latency. Whether this holds in diverse production workloads rather than benchmark conditions requires independent testing.

Is Gemini Spark genuinely persistent? Google has not been fully transparent about whether Gemini Spark uses true continuous context or simulated persistence through retrieved memory. The distinction matters architecturally and operationally. Defer production commitments until the architecture is clarified.

Should I migrate from OpenAI or Anthropic to Google’s agent stack? Not on the basis of I/O announcements alone. Migration carries real switching costs that are not justified by the gap between announced capabilities and Google’s production track record. Benchmark Gemini 3.5 Flash on your workloads for cost optimisation; defer agent infrastructure migration until production evidence accumulates.

What is Google’s biggest challenge in enterprise AI adoption? Microsoft’s existing enterprise AI deployment footprint — through Copilot in Microsoft 365, GitHub Copilot, and Azure OpenAI Service — represents a strong incumbent position that Google Workspace has not displaced. Enterprise AI adoption follows existing infrastructure relationships, and most large organisations’ primary infrastructure is Azure rather than Google Cloud.

Sources

The Growth-Loop Diagnosis On What “Agentic Era” Actually Means For Operators

The agentic-era framing is a useful narrative and a misleading planning tool. Useful because it captures the directional shift in how software gets used. Misleading because it suggests a single transition between two states — pre-agentic and post-agentic — when the actual transition is a long sequence of partial integrations, each of which produces a different growth-loop dynamic for the operators trying to build on top of the shift.

The growth-loop question worth asking is which side of each integration the loop accrues to. When an agent acts on behalf of a user inside an existing product, the loop is often captured by the agent platform, not by the product. The product becomes a tool the agent calls. The agent platform owns the user relationship, the retention, the cross-sell. The product is reduced to an API on someone else’s distribution. That dynamic is not new — it played out during the mobile-app-store transition and the search-engine transition before that — but the agentic transition compresses it into a shorter window and sharpens the consequences. Operators who assume their product will keep its current growth loop in an agentic-mediated world are usually assuming wrong.

The operator move that protects against this is to build a direct relationship with the user that the agent layer cannot intermediate away. Direct identity, direct billing, direct usage data, direct support. None of these are exciting features. All of them are the foundation that keeps the growth loop accruing to the operator rather than to whichever agent platform happens to be in front of the user today. The operators who treat this as a 2026 priority will retain compounding. The operators who treat the agentic shift as someone else’s problem will discover that their CAC has tripled by 2028 because the user relationship they assumed they owned was actually owned by a layer above them that started charging for the privilege.

Seven Powers in the Agentic Platform Transition: What Google Is Actually Building

Hamilton Helmer’s Seven Powers framework identifies the specific business characteristics that allow a company to sustain competitive advantage — not temporarily, but durably, across competitive responses. The agentic AI announcement from Google I/O 2026 is worth reading through this framework because the announcements that matter are the ones that are building genuine power, and the announcements that don’t are the ones that are building capability without the structural moat that prevents competitors from replicating it. Google has competitive threats on multiple fronts simultaneously, and reading which announcements build power versus which build capability helps operators understand which Google investments to treat as durable and which to treat as temporary leads that will be competed away.

Scale economies: Google’s Gemini Flash 3.5 and the inference cost reductions announced at I/O represent genuine scale economies — the cost per token at Google’s inference volume is structurally lower than any competitor’s cost at smaller scale, which means Google can price below its marginal cost for enterprise customers who generate volume and still be profitable at the aggregate level. This is a real power source, but it is time-limited: as the model capability frontier continues to advance, the inference cost curves for all major providers continue to decline, and the absolute cost advantage of scale compresses toward commodity infrastructure costs. Scale economies in AI inference are a 2026 advantage, not a 2030 advantage at today’s trajectory.

Network effects: The agentic platform announcement is where Helmer’s network effects analysis is most relevant. Google’s stated strategy — agents that maintain context, connect to personal data sources, and build persistent user relationships — is designed to create a specific type of network effect: the user whose agent has built context about their preferences, calendar, communication patterns, and ongoing tasks creates switching costs that are fundamentally different from the switching costs created by learning a new interface. If the agent builds genuine context and that context is not portable, the user who has invested in building the agent’s knowledge base has created a real switching cost. Enterprise AI’s 3.3% penetration problem is precisely the failure to build this type of network effect: Copilot seat deployment has not translated to the context accumulation that would make the seats structurally valuable. Google’s agent architecture announcement is an attempt to solve this problem differently than Microsoft’s bundling approach.

Switching costs and cornered resources: Google’s position in search data, Android behavioral data, and Gmail communication patterns represents a cornered resource that is difficult to replicate — not because of the raw data volume but because of the temporal depth of the data. A user’s ten-year Gmail history, their decade of search behavior, their multi-year Android location and app-usage patterns — these are cornered resources in Helmer’s sense because the relevant alternative (a competitor starting to collect the same data today) would require a decade to replicate the temporal depth that determines the agent’s context quality. Berachain’s proof-of-liquidity protocol is attempting a similar cornered-resource logic at the blockchain layer: the liquidity data that accumulates in the BGT validator ecosystem becomes a resource that competing protocols cannot replicate quickly because it requires the temporal accumulation of validator behavior under the specific incentive structure.

Branding and process power: The announcements at I/O that build the least durable power are the ones that involve capability improvements without structural moat — model capability benchmarks, raw inference speed improvements, new modality support. These build competitive position in the short term but are replicated within 6–12 months at the current pace of advancement. The announcements that build the most durable power are the ones that build toward the agent-as-context-accumulator architecture, because that architecture creates the switching cost moat that makes the platform durable. On-chain private credit protocol power has the same structure: the institutional relationships and credit track record that the protocol accumulates over time are process power that competitors cannot buy — they can only build through the same accumulation process, which takes time and operational execution. Chinese open-source AI’s challenge to Google’s agentic platform is specifically at the capability layer — DeepSeek and Qwen can replicate the model capability but cannot replicate Google’s cornered data resource or the switching cost architecture of a Google account user’s decade of accumulated context. Prediction markets on Google’s enterprise AI market share through 2027 are pricing the cornered-resource and network-effect powers at a meaningful premium over competitors without those structural advantages.

Ben Rogers

Ben Rogers is Head of Growth at VaaSBlock and regular contributor, recognised for building real companies with real revenue in markets full of noise. His work sits at the intersection of growth, credibility, and emerging technology, where clear thinking and disciplined execution matter more than hype. Across his career, Ben has become known as one of the most effective growth operators working in frontier markets today.

He has scaled technology companies across continents, cultures, and time zones, from Thailand to Korea and Singapore. His leadership has helped transform early-stage products into global growth engines, including taking Travala from 200K to 8M monthly revenue and elevating Flipster into a top-tier derivatives exchange. These results were not the product of viral luck. They came from structured experimentation, high-leverage storytelling, and the ability to translate market psychology into repeatable growth systems.

As VaaSBlock’s Head of Growth, Ben leads the company’s market strategy, credibility frameworks, and research direction. He co-designed the RMA, a trust and governance standard that evaluates blockchain and emerging-tech organisations. His work bridges operational reality with strategic insight, helping teams navigate sectors where the narrative moves faster than the numbers. Ben writes about market cycles, behavioural incentives, and structural risk, offering a deeper view of how AI, SaaS, and crypto will evolve as capital becomes more disciplined.

Ben’s approach is shaped by a belief that businesses succeed when they combine clear thinking with practical execution. He works closely with founders, regulators, and institutional teams, advising on go-to-market strategy, credibility building, and sustainable growth models. His writing and research are widely read by operators looking to understand how emerging technology matures.

Originally from Australia and based in APAC, Ben is part of a global community of builders who want to see technology deliver genuine value. His work continues to shape how companies in emerging markets think about trust, growth, and long-term resilience.