Google Agent Platform（Gemini Enterprise Agent Platform）

tags: [AI, Agent, 平台, Google, 治理, 架构, 行业格局]
created: 2026-04-28
updated: 2026-05-08
sources: [raw/notes/google-agent-platform-research-2026-04-28.md, raw/notes/google-memory-bank-research-2026-04-28.md, raw/notes/chijianqiang-idc-maas-token-economy-2026-05-08.md, https://cloud.google.com/blog/products/ai-machine-learning/introducing-gemini-enterprise-agent-platform, https://docs.cloud.google.com/gemini-enterprise-agent-platform/overview, https://docs.cloud.google.com/gemini-enterprise-agent-platform/govern/gateways/agent-gateway-overview, https://cloud.google.com/blog/topics/google-cloud-next/google-cloud-next-2026-wrap-up, https://docs.cloud.google.com/iam/docs/agent-identity-overview, https://blog.google/innovation-and-ai/infrastructure-and-cloud/google-cloud/eighth-generation-tpu-agentic-era/]

定义¶

2026-04-22 Google Cloud Next（Las Vegas）发布的整体 Agent 开发/运行/治理/优化平台。由 Vertex AI 改名而来——不是 rebrand，是终止 Vertex AI 作为独立产品形态。Google Cloud 官方原话：

all Vertex AI services and roadmap evolutions will be delivered exclusively through the Agent Platform, rather than as a standalone service.

由 4 pillars × 18 components（Build / Scale / Govern / Optimize）+ 跨切组件（Model Garden 200+ 模型 / Workspace Studio / A2A 协议 / Managed MCP）+ 配套基础设施（TPU 8t/8i / Cross-Cloud Infrastructure / Agentic Data Cloud）构成完整 agent 全栈。

为什么这条值得独立成页¶

google-memory-bank 是这套平台单组件（Scale 柱的 Memory Bank）的深度调研，但把局部当整体处理会丢掉关键判断：

整套平台是 platform-layer-collapse 的最强单一实证——18 个 agent 工具内化进单一平台，第三方 agent infra 公司空间被压缩
Govern 柱（Identity / Registry / Gateway）是企业客户下一个"必选项"——Sentino 当前缺失整柱
ADK 改 graph-based 子 agent 网络反映行业从"单 agent + tool-calling"到"orchestrated 多 agent"转向，与 orchestration-free-agents / amphiflow-pattern 形成三向分叉
Model Garden 的"开放模型选择"与 Sentino 的"不绑 LLM 厂商"立场同方向但不同动机
TPU 8t/8i 是 google-tpu-v8 训练-推理拆分的延续——Google 双 SKU 路线没撤回，反而更细化

本词条把这些跨概念的关联集中到父节点，让 google-memory-bank 等子组件页只承担"具体能力"职责。

时间线¶

时间	事件
2025-07-08	Memory Bank 在 Vertex AI Agent Engine 下公开预览（即将成为 Scale 柱的子组件）
~2025-12	Memory Bank + Sessions 一起 GA
2026-01-28	Memory Bank + Sessions 正式按 $0.25/1K events 计费
2026-04-22	Google Cloud Next 26（Las Vegas）：Vertex AI 整体改名 Gemini Enterprise Agent Platform；4 pillars × 18 组件正式公布
2026-04-21	google-adk PyPI 发布（含 graph-based subagent 升级）
2H 2026	TPU 8t/8i preview
late 2027	TPU 8t/8i GA；Anthropic 3.5 GW 算力上线

4 Pillars × 18 组件骨架¶

Build（构建）¶

组件	职能
Agent Studio	低代码可视化构建（drag-and-drop）
ADK（Agent Development Kit）	代码优先；2026 重大升级——graph-based 子 agent 网络（之前线性，现在显式 DAG 编排子 agent 间协作）；ADK v1.0 stable 跨四种语言
Agent Garden	预构建 agent 模板（金融 / 代码现代化 / 客户服务 / 数据分析 / 创意任务）

Scale（规模化运行）¶

组件	职能
Agent Runtime	sub-second cold starts；multi-day 长跑 agent
Agent Sandbox	模型生成代码的 hardened 执行环境
Memory Bank	长期记忆服务（详见独立页）
Memory Profiles	Memory Bank 子能力——schema-based structured profiles（Pydantic 定义）+ 低延迟召回 + scope 隔离
Agent Sessions	自定义 session ID 跟踪
Bidirectional Streaming	WebSocket 协议
Agent-to-agent orchestration	A2A 协议 v1.0 production at 150+ orgs

Govern（治理）⭐ Sentino 当前缺失整柱¶

组件	职能	关键技术
Agent Identity	每个 agent 唯一密码学 ID + 可作为 IAM principal	SPIFFE 标准 + X.509 证书；mTLS（端到端 cert-bound token） + DPoP（Demonstrating Proof of Possession）；Service Account 不能 impersonate / 不允许长期 key；Credential Vault 管理第三方 OAuth
Agent Registry	中央 catalog 索引每个 agent / tool / skill	含第三方 MCP servers；自动从 deployment 注册
Agent Gateway	Thomas Kurian 称"air traffic control for your agent ecosystem"	preventative enforcement（动作执行前阻止）；支持 MCP / A2A / REST / gRPC 多协议；只有 MCP 支持基于 protocol attribute 的授权策略条件；least-privilege 默认；Agent-to-Anywhere（egress）模式
Model Armor	prompt injection / data leakage 防护	在 Gateway 内联
Agent Anomaly Detection	实时异常行为检测	statistical models + LLM-as-a-judge
Agent Threat Detection	恶意活动可见性	与 Security Command Center 集成
Agent Security Dashboard	Security Command Center 集成	—

行业评论类比（不是 Google 官方表述）： - Agent Identity ≈ "Active Directory for AI era"——Microsoft Agent 365 的 Entra ID 是同方向 - Agent Registry ≈ "container registry for agents"

Optimize（优化）⭐ Sentino 当前缺失整柱¶

组件	职能
Agent Simulation	合成用户交互 + 虚拟工具压测
Agent Evaluation	持续打分（vs 真实流量）
Agent Observability	reasoning 可视化轨迹 + visual tracing + debugging
Agent Optimizer	失败聚类 + 自动建议改进 instructions

跨切组件¶

Model Garden：200+ 模型；包括 Anthropic Claude 系列（Opus / Sonnet / Haiku，4-22 公告中未具体到版本号 4.7）/ Gemma 系列 / Llama / DeepSeek（R1 / V3 / V3-0324 with H200 + vLLM）/ Mistral Small 3.1
Workspace Studio：no-code agent builder
Project Mariner：web-browsing agent
Managed MCP servers + Apigee：API-to-agent bridge
A2A 协议 v1.0：production at 150+ organizations
Partner agents：Box / Workday / Salesforce / ServiceNow / SAP 承诺原生支持

配套基础设施（同期 2026-04-22 宣布）¶

TPU 8t（训练，codename Sunfish，Broadcom 合作）¶

9,600 chip superpod，121 FP4 EFLOPS/pod
8 stacks of 12-high HBM3e（30% 更高内存带宽 vs Ironwood）
同精度对比下 vs TPUv7 提升小于宣称的 2.8x（FP4 vs FP8 偷换问题，详见 ai-vendor-spec-traps）
TSMC 2nm；preview 2H 2026 / GA late 2027
Anthropic 是 anchor customer——3.5 GW 算力 2027 上线

TPU 8i（推理 + RL，codename Zebrafish，MediaTek 合作）¶

80% better perf-per-dollar；针对 agentic workflows + MoE
单 compute die + 6 stacks HBM3e
10.1 FP4 PFLOPS/chip，384 MB on-chip SRAM（3x prior gen），288 GB HBM，8.6 TB/s 内存带宽
20-30% 比 8t 低成本
MediaTek 股价创纪录（TWD 2,090 daily limit）反映市场对 Google 推理芯片商业化预期

→ 这是 google-tpu-v8 "训练-推理硬件拆分"在 v8 之后8t/8i 双 SKU的延续。v8 已是双 SKU（v8t / v8i），8t/8i 是 v8 的进阶——training-inference-bifurcation 行业方向坐实，没有路线撤回。

其他配套¶

Cross-Cloud Infrastructure：Agent Gateway / Cloud Network Insights / NGFW / Cloud Armor
Agentic Data Cloud：Knowledge Catalog 从 Dataplex Universal Catalog 演化
$240B backlog 数字（Q4 2025 退出）

战略含义¶

信号 1：平台分类的官方词汇出现¶

Google 把 agent platform 定义为 4 pillars × 18 components——这是对 sentino-agent 讲"我们也是 agent 平台"的必须对照参照系。可借鉴 4-pillar 词汇结构描述 Sentino：

Pillar	Sentino 现状
Build	Standalone Agent + 工具化集成模式 + Agora ConvoAI 接入
Scale	Memex + Agent Loop + workflow 引擎
Govern	当前缺失整柱（与 2026-04-28-memex-v2-design-options ③ Revision API + IAM 是同一类信号）
Optimize	当前缺失整柱（没有 Simulation / Evaluation / Observability / Optimizer）

信号 2：Govern 三件套是企业客户下一个"必选项"¶

Agent Identity / Registry / Gateway 与 Microsoft Agent 365 Entra ID 是行业标准化方向。Sentino 短期内追不上但企业客户问起来要有答案——至少在销售材料里要有"治理路线图"段落。重点关注：

客户合规对接 SPIFFE / X.509 / mTLS 是否成强制要求
Agent 行为审计要求是否会下放到 Sentino 这种中间层供应商
Memory revisions API（已写入 2026-04-28-memex-v2-design-options P1）是 Sentino Govern 柱的最小可行起步

信号 3：ADK 改 graph-based 子 agent 网络¶

回应 LangGraph，反映 2026 行业从"单 agent + tool-calling"到"orchestrated 多 agent"的转向。这与 orchestration-free-agents"文件锁 + Git 替代编排"是反方向——Google 押"显式编排是对的"。三种路径并存于 2026 行业：

路径	代表	哲学
显式 graph 编排	Google ADK 2026 / LangGraph	子 agent 间协作要 DAG 编排
无编排（朴素基础设施）	orchestration-free-agents Carlini C 编译器	文件锁 + Git + 自组织任务池
Amphiflow（默认 workflow + agent 兜底）	amphiflow-pattern 张铁蕾	workflow 凝固通用路径 + agent 兜底异常

Sentino 当前是单 agent + 工具化集成 + Standalone Agent 数据生产者模式，没有显式 graph-based 子 agent 编排——选哪条路径要在客户场景需求明确后再定。

信号 4：Model Garden 200+ 模型 = "开放模型选择"差异化¶

Google 押"开放模型选择"作为差异化 vs Anthropic（绑 Claude）vs OpenAI（绑 GPT）。Sentino 的"不绑 LLM 厂商"立场（sentino-memex 不变量第 1 条）与 Google 在 Model Garden 上的策略同方向但不同动机：

Google 动机：基础设施才是锁定（TPU + GCP + 数据），模型选择开放反而强化全栈黏性
Sentino 动机：作为中间层不能被任何模型厂商锁定，多模型适配是生存条件

同方向不同动机意味着两家协同短期可能（Sentino 接 Vertex AI Gemini + Anthropic Claude），但当 Google 把 Memory Bank / Govern 三件套当作"全栈一等公民"销售时，Sentino 在 GCP 全栈客户面前会遇到结构性挤压。

与已有 wiki 的对位关系¶

已有页面	对位关系
google-memory-bank	是 Agent Platform 的一个组件（Scale 柱）——之前调研把局部当整体
claude-agent-sdk	同类：模型厂商出的 agent SDK；ADK 是 Google 的 Claude Agent SDK 对标
agent-runtime-architecture	Claude Code 7 层 vs Google AP 18 组件——两套不同抽象层级的"agent runtime"
platform-layer-collapse	最强证据：18 个 agent 工具内化进单一平台
agent-creator-refiner-operator	Agent Platform 是 Creator + Refiner 平台（不直接是 Operator）；Garden 模板 + ADK 让企业自己造 Operator
mcp-protocol	Agent Gateway 把 MCP 作为多协议中唯一支持基于 protocol attribute 的授权策略条件的协议
google-tpu-v8	TPU 8t/8i 是 v8 训练-推理拆分的延续
training-inference-bifurcation	v8→8t/8i 延续，行业方向坐实
2026-04-28-memex-v2-design-options	Govern 三件套是借鉴点 ③ Revision API + IAM 的更完整对照
orchestration-free-agents	ADK graph-based 子 agent 网络是反方向行业证据
agora-convoai-join-api	Agora ConvoAI 把 mcp_servers 内置进 LLM 配置层 = MCP 渗透 Layer 3；Agent Gateway 是更高一层的 multi-protocol policy enforcement

适用边界¶

不是 Sentino 直接竞品——Sentino 是中间层 Agent 平台 + 语音 IoT 垂直，Google AP 是 GCP 全栈基础设施层。两者在"Agent 平台"概念上有重叠，但客户场景、定价、控制权完全不同
客户提到 Google AP 时的判断：客户是 GCP 全栈客户（已用 Gemini / Vertex / Workspace）→ Google AP 是合理一等公民；客户是混合云 / 多模型 / 数据主权敏感 → Sentino 自建路径仍合理
不要把 4 pillars 当通用术语——这是 Google 商业化框架，借鉴用于内部对齐和销售物料的对照可，但 Sentino 对外不应被动接受 Google 词汇定义自己

中国对照：阿里 Alibaba Token Hub（ATH）¶

2026-03-16 阿里成立 ATH 事业群（CEO 吴泳铭直接管辖），是 Google AP 之后的中国版"AI 全栈框架"。两套切分轴对位：

维度	Google Agent Platform	Alibaba Token Hub
切分轴	4 pillars × 18 components（Build / Scale / Govern / Optimize）	Token 三阶段（创造 / 输送 / 应用）
核心抽象	Agent 平台	Token 经济
模型策略	Model Garden 200+ 模型开放	通义实验室主推 + 兼容第三方
Govern 处理	Govern 独立柱（Identity / Registry / Gateway 三件套）	Govern 未在三阶段中独立 —— 是阿里方案的明显差距
客户层	Box / Workday / Salesforce / ServiceNow 原生集成	钉钉 7 亿用户 + 悟空 B 端工作流
时间	2026-04-22 Cloud Next	2026-03-16 阿里官方公告
触发动机	Vertex AI 业务整合 + 与 Microsoft / OpenAI 平台战	应对字节豆包 + Seed 双线进攻 + Qwen 团队连续震荡（周畅 2024 / 林俊旸 2026-03 离职至字节）

吴泳铭主张："Token 就是未来的电"——把 Token 从"技术单位"重新命名为"经济商品"，把 MaaS 平台位置类比为"电网"。这是 naming-shapes-thinking 的中国行业版应用——与 Google 的 "4 pillars" 词汇结构同方向（命名塑造行业框架）但视角不同。

关键判断：两套框架的差异不是"谁更对"，是反映本国 AI 行业组织形态的差异—— - Google AP 起点是模型公司向云延伸，所以以"Agent 平台"切分（重视组件正交性） - 阿里 ATH 起点是云公司 + 模型公司天然合一，所以以"Token 经济"切分（重视生命周期 + 计费）

两套框架在 mid-market AI 部署赛道形成"美国模式 vs 中国模式"的双轨——美国走 Google AP / Anthropic / OpenAI（模型公司向云延伸 + PE 渠道收编）+ 中国走 ATH / 火山引擎（云公司 + 模型公司天然合一 + MaaS 头部聚合），同方向不同路径。

详见 china-maas-market-2026。

待观察¶

ADK graph-based 子 agent 编排在企业内的实际渗透（vs LangGraph）
Agent Identity SPIFFE + X.509 是否成 B2B Agent 集成 RFP 标准
TPU 8t late 2027 GA 时回访 Anthropic 3.5 GW 部署对 Claude 模型路线的影响
Memory Bank GA 后第一个公开企业客户案例（Payhawk 已是早期实证，需要看更多）
Sentino 销售材料是否要补"治理路线图"段——客户问起 Govern 三件套时的回答框架

定义¶