Claude Fable 5 API 价格：Input、Output、缓存和各平台费用

Quick Answer

Claude Fable 5 是 Anthropic 最高能力的公开模型，专为复杂推理和长时间 Agent 任务设计。官方 API 价格为 input $10 / 1M tokens，output $50 / 1M tokens（output 是 input 的 5 倍）。Prompt Caching 提供约 90% 折扣，cache read 只需 $1 / 1M tokens。Claude Fable 5 支持 1M token 上下文窗口和高达 128k output，上线日期为 2026 年 6 月 9 日。AWS Bedrock、Google Vertex AI、Microsoft Foundry 的 Fable 5 价格与官方一致，但具体费率以各平台账单为准。OpenRouter 上的价格仅为第三方参考，不等同于官方定价。国内开发者可用 AI API Doctor 检测 API 配置，再用 LinkAI 注册入口小额测试。

Claude Fable 5 是什么

Claude Fable 5 是 Anthropic 发布的最高能力公开模型，面向复杂知识工作、长时间推理和代码任务。关键规格：

API 模型 ID：claude-fable-5
上下文窗口：1M token（默认）
单次最大输出：128k token
支持能力：多模态（图像理解）、工具调用、结构化输出、Prompt Caching、Batch API、Adaptive Thinking（自适应推理）
上线日期：2026 年 6 月 9 日
可用平台：Anthropic Direct API、AWS Bedrock、Google Vertex AI、Microsoft Foundry

Claude Fable 5 的 output token 成本明显高于 input（5 倍），这意味着长回复、长代码生成、深度推理任务会比普通聊天消耗更多成本。

Claude Fable 5 官方 API 价格表

以下价格为 Anthropic Direct API 官方标准费率（2026 年 6 月 9 日生效）：

计费项	官方价格	单位	说明
Input tokens（标准输入）	$10.00	每 1M tokens	包含 prompt、系统消息、上下文历史、工具定义
Output tokens（标准输出）	$50.00	每 1M tokens	包含模型生成的所有内容、推理过程、工具调用参数
Cache read / Cache hit	$1.00	每 1M tokens	复用已缓存上下文，约 90% 折扣；最低 512 token 触发
5 分钟 TTL Cache write	$12.50	每 1M tokens	缓存写入，5 分钟过期；最低 512 token 触发
1 小时 TTL Cache write	$20.00	每 1M tokens	缓存写入，1 小时过期；最低 512 token 触发
Batch API Input	$5.00	每 1M tokens	异步批处理，比标准 input 低 50%
Batch API Output	$25.00	每 1M tokens	异步批处理，比标准 output 低 50%
US-only 推理溢价	1.1x	input / output	需要在美国境内运行的任务，input 和 output 均乘 1.1

价格和模型可用性可能变化。请以 Anthropic 官方 API 定价页为准。

Input Token 和 Output Token 怎么理解

Input tokens 是发送给模型的 token，包括：

用户 prompt 和指令
系统消息（system prompt）
对话历史和上下文
工具定义（tool definitions）
工具返回结果（tool results）
已缓存的上下文（cache hit 时部分计费）

Output tokens 是模型生成的 token，包括：

可见的回复文本
内部推理过程（Claude Fable 5 自适应推理 Always-on）
工具调用参数（tool call arguments）
结构化输出 JSON 中的字段值

Claude Fable 5 的 output 是 input 价格的 5 倍，因此长回复任务（代码生成、长文档分析、Agent 多轮操作）会比普通短回复贵很多。

缓存价格怎么影响长任务成本

Prompt Caching 可以显著降低长上下文任务的成本，但需要理解 cache write 和 cache read 的区别：

Cache write：首次请求时，将上下文写入服务器缓存。费用为 $12.50（5 分钟 TTL）或 $20（1 小时 TTL）每 1M tokens。比标准 input 低约 20%，但额外写入成本需要和后续 cache hit 的节省对比。
Cache read / hit：后续请求复用缓存上下文。费用仅为 $1 / 1M tokens，比标准 input 便宜 90%。
适用场景：Agent 长时间任务、多轮代码编辑、复杂知识库查询等需要重复使用大量上下文的场景。

注意：最低 512 token 触发缓存，TTL（生存时间）从首次请求开始计时。Agent 长时间运行任务需要考虑缓存过期后的重新写入成本。

不同平台价格和可用性

Claude Fable 5 于 2026 年 6 月 9 日上线，可在以下平台使用：

平台	是否可用	价格	说明
Anthropic Direct API	✅ 正式可用	$10 / $50 / 1M（官方）	官方直接来源，支持 Prompt Caching、Batch API、Adaptive Thinking
AWS Bedrock	✅ 正式可用	$10 / $50 / 1M（与官方一致）	支持 global 和 regional endpoints；注意 30 天数据留存要求；具体区域价格以 AWS 控制台为准
Google Vertex AI	✅ 正式可用	$10 / $50 / 1M（与官方一致）	支持 global、multi-region、regional endpoints；具体价格以 Google Cloud 定价页为准
Microsoft Foundry	✅ 正式可用	$10 / $50 / 1M（与官方一致）	按 Microsoft 计费体系结算；具体价格以 Azure AI Foundry 定价页为准
OpenRouter	可用（第三方）	仅供参考	OpenRouter 价格仅为第三方参考，不等同于 Anthropic 官方定价；包含额外路由和手续费

云平台价格与 Anthropic 官方一致，但按各云厂商的区域计费方式结算。如需了解具体区域价格，请访问对应云厂商的定价页面或后台控制台。

Claude Fable 5 和 Opus / Sonnet 的成本差异

Claude Fable 5 定位为 Claude 系列的最高能力层，定价也最高。与 Opus 4.8 的对比：

计费项	Claude Fable 5	Claude Opus 4.8
Input tokens	$10 / 1M	$5 / 1M
Output tokens	$50 / 1M	$25 / 1M
Cache read	$1 / 1M	$0.50 / 1M
5m Cache write	$12.50 / 1M	$6.25 / 1M
Batch Input	$5 / 1M	$2.50 / 1M
Batch Output	$25 / 1M	$12.50 / 1M

Claude Fable 5 的各项价格均为 Opus 4.8 的 2 倍。成本是否值得取决于：任务是否需要 Fable 5 的更高能力、output token 消耗量、以及业务价值。建议先用小额请求测试并记录 usage，综合判断后再决定。

更多 Claude 模型价格请看 Claude API 价格页。

Claude Code / Agent 任务为什么要关注 Fable 5 成本

Claude Fable 5 的 output token 高单价对 Agent 和 Claude Code 类任务有显著影响：

长上下文消耗：代码任务涉及大量文件读取、工具结果、对话历史，input token 消耗高。
长输出生成：Fable 5 的 output 是 input 的 5 倍，代码生成、长文档输出成本显著放大。
工具调用放大：Agent 任务中每次工具调用的参数和返回都计为 output token，频繁调用会快速增加成本。
多轮循环：Claude Code 的多轮编辑循环会产生大量累积 output token。
自适应推理：Fable 5 的 Always-on Adaptive Thinking 推理过程全部计为 output token，成本比纯响应模型更高。

如需了解 Claude Code 场景的 Token 成本详情，请看 Claude Code Token 成本页。

国内用户怎么检查实际 API 配置

Claude Fable 5 在国内无法直接访问 Anthropic API。如需测试：

先用 AI API Doctor 检测 Base URL、API Key、模型名配置是否正确
确认 API Key 有权限访问 claude-fable-5 模型
发小额请求测试，记录 usage 返回，核对扣费
如需中转服务，可通过 LinkAI 注册入口领取小额福利进行测试

Sources / 来源

Anthropic — Claude Fable 产品页（官方模型来源）
Anthropic — Claude API Pricing docs（官方 API 定价来源）
Anthropic — Introducing Claude Fable 5 and Claude Mythos 5（官方模型文档）
Anthropic — Claude Fable 5 and Claude Mythos 5 launch post（官方发布公告）
AWS Bedrock — Claude Fable 5（官方云厂商来源，区域定价以 AWS 控制台为准）
Google Cloud Vertex AI（官方云厂商来源，区域定价以 GCP 定价页为准）
Microsoft Azure AI Foundry（官方云厂商来源，区域定价以 Azure 控制台为准）
OpenRouter — Claude Fable 5（第三方价格参考，非官方定价）

价格和可用性说明：Claude Fable 5 API 的价格和模型可用性可能变化。开发者应定期查看 Anthropic 官方文档、后台模型列表和最新价格更新，以服务商官方信息为准。扣费判断需要结合 request_id、usage、completion_tokens、cache 记录和后台账单记录综合分析。

价格和可用性说明：模型 API 价格和可用性可能变化。开发者应定期查看官方文档、后台模型列表和最新价格更新，以服务商官方信息为准。扣费判断需要结合 request_id、usage、raw quota、completion_tokens、stream 状态和后台账单记录综合分析。

开始测试前，建议先做 API 检测

不确定 API 是否可用、是否会扣费？先用 AI API Doctor 检测 Base URL、API Key、模型权限和 usage 返回情况。

AI API Doctor 检测注册 LinkAI 小额测试

Claude Fable 5 API 价格：Input、Output、缓存和各平台费用

Quick Answer

Claude Fable 5 是什么

Claude Fable 5 官方 API 价格表

Input Token 和 Output Token 怎么理解

缓存价格怎么影响长任务成本

不同平台价格和可用性

Claude Fable 5 和 Opus / Sonnet 的成本差异

Claude Code / Agent 任务为什么要关注 Fable 5 成本

国内用户怎么检查实际 API 配置

Sources / 来源

开始测试前，建议先做 API 检测

注册 LinkAI，小额测试模型扣费

常见问题

Claude Fable 5 API 价格是多少？

Claude Fable 5 output 为什么更贵？

Claude Fable 5 cache hit 是什么意思？

Claude Fable 5 在 Bedrock 上价格一样吗？

Claude Fable 5 在 Vertex AI 上怎么收费？

Claude Fable 5 在 Microsoft Foundry 上怎么收费？

OpenRouter 的 Claude Fable 5 价格能当官方价格吗？

Claude Fable 5 适合 Claude Code 吗？

国内用户怎么测试 Claude Fable 5 API 是否能用？

Quick Answer

Claude Fable 5 是什么

Claude Fable 5 官方 API 价格表

Input Token 和 Output Token 怎么理解

缓存价格怎么影响长任务成本

不同平台价格和可用性

Claude Fable 5 和 Opus / Sonnet 的成本差异

Claude Code / Agent 任务为什么要关注 Fable 5 成本

国内用户怎么检查实际 API 配置

Sources / 来源

开始测试前，建议先做 API 检测

注册 LinkAI，小额测试模型扣费

相关链接

常见问题

Claude Fable 5 API 价格是多少？

Claude Fable 5 output 为什么更贵？

Claude Fable 5 cache hit 是什么意思？

Claude Fable 5 在 Bedrock 上价格一样吗？

Claude Fable 5 在 Vertex AI 上怎么收费？

Claude Fable 5 在 Microsoft Foundry 上怎么收费？

OpenRouter 的 Claude Fable 5 价格能当官方价格吗？

Claude Fable 5 适合 Claude Code 吗？

国内用户怎么测试 Claude Fable 5 API 是否能用？