<?xml version="1.0" encoding="UTF-8"?><rss xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:atom="http://www.w3.org/2005/Atom" version="2.0"><channel><title><![CDATA[Token 预算怎么拆，别让系统提示被业务内容挤掉]]></title><description><![CDATA[<p dir="auto">大家做多轮问答时 token 预算怎么拆？我们现在就是超了就从最早消息开始删，感觉偶尔会丢关键约束。</p>
]]></description><link>https://localaihub.com/topic/81/token-预算怎么拆-别让系统提示被业务内容挤掉</link><generator>RSS for Node</generator><lastBuildDate>Wed, 03 Jun 2026 19:44:25 GMT</lastBuildDate><atom:link href="https://localaihub.com/topic/81.rss" rel="self" type="application/rss+xml"/><pubDate>Mon, 04 May 2026 18:55:00 GMT</pubDate><ttl>60</ttl><item><title><![CDATA[Reply to Token 预算怎么拆，别让系统提示被业务内容挤掉 on Tue, 05 May 2026 15:28:00 GMT]]></title><description><![CDATA[<p dir="auto">对，先能解释每次预算决策，再谈优化。</p>
]]></description><link>https://localaihub.com/post/498</link><guid isPermaLink="true">https://localaihub.com/post/498</guid><dc:creator><![CDATA[林小北]]></dc:creator><pubDate>Tue, 05 May 2026 15:28:00 GMT</pubDate></item><item><title><![CDATA[Reply to Token 预算怎么拆，别让系统提示被业务内容挤掉 on Tue, 05 May 2026 12:50:00 GMT]]></title><description><![CDATA[<p dir="auto">我打算把上下文分成必保、可压缩、可丢三档。日志里记录每次删了什么，不在用户界面显示。</p>
]]></description><link>https://localaihub.com/post/497</link><guid isPermaLink="true">https://localaihub.com/post/497</guid><dc:creator><![CDATA[橘子汽水]]></dc:creator><pubDate>Tue, 05 May 2026 12:50:00 GMT</pubDate></item><item><title><![CDATA[Reply to Token 预算怎么拆，别让系统提示被业务内容挤掉 on Tue, 05 May 2026 10:46:00 GMT]]></title><description><![CDATA[<p dir="auto">摘要里可以有“情绪状态=强烈不满，已道歉一次，不要重复解释政策”。不是只存事实。</p>
]]></description><link>https://localaihub.com/post/496</link><guid isPermaLink="true">https://localaihub.com/post/496</guid><dc:creator><![CDATA[nora]]></dc:creator><pubDate>Tue, 05 May 2026 10:46:00 GMT</pubDate></item><item><title><![CDATA[Reply to Token 预算怎么拆，别让系统提示被业务内容挤掉 on Tue, 05 May 2026 08:16:00 GMT]]></title><description><![CDATA[<p dir="auto">但摘要会不会丢情绪？客服投诉里情绪很重要。</p>
]]></description><link>https://localaihub.com/post/495</link><guid isPermaLink="true">https://localaihub.com/post/495</guid><dc:creator><![CDATA[普通网友A]]></dc:creator><pubDate>Tue, 05 May 2026 08:16:00 GMT</pubDate></item><item><title><![CDATA[Reply to Token 预算怎么拆，别让系统提示被业务内容挤掉 on Tue, 05 May 2026 07:21:00 GMT]]></title><description><![CDATA[<p dir="auto">历史消息我建议存结构化状态：用户身份、已确认事实、未解决问题、风险边界。聊天原文只在需要追溯时检索。</p>
]]></description><link>https://localaihub.com/post/494</link><guid isPermaLink="true">https://localaihub.com/post/494</guid><dc:creator><![CDATA[阿远]]></dc:creator><pubDate>Tue, 05 May 2026 07:21:00 GMT</pubDate></item><item><title><![CDATA[Reply to Token 预算怎么拆，别让系统提示被业务内容挤掉 on Tue, 05 May 2026 07:12:00 GMT]]></title><description><![CDATA[<p dir="auto">面向用户别说 token。可以说“我会基于当前问题和关键记录回答”。内部日志再记压缩比例。</p>
]]></description><link>https://localaihub.com/post/493</link><guid isPermaLink="true">https://localaihub.com/post/493</guid><dc:creator><![CDATA[葡萄冰]]></dc:creator><pubDate>Tue, 05 May 2026 07:12:00 GMT</pubDate></item><item><title><![CDATA[Reply to Token 预算怎么拆，别让系统提示被业务内容挤掉 on Tue, 05 May 2026 04:10:00 GMT]]></title><description><![CDATA[<p dir="auto">预算表要不要暴露到前端？比如告诉用户“内容太长已压缩”。</p>
]]></description><link>https://localaihub.com/post/492</link><guid isPermaLink="true">https://localaihub.com/post/492</guid><dc:creator><![CDATA[小吴]]></dc:creator><pubDate>Tue, 05 May 2026 04:10:00 GMT</pubDate></item><item><title><![CDATA[Reply to Token 预算怎么拆，别让系统提示被业务内容挤掉 on Tue, 05 May 2026 02:58:00 GMT]]></title><description><![CDATA[<p dir="auto">能用 prompt caching 的静态部分就拆出来。系统提示、工具说明、固定政策，这些别每次都当新内容烧。</p>
]]></description><link>https://localaihub.com/post/491</link><guid isPermaLink="true">https://localaihub.com/post/491</guid><dc:creator><![CDATA[rootless]]></dc:creator><pubDate>Tue, 05 May 2026 02:58:00 GMT</pubDate></item><item><title><![CDATA[Reply to Token 预算怎么拆，别让系统提示被业务内容挤掉 on Tue, 05 May 2026 00:15:00 GMT]]></title><description><![CDATA[<p dir="auto">token 预算还影响成本预估。一个“继续”按钮可能把完整历史再发一遍，账单翻倍。</p>
]]></description><link>https://localaihub.com/post/490</link><guid isPermaLink="true">https://localaihub.com/post/490</guid><dc:creator><![CDATA[今天也没睡醒]]></dc:creator><pubDate>Tue, 05 May 2026 00:15:00 GMT</pubDate></item><item><title><![CDATA[Reply to Token 预算怎么拆，别让系统提示被业务内容挤掉 on Tue, 05 May 2026 00:05:00 GMT]]></title><description><![CDATA[<p dir="auto">我们客服场景设了最大输出 700 字，复杂问题让模型先问澄清，不允许一次写长篇。</p>
]]></description><link>https://localaihub.com/post/489</link><guid isPermaLink="true">https://localaihub.com/post/489</guid><dc:creator><![CDATA[小蓝]]></dc:creator><pubDate>Tue, 05 May 2026 00:05:00 GMT</pubDate></item><item><title><![CDATA[Reply to Token 预算怎么拆，别让系统提示被业务内容挤掉 on Mon, 04 May 2026 23:10:00 GMT]]></title><description><![CDATA[<p dir="auto">输出预算也别忘。很多人只算输入，结果模型回答到一半被截断，用户看到半句 SQL。</p>
]]></description><link>https://localaihub.com/post/488</link><guid isPermaLink="true">https://localaihub.com/post/488</guid><dc:creator><![CDATA[latte404]]></dc:creator><pubDate>Mon, 04 May 2026 23:10:00 GMT</pubDate></item><item><title><![CDATA[Reply to Token 预算怎么拆，别让系统提示被业务内容挤掉 on Mon, 04 May 2026 22:54:00 GMT]]></title><description><![CDATA[<p dir="auto">比例不如优先级。系统和工具协议不可删，当前用户消息不可删，证据按相关度删，历史要先摘要再删。</p>
]]></description><link>https://localaihub.com/post/487</link><guid isPermaLink="true">https://localaihub.com/post/487</guid><dc:creator><![CDATA[陈小舟]]></dc:creator><pubDate>Mon, 04 May 2026 22:54:00 GMT</pubDate></item><item><title><![CDATA[Reply to Token 预算怎么拆，别让系统提示被业务内容挤掉 on Mon, 04 May 2026 21:17:00 GMT]]></title><description><![CDATA[<p dir="auto">我们做了固定比例，系统 10%，资料 60%，历史 20%，输出 10%。结果资料短的时候浪费，资料长的时候还是爆。</p>
]]></description><link>https://localaihub.com/post/486</link><guid isPermaLink="true">https://localaihub.com/post/486</guid><dc:creator><![CDATA[小马过河]]></dc:creator><pubDate>Mon, 04 May 2026 21:17:00 GMT</pubDate></item><item><title><![CDATA[Reply to Token 预算怎么拆，别让系统提示被业务内容挤掉 on Mon, 04 May 2026 21:06:00 GMT]]></title><description><![CDATA[<p dir="auto">“从最早开始删”是最低配，能跑但容易错。预算要分区：系统指令、工具协议、当前问题、证据、历史状态、可丢闲聊。</p>
]]></description><link>https://localaihub.com/post/485</link><guid isPermaLink="true">https://localaihub.com/post/485</guid><dc:creator><![CDATA[林小北]]></dc:creator><pubDate>Mon, 04 May 2026 21:06:00 GMT</pubDate></item></channel></rss>