<?xml version="1.0" encoding="UTF-8"?><rss xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:atom="http://www.w3.org/2005/Atom" version="2.0"><channel><title><![CDATA[提示词改了十版，没人知道是不是变好了]]></title><description><![CDATA[<p dir="auto">我们客服提示词已经改了十几版，每次业务方都说“这版好一点”。但上线后还是被吐槽。怎么判断到底有没有变好？</p>
]]></description><link>https://localaihub.com/topic/103/提示词改了十版-没人知道是不是变好了</link><generator>RSS for Node</generator><lastBuildDate>Wed, 03 Jun 2026 17:51:04 GMT</lastBuildDate><atom:link href="https://localaihub.com/topic/103.rss" rel="self" type="application/rss+xml"/><pubDate>Wed, 06 May 2026 15:00:00 GMT</pubDate><ttl>60</ttl><item><title><![CDATA[Reply to 提示词改了十版，没人知道是不是变好了 on Thu, 07 May 2026 13:02:00 GMT]]></title><description><![CDATA[<p dir="auto">对。没有评测集，提示词越写越像祈祷文。</p>
]]></description><link>https://localaihub.com/post/830</link><guid isPermaLink="true">https://localaihub.com/post/830</guid><dc:creator><![CDATA[melo]]></dc:creator><pubDate>Thu, 07 May 2026 13:02:00 GMT</pubDate></item><item><title><![CDATA[Reply to 提示词改了十版，没人知道是不是变好了 on Thu, 07 May 2026 12:37:00 GMT]]></title><description><![CDATA[<p dir="auto">听下来我现在缺的不是更强提示词，是测试集。</p>
]]></description><link>https://localaihub.com/post/829</link><guid isPermaLink="true">https://localaihub.com/post/829</guid><dc:creator><![CDATA[小曹]]></dc:creator><pubDate>Thu, 07 May 2026 12:37:00 GMT</pubDate></item><item><title><![CDATA[Reply to 提示词改了十版，没人知道是不是变好了 on Thu, 07 May 2026 11:39:00 GMT]]></title><description><![CDATA[<p dir="auto">最少也要进 Git。每次改动写清楚：为什么改、影响哪些样例、回滚方式。</p>
]]></description><link>https://localaihub.com/post/828</link><guid isPermaLink="true">https://localaihub.com/post/828</guid><dc:creator><![CDATA[林小北]]></dc:creator><pubDate>Thu, 07 May 2026 11:39:00 GMT</pubDate></item><item><title><![CDATA[Reply to 提示词改了十版，没人知道是不是变好了 on Thu, 07 May 2026 09:02:00 GMT]]></title><description><![CDATA[<p dir="auto">提示词版本怎么管理？我们现在是文档里复制粘贴。</p>
]]></description><link>https://localaihub.com/post/827</link><guid isPermaLink="true">https://localaihub.com/post/827</guid><dc:creator><![CDATA[小周]]></dc:creator><pubDate>Thu, 07 May 2026 09:02:00 GMT</pubDate></item><item><title><![CDATA[Reply to 提示词改了十版，没人知道是不是变好了 on Thu, 07 May 2026 08:17:00 GMT]]></title><description><![CDATA[<p dir="auto">我这边做法是：模型先打分，人工抽查争议样例。这样不至于全靠人工。</p>
]]></description><link>https://localaihub.com/post/826</link><guid isPermaLink="true">https://localaihub.com/post/826</guid><dc:creator><![CDATA[阿白]]></dc:creator><pubDate>Thu, 07 May 2026 08:17:00 GMT</pubDate></item><item><title><![CDATA[Reply to 提示词改了十版，没人知道是不是变好了 on Thu, 07 May 2026 06:32:00 GMT]]></title><description><![CDATA[<p dir="auto">可以辅助，但不能全信。尤其客服、合规、业务口径，模型裁判经常看起来合理但不懂真实规则。</p>
]]></description><link>https://localaihub.com/post/825</link><guid isPermaLink="true">https://localaihub.com/post/825</guid><dc:creator><![CDATA[nora]]></dc:creator><pubDate>Thu, 07 May 2026 06:32:00 GMT</pubDate></item><item><title><![CDATA[Reply to 提示词改了十版，没人知道是不是变好了 on Thu, 07 May 2026 04:44:00 GMT]]></title><description><![CDATA[<p dir="auto">是不是可以让另一个模型当裁判？</p>
]]></description><link>https://localaihub.com/post/824</link><guid isPermaLink="true">https://localaihub.com/post/824</guid><dc:creator><![CDATA[普通网友A]]></dc:creator><pubDate>Thu, 07 May 2026 04:44:00 GMT</pubDate></item><item><title><![CDATA[Reply to 提示词改了十版，没人知道是不是变好了 on Thu, 07 May 2026 03:06:00 GMT]]></title><description><![CDATA[<p dir="auto">那就让他们至少标“能不能接受”。没有标注，工程侧没法猜业务标准。</p>
]]></description><link>https://localaihub.com/post/823</link><guid isPermaLink="true">https://localaihub.com/post/823</guid><dc:creator><![CDATA[陈一]]></dc:creator><pubDate>Thu, 07 May 2026 03:06:00 GMT</pubDate></item><item><title><![CDATA[Reply to 提示词改了十版，没人知道是不是变好了 on Thu, 07 May 2026 00:58:00 GMT]]></title><description><![CDATA[<p dir="auto">业务方不愿意写理想回答，说没时间。</p>
]]></description><link>https://localaihub.com/post/822</link><guid isPermaLink="true">https://localaihub.com/post/822</guid><dc:creator><![CDATA[小曹]]></dc:creator><pubDate>Thu, 07 May 2026 00:58:00 GMT</pubDate></item><item><title><![CDATA[Reply to 提示词改了十版，没人知道是不是变好了 on Wed, 06 May 2026 23:39:00 GMT]]></title><description><![CDATA[<p dir="auto">失败类型很关键。答非所问、废话多、拒答过度、格式错、引用错，这些不能混着算。</p>
]]></description><link>https://localaihub.com/post/821</link><guid isPermaLink="true">https://localaihub.com/post/821</guid><dc:creator><![CDATA[Grace]]></dc:creator><pubDate>Wed, 06 May 2026 23:39:00 GMT</pubDate></item><item><title><![CDATA[Reply to 提示词改了十版，没人知道是不是变好了 on Wed, 06 May 2026 23:04:00 GMT]]></title><description><![CDATA[<p dir="auto">我建议建三列：用户原问题、当前回答、理想回答。再加一个失败类型。</p>
]]></description><link>https://localaihub.com/post/820</link><guid isPermaLink="true">https://localaihub.com/post/820</guid><dc:creator><![CDATA[小潘同学]]></dc:creator><pubDate>Wed, 06 May 2026 23:04:00 GMT</pubDate></item><item><title><![CDATA[Reply to 提示词改了十版，没人知道是不是变好了 on Wed, 06 May 2026 21:15:00 GMT]]></title><description><![CDATA[<p dir="auto">而且要保留旧回答。只看新回答，很容易忘了之前哪里坏。</p>
]]></description><link>https://localaihub.com/post/819</link><guid isPermaLink="true">https://localaihub.com/post/819</guid><dc:creator><![CDATA[半截薯条]]></dc:creator><pubDate>Wed, 06 May 2026 21:15:00 GMT</pubDate></item><item><title><![CDATA[Reply to 提示词改了十版，没人知道是不是变好了 on Wed, 06 May 2026 20:02:00 GMT]]></title><description><![CDATA[<p dir="auto">那不是调提示词，是抽卡。先把真实失败样例收起来，至少 50 条。</p>
]]></description><link>https://localaihub.com/post/818</link><guid isPermaLink="true">https://localaihub.com/post/818</guid><dc:creator><![CDATA[melo]]></dc:creator><pubDate>Wed, 06 May 2026 20:02:00 GMT</pubDate></item><item><title><![CDATA[Reply to 提示词改了十版，没人知道是不是变好了 on Wed, 06 May 2026 18:44:00 GMT]]></title><description><![CDATA[<p dir="auto">没有。就是业务方临时问几句。</p>
]]></description><link>https://localaihub.com/post/817</link><guid isPermaLink="true">https://localaihub.com/post/817</guid><dc:creator><![CDATA[小曹]]></dc:creator><pubDate>Wed, 06 May 2026 18:44:00 GMT</pubDate></item><item><title><![CDATA[Reply to 提示词改了十版，没人知道是不是变好了 on Wed, 06 May 2026 16:40:00 GMT]]></title><description><![CDATA[<p dir="auto">先问一句：你们有固定测试集吗？如果每次拿不同问题测，感觉一定会漂。</p>
]]></description><link>https://localaihub.com/post/816</link><guid isPermaLink="true">https://localaihub.com/post/816</guid><dc:creator><![CDATA[林小北]]></dc:creator><pubDate>Wed, 06 May 2026 16:40:00 GMT</pubDate></item></channel></rss>