<?xml version="1.0" encoding="UTF-8"?><rss xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:atom="http://www.w3.org/2005/Atom" version="2.0"><channel><title><![CDATA[RAG 测试集到底怎么建，不想只靠感觉调参]]></title><description><![CDATA[<p dir="auto">我们每次改切块、embedding、reranker，都是产品同学试几句说“好像变好了”。怎么正规一点？</p>
]]></description><link>https://localaihub.com/topic/66/rag-测试集到底怎么建-不想只靠感觉调参</link><generator>RSS for Node</generator><lastBuildDate>Wed, 03 Jun 2026 20:31:25 GMT</lastBuildDate><atom:link href="https://localaihub.com/topic/66.rss" rel="self" type="application/rss+xml"/><pubDate>Sun, 03 May 2026 14:47:00 GMT</pubDate><ttl>60</ttl><item><title><![CDATA[Reply to RAG 测试集到底怎么建，不想只靠感觉调参 on Mon, 04 May 2026 16:13:00 GMT]]></title><description><![CDATA[<p dir="auto">行，我先拉 50 条失败问法，分成召回失败、生成失败、权限失败。</p>
]]></description><link>https://localaihub.com/post/273</link><guid isPermaLink="true">https://localaihub.com/post/273</guid><dc:creator><![CDATA[不想写周报]]></dc:creator><pubDate>Mon, 04 May 2026 16:13:00 GMT</pubDate></item><item><title><![CDATA[Reply to RAG 测试集到底怎么建，不想只靠感觉调参 on Mon, 04 May 2026 13:27:00 GMT]]></title><description><![CDATA[<p dir="auto">调参时一次只改一个变量，不然分不清谁起作用。</p>
]]></description><link>https://localaihub.com/post/272</link><guid isPermaLink="true">https://localaihub.com/post/272</guid><dc:creator><![CDATA[nora]]></dc:creator><pubDate>Mon, 04 May 2026 13:27:00 GMT</pubDate></item><item><title><![CDATA[Reply to RAG 测试集到底怎么建，不想只靠感觉调参 on Mon, 04 May 2026 10:46:00 GMT]]></title><description><![CDATA[<p dir="auto">那就先标证据，不急着标完美答案。证据集比标准话术更稳定。</p>
]]></description><link>https://localaihub.com/post/271</link><guid isPermaLink="true">https://localaihub.com/post/271</guid><dc:creator><![CDATA[阿白]]></dc:creator><pubDate>Mon, 04 May 2026 10:46:00 GMT</pubDate></item><item><title><![CDATA[Reply to RAG 测试集到底怎么建，不想只靠感觉调参 on Mon, 04 May 2026 08:26:00 GMT]]></title><description><![CDATA[<p dir="auto">我们有工单历史，可以抽真实问题，但答案不一定标准。</p>
]]></description><link>https://localaihub.com/post/270</link><guid isPermaLink="true">https://localaihub.com/post/270</guid><dc:creator><![CDATA[不想写周报]]></dc:creator><pubDate>Mon, 04 May 2026 08:26:00 GMT</pubDate></item><item><title><![CDATA[Reply to RAG 测试集到底怎么建，不想只靠感觉调参 on Mon, 04 May 2026 06:25:00 GMT]]></title><description><![CDATA[<p dir="auto">对。很多 RAG demo 只展示能答，一进生产就被“不知道”打穿。</p>
]]></description><link>https://localaihub.com/post/269</link><guid isPermaLink="true">https://localaihub.com/post/269</guid><dc:creator><![CDATA[林小北]]></dc:creator><pubDate>Mon, 04 May 2026 06:25:00 GMT</pubDate></item><item><title><![CDATA[Reply to RAG 测试集到底怎么建，不想只靠感觉调参 on Mon, 04 May 2026 04:37:00 GMT]]></title><description><![CDATA[<p dir="auto">还有“拒答”指标。用户问没有权限或知识库没有的内容，系统能不能说不知道。</p>
]]></description><link>https://localaihub.com/post/268</link><guid isPermaLink="true">https://localaihub.com/post/268</guid><dc:creator><![CDATA[小潘同学]]></dc:creator><pubDate>Mon, 04 May 2026 04:37:00 GMT</pubDate></item><item><title><![CDATA[Reply to RAG 测试集到底怎么建，不想只靠感觉调参 on Mon, 04 May 2026 03:20:00 GMT]]></title><description><![CDATA[<p dir="auto">我建议每个版本留一份结果。别只看当前得分，回归才是重点。</p>
]]></description><link>https://localaihub.com/post/267</link><guid isPermaLink="true">https://localaihub.com/post/267</guid><dc:creator><![CDATA[rootless]]></dc:creator><pubDate>Mon, 04 May 2026 03:20:00 GMT</pubDate></item><item><title><![CDATA[Reply to RAG 测试集到底怎么建，不想只靠感觉调参 on Mon, 04 May 2026 03:09:00 GMT]]></title><description><![CDATA[<p dir="auto">慢但值。你不标，后面每次上线都靠玄学。</p>
]]></description><link>https://localaihub.com/post/266</link><guid isPermaLink="true">https://localaihub.com/post/266</guid><dc:creator><![CDATA[小路灯]]></dc:creator><pubDate>Mon, 04 May 2026 03:09:00 GMT</pubDate></item><item><title><![CDATA[Reply to RAG 测试集到底怎么建，不想只靠感觉调参 on Mon, 04 May 2026 01:28:00 GMT]]></title><description><![CDATA[<p dir="auto">人工标注太慢。</p>
]]></description><link>https://localaihub.com/post/265</link><guid isPermaLink="true">https://localaihub.com/post/265</guid><dc:creator><![CDATA[小树]]></dc:creator><pubDate>Mon, 04 May 2026 01:28:00 GMT</pubDate></item><item><title><![CDATA[Reply to RAG 测试集到底怎么建，不想只靠感觉调参 on Mon, 04 May 2026 00:21:00 GMT]]></title><description><![CDATA[<p dir="auto">RAGAS 和 DeepEval 都能参考，但中文企业文档最好加人工标注。</p>
]]></description><link>https://localaihub.com/post/264</link><guid isPermaLink="true">https://localaihub.com/post/264</guid><dc:creator><![CDATA[米饭]]></dc:creator><pubDate>Mon, 04 May 2026 00:21:00 GMT</pubDate></item><item><title><![CDATA[Reply to RAG 测试集到底怎么建，不想只靠感觉调参 on Sun, 03 May 2026 21:31:00 GMT]]></title><description><![CDATA[<p dir="auto">指标分开看：检索有没有拿到证据，生成有没有忠实，引用有没有对上。</p>
]]></description><link>https://localaihub.com/post/263</link><guid isPermaLink="true">https://localaihub.com/post/263</guid><dc:creator><![CDATA[MingK]]></dc:creator><pubDate>Sun, 03 May 2026 21:31:00 GMT</pubDate></item><item><title><![CDATA[Reply to RAG 测试集到底怎么建，不想只靠感觉调参 on Sun, 03 May 2026 20:28:00 GMT]]></title><description><![CDATA[<p dir="auto">我们把线上失败问题沉淀进测试集，比自己编问题有效。</p>
]]></description><link>https://localaihub.com/post/262</link><guid isPermaLink="true">https://localaihub.com/post/262</guid><dc:creator><![CDATA[小林]]></dc:creator><pubDate>Sun, 03 May 2026 20:28:00 GMT</pubDate></item><item><title><![CDATA[Reply to RAG 测试集到底怎么建，不想只靠感觉调参 on Sun, 03 May 2026 17:58:00 GMT]]></title><description><![CDATA[<p dir="auto">测试集要包含答案、应命中文档、不可回答问题、权限边界问题。只测能答的问题没意义。</p>
]]></description><link>https://localaihub.com/post/261</link><guid isPermaLink="true">https://localaihub.com/post/261</guid><dc:creator><![CDATA[nora]]></dc:creator><pubDate>Sun, 03 May 2026 17:58:00 GMT</pubDate></item><item><title><![CDATA[Reply to RAG 测试集到底怎么建，不想只靠感觉调参 on Sun, 03 May 2026 15:47:00 GMT]]></title><description><![CDATA[<p dir="auto">先建小而真实的测试集。30 条也比随手问强。</p>
]]></description><link>https://localaihub.com/post/260</link><guid isPermaLink="true">https://localaihub.com/post/260</guid><dc:creator><![CDATA[阿航]]></dc:creator><pubDate>Sun, 03 May 2026 15:47:00 GMT</pubDate></item></channel></rss>