<?xml version="1.0" encoding="UTF-8"?><rss xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:atom="http://www.w3.org/2005/Atom" version="2.0"><channel><title><![CDATA[中文模型测试，为什么不能只问成语和古诗]]></title><description><![CDATA[<p dir="auto">我看很多人测中文模型都问成语、古诗、脑筋急转弯。这能说明业务中文能力吗？</p>
]]></description><link>https://localaihub.com/topic/208/中文模型测试-为什么不能只问成语和古诗</link><generator>RSS for Node</generator><lastBuildDate>Wed, 03 Jun 2026 19:24:00 GMT</lastBuildDate><atom:link href="https://localaihub.com/topic/208.rss" rel="self" type="application/rss+xml"/><pubDate>Fri, 15 May 2026 10:10:00 GMT</pubDate><ttl>60</ttl><item><title><![CDATA[Reply to 中文模型测试，为什么不能只问成语和古诗 on Sat, 16 May 2026 01:04:00 GMT]]></title><description><![CDATA[<p dir="auto">所以评测集要土一点，越接近真实越好。</p>
]]></description><link>https://localaihub.com/post/2351</link><guid isPermaLink="true">https://localaihub.com/post/2351</guid><dc:creator><![CDATA[melo]]></dc:creator><pubDate>Sat, 16 May 2026 01:04:00 GMT</pubDate></item><item><title><![CDATA[Reply to 中文模型测试，为什么不能只问成语和古诗 on Sat, 16 May 2026 00:11:00 GMT]]></title><description><![CDATA[<p dir="auto">这个例子太真实。</p>
]]></description><link>https://localaihub.com/post/2350</link><guid isPermaLink="true">https://localaihub.com/post/2350</guid><dc:creator><![CDATA[小谢]]></dc:creator><pubDate>Sat, 16 May 2026 00:11:00 GMT</pubDate></item><item><title><![CDATA[Reply to 中文模型测试，为什么不能只问成语和古诗 on Fri, 15 May 2026 21:43:00 GMT]]></title><description><![CDATA[<p dir="auto">我们模型会背古诗，但把“补卡”理解成补公交卡。</p>
]]></description><link>https://localaihub.com/post/2349</link><guid isPermaLink="true">https://localaihub.com/post/2349</guid><dc:creator><![CDATA[半截薯条]]></dc:creator><pubDate>Fri, 15 May 2026 21:43:00 GMT</pubDate></item><item><title><![CDATA[Reply to 中文模型测试，为什么不能只问成语和古诗 on Fri, 15 May 2026 20:59:00 GMT]]></title><description><![CDATA[<p dir="auto">有初筛价值，但业务中文必须自己测。</p>
]]></description><link>https://localaihub.com/post/2348</link><guid isPermaLink="true">https://localaihub.com/post/2348</guid><dc:creator><![CDATA[陈一]]></dc:creator><pubDate>Fri, 15 May 2026 20:59:00 GMT</pubDate></item><item><title><![CDATA[Reply to 中文模型测试，为什么不能只问成语和古诗 on Fri, 15 May 2026 19:25:00 GMT]]></title><description><![CDATA[<p dir="auto">榜单中文分高还有参考吗？</p>
]]></description><link>https://localaihub.com/post/2347</link><guid isPermaLink="true">https://localaihub.com/post/2347</guid><dc:creator><![CDATA[小谢]]></dc:creator><pubDate>Fri, 15 May 2026 19:25:00 GMT</pubDate></item><item><title><![CDATA[Reply to 中文模型测试，为什么不能只问成语和古诗 on Fri, 15 May 2026 19:17:00 GMT]]></title><description><![CDATA[<p dir="auto">还要加入企业内部词。项目代号、系统简称、部门黑话。</p>
]]></description><link>https://localaihub.com/post/2346</link><guid isPermaLink="true">https://localaihub.com/post/2346</guid><dc:creator><![CDATA[小吴]]></dc:creator><pubDate>Fri, 15 May 2026 19:17:00 GMT</pubDate></item><item><title><![CDATA[Reply to 中文模型测试，为什么不能只问成语和古诗 on Fri, 15 May 2026 18:53:00 GMT]]></title><description><![CDATA[<p dir="auto">注意隐私。真实问题脱敏要认真做。</p>
]]></description><link>https://localaihub.com/post/2345</link><guid isPermaLink="true">https://localaihub.com/post/2345</guid><dc:creator><![CDATA[nora]]></dc:creator><pubDate>Fri, 15 May 2026 18:53:00 GMT</pubDate></item><item><title><![CDATA[Reply to 中文模型测试，为什么不能只问成语和古诗 on Fri, 15 May 2026 16:54:00 GMT]]></title><description><![CDATA[<p dir="auto">抽真实问题，脱敏后分层。不要自己坐办公室编题。</p>
]]></description><link>https://localaihub.com/post/2344</link><guid isPermaLink="true">https://localaihub.com/post/2344</guid><dc:creator><![CDATA[林小北]]></dc:creator><pubDate>Fri, 15 May 2026 16:54:00 GMT</pubDate></item><item><title><![CDATA[Reply to 中文模型测试，为什么不能只问成语和古诗 on Fri, 15 May 2026 15:19:00 GMT]]></title><description><![CDATA[<p dir="auto">那怎么设计中文评测？</p>
]]></description><link>https://localaihub.com/post/2343</link><guid isPermaLink="true">https://localaihub.com/post/2343</guid><dc:creator><![CDATA[普通网友A]]></dc:creator><pubDate>Fri, 15 May 2026 15:19:00 GMT</pubDate></item><item><title><![CDATA[Reply to 中文模型测试，为什么不能只问成语和古诗 on Fri, 15 May 2026 15:04:00 GMT]]></title><description><![CDATA[<p dir="auto">中文能力要按场景测。写作、客服、制度问答、代码解释，不是一回事。</p>
]]></description><link>https://localaihub.com/post/2342</link><guid isPermaLink="true">https://localaihub.com/post/2342</guid><dc:creator><![CDATA[Grace]]></dc:creator><pubDate>Fri, 15 May 2026 15:04:00 GMT</pubDate></item><item><title><![CDATA[Reply to 中文模型测试，为什么不能只问成语和古诗 on Fri, 15 May 2026 13:40:00 GMT]]></title><description><![CDATA[<p dir="auto">客服场景还要看语气。太书面也不行。</p>
]]></description><link>https://localaihub.com/post/2341</link><guid isPermaLink="true">https://localaihub.com/post/2341</guid><dc:creator><![CDATA[小蓝]]></dc:creator><pubDate>Fri, 15 May 2026 13:40:00 GMT</pubDate></item><item><title><![CDATA[Reply to 中文模型测试，为什么不能只问成语和古诗 on Fri, 15 May 2026 11:56:00 GMT]]></title><description><![CDATA[<p dir="auto">还有“半句话问题”。用户会问“那个报销还能补吗”，不是写标准问句。</p>
]]></description><link>https://localaihub.com/post/2340</link><guid isPermaLink="true">https://localaihub.com/post/2340</guid><dc:creator><![CDATA[melo]]></dc:creator><pubDate>Fri, 15 May 2026 11:56:00 GMT</pubDate></item><item><title><![CDATA[Reply to 中文模型测试，为什么不能只问成语和古诗 on Fri, 15 May 2026 11:35:00 GMT]]></title><description><![CDATA[<p dir="auto">只能说明一小部分。企业中文更多是制度、口语、缩写、错别字、表格、长句。</p>
]]></description><link>https://localaihub.com/post/2339</link><guid isPermaLink="true">https://localaihub.com/post/2339</guid><dc:creator><![CDATA[陈一]]></dc:creator><pubDate>Fri, 15 May 2026 11:35:00 GMT</pubDate></item></channel></rss>