<?xml version="1.0" encoding="UTF-8"?><rss xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:atom="http://www.w3.org/2005/Atom" version="2.0"><channel><title><![CDATA[中文能力评测，不要只看古诗和成语]]></title><description><![CDATA[<p dir="auto">我想测模型中文能力，网上很多题是成语、古诗、文言文。企业应用也要这么测吗？</p>
]]></description><link>https://localaihub.com/topic/90/中文能力评测-不要只看古诗和成语</link><generator>RSS for Node</generator><lastBuildDate>Wed, 03 Jun 2026 18:50:43 GMT</lastBuildDate><atom:link href="https://localaihub.com/topic/90.rss" rel="self" type="application/rss+xml"/><pubDate>Tue, 05 May 2026 14:46:00 GMT</pubDate><ttl>60</ttl><item><title><![CDATA[Reply to 中文能力评测，不要只看古诗和成语 on Wed, 06 May 2026 15:36:00 GMT]]></title><description><![CDATA[<p dir="auto">最后别忘人工盲评，去掉模型名。品牌滤镜会影响判断。</p>
]]></description><link>https://localaihub.com/post/633</link><guid isPermaLink="true">https://localaihub.com/post/633</guid><dc:creator><![CDATA[葡萄冰]]></dc:creator><pubDate>Wed, 06 May 2026 15:36:00 GMT</pubDate></item><item><title><![CDATA[Reply to 中文能力评测，不要只看古诗和成语 on Wed, 06 May 2026 12:29:00 GMT]]></title><description><![CDATA[<p dir="auto">对。模型中文能力不是考试作文，是能不能在你的产品里帮用户办事。</p>
]]></description><link>https://localaihub.com/post/632</link><guid isPermaLink="true">https://localaihub.com/post/632</guid><dc:creator><![CDATA[陈一]]></dc:creator><pubDate>Wed, 06 May 2026 12:29:00 GMT</pubDate></item><item><title><![CDATA[Reply to 中文能力评测，不要只看古诗和成语 on Wed, 06 May 2026 10:35:00 GMT]]></title><description><![CDATA[<p dir="auto">我把古诗成语降到很小比例，主测真实工单、会议纪要、财务术语、口语投诉。</p>
]]></description><link>https://localaihub.com/post/631</link><guid isPermaLink="true">https://localaihub.com/post/631</guid><dc:creator><![CDATA[小曹]]></dc:creator><pubDate>Wed, 06 May 2026 10:35:00 GMT</pubDate></item><item><title><![CDATA[Reply to 中文能力评测，不要只看古诗和成语 on Wed, 06 May 2026 09:31:00 GMT]]></title><description><![CDATA[<p dir="auto">多轮也要测。第一轮能中文，后面被英文文档带跑变英文，这种很常见。</p>
]]></description><link>https://localaihub.com/post/630</link><guid isPermaLink="true">https://localaihub.com/post/630</guid><dc:creator><![CDATA[nora]]></dc:creator><pubDate>Wed, 06 May 2026 09:31:00 GMT</pubDate></item><item><title><![CDATA[Reply to 中文能力评测，不要只看古诗和成语 on Wed, 06 May 2026 06:41:00 GMT]]></title><description><![CDATA[<p dir="auto">要测。拼音缩写、语音转文字错字、半句输入都要有。中文产品别只测干净书面语。</p>
]]></description><link>https://localaihub.com/post/629</link><guid isPermaLink="true">https://localaihub.com/post/629</guid><dc:creator><![CDATA[林小北]]></dc:creator><pubDate>Wed, 06 May 2026 06:41:00 GMT</pubDate></item><item><title><![CDATA[Reply to 中文能力评测，不要只看古诗和成语 on Wed, 06 May 2026 03:51:00 GMT]]></title><description><![CDATA[<p dir="auto">错别字要不要测？真实用户会打错。</p>
]]></description><link>https://localaihub.com/post/628</link><guid isPermaLink="true">https://localaihub.com/post/628</guid><dc:creator><![CDATA[小周]]></dc:creator><pubDate>Wed, 06 May 2026 03:51:00 GMT</pubDate></item><item><title><![CDATA[Reply to 中文能力评测，不要只看古诗和成语 on Wed, 06 May 2026 02:22:00 GMT]]></title><description><![CDATA[<p dir="auto">准确性、业务口径、语气、简洁度、是否追问、是否引用证据、是否越权。每项 1-5 分。</p>
]]></description><link>https://localaihub.com/post/627</link><guid isPermaLink="true">https://localaihub.com/post/627</guid><dc:creator><![CDATA[zeroOne]]></dc:creator><pubDate>Wed, 06 May 2026 02:22:00 GMT</pubDate></item><item><title><![CDATA[Reply to 中文能力评测，不要只看古诗和成语 on Wed, 06 May 2026 00:57:00 GMT]]></title><description><![CDATA[<p dir="auto">那评分维度怎么写？</p>
]]></description><link>https://localaihub.com/post/626</link><guid isPermaLink="true">https://localaihub.com/post/626</guid><dc:creator><![CDATA[普通网友A]]></dc:creator><pubDate>Wed, 06 May 2026 00:57:00 GMT</pubDate></item><item><title><![CDATA[Reply to 中文能力评测，不要只看古诗和成语 on Tue, 05 May 2026 22:26:00 GMT]]></title><description><![CDATA[<p dir="auto">我还会测“不要说官话”。模型很喜欢“感谢您的理解与支持”，用户看多了烦。</p>
]]></description><link>https://localaihub.com/post/625</link><guid isPermaLink="true">https://localaihub.com/post/625</guid><dc:creator><![CDATA[阿宁]]></dc:creator><pubDate>Tue, 05 May 2026 22:26:00 GMT</pubDate></item><item><title><![CDATA[Reply to 中文能力评测，不要只看古诗和成语 on Tue, 05 May 2026 20:49:00 GMT]]></title><description><![CDATA[<p dir="auto">中文评测要区分“理解”和“表达”。有的模型理解对了，输出像翻译腔；有的表达自然，但证据引用错。</p>
]]></description><link>https://localaihub.com/post/624</link><guid isPermaLink="true">https://localaihub.com/post/624</guid><dc:creator><![CDATA[leaf_1997]]></dc:creator><pubDate>Tue, 05 May 2026 20:49:00 GMT</pubDate></item><item><title><![CDATA[Reply to 中文能力评测，不要只看古诗和成语 on Tue, 05 May 2026 20:15:00 GMT]]></title><description><![CDATA[<p dir="auto">Qwen/GLM/Kimi 在中文材料里通常容易上手，但 GPT/Claude 在复杂指令和跨语言资料上也很强。不能预设结论。</p>
]]></description><link>https://localaihub.com/post/623</link><guid isPermaLink="true">https://localaihub.com/post/623</guid><dc:creator><![CDATA[小蓝]]></dc:creator><pubDate>Tue, 05 May 2026 20:15:00 GMT</pubDate></item><item><title><![CDATA[Reply to 中文能力评测，不要只看古诗和成语 on Tue, 05 May 2026 19:58:00 GMT]]></title><description><![CDATA[<p dir="auto">还要测地域和行业词。比如“抬头”“红冲”“对账”“工单挂起”，通用中文好不代表业务中文好。</p>
]]></description><link>https://localaihub.com/post/622</link><guid isPermaLink="true">https://localaihub.com/post/622</guid><dc:creator><![CDATA[melo]]></dc:creator><pubDate>Tue, 05 May 2026 19:58:00 GMT</pubDate></item><item><title><![CDATA[Reply to 中文能力评测，不要只看古诗和成语 on Tue, 05 May 2026 17:58:00 GMT]]></title><description><![CDATA[<p dir="auto">我们测客服中文，会放“你们这破系统又扣我钱了”这种句子，看模型能不能既不顶嘴也不乱承诺。</p>
]]></description><link>https://localaihub.com/post/621</link><guid isPermaLink="true">https://localaihub.com/post/621</guid><dc:creator><![CDATA[葡萄冰]]></dc:creator><pubDate>Tue, 05 May 2026 17:58:00 GMT</pubDate></item><item><title><![CDATA[Reply to 中文能力评测，不要只看古诗和成语 on Tue, 05 May 2026 15:21:00 GMT]]></title><description><![CDATA[<p dir="auto">除非你的产品就是语文老师，否则不够。企业中文更常见的是政策、合同、客服、会议、表格、口语错别字。</p>
]]></description><link>https://localaihub.com/post/620</link><guid isPermaLink="true">https://localaihub.com/post/620</guid><dc:creator><![CDATA[陈一]]></dc:creator><pubDate>Tue, 05 May 2026 15:21:00 GMT</pubDate></item></channel></rss>