# Filter by pattern
A survey of 2,000 UK adults — what Lovehoney is calling "The Great British Cliteracy Test" — found an alarming gap between confidence and knowledge of the anatomy. Ninety percent of participants stated they know where the clitoris is, but only 30 percent could correctly locate it on a diagram. Women were only one percent more likely than men (30 percent to 29 percent) to find it.
。PDF资料对此有专业解读
This also applies to LLM-generated evaluation. Ask the same LLM to review the code it generated and it will tell you the architecture is sound, the module boundaries clean and the error handling is thorough. It will sometimes even praise the test coverage. It will not notice that every query does a full table scan if not asked for. The same RLHF reward that makes the model generate what you want to hear makes it evaluate what you want to hear. You should not rely on the tool alone to audit itself. It has the same bias as a reviewer as it has as an author.
:first-child]:h-full [&:first-child]:w-full [&:first-child]:mb-0 [&:first-child]:rounded-[inherit] h-full w-full