AI.SkillsAI.Skills
Browse SkillsDashboardDocs
Get Started
AI.SkillsAI.Skills

The trusted marketplace for verified OpenClaw AI agent skills. Built by CogniWatch.

Marketplace

  • Browse Skills
  • Categories
  • Verified Skills
  • Free Skills

Creators

  • Become a Creator
  • Upload Skills
  • Creator Guide
  • Pricing

Resources

  • Documentation
  • OpenClaw GitHub
  • CogniWatch
  • Security

© 2026 AI.Skills — A CogniWatch Project. All rights reserved.

PrivacyTermsContact
Home/Skills/Development & DevOps/advanced-evaluation
💻

advanced-evaluation

This skill covers production-grade techniques for evaluating LLM outputs using LLMs as judges. It synthesizes research from academic papers, industry practices, and practical implementation experience...

Free

Community use case

View on GitHub

Security Scan

No security scan has been run for this skill yet.

to run a security scan.

CogniWatch
+
ΣVirusTotal

Details

Category💻 Development & DevOps
PriceFree
SourceCommunity
LicenseOpen Source