L
Llm Evaluation
VERIFIED
by community
—(0 reviews)
66,557installs
Updated Mar 2026
Description
Deep LLM evaluation workflow—quality dimensions, golden sets, human vs automatic metrics, regression suites, offline/online signals, and safe rollout gates f...
Security Analysis
⚠️警告63/100
Open Source
Code is publicly available for audit.
Community Verified
Reviewed by the ClawHub community.
User Reviews
No ratings yet
No reviews yet. Be the first!
Community Signal
⭐ ClawHub Score2.83 / 5.00
📥 Installs66,557
🔄 Last UpdateMar 25, 2026
🟢 Actively maintained (6d ago)
Submit Your Review
Share your experience with the community and help others find the best skills.