L

Llm Evaluation

VERIFIED

by community

(0 reviews)
66,557installs
Updated Mar 2026

Description

Deep LLM evaluation workflow—quality dimensions, golden sets, human vs automatic metrics, regression suites, offline/online signals, and safe rollout gates f...

Security Analysis

⚠️警告63/100

Open Source

Code is publicly available for audit.

Community Verified

Reviewed by the ClawHub community.

User Reviews

No ratings yet

No reviews yet. Be the first!

Community Signal

ClawHub Score2.83 / 5.00
📥 Installs66,557
🔄 Last UpdateMar 25, 2026
🟢 Actively maintained (6d ago)
View on ClawHub →

Submit Your Review

Share your experience with the community and help others find the best skills.