A
agent-evaluation
VERIFIED
by community
—No community reviews yet
32,970installs
Updated Apr 2026
Description
Testing and benchmarking LLM agents including behavioral testing, capability assessment, reliability metrics, and production monitoring—where even top agents...
Security Analysis
❌危险46/100
Open Source
Code is publicly available for audit.
Community Verified
Reviewed by the ClawHub community.
Community Reviews
Real user ratings only — separate from the editorial assessment and ClawHub signal.
No community reviews yet
Be the first to share your experience!
Community Signal
⭐ ClawHub Community Score1.40 / 5.00
📥 Installs32,970
🔄 Last UpdateApr 13, 2026
🟢 Actively maintained (0d ago)
ClawHub community score is a third-party marketplace signal. It is shown separately from SkillsReview editorial assessment and real user review averages.
View on ClawHub →Submit Your Review
Share your experience with the community and help others find the best skills.