A

agent-evaluation

VERIFIED

by community

No community reviews yet
32,970installs
Updated Apr 2026

Description

Testing and benchmarking LLM agents including behavioral testing, capability assessment, reliability metrics, and production monitoring—where even top agents...

Security Analysis

危险46/100

Open Source

Code is publicly available for audit.

Community Verified

Reviewed by the ClawHub community.

Community Reviews

Real user ratings only — separate from the editorial assessment and ClawHub signal.

No community reviews yet

Be the first to share your experience!

Community Signal

ClawHub Community Score1.40 / 5.00
📥 Installs32,970
🔄 Last UpdateApr 13, 2026
🟢 Actively maintained (0d ago)
ClawHub community score is a third-party marketplace signal. It is shown separately from SkillsReview editorial assessment and real user review averages.
View on ClawHub →

Submit Your Review

Share your experience with the community and help others find the best skills.

agent-evaluation — OpenClaw AgentSkill Review | SkillsReview