Modularized Evaluation: Measure each module in the pipeline with tailored metrics. Comprehensive Metric Library: Covers Retrieval-Augmented Generation (RAG), Code Generation, Agent Tool Use, ...