Commit 83089ae5 by nanziyuan

add evaluate

parent 22f6d271
This diff is collapsed. Click to expand it.
......@@ -58,6 +58,7 @@ agent轨迹和一些运行中的信息会被保存到run.traj.json。建议保
## RealBench
先根据https://github.com/IPRC-DIP/RealBench的Readme,配置环境并解压缩。
agent_bench.py和evaluate.py需要拷贝到realbench目录下执行。
具体使用方法在`agent_bench.py` 开头的文档。
```python
......
Markdown is supported
0% or
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment