You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
-[Metrics for Assessing the Effectiveness of Simulated Evaluation Pipelines](#metrics-for-assessing-the-effectiveness-of-simulated-evaluation-pipelines)
23
24
-[Code Structure](#code-structure)
24
25
-[Adding New Policies](#adding-new-policies)
25
26
-[Adding New Real-to-Sim Evaluation Environments and Robots](#adding-new-real-to-sim-evaluation-environments-and-robots)
@@ -131,7 +132,9 @@ By default, Google Robot environments use a control frequency of 3hz, and Bridge
131
132
132
133
Please see `scripts/` for examples of how to customize evaluation configs. The inference script `simpler_env/main_inference.py` supports advanced environment building and logging. For example, you can perform a sweep over object and robot poses for evaluation. (Note, however, varying robot poses is not meaningful under the visual matching evaluation setup.)
133
134
135
+
## Metrics for Assessing the Effectiveness of Simulated Evaluation Pipelines
134
136
137
+
In our paper, we use the Mean Maximum Rank Violation (MMRV) metric and the Pearson Correlation Coefficient metric to assess the correlation between real and simulated evaluation results. You can reproduce the metrics in `tools/calc_metrics.py` and assess your own real-to-sim evaluation pipeline.
135
138
136
139
## Code Structure
137
140
@@ -165,7 +168,7 @@ simpler_env/
165
168
tools/
166
169
robot_object_visualization/: tools for visualizing robots and objects when creating new environments
167
170
sysid/: tools for system identification when adding new robots
168
-
calc_metrics.py: tools for summarizing eval results and calculating metrics, such as Normalized Rank Loss, Pearson Correlation, and Kruskal-Wallis test, to reproduce our paper results
171
+
calc_metrics.py: tools for summarizing eval results and calculating metrics, such as Mean Maximum Rank Violation (MMRV) and Pearson Correlation
169
172
coacd_process_mesh.py: tools for generating convex collision meshes through CoACD when adding new assets
170
173
merge_videos.py: tools for merging videos into one
0 commit comments