You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
@@ -99,7 +109,10 @@ If the user wants to manually start a frontend page, you need to enter the follo
99
109
python -m dingo.run.vsl --input xxx
100
110
```
101
111
102
-
The input followed is the directory of the quality inspection results. Users need to ensure that there is a summary.json file when the directory is opened.
112
+
The input followed is the directory of the quality inspection results. Users need to ensure that there is a summary.json file when the directory is opened. Frontend page of output looks like:
113
+
114
+
## Online Demo
115
+
Try dingo on our online demo: [(Hugging Face)🤗](https://huggingface.co/spaces/DataEval/dingo)
103
116
104
117
# Feature List
105
118
@@ -153,17 +166,17 @@ then you can refer to: [Install Dependencies](requirements)
153
166
154
167
## Register Rules/Prompts/Models
155
168
156
-
If the heuristic rules inside the project do not meet the user's quality inspection requirements, users can also customize rules or models.
169
+
If the heuristic rules inside the project do not meet the user's quality inspection requirements, users can also customize rules or models.
157
170
158
171
### Register Rules
159
172
160
-
If the user wants to create a new rule `CommonPatternDemo`, then the first step is to add a decorator to the rule to inject the rule into the project.
161
-
Secondly, the `metric_type` type, such as `QUALITY_BAD_RELEVANCE`, needs to be set for the rule, and `group` does not need to be set.
162
-
Then the user needs to define the `DynamicRuleConfig` object, so that the properties of the rule can be configured dynamically.
163
-
In addition, the method name of the rule must be `eval` and it needs to be a class method.
164
-
The return value of the last step should be a `ModelRes` object.
173
+
If the user wants to create a new rule `CommonPatternDemo`, then the first step is to add a decorator to the rule to inject the rule into the project.
174
+
Secondly, the `metric_type` type, such as `QUALITY_BAD_RELEVANCE`, needs to be set for the rule, and `group` does not need to be set.
175
+
Then the user needs to define the `DynamicRuleConfig` object, so that the properties of the rule can be configured dynamically.
176
+
In addition, the method name of the rule must be `eval` and it needs to be a class method.
177
+
The return value of the last step should be a `ModelRes` object.
165
178
166
-
For example: [Register Rules](examples/register/sdk_register_rule.py)
179
+
For example: [Register Rules](examples/register/sdk_register_rule.py)
167
180
168
181
### Register Prompts
169
182
@@ -173,8 +186,8 @@ For example: [Register Prompts](examples/register/sdk_register_prompt.py)
173
186
174
187
### Register Models
175
188
176
-
The way to register models is slightly different, users need to implement a call_api method, accept MetaData type parameters, and return ModelRes type results.
177
-
There are already implemented basic model classes [BaseOpenAI](dingo/model/llm/base_openai.py) in the project, users can directly inherit.
189
+
The way to register models is slightly different, users need to implement a call_api method, accept MetaData type parameters, and return ModelRes type results.
190
+
There are already implemented basic model classes [BaseOpenAI](dingo/model/llm/base_openai.py) in the project, users can directly inherit.
178
191
If the user has special functions to implement, then you can rewrite the corresponding methods.
179
192
180
193
For example: [Register Models](examples/register/sdk_register_llm.py)
@@ -185,7 +198,7 @@ For example: [Register Models](examples/register/sdk_register_llm.py)
185
198
186
199
## Execution Engine
187
200
188
-
`Dingo` can run locally or on a spark cluster.
201
+
`Dingo` can run locally or on a spark cluster.
189
202
Regardless of the choice of engine, the executor supports some common methods:
190
203
191
204
| function name | description |
@@ -203,9 +216,9 @@ When choosing the spark engine, users can freely choose rules, models for qualit
203
216
204
217
### Spark Mode
205
218
206
-
When choosing the spark engine, users can only choose rules for quality inspection, and models cannot be used.
207
-
And only `eval_group`,`save_data`,`save_correct`,`custom_config` in `InputArgs` are still valid.
208
-
Therefore, the user needs to input `spark_session` to initialize spark, and input `spark_rdd` (composed of `MetaData` structure) as data for quality inspection.
219
+
When choosing the spark engine, users can only choose rules for quality inspection, and models cannot be used.
220
+
And only `eval_group`,`save_data`,`save_correct`,`custom_config` in `InputArgs` are still valid.
221
+
Therefore, the user needs to input `spark_session` to initialize spark, and input `spark_rdd` (composed of `MetaData` structure) as data for quality inspection.
209
222
It should be noted that if `save_data` is `False`, then the data in memory will be cleared immediately after the quality inspection is completed, and `spark_session` will also stop immediately.
210
223
211
224
[Spark Example](examples/spark/sdk_spark.py)
@@ -275,7 +288,8 @@ If you find this project useful, please consider citing our tool:
275
288
```
276
289
@misc{dingo,
277
290
title={Dingo: A Comprehensive Data Quality Evaluation Tool for Large Models},
0 commit comments