update metrics
Browse files
app.py
CHANGED
|
@@ -22,8 +22,8 @@ The evaluation is conducted on 8 datasets across 4 tasks:
|
|
| 22 |
- TyDiQA (Thai only), contains 763 test samples, https://huggingface.co/datasets/chompk/tydiqa-goldp-th
|
| 23 |
## Metrics
|
| 24 |
The evaluation metrics for each task are as follows:
|
| 25 |
-
1. STS: Spearman’s
|
| 26 |
-
2. Text Classification: F1
|
| 27 |
3. Pair Classification: Average Precision
|
| 28 |
3. Retrieval: MMR@10
|
| 29 |
"""
|
|
|
|
| 22 |
- TyDiQA (Thai only), contains 763 test samples, https://huggingface.co/datasets/chompk/tydiqa-goldp-th
|
| 23 |
## Metrics
|
| 24 |
The evaluation metrics for each task are as follows:
|
| 25 |
+
1. STS: Spearman’s Rank Correlation
|
| 26 |
+
2. Text Classification: F1 Score
|
| 27 |
3. Pair Classification: Average Precision
|
| 28 |
3. Retrieval: MMR@10
|
| 29 |
"""
|