Update README.md
Browse files
README.md
CHANGED
|
@@ -13,7 +13,7 @@ widget:
|
|
| 13 |
## Overview
|
| 14 |
<!-- Provide a quick summary of what the model is/does. -->
|
| 15 |
We continually pretrain `uitnlp/visobert` on a merged 14GB dataset, the training dataset includes:
|
| 16 |
-
-
|
| 17 |
- UIT data, which is used to pretrain `uitnlp/visobert`
|
| 18 |
- MC4 ecommerce
|
| 19 |
|
|
|
|
| 13 |
## Overview
|
| 14 |
<!-- Provide a quick summary of what the model is/does. -->
|
| 15 |
We continually pretrain `uitnlp/visobert` on a merged 14GB dataset, the training dataset includes:
|
| 16 |
+
- Crawled data (100M comments and 15M posts on Facebook)
|
| 17 |
- UIT data, which is used to pretrain `uitnlp/visobert`
|
| 18 |
- MC4 ecommerce
|
| 19 |
|