diff --git a/README.md b/README.md index 841bc091e..4ecc0251d 100644 --- a/README.md +++ b/README.md @@ -41,6 +41,8 @@ Welcome to join our [Slack channel](https://join.slack.com/t/data-juicer/shared_ ---- ## News +- ![new](https://img.alicdn.com/imgextra/i4/O1CN01kUiDtl1HVxN6G56vN_!!6000000000764-2-tps-43-19.png) [2024-01-10] Discover new horizons in "Data Mixture" — Our second data-centric LLM competition has kicked off! Please visit the competition's [official website](https://tianchi.aliyun.com/competition/entrance/532174) for more information. + - ![new](https://img.alicdn.com/imgextra/i4/O1CN01kUiDtl1HVxN6G56vN_!!6000000000764-2-tps-43-19.png) [2024-01-05] We release **Data-Juicer v0.1.3** now! In this new version, we support **more Python versions** (3.7-3.10), and support **multimodal** dataset [converting](tools/multimodal/README.md)/[processing](docs/Operators.md) (Including texts, images, and audios. More modalities will be supported in the future). Besides, our paper is also updated to [v3](https://arxiv.org/abs/2309.02033). diff --git a/README_ZH.md b/README_ZH.md index 496f8e1a1..1b5b9f50e 100644 --- a/README_ZH.md +++ b/README_ZH.md @@ -39,6 +39,8 @@ Data-Juicer 是一个一站式数据处理系统,旨在为大语言模型 (LLM ---- ## 新消息 +- ![new](https://img.alicdn.com/imgextra/i4/O1CN01kUiDtl1HVxN6G56vN_!!6000000000764-2-tps-43-19.png) [2024-01-10] 开启“数据混合”新视界——第二届Data-Juicer大模型数据挑战赛已经正式启动!立即访问[竞赛官网](https://tianchi.aliyun.com/competition/entrance/532174),了解赛事详情。 + - ![new](https://img.alicdn.com/imgextra/i4/O1CN01kUiDtl1HVxN6G56vN_!!6000000000764-2-tps-43-19.png) [2024-01-05] 现在,我们发布了 **Data-Juicer v0.1.3** 版本! 在这个新版本中,我们支持了**更多Python版本**(3.7-3.10),同时支持了**多模态**数据集的[转换](tools/multimodal/README_ZH.md)和[处理](docs/Operators_ZH.md)(包括文本、图像和音频。更多模态也将会在之后支持)。 此外,我们的论文也更新到了[第三版](https://arxiv.org/abs/2309.02033) 。