EVA-中文开放域对话预训练模型
EVA 是目前最大的中文开放域对话预训练模型,拥有28亿参数,在 WDC-Dialogue 上预训练而成。该数据包含14亿个多领域的上文-回复对。实验表明 EVA 在自动指标和人工指标上都超越了现在其他的中文预训练对话模型。
官网: 智源开源开放平台 (wudaoai.cn)
github:GitHub - BAAI-WuDao/EVA
Paper link: https://arxiv.org/abs/2108.01547.
2 Dataset
We construct a dataset named WDC-Dialogue from Chinese social media to train EVA. Specifically, conversations from various sources are gathered and a rigorous data cleaning pipeline is designed to enforce the quality of WDC-Dialogue. We mainly fo
共有 0 条评论