v1.0: 发布Chinese-LLaMA-2-7B, Chinese-Alpaca-2-7B #18
Replies: 2 comments 4 replies
-
Release Note for v1.0This part of the text is simply HTML code for formatting purposes. It centers an image (with an alternate text "newlogo") from the provided URL and sets its width to 700 pixels. We are pleased to announce that the Chinese LLaMA-2-7B and Alpaca-2-7B models have been officially released.
Main features compared to the first project📖 Optimized Chinese Vocabulary
⚡ Efficient FlashAttention-2
🚄 Adaptive Context Extension based on NTK
🤖 Simplified Bilingual System Prompt
Model PerformanceSubjective evaluationIn order to understand the model's generation effects more intuitively, this project has launched an online model battle platform in imitation of the Fastchat Chatbot Arena. You can browse and evaluate the quality of model responses. The battle platform provides evaluation indicators such as win rate and Elo rating, and you can also view the win rate results of pairwise model battles. The tested models include:
📊 Model Online Battle: http://chinese-alpaca-arena.ymcui.com/ Objective evaluationObjective evaluation was conducted using C-Eval. The results are shown in the following table. It can be seen that the second-generation models are significantly better than the first-generation models, and even surpass the 13B version on some indicators. Comparison between LLaMA series models:
Comparison between Alpaca series models:
|
Beta Was this translation helpful? Give feedback.
-
请问新的词表在哪里下载啊? |
Beta Was this translation helpful? Give feedback.
-
很高兴地向大家宣布中文LLaMA-2-7B、Alpaca-2-7B大模型已正式发布。
相比一期项目的主要特点
📖 经过优化的中文词表
⚡ 基于FlashAttention-2的高效注意力
🚄 基于NTK的自适应上下文扩展技术
🤖 简化的中英双语系统提示语
模型效果
主观评测
为了更加直观地了解模型的生成效果,本项目仿照Fastchat Chatbot Arena推出了模型在线对战平台,可浏览和评测模型回复质量。对战平台提供了胜率、Elo评分等评测指标,并且可以查看两两模型的对战胜率等结果。测试模型包括:
📊 模型在线对战:http://chinese-alpaca-arena.ymcui.com/
客观评测
客观评测选择了C-Eval进行评价,结果如下表。可以看到二代模型显著优于一代模型,且部分指标上甚至超过13B版本。
LLaMA系列模型之间对比:
Alpaca系列模型之间对比:
For English release note, please refer to Discussion.
This discussion was created from the release v1.0: 发布Chinese-LLaMA-2-7B, Chinese-Alpaca-2-7B.
Beta Was this translation helpful? Give feedback.
All reactions