1 Star 0 Fork 0

xqy2006/llama2.c-zh

加入 Gitee
与超过 1200万 开发者一起发现、参与优秀开源项目,私有仓库也完全免费 :)
免费加入
该仓库未声明开源许可证文件(LICENSE),使用请关注具体项目描述及其代码上游依赖。
克隆/下载
config.py 1.03 KB
一键复制 编辑 原始数据 按行查看 历史
xqy2006 提交于 2023-09-02 13:55 . update config.py.
"""
tokenizer 和 训练数据配置文件
"""
LANGUAGE = "enzh" # [en, zh, enzh]
# https://huggingface.co/baichuan-inc/Baichuan-7B/blob/main/tokenizer.model
TOKENIZER_MODEL = "tokenizers/baichuan/tokenizer.model" # the baichuan sentencepiece tokenizer model
TOKENIZER_BIN = "tokenizers/baichuan/tokenizer.bin" # binary version of the tokenizer for inference in C
# https://huggingface.co/ziqingyang/chinese-llama-2-7b/blob/main/tokenizer.model
# TOKENIZER_MODEL = "tokenizers/llama2enzh/tokenizer.model" # the llama2.
# TOKENIZER_BIN = "tokenizers/llama2enzh/tokenizer.bin" # binary version of the tokenizer for inference in C
# base llama2,
# TOKENIZER_MODEL = "tokenizers/llama2en/tokenizer.model" # the llama2-enzh.
# TOKENIZER_BIN = "tokenizers/llama2en/tokenizer.bin" # binary version of the tokenizer for inference in C
#自定义中文词表(红楼梦.txt)
#TOKENIZER_MODEL = "tokenizers/custom_tokenizer/meng.model" # the llama2-zh.
#TOKENIZER_BIN = "tokenizers/custom_tokenizer/meng.bin" # binary version of the tokenizer for inference in C
马建仓 AI 助手
尝试更多
代码解读
代码找茬
代码优化
1
https://gitee.com/kilszfdjs/llama2.c-zh.git
git@gitee.com:kilszfdjs/llama2.c-zh.git
kilszfdjs
llama2.c-zh
llama2.c-zh
main

搜索帮助