1 Star 1 Fork 0

沈家麒/Chinese-BertWord-Embedding

加入 Gitee
与超过 1200万 开发者一起发现、参与优秀开源项目,私有仓库也完全免费 :)
免费加入
克隆/下载
data_process.py 526 Bytes
一键复制 编辑 原始数据 按行查看 历史
singaln 提交于 2020-12-28 17:30 . Add files via upload
file = open("train.txt", "a+", encoding="utf-8")
with open("example.train", "r", encoding="utf-8") as f:
sentences = f.read().strip().split("\n\n")
for i in range(len(sentences)):
sentence = sentences[i].split("\n")
contents = []
labels = []
for sent in sentence:
contents.extend(sent.strip().split()[0])
labels.append(sent.strip().split()[1])
data = " ".join(contents) + "\t" + " ".join(labels) + "\n"
file.write(data)
file.close()
马建仓 AI 助手
尝试更多
代码解读
代码找茬
代码优化
1
https://gitee.com/shen-jiaqi/Chinese-BertWord-Embedding.git
git@gitee.com:shen-jiaqi/Chinese-BertWord-Embedding.git
shen-jiaqi
Chinese-BertWord-Embedding
Chinese-BertWord-Embedding
main

搜索帮助

23e8dbc6 1850385 7e0993f3 1850385