1 Star 0 Fork 0

Meng Wei/CMUdict

加入 Gitee
与超过 1200万 开发者一起发现、参与优秀开源项目,私有仓库也完全免费 :)
免费加入
文件
克隆/下载
README.txt 1.02 KB
一键复制 编辑 原始数据 按行查看 历史
Alex Rudnicky 提交于 2015-01-24 18:23 . updates (more words)
cmudict refub project
---------------------
[20130331] (air)
Starts with a version that was modified during the setup of the Google
AI AMT verification project.
Some changes will be made before the AMT data is vetted and folded
in. Specifically, the dict was run through Sequitur and deletion errors
were examined; for the most part these are mistakes in the dict and
would not need AMT verification; on the other hand it performs a
redundant check. Still...
[201412] (air)
Major refurb, part the 2.
I.
1. Start incorporating Nickolay words
2. Harvest lmtool LtoS invocations
3. Add color to reduced vowels (which are mostly AH0)
4. weed out variants, especially AH0/IH0. (really a part of 3.)
II.
Bring the sequitur g2p testing stage back. Generalize testing setup for better throughput.
- doing this on an i7 octo is too slow
- revise on aspen/birch to be able to run ||lel experiments
[201501] (air)
weed out acronyms to allow for a cleaner dictionary (for g2p training).
Keep in acronym-0.7b, but the full dict is the distribution.
马建仓 AI 助手
尝试更多
代码解读
代码找茬
代码优化
1
https://gitee.com/wmeng223/CMUdict.git
git@gitee.com:wmeng223/CMUdict.git
wmeng223
CMUdict
CMUdict
master

搜索帮助