代码拉取完成,页面将自动刷新
cmudict refub project
---------------------
[20130331] (air)
Starts with a version that was modified during the setup of the Google
AI AMT verification project.
Some changes will be made before the AMT data is vetted and folded
in. Specifically, the dict was run through Sequitur and deletion errors
were examined; for the most part these are mistakes in the dict and
would not need AMT verification; on the other hand it performs a
redundant check. Still...
[201412] (air)
Major refurb, part the 2.
I.
1. Start incorporating Nickolay words
2. Harvest lmtool LtoS invocations
3. Add color to reduced vowels (which are mostly AH0)
4. weed out variants, especially AH0/IH0. (really a part of 3.)
II.
Bring the sequitur g2p testing stage back. Generalize testing setup for better throughput.
- doing this on an i7 octo is too slow
- revise on aspen/birch to be able to run ||lel experiments
[201501] (air)
weed out acronyms to allow for a cleaner dictionary (for g2p training).
Keep in acronym-0.7b, but the full dict is the distribution.
此处可能存在不合适展示的内容,页面不予展示。您可通过相关编辑功能自查并修改。
如您确认内容无涉及 不当用语 / 纯广告导流 / 暴力 / 低俗色情 / 侵权 / 盗版 / 虚假 / 无价值内容或违法国家有关法律法规的内容,可点击提交进行申诉,我们将尽快为您处理。