CMU Pronouncing Dictionary
From Wikipedia, the free encyclopedia
CMU Pronouncing Dictionary | |
---|---|
Developed by | Carnegie Mellon University |
Latest release | 0.6 / 11 August 1998 |
Available in | English |
Development status | Unmaintained |
License | Public Domain |
Website | Homepage |
The CMU Pronouncing Dictionary (also known as cmudict) is a public domain pronouncing dictionary created by Carnegie Mellon University. It is used as the American lexicon for the Festival Speech Synthesis System. The latest release is 0.6, which contains 127,069 entries.
Contents |
[edit] Database Format
The database is distributed as a text file of the format word <two spaces> pronunciation. If there are multiple pronunciations available for a word, all subsequent entries are followed by an index in parentheses. The pronunciation is encoded using the Arpabet system. For example, the following pronunciations are available for encyclopedia:
ENCYCLOPEDIA AH0 N S AY2 K L AH0 P IY1 D IY0 AH0 ENCYCLOPEDIA(2) AH0 N S AY2 K L OW0 P IY1 D IY0 AH0
[edit] History
Version | Release date [1] |
---|---|
0.1 | 16 September 1993 |
0.2 | 10 March 1994 |
0.3 | 28 September 1994 |
0.4 | 8 November 1995 |
0.5 | No public release |
0.6 | 11 August 1998 |
[edit] License
The dictionary is released under the following license:
The Carnegie Mellon Pronouncing Dictionary [cmudict.0.6] is Copyright 1998 by Carnegie Mellon University. Use of this dictionary, for any research or commercial purpose, is completely unrestricted. If you make use of or redistribute this material, we would appreciate acknowledgement of its origin.
[edit] References
[edit] External links
- Homepage - includes database search
- RDF converted to Resource Description Framework by the open source Texai project.