python版汉字转拼音脚本【转】

实际上是字典举穷,把汉字码表和拼音对应起来了

[python]#!/usr/bin/env python
# encoding: utf-8
"""
Created by Eric Lo on 2010-05-20.
Copyright (c) 2010 __lxneng@gmail.com__. http://lxneng.com All rights reserved.
"""
class Pinyin():
def __init__(self, data_path='./Mandarin.dat'):
self.dict = {}
for line in open(data_path):
k, v = line.split('\t')
self.dict[k] = v
self.splitter = ''
def get_pinyin(self, chars=u"你好吗"):
result = []
for char in chars:
key = "%X" % ord(char)
try:
result.append(self.dict[key].split(" ")[0].strip()[:-1].lower())
except:
result.append(char)
return self.splitter.join(result)
def get_initials(self, char=u'你'):
try:
return self.dict["%X" % ord(char)].split(" ")[0][0]
except:
return char[/python]

[python]from xpinyin import Pinyin
p = Pinyin()
p.get_pinyin(u"上海")
#输出: 'shanghai'
p.get_initials(u"上")
#输出 'S'[/python]

下载数据库:http://github.com/lxneng/xpinyin/raw/master/Mandarin.dat

From :http://blog.lxneng.com/?p=87

Author Info :
  • From:python版汉字转拼音脚本【转】
  • URL:https://blog.ihipop.info/2010/10/1729.html
  • Please Reserve This Link,Thanks!
  • 《python版汉字转拼音脚本【转】》上有2条评论

    发表评论

    邮箱地址不会被公开。 必填项已用*标注