Kana / romaji
Python
cited
closed
pykakasi fails to romanize half-width katakana with voiced marks
Half-width katakana fails to romanize, especially with voiced marks
pykakasi · miurahr/pykakasi
Symptom
pykakasi does not romanize half-width katakana correctly, particularly when a half-width voiced or semi-voiced mark (U+FF9E / U+FF9F) follows the base kana.
Minimal repro
Convert half-width katakana such as a half-width ka followed by a half-width dakuten; the output is wrong instead of 'ga'.
Fix
NFKC-normalize half-width katakana and combining voiced marks to their full-width equivalents before romanization.