Transliteration program to convert roman script(singlish) to Unicode Sinhala. Can also transliterate text files written in roman script to Sinhala.
This Roman class is a subclass of int and supports the same methods int does, but any special methods that would normally return ints are return a new instance of Roman. You can use instances of this class in math expressions and a Roman instance...
Safely convert any given string type (text or binary) to unicode. You won't get UnicodeDecodeError error, at the cost of ignoring those errors during conversion, which is useful for debugging and logging. This recipe requires the six package.
It's a simple recipe to convert a str type string with pure unicode code point (e.g string = "\u5982\u679c\u7231" ) to an unicode type string.
Actually, this method has the same effect with 'u' prefix. But differently, it allows...
N = ROMAN2ARABIC( S ) produces the Arabic numeral, N, for a given Roman numeral, S. S is a string. N returns a scalar number.
S = ARABIC2ROMAN( N ) produces the Roman numeral, S, for a given Arabic numeral, N. N is a scalar number. S returns a string.
Limitation: Does not handle Arabic values greater than 3999
Unicode Conversion Gateway is a web-based proxy server to convert some of the Indian language web pages encoded in proprietary encodings into Unicode.
Padma, a popular Firefox extension, is extended and reimplemented in PHP to create...
Unicode Utils - Unicode algorithms for Ruby 1.9
Conversion of Unicode and Punycode on web based text is demonstrated on this ASP tutorial with working demo. And from this tutorial you will be able to know the terms ByteArray, Punycode, HexString, Base64 and a lot more in text string...
WinPST Ansi PST to Unicode Converter is simple yet powerful PST conversion tool to convert Ansi PST to Unicode PST and Vice versa. Software efficiently performs:
Convert ANSI PST file to Unicode PST file:
Microsoft Outlook 2003,...
Recipe for using unicode files (i.e. files opened with codecs.open) with the csv module.
Python 3 makes a clean separation between unicode text strings (str) and byte
strings (bytes). However, for some tasks (notably networking), it makes sense
to apply the same process to str and bytes, usually relying on the byte string
A replacement for "print" that will safely handle unicode conversion.
Simple routine for dumping any kind of string, ascii, encoded, or unicode, to a standard hex dump. Plus read/write of unicode and encoded strings.
Sometimes you want to pass XML document as unicode object which later should be encoded for output. Unfortunately very often you don't know the output encoding and can't set XML declaration properly. UnicodeXML adds XML declaration right on...
Python's built in function str() and unicode() return a string representation of the object in byte string and unicode string respectively. This enhanced version of str() and unicode() can be used as handy functions to convert between byte string...
You are processing unicode strings. You want to print the string but run into UnicodeEncodeError all the time. This recipe show you some simple steps to visualize unicode strings.
Convert decimals to Roman numerials.
I use a recursive algorithm here since it reflects the thinking clearly in code.
A non-recursive algothrithm can be done as well.
There are many cool word-wrap recipes, but most don't support unicode, such as Chinese characters. Note: you should use python2.4 to test the recipe for the 'gbk' encoding.
latin1_to_ascii -- The UNICODE Hammer -- AKA "The Stupid American"
This takes a UNICODE string and replaces Latin-1 characters with
something equivalent in 7-bit ASCII and returns a plain ASCII string.