Vietnamese character in .NET Console Application (UTF-8)

Posted by DucDigital on Stack Overflow See other posts from Stack Overflow or by DucDigital
Published on 2010-02-06T15:11:46Z Indexed on 2010/04/04 11:03 UTC
Read the original article Hit count: 388

Filed under:
|
|
|

Im trying to write down an UTF8 string (Vietnamese) into C# Console but no success. Im running on windows 7.

I tried to use the Encoding class that convert string to char[] to byte[] and then to String, but no help, the string is input directly fron the database.

Here is some example

Tôi tên là Ð?c, cu?c s?ng th?t vui v? tuy?t v?i

It does not show the special character like : Ð or ?... instead it show up ?, much worse with the Encoding class.

Does anyone can try this out or know about this problem?

Thank you


My code static void Main(string[] args) { XDataContext _new = new XDataContext(); Console.OutputEncoding = Encoding.GetEncoding("UTF-8"); string srcString = _new.Posts.First().TITLE;

        Console.WriteLine(srcString);
        // Convert the UTF-16 encoded source string to UTF-8 and ASCII.
        byte[] utf8String = Encoding.UTF8.GetBytes(srcString);
        byte[] asciiString = Encoding.ASCII.GetBytes(srcString);

        // Write the UTF-8 and ASCII encoded byte arrays. 
        Console.WriteLine("UTF-8  Bytes: {0}", BitConverter.ToString(utf8String));
        Console.WriteLine("ASCII  Bytes: {0}", BitConverter.ToString(asciiString));


        // Convert UTF-8 and ASCII encoded bytes back to UTF-16 encoded  
        // string and write.
        Console.WriteLine("UTF-8  Text : {0}", Encoding.UTF8.GetString(utf8String));
        Console.WriteLine("ASCII  Text : {0}", Encoding.ASCII.GetString(asciiString));

        Console.WriteLine(Encoding.UTF8.GetString(utf8String));
        Console.WriteLine(Encoding.ASCII.GetString(asciiString));
    }

and here is the outstanding output

Nhà báo đi hội báo Xuân
UTF-8  Bytes: 4E-68-C3-A0-20-62-C3-A1-6F-20-C4-91-69-20-68-E1-BB-99-69-20-62-C3-
A1-6F-20-58-75-C3-A2-6E
ASCII  Bytes: 4E-68-3F-20-62-3F-6F-20-3F-69-20-68-3F-69-20-62-3F-6F-20-58-75-3F-
6E
UTF-8  Text : Nhà báo đi hội báo Xuân
ASCII  Text : Nh? b?o ?i h?i b?o Xu?n
Nhà báo đi hội báo Xuân
Nh? b?o ?i h?i b?o Xu?n


Press any key to continue . . .

© Stack Overflow or respective owner

Related posts about utf-8

Related posts about console-application