Unicode or UTF-8 Confusion and confuse post

Products and tips

Moderator: Site Mods

Post Reply
User avatar
admsupport
Bronze Member
Bronze Member
Posts: 142
Joined: 2008 Dec 11, 00:18
Location: Japan

Unicode or UTF-8 Confusion and confuse post

Post by admsupport » 2009 Jan 02, 08:03

(sorry, wrong place, this post should go to computer and internet)

Can anyone give me a word of advice or basic explanation?

In IE Encodings:
European ISO (Unicode)
Unicode (UTF-8 )
Western European (Windows)
...

I am confused with these encoding formats (First are Encoding similar to  character sets like Morse, Baudot or ASCII?)

Then when I save a document in mix language (French/German, English/Japanese) or if I publish it on the internet, I usually chose Unicode (to preserve the accents). Is there a difference between Unicode and UTF-8 (I can see both options in notepad.exe) and what is the particular difference in a document saved in both format?

On MS Office US version, the default encoding is Western European (Windows).  So in short what the best encoding to save/publish document in mix languages in Windows and to write emails? Unicode, UTF-8, ANSI.

Thanks

User avatar
nikos
Site Admin
Site Admin
Posts: 14962
Joined: 2002 Feb 07, 15:57
Location: UK
Contact:

Post by nikos » 2009 Jan 03, 17:46

for mixed languages better use full unicode

User avatar
admsupport
Bronze Member
Bronze Member
Posts: 142
Joined: 2008 Dec 11, 00:18
Location: Japan

Post by admsupport » 2009 Jan 08, 06:27

nikos wrote:for mixed languages better use full unicode
Unicode or UT-F8? Does that make a difference when I save a document (.txt or .html) in mix language?

I found this on Wikipedia but I am not advanced...
[...]Unicode can be implemented by different character encodings. The most commonly used encodings are UTF-8[...]

User avatar
nikos
Site Admin
Site Admin
Posts: 14962
Joined: 2002 Feb 07, 15:57
Location: UK
Contact:

Post by nikos » 2009 Jan 08, 07:17

there are similarities, unicode is 2-byte whereas UTF-8 is multibyte. Probably you'd get smaller files with UTF-8. As with all things, you can experiment yourself and see how it goes ;)

ie save the same file in unicode then in utf8 and see which you like the most

User avatar
admsupport
Bronze Member
Bronze Member
Posts: 142
Joined: 2008 Dec 11, 00:18
Location: Japan

Post by admsupport » 2009 Jan 10, 01:10

Thanks, now I understand a little better.
The output was the same when I saved the same file either in Unicode/UTF-8, but I did not think about checking the size difference.

Post Reply