WC3 Encoding

Arhowk · Jun 10, 2013

What encoding does Warcraft 3 use?

I tried using UTF-8 to copy and paste unicode symbols into warcraft 3 for use in other-language games when commands are needed, but they aren't working.

Theres the program aclled UTFizer where if you click the button it sets it to the encoding and u can paste it to wc3 fine, but I dont know what encoding it uses

e/ UnicodeBig doesnt work either

Nestharus · Jun 10, 2013

US wc3 uses Ascii 256, but it depends on the region

.

Arhowk · Jun 10, 2013

Nestharus said:
US wc3 uses Ascii 256, but it depends on the region .

Okay, but that doesn't work :\

What im trying to figure out is how to convert the ascii into a UTF-8 type that non-English Warcraft 3s use.

I know its possible because UTFizer does it, makes the letters appear even when the charset is ASCII

(java)

Code:

            String c = "UTF-8";
            byte[] b = s.getBytes(c);
            Toolkit.getDefaultToolkit().getSystemClipboard().setContents(new StringSelection(new String(b, c)), null);

will just replace ?'s for the other characters

Dr Super Good · Jun 11, 2013

What im trying to figure out is how to convert the ascii into a UTF-8 type that non-English Warcraft 3s use.

This cannot be done. WC3 uses a character map system to convert 8 bit characters into localization specific text. This is why when you play Asian/Russian maps the text appears as garbage in English mode since the characters are mapping to "unused" characters or nonsense characters.

This means that text localization is mutually exclusive, you cannot have a single piece of text containing all localizations without the others appearing as garbage.

Warcraft III was designed with localization in mind however. The Strings file that holds virtually every tooltip and GUI trigger text piece is localized via an extension. By importing multiple into a map, you should be able to provide every region with targeted localization. The big problem is the extra map size as well as the slower load time (in-lining these strings speeds up load time).

The simplest test you can do is to paste in 256 characters and see how WC3 interprets them. Obviously you remove the control characters from those as they make WC3 act strangely.

Also maybe provide a sample of a localized tooltip containing the characters you want as then values could be extracted and tested for which character set is used. I am guessing some custom character set may be used for bit 7 characters which requires special mapping (and thus how the tool gets its claim to fame).

Arhowk · Jun 11, 2013

Dr Super Good said:
This cannot be done. WC3 uses a character map system to convert 8 bit characters into localization specific text. This is why when you play Asian/Russian maps the text appears as garbage in English mode since the characters are mapping to "unused" characters or nonsense characters.

This means that text localization is mutually exclusive, you cannot have a single piece of text containing all localizations without the others appearing as garbage.

Warcraft III was designed with localization in mind however. The Strings file that holds virtually every tooltip and GUI trigger text piece is localized via an extension. By importing multiple into a map, you should be able to provide every region with targeted localization. The big problem is the extra map size as well as the slower load time (in-lining these strings speeds up load time).

The simplest test you can do is to paste in 256 characters and see how WC3 interprets them. Obviously you remove the control characters from those as they make WC3 act strangely.

Also maybe provide a sample of a localized tooltip containing the characters you want as then values could be extracted and tested for which character set is used. I am guessing some custom character set may be used for bit 7 characters which requires special mapping (and thus how the tool gets its claim to fame).

Thats the funny thing though... Heres what i decoded one of the strings into

Code:

If i try to assemble a string with those same bytes, it gives me the same bytes back if i use .getBytes() but when i paste it in WC3, it just pastes a question mark.

The text of the bytes above is 시야. When encoded in UTFizer and pasted here, it looks like ?쒖빞 in Windows but works fine in game. When re-assembling it by byte in Java, it looks like ?쒖빞 outside of windows but ? inside of wc3

Dr Super Good · Jun 12, 2013

Could you post example maps with this working? As far as I am aware, WC3 will not render characters like ?쒖빞 because I have not set my character set to include Asian languages.

Arhowk · Jun 12, 2013

Dr Super Good said:
Could you post example maps with this working? As far as I am aware, WC3 will not render characters like ?쒖빞 because I have not set my character set to include Asian languages.

Sorry, it turns out to be a bug with Netbeans compiling .jars

http://stackoverflow.com/questions/17055689/netbeans-editor-not-building-properly

WC3 Encoding

Arhowk

Arhowk

Nestharus

Nestharus

Resources

Arhowk

Arhowk

Dr Super Good

Dr Super Good

Resources

Arhowk

Arhowk

Dr Super Good

Dr Super Good

Resources

Arhowk

Arhowk

Similar threads