2-byte code conversion

DataMagic allows the conversion of kanji code consisting of multi-byte characters.

In this manual, kanji code consisting of multi-byte characters is called 2-byte code.

In 2-byte code conversion, JIS basic kanji (JIS level 1 and 2 characters) is converted by default. Other characters that are not JIS basic kanji can also be converted.

Figure 4.5 Code conversion (2-byte codes)

The following shows how to specify the setting to convert characters that are not defined in JIS level 1 and 2. You can specify the setting for the DataMagic Server grade.

Select the Convert characters other than of JIS Level 1 or JIS Level 2 as well check box on the Code conversion tab of the Input Settings screen and Output Settings screen.

The following describes the combination of 2-byte code conversion when the Convert characters other than of JIS Level 1 or JIS Level 2 as well check box is selected.

Table 4.1 Availability of 2-byte code conversion

Input

Output

SHIFT-JIS

JEF

EUC

IBM kanji

KEIS

NEC kanji

UTF-16

UTF-8

JIS

SHIFT-JIS

N

N

A

N

N

A

A

N

JEF

N

N

N

N

N

N

N

N

EUC

N

N

N

N

N

N

N

N

IBM kanji

B

N

N

N

N

A

A

N

KEIS

N

N

N

N

N

N

N

N

NEC kanji

N

N

N

N

N

N

N

N

UTF-16

B

N

N

B

N

N

A

N

UTF-8

B

N

N

B

N

N

A

N

JIS

N

N

N

N

N

N

N

N

A

:

Code conversion in which codes that are handled as external characters do not occur

B

:

Code conversion in which codes that are handled as external characters occur

N

:

The characters other than of JIS Level 1 or JIS Level 2 are treated as external characters.

-

:

Code conversion does not occur

 

If the code conversion is performed between different kanji codes, the range of characters you can convert is the range of the kanji codes after the conversion. (For example, the range of character encodings when converting Shift JIS to KEIS is JIS basic kanji.)

Table 4.2 Input kanji code, output kanji code, and the range of characters that can be converted

Input kanji code

Output kanji code

Range of characters that can be converted

SHIFT-JIS

SHIFT-JIS

All characters that are defined in the Windows-31J

EUC

JIS basic kanji

IBM kanji

All characters that are defined in the IBM Japanese

JEF

JIS basic kanji

KEIS

JIS basic kanji

NEC kanji

JIS basic kanji

UTF-8

All characters that are defined in the Windows-31J

UTF-16

All characters that are defined in the Windows-31J

JIS

JIS basic kanji

SHIFT-JIS

EUC

JIS basic kanji

EUC

JIS basic kanji

IBM kanji

JIS basic kanji

JEF

JIS basic kanji

KEIS

JIS basic kanji

NEC kanji

JIS basic kanji

UTF-8

JIS basic kanji

UTF-16

JIS basic kanji

JIS

JIS basic kanji

SHIFT-JIS

IBM kanji

All characters that are defined in the IBM Japanese

EUC

JIS basic kanji

IBM kanji

All characters that are defined in the IBM Japanese

JEF

JIS basic kanji

KEIS

JIS basic kanji

NEC kanji

JIS basic kanji

UTF-8

All characters that are defined in the IBM Japanese

UTF-16

All characters that are defined in the IBM Japanese

JIS

JIS basic kanji

SHIFT-JIS

JEF

JIS basic kanji

EUC

JIS basic kanji

IBM kanji

JIS basic kanji

JEF

JIS basic kanji

KEIS

JIS basic kanji

NEC kanji

JIS basic kanji

UTF-8

JIS basic kanji

UTF-16

JIS basic kanji

JIS

JIS basic kanji

SHIFT-JIS

KEIS

JIS basic kanji

EUC

JIS basic kanji

IBM kanji

JIS basic kanji

JEF

JIS basic kanji

KEIS

JIS basic kanji

NEC kanji

JIS basic kanji

UTF-8

JIS basic kanji

UTF-16

JIS basic kanji

JIS

JIS basic kanji

SHIFT-JIS

NEC kanji

JIS basic kanji

EUC

JIS basic kanji

IBM kanji

JIS basic kanji

JEF

JIS basic kanji

KEIS

JIS basic kanji

NEC kanji

JIS basic kanji

UTF-8

JIS basic kanji

UTF-16

JIS basic kanji

JIS

JIS basic kanji

SHIFT-JIS

UTF-8

All characters that are defined in the Windows-31J

EUC

JIS basic kanji

IBM kanji

All characters that are defined in the IBM Japanese

JEF

JIS basic kanji

KEIS

JIS basic kanji

NEC kanji

JIS basic kanji

UTF-8

All characters

UTF-16

All characters

JIS

JIS basic kanji

SHIFT-JIS

UTF-16

All characters that are defined in the Windows-31J

EUC

JIS basic kanji

IBM kanji

All characters that are defined in the IBM Japanese

JEF

JIS basic kanji

KEIS

JIS basic kanji

NEC kanji

JIS basic kanji

UTF-8

All characters

UTF-16

All characters

JIS

JIS basic kanji

SHIFT-JIS

JIS

JIS basic kanji

EUC

JIS basic kanji

IBM kanji

JIS basic kanji

JEF

JIS basic kanji

KEIS

JIS basic kanji

NEC kanji

JIS basic kanji

UTF-8

JIS basic kanji

UTF-16

JIS basic kanji

JIS

JIS basic kanji

Characters not listed in the above table are handled as external characters and converted codes by setting an external character table.