WebLinks Developer's Document

Character code of RSS

In generally, the character code of RSS is UTF-8

XOOPS uses the differrent character code by language.
In the English area, use US-ASCII .
In Japan, use EUC-JP.
Some web sites output RSS feed with the character code which is different in UTF-8.

Moreover, PHP XML parser function can handle only US-ASCII and UTF-8.

Processing of displaing RSS

The getting RSS is displayed at the following step.

(1) Convert the getting RSS into UTF-8
judge a character code by XML encoding.
assume UTF-8, when encoding is empty.

<? xml version="1.0" encoding="***" ?>


(2) Parse the RSS by PHP XML parser function

(3) Convert the parsed result from UTF-8 to the XOOPS character code

(4) Display in the XOOPS character code.

Processing of outputing RSS


(1) convert RSS from the XOOPS character code to UTF-8.
(2) output RSS in UTF-8.

Conversion of a character code


the PHP multi-byte function is used in conversion of a character code.
However, in the English area, many servers don't include the multi-byte function.

Then, in default
don't use the multi-byte function.
it cause a chracter garble in foreign language except English.

In the Japanese environment, must include the multi-byte function.
if not include , it cause a chracter garble.

using a multi-byte function, or not.
It is selected by language code, Japanese or others.
I described in language/xxx/language_convert.php


INDEX
$Id: rss_code.html,v 1.1 2005/07/19 03:54:34 ohwada Exp $