|
|
|
|
|
- Conversion rules
- Supported tags
- Managing the character set
HTMLToRTF (Function) In french: HTMLVersRTF Converts an HTML string or an HTML buffer to an RTF string. The following operations are performed during the conversion: - Deletion of HTML tags
- Conversion of HTML special characters
- Conversion of CR characters (Carriage Return) to spaces
- Conversion of multiple spaces to single spaces
The formatting is kept "as best as possible".
MonTexteHTML is string = "<!--test-->'''''"e;Bonjour!"e;''"
Texte is string = HTMLVersRTF(MonTexteHTML)
Syntax
<Result> = HTMLToRTF(<Text in HTML format> [, <Charset used>])
<Result>: Character string RTF text corresponding to the result of the HTML conversion. The encoding used is the one of the current character set of WINDEV or WEBDEV. <Text in HTML format>: String or buffer Text to convert. <Charset used>: Optional Integer constant Constant identifying the character set used to write the <Text in HTML format>. The current character set of WINDEV or WEBDEV is used by default (charsetCurrent constant). If any information about the character set used is found in the <Text in HTML format>, this information has priority over this parameter. For more details on these constants, see Correspondence between languages, sub-languages, character sets and nations. Remarks - The HTML tags are analyzed to keep the best possible formatting in the output text (CR characters, spaces, tabs, etc.). Formatting is preserved as far as possible: bold, italics, colors, etc.
- Do not appear in RTF output:
- HTML tags
- content of the "header" (information in the <HEAD> tag)
- comments
- control texts
- scripts
- SSL definitions
- CSS styles (except "color" attributes)
- Management of CR characters
- 2 Carriage Returns are inserted to replace the following tags: <P>, <H1> to <H6>, <TABLE>, <UL> or <OL>
- 1 Carriage Return is inserted to replace the following tags: <BR>, <TR>, <LI>, <DD> or <DIV>
- 1 single Carriage Return is inserted if several identical tags (<TR>, <LI>, <DD> or <DIV>) follow one another (except for <BR> tags)
- Management of arrays
- A CR character is inserted for each array row (<TR> tag).
- A tab is inserted for each array column (<TD> tag).
- Management of special characters
A special character is a character defined in the HTML standard. For example, a space can be written as " ". This standard is automatically used.
Unmanaged tags are ignored: their content is treated as text.. The supported tags are as follows:
This page is also available for…
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|