分享
 
 
 

RFC1874 - SGML Media Types

王朝other·作者佚名  2008-05-31
窄屏简体版  字體: |||超大  

Network Working Group E. Levinson

Request for Comments: 1874 Accurate Information Systems, Inc.

Category: EXPerimental December 1995

SGML Media Types

Status of this Memo

This memo defines an Experimental Protocol for the Internet

community. This memo does not specify an Internet standard of any

kind. Discussion and suggestions for improvement are requested.

Distribution of this memo is unlimited.

Abstract

This document proposes new media sub-types of Text/SGML and

Application/SGML. These media types can be used in the exchange of

SGML documents and their entities. Specific details for the exchange

or encapsulation of groups of related SGML entities using MIME are

currently being considered by the mimesgml Working Group <sgml-

internet@eBT.com>.

1. IntrodUCtion

A need exists for the transfer the elements of documents constructed

using the Standard Generalized Markup Language (SGML) [ISO-8879].

While the specific details of such transfers are being considered

general agreement exists on the need to register basic media types

for the SGML entities not covered by existing types.

The Standard Generalized Markup Language (SGML) is used to encode

document structure and a rigorous description of it is left to [ISO-

8879]. The terms used in the present document attempt to be

consistent with SGML terminology and usage.

2. The SGML Media-Types

There are two media-types for SGML parsable entities, Text/SGML and

Application/SGML. Both have the same optional parameters. Text/SGML

provides a fallback to Text/Plain for those without SGML capability.

Senders should base the choice between text and application media-

types on the entity's content. Text is suggested for entities that

would be meaningful to a human being without SGML processing.

Application/SGML is recommended for all others.

2.1. Text/SGML

MIME type name: Text

MIME subtype name: SGML

Required parameters: none

Optional parameters: charset, SGML-bctf, SGML-boot

Encoding considerations: may be encoded

Security considerations: see section 4 below

Published specification: ISO 8879:1986

Person and email address to contact for further information:

E. Levinson <ELevinson@Accurate.com>

The Text/SGML media-type can be employed when the contents of the

SGML entity is intended to be read by a human and is in a readily

comprehensible form. That is the content can be easily discerned by

someone without SGML display software. Each record in the SGML

entity, delimited by record start (RS) and record end (RE) codes,

must correspond to a line in the Text/SGML body part.

SGML entities that do not meet the above requirements should use the

Application/SGML media-type.

See section 2.3 for a description of the parameters.

2.2. Application/SGML

MIME type name: Application

MIME subtype name: SGML

Required parameters: none

Optional parameters: SGML-bctf, SGML-boot

Encoding considerations: may be encoded

Security considerations: see section 4 below

Published specification: ISO-8879

Person and email address to contact for further information:

E. Levinson <ELevinson@Accurate.com>

Use the Application/SGML media-type for SGML text entities that are

not appropriate for Text/SGML. When used, each record start (RS) and

record end (RE) character shall be explicitly represented by the bit

combination specified in the SGML declaration.

The parameters are described in the next section.

2.3. SGML Sub-type Parameters

The parameters for the Text/ and Application/SGML subtypes are

defined below.

charset The charset parameter for Text/SGML is defined in

[RFC-1521], the valid values and their meaning are

registered by the Internet Assigned Numbers

Authority (IANA) [RFC-1590]. The default charset

value for all Text content-types is "us-ascii"

[RFC-1521].

The charset parameter is provided to permit non-

SGML capable systems to provide reasonable

behavior when Text/SGML defaults to Text/Plain.

SGML capable systems will use the SGML-bctf param-

eter.

SGML-bctf The SGML-bctf (SGML bit combination transformation

format) parameter describes the method used to

transform the entity's sequence of constant width

binary numbers (called "bit combinations" in [ISO

8879, 4.24]) into the octet stream contained in

the MIME body part.

Valid values for SGML-bctf are the BCTF notation

names defined in Annex C of [ISO-10744] and are

reproduced for convenience in the Appendix. The

default value is "identity", i.e. perform no

transformation.

SGML-boot The SGML-boot parameter value is the content-ID of

a MIME body part (Application/Octet-stream) that

satisfies the requirements of the boot attribute

in [ISO-10744]. The Appendix contains a summary

of those requirements. The SGML-boot parameter is

only applicable if the SGML entity is a document

entity.

3. Security Considerations

SGML entities contain information to be parsed and processed by the

recipient's SGML system. Those entities may contain and such systems

may permit explicit system level commands to be execute while

processing the data. To the extent that an SGML system will execute

arbitrary command strings recipients of SGML entities may be at risk.

Parsable SGML entities may also contain explicit processing

instructions for a presentation or composition system; use of such

instructions present concerns similar to those of

Application/PostScript.

4. References

[ISO-8879]

Information processing -- 8-bit Single-Byte Coded Graphic

Character Sets -- Part 1: Latin Alphabet No. 1, ISO

8859-1:1987.

[ISO-8879]

ISO 8879:1986, Information processing -- Text and Office

systems -- Standard Generalized Markup Language (SGML).

[ISO-10744]

ISO/IEC 10744:1992, Information technology --

Hypermedia/Time-based Structuring Language (HyTime) (as

modified by First Proposed Technical Corrigendum, ISO/IEC

JTC1/SC18 N5027)

[RFC-1521]

Borenstein, N., and N. Freed, "MIME (Multipurpose Internet

Mail Extensions) Part One: Mechanisms for Specifying and

Describing the Format of Internet Message Bodies", RFC

1521, Bellcore, Innosoft, September 1993.

[RFC-1590]

Postel, J., "Media Type Registration Procedure", RFC1590,

USC/Information Sciences Institute, March 1994.

[RFC-1642]

Goldsmith, D., and M. Davis, "UTF-7, A Mail-Safe

Transformation Format of UNICODE", RFC1642, Taligent,

Inc., July 1994.

5. Author's Address

Ed Levinson

Accurate Information Systems, Inc.

2 Industrial Way

Eatontown, NJ 07724

EMail: ELevinson@Accurate.com

APPENDIX

ISO-10744 BCTF Values and Boot Attribute

A.1. Bit Combination Transformation Format (BCTF) Values

The following list of Bit Combination Transformation Format (BCTF)

values is provided as a convenience. The authoritative source is

[ISO-10744].

identity Each bit combination is represented by a single

octet; this BCTF can be used only for entities all

of whose bit combinations have a value not exceeding

255.

fixed-2 Each bit combination is represented by exactly 2

octets, with the more significant octet first; this

BCTF can be used only for entities all of whose bit

combinations have a value not exceeding 65535.

fixed-3 Each bit combination is represented by exactly 3

octets, with a more significant octet preceding any

less significant octets; this BCTF can be used only

for entities all of whose bit combinations have a

value not exceeding 16777215.

fixed-4 Each bit combination is represented by exactly 4

octets, with a more significant octet preceding any

less significant octets.

utf-8 Each bit combination is represented by a variable

number of octets according to UCS Transformation

Format 8 defined in Annex P to be added by the first

proposed drafted amendment (PDAM 1) to ISO/IEC

10646-1:1993.

utf-7 Each bit combination is represented by a variable

number of octets in the range 0 through 127 as

described in [RFC-1642]; this BCTF can be used only

for entities all of whose bit combinations have a

value not exceeding 65535.

euc-jp Each bit combination is treated as a pair of octets,

most significant octet first, encoding a character

using the

Extended_UNIX_Code_Fixed_Width_for_Japanese charset,

and is transformed into the variable length sequence

of octets that would encode that character using the

Extended_UNIX_Code_Packed_Format_for_Japanese char-

set.

sjis Each bit combination is treated as a pair of octets,

most significant octet first, encoding a character

using the

Extended_UNIX_Code_Fixed_Width_for_Japanese charset,

and is transformed into the variable length sequence

of octets that would encode that character using the

Shift_JIS charset.

A.2. The Boot Attribute

The body part specified by the SGML-boot parameter contains a

sequence of triplets of positive integers separated by white space.

The triplets correspond to the described character set portion [IS0-

8879, 13.1.1.2] of the SGML declaration. SGML-boot provides the

capability to identify the character set of the document's SGML

declaration when it uses significant SGML characters [ibid., 4.298]

in the SGML reference concrete syntax [ibid., 13.4] that have a

character number [ibid., 4.44] in the document's character set that

differs from us-ascii. The default value is "0 128 0", all

characters are us-ascii.

Notes: (1) The triplet, <dscn noc bscn> has the following meaning.

Starting with character number dscn in the us-ascii character set,

renumber noc characters starting at bscn and incrementing by one.

Thus, 0 128 0, represents the identity mapping. (2) The document's

declaration itself may also redefine the significant SGML characters;

the boot attribute is intended to bootstrap the SGML system's parse

of the declaration.

 
 
 
免责声明:本文为网络用户发布,其观点仅代表作者个人观点,与本站无关,本站仅提供信息存储服务。文中陈述内容未经本站证实,其真实性、完整性、及时性本站不作任何保证或承诺,请读者仅作参考,并请自行核实相关内容。
2023年上半年GDP全球前十五强
 百态   2023-10-24
美众议院议长启动对拜登的弹劾调查
 百态   2023-09-13
上海、济南、武汉等多地出现不明坠落物
 探索   2023-09-06
印度或要将国名改为“巴拉特”
 百态   2023-09-06
男子为女友送行,买票不登机被捕
 百态   2023-08-20
手机地震预警功能怎么开?
 干货   2023-08-06
女子4年卖2套房花700多万做美容:不但没变美脸,面部还出现变形
 百态   2023-08-04
住户一楼被水淹 还冲来8头猪
 百态   2023-07-31
女子体内爬出大量瓜子状活虫
 百态   2023-07-25
地球连续35年收到神秘规律性信号,网友:不要回答!
 探索   2023-07-21
全球镓价格本周大涨27%
 探索   2023-07-09
钱都流向了那些不缺钱的人,苦都留给了能吃苦的人
 探索   2023-07-02
倩女手游刀客魅者强控制(强混乱强眩晕强睡眠)和对应控制抗性的关系
 百态   2020-08-20
美国5月9日最新疫情:美国确诊人数突破131万
 百态   2020-05-09
荷兰政府宣布将集体辞职
 干货   2020-04-30
倩女幽魂手游师徒任务情义春秋猜成语答案逍遥观:鹏程万里
 干货   2019-11-12
倩女幽魂手游师徒任务情义春秋猜成语答案神机营:射石饮羽
 干货   2019-11-12
倩女幽魂手游师徒任务情义春秋猜成语答案昆仑山:拔刀相助
 干货   2019-11-12
倩女幽魂手游师徒任务情义春秋猜成语答案天工阁:鬼斧神工
 干货   2019-11-12
倩女幽魂手游师徒任务情义春秋猜成语答案丝路古道:单枪匹马
 干货   2019-11-12
倩女幽魂手游师徒任务情义春秋猜成语答案镇郊荒野:与虎谋皮
 干货   2019-11-12
倩女幽魂手游师徒任务情义春秋猜成语答案镇郊荒野:李代桃僵
 干货   2019-11-12
倩女幽魂手游师徒任务情义春秋猜成语答案镇郊荒野:指鹿为马
 干货   2019-11-12
倩女幽魂手游师徒任务情义春秋猜成语答案金陵:小鸟依人
 干货   2019-11-12
倩女幽魂手游师徒任务情义春秋猜成语答案金陵:千金买邻
 干货   2019-11-12
 
推荐阅读
 
 
 
>>返回首頁<<
 
靜靜地坐在廢墟上,四周的荒凉一望無際,忽然覺得,淒涼也很美
© 2005- 王朝網路 版權所有