INTERNET DRAFT Paul Burchard, Princeton CS Expires in six months Dave Raggett, W3 Consortium Compound Documents in HTML Status of this Memo This document is an Internet draft. Internet drafts are working documents of the Internet Engineering Task Force (IETF), its areas and its working groups. Note that other groups may also distribute working information as Internet drafts. Internet Drafts are draft documents valid for a maximum of six months and can be updated, replaced or obsoleted by other documents at any time. It is inappropriate to use Internet drafts as reference material or to cite them as other than as "work in progress". To learn the current status of any Internet draft please check the "lid-abstracts.txt" listing contained in the Internet drafts shadow directories on ftp.is.co.za (Africa), nic.nordu.net (Europe), munnari.oz.au (Pacific Rim), ds.internic.net (US East coast) or ftp.isi.edu (US West coast). Further information about the IETF can be found at URL: http://www.cnri.reston.va.us/ Distribution of this document is unlimited. Please send comments to the HTML working group (HTML-WG) of the Internet Engineering Task Force (IETF) at . Discussions of this group are archived at URL: http://www.acl.lanl.gov/HTML-WG/archives.html. 1. Abstract This specification provides an HTML implementation of a simple compound document architecture for the World Wide Web, based on a new

element. By not restricting itself to a limited class of media types or media handler implementations, this element enables portable compound document markup, and encourages the modular design of user-agents. Although this specification does not presume to define a concrete API between extensible user-agents and their media handlers, some high-level requirements are imposed on the embedding semantics in order to ensure support for the full linking and embedding model. By making

a container element, rich alternative text with links and images is enabled. Moreover, the container element provides superior extensibility, setting the stage for structured enhancement of SGML content models, rather than sole dependence on proliferation of attributes. ------------------------------------------------------------------------------ Burchard and Raggett Page 1 Compound Documents in HTML Contents 1. Abstract .......................................................... 1 2. Examples and Rationale ............................................ 2 3. Compound Document Architecture .................................... 4 4. Geometry Negotiation .............................................. 5 5. HTML Markup ....................................................... 5 a) Elements ...................................................... 5 b) Attribute Value Types ......................................... 8 c) Attribute Sets ................................................ 9 6. Summary of DTD Changes for HTML .................................. 14 7. Transition Issues ................................................ 18 a) Sun's APPLET Tag ............................................. 18 b) Netscape's Early Implementations of EMBED .................... 18 8. Security Considerations .......................................... 18 9. Acknowledgments .................................................. 18 10. References ....................................................... 19 11. Authors' Addresses ............................................... 19 2. Examples and Rationale The only compound document feature defined in HTML 2.0 [1] was the tag for embedding image media into HTML. Although arguably, next to , the single most influential tag in the explosive growth of the Web, its shortcomings are now causing a proliferation of media- and user-agent-specific attempts at extending HTML. The most serious shortcoming of the tag is its arbitrary restriction to image media. Many of the proposed embedding extensions also use names which suggest specialized functionality (e.g. , ). But as modular, extensible user agents become the norm, such restrictions become untenable. We propose that the name EMBED is both broad and intuitive enough to denote a generic embedding element. In order to avoid implementation-dependent markup, it is essential that this EMBED tag should cover all embeddable Internet media types [2], however the Burchard and Raggett Page 2 Compound Documents in HTML corresponding media handlers are implemented (`built-ins', `plug-ins', etc.). The second key shortcoming of the tag is that it is `empty' (meaning that it doesn't use a closing ). Since SGML attributes are the only mechanism then available for the expression of element properties, all the powerful structuring capabilities of SGML are lost. In particular, the inability to nest elements inside an empty element means that properties with rich text values (such as would be desirable for the alternative text of ) cannot be implemented. Moreover, extensibility of empty elements suffers because everything occurs in the a single, flat attribute namespace. Whereas empty elements with complex properties create an "attribute soup" that may not even be legal SGML: [applet]