The 'application/xhtml+xml' Media Type
draft-baker-xhtml-media-reg-01.txt
Mark A. Baker
Status of this Memo
This document is an Internet-Draft and is in full conformance with
all provisions of Section 10 of RFC2026.
Internet-Drafts are working documents of the Internet Engineering
Task Force (IETF), its areas, and its working groups. Note that
other groups may also distribute working documents as
Internet-Drafts.
Internet-Drafts are draft documents valid for a maximum of six
months and may be updated, replaced, or obsoleted by other
documents at any time. It is inappropriate to use Internet-
Drafts as reference material or to cite them other than as
"work in progress."
The list of current Internet-Drafts can be accessed at
http://www.ietf.org/ietf/1id-abstracts.txt
The list of Internet-Draft Shadow Directories can be accessed at
http://www.ietf.org/shadow.html.
This Internet-Draft will expire on August 5, 2001.
Abstract
This document defines the 'application/xhtml+xml' MIME media type
for XHTML based markup languages; it is not intended to obsolete
any previous IETF documents, in particular RFC 2854 which registers
'text/html'.
This document was prepared by members of the W3C HTML working group
based on the structure, and some of the content, of RFC 2854, the
registration of 'text/html'. Please send comments to
www-html@w3.org, a public mailing list (requiring subscription)
with archives at .
1. Introduction
In 1998, the W3C HTML working group began work on reformulating HTML
in terms of XML 1.0 [XML] and XML Namespaces [XMLNS]. The first
part of that work concluded in January 2000 with the publication of
the XHTML 1.0 Recommendation [XHTML1], the reformulation for HTML
4.01 [HTML401].
Work continues in the HTML WG on XHTML Modularization (see
http://www.w3.org/TR/xhtml-modularization), the decomposition of
XHTML 1.0 into modules that can be used to compose new XHTML based
languages, plus a framework for supporting this composition.
As of February 2001, the HTML WG has taken no official position on
what MIME media type should be used to describe XHTML 1.0 or any
other XHTML based language, except in the case where XHTML 1.0
documents satisfy certain additional requirements (see [XHTML1]
section 5.1) and can be described with "text/html" (see [TEXTHTML]).
This document only registers a new MIME media type,
'application/xhtml+xml'. It does not define anything more than is
required to perform this registration. The HTML WG expects to
publish further documentation on this subject, including but not
limited to, information about rules for which documents should and
should not be described with this new media type, and further
information about recognizing XHTML documents.
This document follows the convention set out in [XMLMIME] for the
MIME subtype name; attaching the suffix "+xml" to denote that the
entity being described conforms to the XML syntax as defined in XML
1.0 [XML].
2. Registration of MIME media type application/xhtml+xml
MIME media type name: application
MIME subtype name: xhtml+xml
Required parameters: none
Optional parameters:
charset
This parameter has identical semantics to the charset parameter
of the "application/xml" media type as specified in [XMLMIME].
profile
See Section 8 of this document.
Encoding considerations:
See Section 4 of this document.
Security considerations:
See Section 7 of this document.
Interoperability considerations:
XHTML 1.0 [XHTML10] specifies user agent conformance rules that
dictate behaviour that must be followed when dealing with, amoung
other things, unrecognized elements.
With respect to XHTML Modularization [XHTMLMOD] and the existence
of XHTML based languages (referred to as XHTML family members)
that are not XHTML 1.0 conformant languages, it is possible that
'application/xhtml+xml' may be used to describe some of these
documents. The HTML WG will be releasing further guidelines about
what documents should and should not be described with this type.
However, it should suffice for now for the purposes of
interoperability that user agents accepting
'application/xhtml+xml' content use the user agent conformance
rules in [XHTML1].
Although conformant 'application/xhtml+xml' interpreters can
expect that content received is well-formed XML (as defined in
[XML]), it cannot be guaranteed that the content is valid XHTML
(as defined in [XHTML1]. This is in large part due to the reasons
in the preceding paragraph.
Published specification:
XHTML 1.0 is now defined by W3C Recommendation; the latest
published version is [XHTML1]. It provides for the description of
some types of conformant content as "text/html", but also doesn't
disallow the use with other content types (effectively allowing
for the possibility of this new type).
Applications which use this media type:
Some content authors have already begun hand and tool
authoring on the Web with XHTML 1.0. However that content
is currently described as "text/html", allowing existing
Web browsers to process it without reconfiguration for a
new media type.
There is no experimental, vendor specific, or personal tree
predecessor to 'application/xhtml+xml', reflecting the fact that
no applications currently recognize it. This new type is being
registered in order to allow for the expected deployment of XHTML
on the World Wide Web, as a first class XML application where
authors can expect that user agents are conformant XML 1.0 [XML]
processors.
Additional information:
Magic number:
There is no single initial byte sequence that is always present
for XHTML files. However, Section 5 below gives some guidelines
for recognizing XHTML files.
File extension:
There are two known file extensions that are currently in use
for XHTML 1.0; ".xht" and ".xhtml".
It is not recommended that the ".xml" extension (defined in
[XMLMIME]) be used, as web servers may be configured to
distribute such content as type "text/xml" or "application/xml".
[XMLMIME] discusses the unreliability of this approach in
section 3. Of course, should the author desire this behaviour,
then the ".xml" extension can be used.
Macintosh File Type code: TEXT
Person & email address to contact for further information:
Mark Baker
Intended usage: COMMON
Author/Change controller:
The XHTML specifications are a work product of the World
Wide Web Consortium's HTML Working Group. The W3C has change
control over these specifications.
3. Fragment identifiers
For documents labeled as 'application/xhtml+xml', the fragment
identifier notation is exactly that for application/xml, as
specified in [XMLMIME].
4. Encoding considerations
By virtue of XHTML content being XML, it has the same considerations
when sent as 'application/xhtml+xml' as does XML. See [XMLMIME],
section 3.2.
5. Recognizing XHTML files
All XHTML files will have the string "
(or ).
[MIME] Freed, N., and Borenstein, N., "Multipurpose Internet Mail
Extensions (MIME) Part Two: Media Types", RFC 2046, November
1996.
[XHTML1] "XHTML 1.0: The Extensible HyperText Markup Language: A
Reformulation of HTML 4 in XML 1.0", W3C Recommendation,
January 2000. Available at .
[XML] "Extensible Markup Language (XML) 1.0", W3C Recommendation,
February 1998. Available at
(or ).
[TEXTHTML] Connolly, D., Masinter, L., "The 'text/html' Media Type",
RFC 2854, June 2000.
[XMLMIME] Murata, M., St.Laurent, S., Kohn, D., "XML Media Types",
RFC 3023, January 2001.
Appendix A. Revision History (to be removed before publication)
draft-baker-xhtml-media-reg-01: Cleaned up text around "//DTD XHTML"
(thanks to Arjun Ray). Elaborated on the need to keep schema-location
in light of Content-feature and UAPROF. Updated draft-murata-xml
reference to RFC 3023. Added mention of needing to subscribe to
www-html. Added clarification on recommendation against using ".xml".
Changed "schema-location" to "profile", changed description of how it
was used, and added reference to the HTML HEAD parameter of the same
name.