idnits 2.17.1 

draft-ietf-httpbis-p3-payload-07.txt:

  Checking boilerplate required by RFC 5378 and the IETF Trust (see
  https://trustee.ietf.org/license-info):
  ----------------------------------------------------------------------------

  ** The document seems to lack a License Notice according IETF Trust
     Provisions of 28 Dec 2009, Section 6.b.i or Provisions of 12 Sep 2009
     Section 6.b -- however, there's a paragraph with a matching beginning.
     Boilerplate error?

     (You're using the IETF Trust Provisions' Section 6.b License Notice from
     12 Feb 2009 rather than one of the newer Notices.  See
     https://trustee.ietf.org/license-info/.)


  Checking nits according to https://www.ietf.org/id-info/1id-guidelines.txt:
  ----------------------------------------------------------------------------

     No issues found here.

  Checking nits according to https://www.ietf.org/id-info/checklist :
  ----------------------------------------------------------------------------

     No issues found here.

  Miscellaneous warnings:
  ----------------------------------------------------------------------------

  == The copyright year in the IETF Trust and authors Copyright Line does not
     match the current year

  -- The document seems to contain a disclaimer for pre-RFC5378 work, and may
     have content which was first submitted before 10 November 2008.  The
     disclaimer is necessary when there are original authors that you have
     been unable to contact, or if some do not wish to grant the BCP78 rights
     to the IETF Trust.  If you are able to get all authors (current and
     original) to grant those rights, you can and should remove the
     disclaimer; otherwise, the disclaimer is needed and you can ignore this
     comment. (See the Legal Provisions document at
     https://trustee.ietf.org/license-info for more information.)

  -- The document date (July 13, 2009) is 5400 days in the past.  Is this
     intentional?

  -- Found something which looks like a code comment -- if you have code
     sections in the document, please surround them with '<CODE BEGINS>' and
     '<CODE ENDS>' lines.


  Checking references for intended status: Proposed Standard
  ----------------------------------------------------------------------------

     (See RFCs 3967 and 4897 for information about using normative references
     to lower-maturity documents in RFCs)

  -- Possible downref: Non-RFC (?) normative reference: ref. 'ISO-8859-1'

  == Outdated reference: A later version (-26) exists of
     draft-ietf-httpbis-p1-messaging-07

  == Outdated reference: A later version (-26) exists of
     draft-ietf-httpbis-p2-semantics-07

  == Outdated reference: A later version (-26) exists of
     draft-ietf-httpbis-p4-conditional-07

  == Outdated reference: A later version (-26) exists of
     draft-ietf-httpbis-p5-range-07

  == Outdated reference: A later version (-26) exists of
     draft-ietf-httpbis-p6-cache-07

  ** Obsolete normative reference: RFC 1766 (Obsoleted by RFC 3066, RFC 3282)

  ** Downref: Normative reference to an Informational RFC: RFC 1950

  ** Downref: Normative reference to an Informational RFC: RFC 1951

  ** Downref: Normative reference to an Informational RFC: RFC 1952

  -- Obsolete informational reference (is this intentional?): RFC 2068
     (Obsoleted by RFC 2616)

  -- Obsolete informational reference (is this intentional?): RFC 2388
     (Obsoleted by RFC 7578)

  -- Obsolete informational reference (is this intentional?): RFC 2616
     (Obsoleted by RFC 7230, RFC 7231, RFC 7232, RFC 7233, RFC 7234, RFC 7235)

  -- Obsolete informational reference (is this intentional?): RFC 4288
     (Obsoleted by RFC 6838)


     Summary: 5 errors (**), 0 flaws (~~), 6 warnings (==), 8 comments (--).

     Run idnits with the --verbose option for more detailed information about
     the items above.

--------------------------------------------------------------------------------


2	HTTPbis Working Group                                   R. Fielding, Ed.
3	Internet-Draft                                              Day Software
4	Obsoletes: 2616 (if approved)                                  J. Gettys
5	Intended status: Standards Track                    One Laptop per Child
6	Expires: January 14, 2010                                       J. Mogul
7	                                                                      HP
8	                                                              H. Frystyk
9	                                                               Microsoft
10	                                                             L. Masinter
11	                                                           Adobe Systems
12	                                                                P. Leach
13	                                                               Microsoft
14	                                                          T. Berners-Lee
15	                                                                 W3C/MIT
16	                                                           Y. Lafon, Ed.
17	                                                                     W3C
18	                                                         J. Reschke, Ed.
19	                                                              greenbytes
20	                                                           July 13, 2009

22	       HTTP/1.1, part 3: Message Payload and Content Negotiation
23	                    draft-ietf-httpbis-p3-payload-07

25	Status of this Memo

27	   This Internet-Draft is submitted to IETF in full conformance with the
28	   provisions of BCP 78 and BCP 79.  This document may contain material
29	   from IETF Documents or IETF Contributions published or made publicly
30	   available before November 10, 2008.  The person(s) controlling the
31	   copyright in some of this material may not have granted the IETF
32	   Trust the right to allow modifications of such material outside the
33	   IETF Standards Process.  Without obtaining an adequate license from
34	   the person(s) controlling the copyright in such materials, this
35	   document may not be modified outside the IETF Standards Process, and
36	   derivative works of it may not be created outside the IETF Standards
37	   Process, except to format it for publication as an RFC or to
38	   translate it into languages other than English.

40	   Internet-Drafts are working documents of the Internet Engineering
41	   Task Force (IETF), its areas, and its working groups.  Note that
42	   other groups may also distribute working documents as Internet-
43	   Drafts.

45	   Internet-Drafts are draft documents valid for a maximum of six months
46	   and may be updated, replaced, or obsoleted by other documents at any
47	   time.  It is inappropriate to use Internet-Drafts as reference
48	   material or to cite them other than as "work in progress."
49	   The list of current Internet-Drafts can be accessed at
50	   http://www.ietf.org/ietf/1id-abstracts.txt.

52	   The list of Internet-Draft Shadow Directories can be accessed at
53	   http://www.ietf.org/shadow.html.

55	   This Internet-Draft will expire on January 14, 2010.

57	Copyright Notice

59	   Copyright (c) 2009 IETF Trust and the persons identified as the
60	   document authors.  All rights reserved.

62	   This document is subject to BCP 78 and the IETF Trust's Legal
63	   Provisions Relating to IETF Documents in effect on the date of
64	   publication of this document (http://trustee.ietf.org/license-info).
65	   Please review these documents carefully, as they describe your rights
66	   and restrictions with respect to this document.

68	Abstract

70	   The Hypertext Transfer Protocol (HTTP) is an application-level
71	   protocol for distributed, collaborative, hypermedia information
72	   systems.  HTTP has been in use by the World Wide Web global
73	   information initiative since 1990.  This document is Part 3 of the
74	   seven-part specification that defines the protocol referred to as
75	   "HTTP/1.1" and, taken together, obsoletes RFC 2616.  Part 3 defines
76	   HTTP message content, metadata, and content negotiation.

78	Editorial Note (To be removed by RFC Editor)

80	   Discussion of this draft should take place on the HTTPBIS working
81	   group mailing list (ietf-http-wg@w3.org).  The current issues list is
82	   at <http://tools.ietf.org/wg/httpbis/trac/report/11> and related
83	   documents (including fancy diffs) can be found at
84	   <http://tools.ietf.org/wg/httpbis/>.

86	   The changes in this draft are summarized in Appendix E.8.

88	Table of Contents

90	   1.  Introduction . . . . . . . . . . . . . . . . . . . . . . . . .  5
91	     1.1.  Requirements . . . . . . . . . . . . . . . . . . . . . . .  5
92	     1.2.  Syntax Notation  . . . . . . . . . . . . . . . . . . . . .  5
93	       1.2.1.  Core Rules . . . . . . . . . . . . . . . . . . . . . .  6
94	       1.2.2.  ABNF Rules defined in other Parts of the
95	               Specification  . . . . . . . . . . . . . . . . . . . .  6
96	   2.  Protocol Parameters  . . . . . . . . . . . . . . . . . . . . .  6
97	     2.1.  Character Sets . . . . . . . . . . . . . . . . . . . . . .  6
98	       2.1.1.  Missing Charset  . . . . . . . . . . . . . . . . . . .  7
99	     2.2.  Content Codings  . . . . . . . . . . . . . . . . . . . . .  7
100	     2.3.  Media Types  . . . . . . . . . . . . . . . . . . . . . . .  9
101	       2.3.1.  Canonicalization and Text Defaults . . . . . . . . . .  9
102	       2.3.2.  Multipart Types  . . . . . . . . . . . . . . . . . . . 10
103	     2.4.  Language Tags  . . . . . . . . . . . . . . . . . . . . . . 11
104	   3.  Entity . . . . . . . . . . . . . . . . . . . . . . . . . . . . 11
105	     3.1.  Entity Header Fields . . . . . . . . . . . . . . . . . . . 12
106	     3.2.  Entity Body  . . . . . . . . . . . . . . . . . . . . . . . 12
107	       3.2.1.  Type . . . . . . . . . . . . . . . . . . . . . . . . . 12
108	       3.2.2.  Entity Length  . . . . . . . . . . . . . . . . . . . . 13
109	   4.  Content Negotiation  . . . . . . . . . . . . . . . . . . . . . 13
110	     4.1.  Server-driven Negotiation  . . . . . . . . . . . . . . . . 14
111	     4.2.  Agent-driven Negotiation . . . . . . . . . . . . . . . . . 15
112	     4.3.  Transparent Negotiation  . . . . . . . . . . . . . . . . . 15
113	   5.  Header Field Definitions . . . . . . . . . . . . . . . . . . . 16
114	     5.1.  Accept . . . . . . . . . . . . . . . . . . . . . . . . . . 16
115	     5.2.  Accept-Charset . . . . . . . . . . . . . . . . . . . . . . 18
116	     5.3.  Accept-Encoding  . . . . . . . . . . . . . . . . . . . . . 19
117	     5.4.  Accept-Language  . . . . . . . . . . . . . . . . . . . . . 20
118	     5.5.  Content-Encoding . . . . . . . . . . . . . . . . . . . . . 22
119	     5.6.  Content-Language . . . . . . . . . . . . . . . . . . . . . 23
120	     5.7.  Content-Location . . . . . . . . . . . . . . . . . . . . . 24
121	     5.8.  Content-MD5  . . . . . . . . . . . . . . . . . . . . . . . 24
122	     5.9.  Content-Type . . . . . . . . . . . . . . . . . . . . . . . 26
123	   6.  IANA Considerations  . . . . . . . . . . . . . . . . . . . . . 26
124	     6.1.  Message Header Registration  . . . . . . . . . . . . . . . 26
125	   7.  Security Considerations  . . . . . . . . . . . . . . . . . . . 27
126	     7.1.  Privacy Issues Connected to Accept Headers . . . . . . . . 27
127	     7.2.  Content-Disposition Issues . . . . . . . . . . . . . . . . 28
128	   8.  Acknowledgments  . . . . . . . . . . . . . . . . . . . . . . . 28
129	   9.  References . . . . . . . . . . . . . . . . . . . . . . . . . . 28
130	     9.1.  Normative References . . . . . . . . . . . . . . . . . . . 28
131	     9.2.  Informative References . . . . . . . . . . . . . . . . . . 30
132	   Appendix A.  Differences Between HTTP Entities and RFC 2045
133	                Entities  . . . . . . . . . . . . . . . . . . . . . . 31
134	     A.1.  MIME-Version . . . . . . . . . . . . . . . . . . . . . . . 31
135	     A.2.  Conversion to Canonical Form . . . . . . . . . . . . . . . 32
136	     A.3.  Conversion of Date Formats . . . . . . . . . . . . . . . . 32
137	     A.4.  Introduction of Content-Encoding . . . . . . . . . . . . . 32
138	     A.5.  No Content-Transfer-Encoding . . . . . . . . . . . . . . . 33
139	     A.6.  Introduction of Transfer-Encoding  . . . . . . . . . . . . 33
140	     A.7.  MHTML and Line Length Limitations  . . . . . . . . . . . . 33
141	   Appendix B.  Additional Features . . . . . . . . . . . . . . . . . 33
142	     B.1.  Content-Disposition  . . . . . . . . . . . . . . . . . . . 34
143	   Appendix C.  Compatibility with Previous Versions  . . . . . . . . 34
144	     C.1.  Changes from RFC 2068  . . . . . . . . . . . . . . . . . . 34
145	     C.2.  Changes from RFC 2616  . . . . . . . . . . . . . . . . . . 35
146	   Appendix D.  Collected ABNF  . . . . . . . . . . . . . . . . . . . 35
147	   Appendix E.  Change Log (to be removed by RFC Editor before
148	                publication)  . . . . . . . . . . . . . . . . . . . . 37
149	     E.1.  Since RFC2616  . . . . . . . . . . . . . . . . . . . . . . 37
150	     E.2.  Since draft-ietf-httpbis-p3-payload-00 . . . . . . . . . . 37
151	     E.3.  Since draft-ietf-httpbis-p3-payload-01 . . . . . . . . . . 38
152	     E.4.  Since draft-ietf-httpbis-p3-payload-02 . . . . . . . . . . 38
153	     E.5.  Since draft-ietf-httpbis-p3-payload-03 . . . . . . . . . . 39
154	     E.6.  Since draft-ietf-httpbis-p3-payload-04 . . . . . . . . . . 39
155	     E.7.  Since draft-ietf-httpbis-p3-payload-05 . . . . . . . . . . 39
156	     E.8.  Since draft-ietf-httpbis-p3-payload-06 . . . . . . . . . . 40
157	   Index  . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 40
158	   Authors' Addresses . . . . . . . . . . . . . . . . . . . . . . . . 42

160	1.  Introduction

162	   This document defines HTTP/1.1 message payloads (a.k.a., content),
163	   the associated metadata header fields that define how the payload is
164	   intended to be interpreted by a recipient, the request header fields
165	   that may influence content selection, and the various selection
166	   algorithms that are collectively referred to as HTTP content
167	   negotiation.

169	   This document is currently disorganized in order to minimize the
170	   changes between drafts and enable reviewers to see the smaller errata
171	   changes.  The next draft will reorganize the sections to better
172	   reflect the content.  In particular, the sections on entities will be
173	   renamed payload and moved to the first half of the document, while
174	   the sections on content negotiation and associated request header
175	   fields will be moved to the second half.  The current mess reflects
176	   how widely dispersed these topics and associated requirements had
177	   become in [RFC2616].

179	1.1.  Requirements

181	   The key words "MUST", "MUST NOT", "REQUIRED", "SHALL", "SHALL NOT",
182	   "SHOULD", "SHOULD NOT", "RECOMMENDED", "MAY", and "OPTIONAL" in this
183	   document are to be interpreted as described in [RFC2119].

185	   An implementation is not compliant if it fails to satisfy one or more
186	   of the MUST or REQUIRED level requirements for the protocols it
187	   implements.  An implementation that satisfies all the MUST or
188	   REQUIRED level and all the SHOULD level requirements for its
189	   protocols is said to be "unconditionally compliant"; one that
190	   satisfies all the MUST level requirements but not all the SHOULD
191	   level requirements for its protocols is said to be "conditionally
192	   compliant."

194	1.2.  Syntax Notation

196	   This specification uses the ABNF syntax defined in Section 1.2 of
197	   [Part1] (which extends the syntax defined in [RFC5234] with a list
198	   rule).  Appendix D shows the collected ABNF, with the list rule
199	   expanded.

201	   The following core rules are included by reference, as defined in
202	   [RFC5234], Appendix B.1: ALPHA (letters), CR (carriage return), CRLF
203	   (CR LF), CTL (controls), DIGIT (decimal 0-9), DQUOTE (double quote),
204	   HEXDIG (hexadecimal 0-9/A-F/a-f), LF (line feed), OCTET (any 8-bit
205	   sequence of data), SP (space), VCHAR (any visible USASCII character),
206	   and WSP (whitespace).

208	1.2.1.  Core Rules

210	   The core rules below are defined in Section 1.2.2 of [Part1]:

212	     quoted-string  = <quoted-string, defined in [Part1], Section 1.2.2>
213	     token          = <token, defined in [Part1], Section 1.2.2>
214	     OWS            = <OWS, defined in [Part1], Section 1.2.2>

216	1.2.2.  ABNF Rules defined in other Parts of the Specification

218	   The ABNF rules below are defined in other parts:

220	     absolute-URI   = <absolute-URI, defined in [Part1], Section 2.1>
221	     Content-Length = <Content-Length, defined in [Part1], Section 8.2>
222	     message-header = <message-header, defined in [Part1], Section 4.2>
223	     partial-URI    = <partial-URI, defined in [Part1], Section 2.1>
224	     qvalue         = <qvalue, defined in [Part1], Section 3.5>

226	     Last-Modified  = <Last-Modified, defined in [Part4], Section 6.6>

228	     Content-Range  = <Content-Range, defined in [Part5], Section 5.2>

230	     Expires        = <Expires, defined in [Part6], Section 3.3>

232	2.  Protocol Parameters

234	2.1.  Character Sets

236	   HTTP uses the same definition of the term "character set" as that
237	   described for MIME:

239	   The term "character set" is used in this document to refer to a
240	   method used with one or more tables to convert a sequence of octets
241	   into a sequence of characters.  Note that unconditional conversion in
242	   the other direction is not required, in that not all characters may
243	   be available in a given character set and a character set may provide
244	   more than one sequence of octets to represent a particular character.
245	   This definition is intended to allow various kinds of character
246	   encoding, from simple single-table mappings such as US-ASCII to
247	   complex table switching methods such as those that use ISO-2022's
248	   techniques.  However, the definition associated with a MIME character
249	   set name MUST fully specify the mapping to be performed from octets
250	   to characters.  In particular, use of external profiling information
251	   to determine the exact mapping is not permitted.

253	      Note: This use of the term "character set" is more commonly
254	      referred to as a "character encoding."  However, since HTTP and
255	      MIME share the same registry, it is important that the terminology
256	      also be shared.

258	   HTTP character sets are identified by case-insensitive tokens.  The
259	   complete set of tokens is defined by the IANA Character Set registry
260	   (<http://www.iana.org/assignments/character-sets>).

262	     charset = token

264	   Although HTTP allows an arbitrary token to be used as a charset
265	   value, any token that has a predefined value within the IANA
266	   Character Set registry MUST represent the character set defined by
267	   that registry.  Applications SHOULD limit their use of character sets
268	   to those defined by the IANA registry.

270	   HTTP uses charset in two contexts: within an Accept-Charset request
271	   header (in which the charset value is an unquoted token) and as the
272	   value of a parameter in a Content-Type header (within a request or
273	   response), in which case the parameter value of the charset parameter
274	   may be quoted.

276	   Implementors should be aware of IETF character set requirements
277	   [RFC3629] [RFC2277].

279	2.1.1.  Missing Charset

281	   Some HTTP/1.0 software has interpreted a Content-Type header without
282	   charset parameter incorrectly to mean "recipient should guess."
283	   Senders wishing to defeat this behavior MAY include a charset
284	   parameter even when the charset is ISO-8859-1 ([ISO-8859-1]) and
285	   SHOULD do so when it is known that it will not confuse the recipient.

287	   Unfortunately, some older HTTP/1.0 clients did not deal properly with
288	   an explicit charset parameter.  HTTP/1.1 recipients MUST respect the
289	   charset label provided by the sender; and those user agents that have
290	   a provision to "guess" a charset MUST use the charset from the
291	   content-type field if they support that charset, rather than the
292	   recipient's preference, when initially displaying a document.  See
293	   Section 2.3.1.

295	2.2.  Content Codings

297	   Content coding values indicate an encoding transformation that has
298	   been or can be applied to an entity.  Content codings are primarily
299	   used to allow a document to be compressed or otherwise usefully
300	   transformed without losing the identity of its underlying media type
301	   and without loss of information.  Frequently, the entity is stored in
302	   coded form, transmitted directly, and only decoded by the recipient.

304	     content-coding   = token

306	   All content-coding values are case-insensitive.  HTTP/1.1 uses
307	   content-coding values in the Accept-Encoding (Section 5.3) and
308	   Content-Encoding (Section 5.5) header fields.  Although the value
309	   describes the content-coding, what is more important is that it
310	   indicates what decoding mechanism will be required to remove the
311	   encoding.

313	   The Internet Assigned Numbers Authority (IANA) acts as a registry for
314	   content-coding value tokens.  Initially, the registry contains the
315	   following tokens:

317	   gzip

319	      An encoding format produced by the file compression program "gzip"
320	      (GNU zip) as described in [RFC1952].  This format is a Lempel-Ziv
321	      coding (LZ77) with a 32 bit CRC.

323	   compress

325	      The encoding format produced by the common UNIX file compression
326	      program "compress".  This format is an adaptive Lempel-Ziv-Welch
327	      coding (LZW).

329	      Use of program names for the identification of encoding formats is
330	      not desirable and is discouraged for future encodings.  Their use
331	      here is representative of historical practice, not good design.
332	      For compatibility with previous implementations of HTTP,
333	      applications SHOULD consider "x-gzip" and "x-compress" to be
334	      equivalent to "gzip" and "compress" respectively.

336	   deflate

338	      The "zlib" format defined in [RFC1950] in combination with the
339	      "deflate" compression mechanism described in [RFC1951].

341	   identity

343	      The default (identity) encoding; the use of no transformation
344	      whatsoever.  This content-coding is used only in the Accept-
345	      Encoding header, and SHOULD NOT be used in the Content-Encoding
346	      header.

348	   New content-coding value tokens SHOULD be registered; to allow
349	   interoperability between clients and servers, specifications of the
350	   content coding algorithms needed to implement a new value SHOULD be
351	   publicly available and adequate for independent implementation, and
352	   conform to the purpose of content coding defined in this section.

354	2.3.  Media Types

356	   HTTP uses Internet Media Types [RFC2046] in the Content-Type
357	   (Section 5.9) and Accept (Section 5.1) header fields in order to
358	   provide open and extensible data typing and type negotiation.

360	     media-type = type "/" subtype *( OWS ";" OWS parameter )
361	     type       = token
362	     subtype    = token

364	   Parameters MAY follow the type/subtype in the form of attribute/value
365	   pairs.

367	     parameter      = attribute "=" value
368	     attribute      = token
369	     value          = token / quoted-string

371	   The type, subtype, and parameter attribute names are case-
372	   insensitive.  Parameter values might or might not be case-sensitive,
373	   depending on the semantics of the parameter name.  The presence or
374	   absence of a parameter might be significant to the processing of a
375	   media-type, depending on its definition within the media type
376	   registry.

378	   A parameter value that matches the token production may be
379	   transmitted as either a token or within a quoted-string.  The quoted
380	   and unquoted values are equivalent.

382	   Note that some older HTTP applications do not recognize media type
383	   parameters.  When sending data to older HTTP applications,
384	   implementations SHOULD only use media type parameters when they are
385	   required by that type/subtype definition.

387	   Media-type values are registered with the Internet Assigned Number
388	   Authority (IANA).  The media type registration process is outlined in
389	   [RFC4288].  Use of non-registered media types is discouraged.

391	2.3.1.  Canonicalization and Text Defaults

393	   Internet media types are registered with a canonical form.  An
394	   entity-body transferred via HTTP messages MUST be represented in the
395	   appropriate canonical form prior to its transmission except for
396	   "text" types, as defined in the next paragraph.

398	   When in canonical form, media subtypes of the "text" type use CRLF as
399	   the text line break.  HTTP relaxes this requirement and allows the
400	   transport of text media with plain CR or LF alone representing a line
401	   break when it is done consistently for an entire entity-body.  HTTP
402	   applications MUST accept CRLF, bare CR, and bare LF as being
403	   representative of a line break in text media received via HTTP.  In
404	   addition, if the text is represented in a character set that does not
405	   use octets 13 and 10 for CR and LF respectively, as is the case for
406	   some multi-byte character sets, HTTP allows the use of whatever octet
407	   sequences are defined by that character set to represent the
408	   equivalent of CR and LF for line breaks.  This flexibility regarding
409	   line breaks applies only to text media in the entity-body; a bare CR
410	   or LF MUST NOT be substituted for CRLF within any of the HTTP control
411	   structures (such as header fields and multipart boundaries).

413	   If an entity-body is encoded with a content-coding, the underlying
414	   data MUST be in a form defined above prior to being encoded.

416	   The "charset" parameter is used with some media types to define the
417	   character set (Section 2.1) of the data.  When no explicit charset
418	   parameter is provided by the sender, media subtypes of the "text"
419	   type are defined to have a default charset value of "ISO-8859-1" when
420	   received via HTTP.  Data in character sets other than "ISO-8859-1" or
421	   its subsets MUST be labeled with an appropriate charset value.  See
422	   Section 2.1.1 for compatibility problems.

424	2.3.2.  Multipart Types

426	   MIME provides for a number of "multipart" types -- encapsulations of
427	   one or more entities within a single message-body.  All multipart
428	   types share a common syntax, as defined in Section 5.1.1 of
429	   [RFC2046], and MUST include a boundary parameter as part of the media
430	   type value.  The message body is itself a protocol element and MUST
431	   therefore use only CRLF to represent line breaks between body-parts.
432	   Unlike in RFC 2046, the epilogue of any multipart message MUST be
433	   empty; HTTP applications MUST NOT transmit the epilogue (even if the
434	   original multipart contains an epilogue).  These restrictions exist
435	   in order to preserve the self-delimiting nature of a multipart
436	   message-body, wherein the "end" of the message-body is indicated by
437	   the ending multipart boundary.

439	   In general, HTTP treats a multipart message-body no differently than
440	   any other media type: strictly as payload.  The one exception is the
441	   "multipart/byteranges" type (Appendix A of [Part5]) when it appears
442	   in a 206 (Partial Content) response.  In all other cases, an HTTP
443	   user agent SHOULD follow the same or similar behavior as a MIME user
444	   agent would upon receipt of a multipart type.  The MIME header fields
445	   within each body-part of a multipart message-body do not have any
446	   significance to HTTP beyond that defined by their MIME semantics.

448	   In general, an HTTP user agent SHOULD follow the same or similar
449	   behavior as a MIME user agent would upon receipt of a multipart type.
450	   If an application receives an unrecognized multipart subtype, the
451	   application MUST treat it as being equivalent to "multipart/mixed".

453	      Note: The "multipart/form-data" type has been specifically defined
454	      for carrying form data suitable for processing via the POST
455	      request method, as described in [RFC2388].

457	2.4.  Language Tags

459	   A language tag identifies a natural language spoken, written, or
460	   otherwise conveyed by human beings for communication of information
461	   to other human beings.  Computer languages are explicitly excluded.
462	   HTTP uses language tags within the Accept-Language and Content-
463	   Language fields.

465	   The syntax and registry of HTTP language tags is the same as that
466	   defined by [RFC1766].  In summary, a language tag is composed of 1 or
467	   more parts: A primary language tag and a possibly empty series of
468	   subtags:

470	     language-tag  = primary-tag *( "-" subtag )
471	     primary-tag   = 1*8ALPHA
472	     subtag        = 1*8ALPHA

474	   White space is not allowed within the tag and all tags are case-
475	   insensitive.  The name space of language tags is administered by the
476	   IANA.  Example tags include:

478	     en, en-US, en-cockney, i-cherokee, x-pig-latin

480	   where any two-letter primary-tag is an ISO-639 language abbreviation
481	   and any two-letter initial subtag is an ISO-3166 country code.  (The
482	   last three tags above are not registered tags; all but the last are
483	   examples of tags which could be registered in future.)

485	3.  Entity

487	   Request and Response messages MAY transfer an entity if not otherwise
488	   restricted by the request method or response status code.  An entity
489	   consists of entity-header fields and an entity-body, although some
490	   responses will only include the entity-headers.

492	   In this section, both sender and recipient refer to either the client
493	   or the server, depending on who sends and who receives the entity.

495	3.1.  Entity Header Fields

497	   Entity-header fields define metainformation about the entity-body or,
498	   if no body is present, about the resource identified by the request.

500	     entity-header  = Content-Encoding         ; Section 5.5
501	                    / Content-Language         ; Section 5.6
502	                    / Content-Length           ; [Part1], Section 8.2
503	                    / Content-Location         ; Section 5.7
504	                    / Content-MD5              ; Section 5.8
505	                    / Content-Range            ; [Part5], Section 5.2
506	                    / Content-Type             ; Section 5.9
507	                    / Expires                  ; [Part6], Section 3.3
508	                    / Last-Modified            ; [Part4], Section 6.6
509	                    / extension-header

511	     extension-header = message-header

513	   The extension-header mechanism allows additional entity-header fields
514	   to be defined without changing the protocol, but these fields cannot
515	   be assumed to be recognizable by the recipient.  Unrecognized header
516	   fields SHOULD be ignored by the recipient and MUST be forwarded by
517	   transparent proxies.

519	3.2.  Entity Body

521	   The entity-body (if any) sent with an HTTP request or response is in
522	   a format and encoding defined by the entity-header fields.

524	     entity-body    = *OCTET

526	   An entity-body is only present in a message when a message-body is
527	   present, as described in Section 4.3 of [Part1].  The entity-body is
528	   obtained from the message-body by decoding any Transfer-Encoding that
529	   might have been applied to ensure safe and proper transfer of the
530	   message.

532	3.2.1.  Type

534	   When an entity-body is included with a message, the data type of that
535	   body is determined via the header fields Content-Type and Content-
536	   Encoding.  These define a two-layer, ordered encoding model:

538	     entity-body := Content-Encoding( Content-Type( data ) )

540	   Content-Type specifies the media type of the underlying data.  Any
541	   HTTP/1.1 message containing an entity-body SHOULD include a Content-
542	   Type header field defining the media type of that body, unless that
543	   information is unknown.  If the Content-Type header field is not
544	   present, it indicates that the sender does not know the media type of
545	   the data; recipients MAY either assume that it is "application/
546	   octet-stream" ([RFC2046], Section 4.5.1) or examine the content to
547	   determine its type.

549	   Content-Encoding may be used to indicate any additional content
550	   codings applied to the data, usually for the purpose of data
551	   compression, that are a property of the requested resource.  There is
552	   no default encoding.

554	   Note that neither the interpretation of the data type of a message
555	   nor the behaviors caused by it are defined by HTTP; this potentially
556	   includes examination of the content to override any indicated type
557	   ("sniffing").

559	3.2.2.  Entity Length

561	   The entity-length of a message is the length of the message-body
562	   before any transfer-codings have been applied.  Section 4.4 of
563	   [Part1] defines how the transfer-length of a message-body is
564	   determined.

566	4.  Content Negotiation

568	   Most HTTP responses include an entity which contains information for
569	   interpretation by a human user.  Naturally, it is desirable to supply
570	   the user with the "best available" entity corresponding to the
571	   request.  Unfortunately for servers and caches, not all users have
572	   the same preferences for what is "best," and not all user agents are
573	   equally capable of rendering all entity types.  For that reason, HTTP
574	   has provisions for several mechanisms for "content negotiation" --
575	   the process of selecting the best representation for a given response
576	   when there are multiple representations available.

578	      Note: This is not called "format negotiation" because the
579	      alternate representations may be of the same media type, but use
580	      different capabilities of that type, be in different languages,
581	      etc.

583	   Any response containing an entity-body MAY be subject to negotiation,
584	   including error responses.

586	   There are two kinds of content negotiation which are possible in
587	   HTTP: server-driven and agent-driven negotiation.  These two kinds of
588	   negotiation are orthogonal and thus may be used separately or in
589	   combination.  One method of combination, referred to as transparent
590	   negotiation, occurs when a cache uses the agent-driven negotiation
591	   information provided by the origin server in order to provide server-
592	   driven negotiation for subsequent requests.

594	4.1.  Server-driven Negotiation

596	   If the selection of the best representation for a response is made by
597	   an algorithm located at the server, it is called server-driven
598	   negotiation.  Selection is based on the available representations of
599	   the response (the dimensions over which it can vary; e.g. language,
600	   content-coding, etc.) and the contents of particular header fields in
601	   the request message or on other information pertaining to the request
602	   (such as the network address of the client).

604	   Server-driven negotiation is advantageous when the algorithm for
605	   selecting from among the available representations is difficult to
606	   describe to the user agent, or when the server desires to send its
607	   "best guess" to the client along with the first response (hoping to
608	   avoid the round-trip delay of a subsequent request if the "best
609	   guess" is good enough for the user).  In order to improve the
610	   server's guess, the user agent MAY include request header fields
611	   (Accept, Accept-Language, Accept-Encoding, etc.) which describe its
612	   preferences for such a response.

614	   Server-driven negotiation has disadvantages:

616	   1.  It is impossible for the server to accurately determine what
617	       might be "best" for any given user, since that would require
618	       complete knowledge of both the capabilities of the user agent and
619	       the intended use for the response (e.g., does the user want to
620	       view it on screen or print it on paper?).

622	   2.  Having the user agent describe its capabilities in every request
623	       can be both very inefficient (given that only a small percentage
624	       of responses have multiple representations) and a potential
625	       violation of the user's privacy.

627	   3.  It complicates the implementation of an origin server and the
628	       algorithms for generating responses to a request.

630	   4.  It may limit a public cache's ability to use the same response
631	       for multiple user's requests.

633	   HTTP/1.1 includes the following request-header fields for enabling
634	   server-driven negotiation through description of user agent
635	   capabilities and user preferences: Accept (Section 5.1), Accept-
636	   Charset (Section 5.2), Accept-Encoding (Section 5.3), Accept-Language
637	   (Section 5.4), and User-Agent (Section 9.9 of [Part2]).  However, an
638	   origin server is not limited to these dimensions and MAY vary the
639	   response based on any aspect of the request, including information
640	   outside the request-header fields or within extension header fields
641	   not defined by this specification.

643	   The Vary header field (Section 3.5 of [Part6]) can be used to express
644	   the parameters the server uses to select a representation that is
645	   subject to server-driven negotiation.

647	4.2.  Agent-driven Negotiation

649	   With agent-driven negotiation, selection of the best representation
650	   for a response is performed by the user agent after receiving an
651	   initial response from the origin server.  Selection is based on a
652	   list of the available representations of the response included within
653	   the header fields or entity-body of the initial response, with each
654	   representation identified by its own URI.  Selection from among the
655	   representations may be performed automatically (if the user agent is
656	   capable of doing so) or manually by the user selecting from a
657	   generated (possibly hypertext) menu.

659	   Agent-driven negotiation is advantageous when the response would vary
660	   over commonly-used dimensions (such as type, language, or encoding),
661	   when the origin server is unable to determine a user agent's
662	   capabilities from examining the request, and generally when public
663	   caches are used to distribute server load and reduce network usage.

665	   Agent-driven negotiation suffers from the disadvantage of needing a
666	   second request to obtain the best alternate representation.  This
667	   second request is only efficient when caching is used.  In addition,
668	   this specification does not define any mechanism for supporting
669	   automatic selection, though it also does not prevent any such
670	   mechanism from being developed as an extension and used within
671	   HTTP/1.1.

673	   HTTP/1.1 defines the 300 (Multiple Choices) and 406 (Not Acceptable)
674	   status codes for enabling agent-driven negotiation when the server is
675	   unwilling or unable to provide a varying response using server-driven
676	   negotiation.

678	4.3.  Transparent Negotiation

680	   Transparent negotiation is a combination of both server-driven and
681	   agent-driven negotiation.  When a cache is supplied with a form of
682	   the list of available representations of the response (as in agent-
683	   driven negotiation) and the dimensions of variance are completely
684	   understood by the cache, then the cache becomes capable of performing
685	   server-driven negotiation on behalf of the origin server for
686	   subsequent requests on that resource.

688	   Transparent negotiation has the advantage of distributing the
689	   negotiation work that would otherwise be required of the origin
690	   server and also removing the second request delay of agent-driven
691	   negotiation when the cache is able to correctly guess the right
692	   response.

694	   This specification does not define any mechanism for transparent
695	   negotiation, though it also does not prevent any such mechanism from
696	   being developed as an extension that could be used within HTTP/1.1.

698	5.  Header Field Definitions

700	   This section defines the syntax and semantics of HTTP/1.1 header
701	   fields related to the payload of messages.

703	   For entity-header fields, both sender and recipient refer to either
704	   the client or the server, depending on who sends and who receives the
705	   entity.

707	5.1.  Accept

709	   The request-header field "Accept" can be used to specify certain
710	   media types which are acceptable for the response.  Accept headers
711	   can be used to indicate that the request is specifically limited to a
712	   small set of desired types, as in the case of a request for an in-
713	   line image.

715	     Accept   = "Accept" ":" OWS Accept-v
716	     Accept-v = #( media-range [ accept-params ] )

718	     media-range    = ( "*/*"
719	                      / ( type "/" "*" )
720	                      / ( type "/" subtype )
721	                      ) *( OWS ";" OWS parameter )
722	     accept-params  = OWS ";" OWS "q=" qvalue *( accept-ext )
723	     accept-ext     = OWS ";" OWS token
724	                      [ "=" ( token / quoted-string ) ]

726	   The asterisk "*" character is used to group media types into ranges,
727	   with "*/*" indicating all media types and "type/*" indicating all
728	   subtypes of that type.  The media-range MAY include media type
729	   parameters that are applicable to that range.

731	   Each media-range MAY be followed by one or more accept-params,
732	   beginning with the "q" parameter for indicating a relative quality
733	   factor.  The first "q" parameter (if any) separates the media-range
734	   parameter(s) from the accept-params.  Quality factors allow the user
735	   or user agent to indicate the relative degree of preference for that
736	   media-range, using the qvalue scale from 0 to 1 (Section 3.5 of
737	   [Part1]).  The default value is q=1.

739	      Note: Use of the "q" parameter name to separate media type
740	      parameters from Accept extension parameters is due to historical
741	      practice.  Although this prevents any media type parameter named
742	      "q" from being used with a media range, such an event is believed
743	      to be unlikely given the lack of any "q" parameters in the IANA
744	      media type registry and the rare usage of any media type
745	      parameters in Accept.  Future media types are discouraged from
746	      registering any parameter named "q".

748	   The example

750	     Accept: audio/*; q=0.2, audio/basic

752	   SHOULD be interpreted as "I prefer audio/basic, but send me any audio
753	   type if it is the best available after an 80% mark-down in quality."

755	   If no Accept header field is present, then it is assumed that the
756	   client accepts all media types.  If an Accept header field is
757	   present, and if the server cannot send a response which is acceptable
758	   according to the combined Accept field value, then the server SHOULD
759	   send a 406 (Not Acceptable) response.

761	   A more elaborate example is

763	     Accept: text/plain; q=0.5, text/html,
764	             text/x-dvi; q=0.8, text/x-c

766	   Verbally, this would be interpreted as "text/html and text/x-c are
767	   the preferred media types, but if they do not exist, then send the
768	   text/x-dvi entity, and if that does not exist, send the text/plain
769	   entity."

771	   Media ranges can be overridden by more specific media ranges or
772	   specific media types.  If more than one media range applies to a
773	   given type, the most specific reference has precedence.  For example,

775	     Accept: text/*, text/html, text/html;level=1, */*

777	   have the following precedence:

779	   1.  text/html;level=1

781	   2.  text/html

783	   3.  text/*

785	   4.  */*

787	   The media type quality factor associated with a given type is
788	   determined by finding the media range with the highest precedence
789	   which matches that type.  For example,

791	     Accept: text/*;q=0.3, text/html;q=0.7, text/html;level=1,
792	             text/html;level=2;q=0.4, */*;q=0.5

794	   would cause the following values to be associated:

796	   +-------------------+---------------+
797	   | Media Type        | Quality Value |
798	   +-------------------+---------------+
799	   | text/html;level=1 | 1             |
800	   | text/html         | 0.7           |
801	   | text/plain        | 0.3           |
802	   | image/jpeg        | 0.5           |
803	   | text/html;level=2 | 0.4           |
804	   | text/html;level=3 | 0.7           |
805	   +-------------------+---------------+

807	   Note: A user agent might be provided with a default set of quality
808	   values for certain media ranges.  However, unless the user agent is a
809	   closed system which cannot interact with other rendering agents, this
810	   default set ought to be configurable by the user.

812	5.2.  Accept-Charset

814	   The request-header field "Accept-Charset" can be used to indicate
815	   what character sets are acceptable for the response.  This field
816	   allows clients capable of understanding more comprehensive or
817	   special-purpose character sets to signal that capability to a server
818	   which is capable of representing documents in those character sets.

820	     Accept-Charset   = "Accept-Charset" ":" OWS
821	             Accept-Charset-v
822	     Accept-Charset-v = 1#( ( charset / "*" )
823	                            [ OWS ";" OWS "q=" qvalue ] )

825	   Character set values are described in Section 2.1.  Each charset MAY
826	   be given an associated quality value which represents the user's
827	   preference for that charset.  The default value is q=1.  An example
828	   is

830	     Accept-Charset: iso-8859-5, unicode-1-1;q=0.8

832	   The special value "*", if present in the Accept-Charset field,
833	   matches every character set (including ISO-8859-1) which is not
834	   mentioned elsewhere in the Accept-Charset field.  If no "*" is
835	   present in an Accept-Charset field, then all character sets not
836	   explicitly mentioned get a quality value of 0, except for ISO-8859-1,
837	   which gets a quality value of 1 if not explicitly mentioned.

839	   If no Accept-Charset header is present, the default is that any
840	   character set is acceptable.  If an Accept-Charset header is present,
841	   and if the server cannot send a response which is acceptable
842	   according to the Accept-Charset header, then the server SHOULD send
843	   an error response with the 406 (Not Acceptable) status code, though
844	   the sending of an unacceptable response is also allowed.

846	5.3.  Accept-Encoding

848	   The request-header field "Accept-Encoding" is similar to Accept, but
849	   restricts the content-codings (Section 2.2) that are acceptable in
850	   the response.

852	     Accept-Encoding    = "Accept-Encoding" ":" OWS
853	                        Accept-Encoding-v
854	     Accept-Encoding-v  =
855	                        #( codings [ OWS ";" OWS "q=" qvalue ] )
856	     codings            = ( content-coding / "*" )

858	   Each codings value MAY be given an associated quality value which
859	   represents the preference for that encoding.  The default value is
860	   q=1.

862	   Examples of its use are:

864	     Accept-Encoding: compress, gzip
865	     Accept-Encoding:
866	     Accept-Encoding: *
867	     Accept-Encoding: compress;q=0.5, gzip;q=1.0
868	     Accept-Encoding: gzip;q=1.0, identity; q=0.5, *;q=0

870	   A server tests whether a content-coding is acceptable, according to
871	   an Accept-Encoding field, using these rules:

873	   1.  If the content-coding is one of the content-codings listed in the
874	       Accept-Encoding field, then it is acceptable, unless it is
875	       accompanied by a qvalue of 0.  (As defined in Section 3.5 of
876	       [Part1], a qvalue of 0 means "not acceptable.")

878	   2.  The special "*" symbol in an Accept-Encoding field matches any
879	       available content-coding not explicitly listed in the header
880	       field.

882	   3.  If multiple content-codings are acceptable, then the acceptable
883	       content-coding with the highest non-zero qvalue is preferred.

885	   4.  The "identity" content-coding is always acceptable, unless
886	       specifically refused because the Accept-Encoding field includes
887	       "identity;q=0", or because the field includes "*;q=0" and does
888	       not explicitly include the "identity" content-coding.  If the
889	       Accept-Encoding field-value is empty, then only the "identity"
890	       encoding is acceptable.

892	   If an Accept-Encoding field is present in a request, and if the
893	   server cannot send a response which is acceptable according to the
894	   Accept-Encoding header, then the server SHOULD send an error response
895	   with the 406 (Not Acceptable) status code.

897	   If no Accept-Encoding field is present in a request, the server MAY
898	   assume that the client will accept any content coding.  In this case,
899	   if "identity" is one of the available content-codings, then the
900	   server SHOULD use the "identity" content-coding, unless it has
901	   additional information that a different content-coding is meaningful
902	   to the client.

904	      Note: If the request does not include an Accept-Encoding field,
905	      and if the "identity" content-coding is unavailable, then content-
906	      codings commonly understood by HTTP/1.0 clients (i.e., "gzip" and
907	      "compress") are preferred; some older clients improperly display
908	      messages sent with other content-codings.  The server might also
909	      make this decision based on information about the particular user-
910	      agent or client.

912	      Note: Most HTTP/1.0 applications do not recognize or obey qvalues
913	      associated with content-codings.  This means that qvalues will not
914	      work and are not permitted with x-gzip or x-compress.

916	5.4.  Accept-Language

918	   The request-header field "Accept-Language" is similar to Accept, but
919	   restricts the set of natural languages that are preferred as a
920	   response to the request.  Language tags are defined in Section 2.4.

922	     Accept-Language   = "Accept-Language" ":" OWS
923	                       Accept-Language-v
924	     Accept-Language-v =
925	                       1#( language-range [ OWS ";" OWS "q=" qvalue ] )
926	     language-range    =
927	               <language-range, defined in [RFC4647], Section 2.1>

929	   Each language-range can be given an associated quality value which
930	   represents an estimate of the user's preference for the languages
931	   specified by that range.  The quality value defaults to "q=1".  For
932	   example,

934	     Accept-Language: da, en-gb;q=0.8, en;q=0.7

936	   would mean: "I prefer Danish, but will accept British English and
937	   other types of English."

939	   For matching, the "Basic Filtering" matching scheme, defined in
940	   Section 3.3.1 of [RFC4647], is used:

942	      A language range matches a particular language tag if, in a case-
943	      insensitive comparison, it exactly equals the tag, or if it
944	      exactly equals a prefix of the tag such that the first character
945	      following the prefix is "-".

947	   The special range "*", if present in the Accept-Language field,
948	   matches every tag not matched by any other range present in the
949	   Accept-Language field.

951	      Note: This use of a prefix matching rule does not imply that
952	      language tags are assigned to languages in such a way that it is
953	      always true that if a user understands a language with a certain
954	      tag, then this user will also understand all languages with tags
955	      for which this tag is a prefix.  The prefix rule simply allows the
956	      use of prefix tags if this is the case.

958	   The language quality factor assigned to a language-tag by the Accept-
959	   Language field is the quality value of the longest language-range in
960	   the field that matches the language-tag.  If no language-range in the
961	   field matches the tag, the language quality factor assigned is 0.  If
962	   no Accept-Language header is present in the request, the server
963	   SHOULD assume that all languages are equally acceptable.  If an
964	   Accept-Language header is present, then all languages which are
965	   assigned a quality factor greater than 0 are acceptable.

967	   It might be contrary to the privacy expectations of the user to send
968	   an Accept-Language header with the complete linguistic preferences of
969	   the user in every request.  For a discussion of this issue, see
970	   Section 7.1.

972	   As intelligibility is highly dependent on the individual user, it is
973	   recommended that client applications make the choice of linguistic
974	   preference available to the user.  If the choice is not made
975	   available, then the Accept-Language header field MUST NOT be given in
976	   the request.

978	      Note: When making the choice of linguistic preference available to
979	      the user, we remind implementors of the fact that users are not
980	      familiar with the details of language matching as described above,
981	      and should provide appropriate guidance.  As an example, users
982	      might assume that on selecting "en-gb", they will be served any
983	      kind of English document if British English is not available.  A
984	      user agent might suggest in such a case to add "en" to get the
985	      best matching behavior.

987	5.5.  Content-Encoding

989	   The entity-header field "Content-Encoding" is used as a modifier to
990	   the media-type.  When present, its value indicates what additional
991	   content codings have been applied to the entity-body, and thus what
992	   decoding mechanisms must be applied in order to obtain the media-type
993	   referenced by the Content-Type header field.  Content-Encoding is
994	   primarily used to allow a document to be compressed without losing
995	   the identity of its underlying media type.

997	     Content-Encoding   = "Content-Encoding" ":" OWS Content-Encoding-v
998	     Content-Encoding-v = 1#content-coding

1000	   Content codings are defined in Section 2.2.  An example of its use is

1002	     Content-Encoding: gzip

1004	   The content-coding is a characteristic of the entity identified by
1005	   the request-target.  Typically, the entity-body is stored with this
1006	   encoding and is only decoded before rendering or analogous usage.
1007	   However, a non-transparent proxy MAY modify the content-coding if the
1008	   new coding is known to be acceptable to the recipient, unless the
1009	   "no-transform" cache-control directive is present in the message.

1011	   If the content-coding of an entity is not "identity", then the
1012	   response MUST include a Content-Encoding entity-header (Section 5.5)
1013	   that lists the non-identity content-coding(s) used.

1015	   If the content-coding of an entity in a request message is not
1016	   acceptable to the origin server, the server SHOULD respond with a
1017	   status code of 415 (Unsupported Media Type).

1019	   If multiple encodings have been applied to an entity, the content
1020	   codings MUST be listed in the order in which they were applied.
1021	   Additional information about the encoding parameters MAY be provided
1022	   by other entity-header fields not defined by this specification.

1024	5.6.  Content-Language

1026	   The entity-header field "Content-Language" describes the natural
1027	   language(s) of the intended audience for the enclosed entity.  Note
1028	   that this might not be equivalent to all the languages used within
1029	   the entity-body.

1031	     Content-Language   = "Content-Language" ":" OWS Content-Language-v
1032	     Content-Language-v = 1#language-tag

1034	   Language tags are defined in Section 2.4.  The primary purpose of
1035	   Content-Language is to allow a user to identify and differentiate
1036	   entities according to the user's own preferred language.  Thus, if
1037	   the body content is intended only for a Danish-literate audience, the
1038	   appropriate field is

1040	     Content-Language: da

1042	   If no Content-Language is specified, the default is that the content
1043	   is intended for all language audiences.  This might mean that the
1044	   sender does not consider it to be specific to any natural language,
1045	   or that the sender does not know for which language it is intended.

1047	   Multiple languages MAY be listed for content that is intended for
1048	   multiple audiences.  For example, a rendition of the "Treaty of
1049	   Waitangi," presented simultaneously in the original Maori and English
1050	   versions, would call for

1052	     Content-Language: mi, en

1054	   However, just because multiple languages are present within an entity
1055	   does not mean that it is intended for multiple linguistic audiences.
1056	   An example would be a beginner's language primer, such as "A First
1057	   Lesson in Latin," which is clearly intended to be used by an English-
1058	   literate audience.  In this case, the Content-Language would properly
1059	   only include "en".

1061	   Content-Language MAY be applied to any media type -- it is not
1062	   limited to textual documents.

1064	5.7.  Content-Location

1066	   The entity-header field "Content-Location" MAY be used to supply the
1067	   resource location for the entity enclosed in the message when that
1068	   entity is accessible from a location separate from the requested
1069	   resource's URI.  A server SHOULD provide a Content-Location for the
1070	   variant corresponding to the response entity; especially in the case
1071	   where a resource has multiple entities associated with it, and those
1072	   entities actually have separate locations by which they might be
1073	   individually accessed, the server SHOULD provide a Content-Location
1074	   for the particular variant which is returned.

1076	     Content-Location   = "Content-Location" ":" OWS
1077	                       Content-Location-v
1078	     Content-Location-v =
1079	                       absolute-URI / partial-URI

1081	   The value of Content-Location also defines the base URI for the
1082	   entity.

1084	   The Content-Location value is not a replacement for the original
1085	   requested URI; it is only a statement of the location of the resource
1086	   corresponding to this particular entity at the time of the request.
1087	   Future requests MAY specify the Content-Location URI as the request-
1088	   target if the desire is to identify the source of that particular
1089	   entity.

1091	   A cache cannot assume that an entity with a Content-Location
1092	   different from the URI used to retrieve it can be used to respond to
1093	   later requests on that Content-Location URI.  However, the Content-
1094	   Location can be used to differentiate between multiple entities
1095	   retrieved from a single requested resource, as described in Section
1096	   2.6 of [Part6].

1098	   If the Content-Location is a relative URI, the relative URI is
1099	   interpreted relative to the request-target.

1101	   The meaning of the Content-Location header in requests is undefined;
1102	   servers are free to ignore it in those cases.

1104	5.8.  Content-MD5

1106	   The entity-header field "Content-MD5", as defined in [RFC1864], is an
1107	   MD5 digest of the entity-body for the purpose of providing an end-to-
1108	   end message integrity check (MIC) of the entity-body.  (Note: a MIC
1109	   is good for detecting accidental modification of the entity-body in
1110	   transit, but is not proof against malicious attacks.)
1111	     Content-MD5   = "Content-MD5" ":" OWS Content-MD5-v
1112	     Content-MD5-v = <base64 of 128 bit MD5 digest as per [RFC1864]>

1114	   The Content-MD5 header field MAY be generated by an origin server or
1115	   client to function as an integrity check of the entity-body.  Only
1116	   origin servers or clients MAY generate the Content-MD5 header field;
1117	   proxies and gateways MUST NOT generate it, as this would defeat its
1118	   value as an end-to-end integrity check.  Any recipient of the entity-
1119	   body, including gateways and proxies, MAY check that the digest value
1120	   in this header field matches that of the entity-body as received.

1122	   The MD5 digest is computed based on the content of the entity-body,
1123	   including any content-coding that has been applied, but not including
1124	   any transfer-encoding applied to the message-body.  If the message is
1125	   received with a transfer-encoding, that encoding MUST be removed
1126	   prior to checking the Content-MD5 value against the received entity.

1128	   This has the result that the digest is computed on the octets of the
1129	   entity-body exactly as, and in the order that, they would be sent if
1130	   no transfer-encoding were being applied.

1132	   HTTP extends RFC 1864 to permit the digest to be computed for MIME
1133	   composite media-types (e.g., multipart/* and message/rfc822), but
1134	   this does not change how the digest is computed as defined in the
1135	   preceding paragraph.

1137	   There are several consequences of this.  The entity-body for
1138	   composite types MAY contain many body-parts, each with its own MIME
1139	   and HTTP headers (including Content-MD5, Content-Transfer-Encoding,
1140	   and Content-Encoding headers).  If a body-part has a Content-
1141	   Transfer-Encoding or Content-Encoding header, it is assumed that the
1142	   content of the body-part has had the encoding applied, and the body-
1143	   part is included in the Content-MD5 digest as is -- i.e., after the
1144	   application.  The Transfer-Encoding header field is not allowed
1145	   within body-parts.

1147	   Conversion of all line breaks to CRLF MUST NOT be done before
1148	   computing or checking the digest: the line break convention used in
1149	   the text actually transmitted MUST be left unaltered when computing
1150	   the digest.

1152	      Note: while the definition of Content-MD5 is exactly the same for
1153	      HTTP as in RFC 1864 for MIME entity-bodies, there are several ways
1154	      in which the application of Content-MD5 to HTTP entity-bodies
1155	      differs from its application to MIME entity-bodies.  One is that
1156	      HTTP, unlike MIME, does not use Content-Transfer-Encoding, and
1157	      does use Transfer-Encoding and Content-Encoding.  Another is that
1158	      HTTP more frequently uses binary content types than MIME, so it is
1159	      worth noting that, in such cases, the byte order used to compute
1160	      the digest is the transmission byte order defined for the type.
1161	      Lastly, HTTP allows transmission of text types with any of several
1162	      line break conventions and not just the canonical form using CRLF.

1164	5.9.  Content-Type

1166	   The entity-header field "Content-Type" indicates the media type of
1167	   the entity-body sent to the recipient or, in the case of the HEAD
1168	   method, the media type that would have been sent had the request been
1169	   a GET.

1171	     Content-Type   = "Content-Type" ":" OWS Content-Type-v
1172	     Content-Type-v = media-type

1174	   Media types are defined in Section 2.3.  An example of the field is

1176	     Content-Type: text/html; charset=ISO-8859-4

1178	   Further discussion of methods for identifying the media type of an
1179	   entity is provided in Section 3.2.1.

1181	6.  IANA Considerations

1183	6.1.  Message Header Registration

1185	   The Message Header Registry located at <http://www.iana.org/
1186	   assignments/message-headers/message-header-index.html> should be
1187	   updated with the permanent registrations below (see [RFC3864]):

1189	   +---------------------+----------+----------+--------------+
1190	   | Header Field Name   | Protocol | Status   | Reference    |
1191	   +---------------------+----------+----------+--------------+
1192	   | Accept              | http     | standard | Section 5.1  |
1193	   | Accept-Charset      | http     | standard | Section 5.2  |
1194	   | Accept-Encoding     | http     | standard | Section 5.3  |
1195	   | Accept-Language     | http     | standard | Section 5.4  |
1196	   | Content-Disposition | http     |          | Appendix B.1 |
1197	   | Content-Encoding    | http     | standard | Section 5.5  |
1198	   | Content-Language    | http     | standard | Section 5.6  |
1199	   | Content-Location    | http     | standard | Section 5.7  |
1200	   | Content-MD5         | http     | standard | Section 5.8  |
1201	   | Content-Type        | http     | standard | Section 5.9  |
1202	   | MIME-Version        | http     |          | Appendix A.1 |
1203	   +---------------------+----------+----------+--------------+

1205	   The change controller is: "IETF (iesg@ietf.org) - Internet
1206	   Engineering Task Force".

1208	7.  Security Considerations

1210	   This section is meant to inform application developers, information
1211	   providers, and users of the security limitations in HTTP/1.1 as
1212	   described by this document.  The discussion does not include
1213	   definitive solutions to the problems revealed, though it does make
1214	   some suggestions for reducing security risks.

1216	7.1.  Privacy Issues Connected to Accept Headers

1218	   Accept request-headers can reveal information about the user to all
1219	   servers which are accessed.  The Accept-Language header in particular
1220	   can reveal information the user would consider to be of a private
1221	   nature, because the understanding of particular languages is often
1222	   strongly correlated to the membership of a particular ethnic group.
1223	   User agents which offer the option to configure the contents of an
1224	   Accept-Language header to be sent in every request are strongly
1225	   encouraged to let the configuration process include a message which
1226	   makes the user aware of the loss of privacy involved.

1228	   An approach that limits the loss of privacy would be for a user agent
1229	   to omit the sending of Accept-Language headers by default, and to ask
1230	   the user whether or not to start sending Accept-Language headers to a
1231	   server if it detects, by looking for any Vary response-header fields
1232	   generated by the server, that such sending could improve the quality
1233	   of service.

1235	   Elaborate user-customized accept header fields sent in every request,
1236	   in particular if these include quality values, can be used by servers
1237	   as relatively reliable and long-lived user identifiers.  Such user
1238	   identifiers would allow content providers to do click-trail tracking,
1239	   and would allow collaborating content providers to match cross-server
1240	   click-trails or form submissions of individual users.  Note that for
1241	   many users not behind a proxy, the network address of the host
1242	   running the user agent will also serve as a long-lived user
1243	   identifier.  In environments where proxies are used to enhance
1244	   privacy, user agents ought to be conservative in offering accept
1245	   header configuration options to end users.  As an extreme privacy
1246	   measure, proxies could filter the accept headers in relayed requests.
1247	   General purpose user agents which provide a high degree of header
1248	   configurability SHOULD warn users about the loss of privacy which can
1249	   be involved.

1251	7.2.  Content-Disposition Issues

1253	   [RFC2183], from which the often implemented Content-Disposition (see
1254	   Appendix B.1) header in HTTP is derived, has a number of very serious
1255	   security considerations.  Content-Disposition is not part of the HTTP
1256	   standard, but since it is widely implemented, we are documenting its
1257	   use and risks for implementors.  See Section 5 of [RFC2183] for
1258	   details.

1260	8.  Acknowledgments

1262	9.  References

1264	9.1.  Normative References

1266	   [ISO-8859-1]
1267	              International Organization for Standardization,
1268	              "Information technology -- 8-bit single-byte coded graphic
1269	              character sets -- Part 1: Latin alphabet No. 1", ISO/
1270	              IEC 8859-1:1998, 1998.

1272	   [Part1]    Fielding, R., Ed., Gettys, J., Mogul, J., Frystyk, H.,
1273	              Masinter, L., Leach, P., Berners-Lee, T., Lafon, Y., Ed.,
1274	              and J. Reschke, Ed., "HTTP/1.1, part 1: URIs, Connections,
1275	              and Message Parsing", draft-ietf-httpbis-p1-messaging-07
1276	              (work in progress), July 2009.

1278	   [Part2]    Fielding, R., Ed., Gettys, J., Mogul, J., Frystyk, H.,
1279	              Masinter, L., Leach, P., Berners-Lee, T., Lafon, Y., Ed.,
1280	              and J. Reschke, Ed., "HTTP/1.1, part 2: Message
1281	              Semantics", draft-ietf-httpbis-p2-semantics-07 (work in
1282	              progress), July 2009.

1284	   [Part4]    Fielding, R., Ed., Gettys, J., Mogul, J., Frystyk, H.,
1285	              Masinter, L., Leach, P., Berners-Lee, T., Lafon, Y., Ed.,
1286	              and J. Reschke, Ed., "HTTP/1.1, part 4: Conditional
1287	              Requests", draft-ietf-httpbis-p4-conditional-07 (work in
1288	              progress), July 2009.

1290	   [Part5]    Fielding, R., Ed., Gettys, J., Mogul, J., Frystyk, H.,
1291	              Masinter, L., Leach, P., Berners-Lee, T., Lafon, Y., Ed.,
1292	              and J. Reschke, Ed., "HTTP/1.1, part 5: Range Requests and
1293	              Partial Responses", draft-ietf-httpbis-p5-range-07 (work
1294	              in progress), July 2009.

1296	   [Part6]    Fielding, R., Ed., Gettys, J., Mogul, J., Frystyk, H.,
1297	              Masinter, L., Leach, P., Berners-Lee, T., Lafon, Y., Ed.,
1298	              Nottingham, M., Ed., and J. Reschke, Ed., "HTTP/1.1, part
1299	              6: Caching", draft-ietf-httpbis-p6-cache-07 (work in
1300	              progress), July 2009.

1302	   [RFC1766]  Alvestrand, H., "Tags for the Identification of
1303	              Languages", RFC 1766, March 1995.

1305	   [RFC1864]  Myers, J. and M. Rose, "The Content-MD5 Header Field",
1306	              RFC 1864, October 1995.

1308	   [RFC1950]  Deutsch, L. and J-L. Gailly, "ZLIB Compressed Data Format
1309	              Specification version 3.3", RFC 1950, May 1996.

1311	              RFC 1950 is an Informational RFC, thus it may be less
1312	              stable than this specification.  On the other hand, this
1313	              downward reference was present since the publication of
1314	              RFC 2068 in 1997 ([RFC2068]), therefore it is unlikely to
1315	              cause problems in practice.  See also [BCP97].

1317	   [RFC1951]  Deutsch, P., "DEFLATE Compressed Data Format Specification
1318	              version 1.3", RFC 1951, May 1996.

1320	              RFC 1951 is an Informational RFC, thus it may be less
1321	              stable than this specification.  On the other hand, this
1322	              downward reference was present since the publication of
1323	              RFC 2068 in 1997 ([RFC2068]), therefore it is unlikely to
1324	              cause problems in practice.  See also [BCP97].

1326	   [RFC1952]  Deutsch, P., Gailly, J-L., Adler, M., Deutsch, L., and G.
1327	              Randers-Pehrson, "GZIP file format specification version
1328	              4.3", RFC 1952, May 1996.

1330	              RFC 1952 is an Informational RFC, thus it may be less
1331	              stable than this specification.  On the other hand, this
1332	              downward reference was present since the publication of
1333	              RFC 2068 in 1997 ([RFC2068]), therefore it is unlikely to
1334	              cause problems in practice.  See also [BCP97].

1336	   [RFC2045]  Freed, N. and N. Borenstein, "Multipurpose Internet Mail
1337	              Extensions (MIME) Part One: Format of Internet Message
1338	              Bodies", RFC 2045, November 1996.

1340	   [RFC2046]  Freed, N. and N. Borenstein, "Multipurpose Internet Mail
1341	              Extensions (MIME) Part Two: Media Types", RFC 2046,
1342	              November 1996.

1344	   [RFC2119]  Bradner, S., "Key words for use in RFCs to Indicate
1345	              Requirement Levels", BCP 14, RFC 2119, March 1997.

1347	   [RFC4647]  Phillips, A., Ed. and M. Davis, Ed., "Matching of Language
1348	              Tags", BCP 47, RFC 4647, September 2006.

1350	   [RFC5234]  Crocker, D., Ed. and P. Overell, "Augmented BNF for Syntax
1351	              Specifications: ABNF", STD 68, RFC 5234, January 2008.

1353	9.2.  Informative References

1355	   [BCP97]    Klensin, J. and S. Hartman, "Handling Normative References
1356	              to Standards-Track Documents", BCP 97, RFC 4897,
1357	              June 2007.

1359	   [RFC1945]  Berners-Lee, T., Fielding, R., and H. Nielsen, "Hypertext
1360	              Transfer Protocol -- HTTP/1.0", RFC 1945, May 1996.

1362	   [RFC2049]  Freed, N. and N. Borenstein, "Multipurpose Internet Mail
1363	              Extensions (MIME) Part Five: Conformance Criteria and
1364	              Examples", RFC 2049, November 1996.

1366	   [RFC2068]  Fielding, R., Gettys, J., Mogul, J., Nielsen, H., and T.
1367	              Berners-Lee, "Hypertext Transfer Protocol -- HTTP/1.1",
1368	              RFC 2068, January 1997.

1370	   [RFC2076]  Palme, J., "Common Internet Message Headers", RFC 2076,
1371	              February 1997.

1373	   [RFC2183]  Troost, R., Dorner, S., and K. Moore, "Communicating
1374	              Presentation Information in Internet Messages: The
1375	              Content-Disposition Header Field", RFC 2183, August 1997.

1377	   [RFC2277]  Alvestrand, H., "IETF Policy on Character Sets and
1378	              Languages", BCP 18, RFC 2277, January 1998.

1380	   [RFC2388]  Masinter, L., "Returning Values from Forms:  multipart/
1381	              form-data", RFC 2388, August 1998.

1383	   [RFC2557]  Palme, F., Hopmann, A., Shelness, N., and E. Stefferud,
1384	              "MIME Encapsulation of Aggregate Documents, such as HTML
1385	              (MHTML)", RFC 2557, March 1999.

1387	   [RFC2616]  Fielding, R., Gettys, J., Mogul, J., Frystyk, H.,
1388	              Masinter, L., Leach, P., and T. Berners-Lee, "Hypertext
1389	              Transfer Protocol -- HTTP/1.1", RFC 2616, June 1999.

1391	   [RFC3629]  Yergeau, F., "UTF-8, a transformation format of ISO
1392	              10646", RFC 3629, STD 63, November 2003.

1394	   [RFC3864]  Klyne, G., Nottingham, M., and J. Mogul, "Registration
1395	              Procedures for Message Header Fields", BCP 90, RFC 3864,
1396	              September 2004.

1398	   [RFC4288]  Freed, N. and J. Klensin, "Media Type Specifications and
1399	              Registration Procedures", BCP 13, RFC 4288, December 2005.

1401	   [RFC5322]  Resnick, P., "Internet Message Format", RFC 5322,
1402	              October 2008.

1404	Appendix A.  Differences Between HTTP Entities and RFC 2045 Entities

1406	   HTTP/1.1 uses many of the constructs defined for Internet Mail
1407	   ([RFC5322]) and the Multipurpose Internet Mail Extensions (MIME
1408	   [RFC2045]) to allow entities to be transmitted in an open variety of
1409	   representations and with extensible mechanisms.  However, RFC 2045
1410	   discusses mail, and HTTP has a few features that are different from
1411	   those described in RFC 2045.  These differences were carefully chosen
1412	   to optimize performance over binary connections, to allow greater
1413	   freedom in the use of new media types, to make date comparisons
1414	   easier, and to acknowledge the practice of some early HTTP servers
1415	   and clients.

1417	   This appendix describes specific areas where HTTP differs from RFC
1418	   2045.  Proxies and gateways to strict MIME environments SHOULD be
1419	   aware of these differences and provide the appropriate conversions
1420	   where necessary.  Proxies and gateways from MIME environments to HTTP
1421	   also need to be aware of the differences because some conversions
1422	   might be required.

1424	A.1.  MIME-Version

1426	   HTTP is not a MIME-compliant protocol.  However, HTTP/1.1 messages
1427	   MAY include a single MIME-Version general-header field to indicate
1428	   what version of the MIME protocol was used to construct the message.
1429	   Use of the MIME-Version header field indicates that the message is in
1430	   full compliance with the MIME protocol (as defined in [RFC2045]).
1431	   Proxies/gateways are responsible for ensuring full compliance (where
1432	   possible) when exporting HTTP messages to strict MIME environments.

1434	     MIME-Version   = "MIME-Version" ":" OWS MIME-Version-v
1435	     MIME-Version-v = 1*DIGIT "." 1*DIGIT

1437	   MIME version "1.0" is the default for use in HTTP/1.1.  However,
1438	   HTTP/1.1 message parsing and semantics are defined by this document
1439	   and not the MIME specification.

1441	A.2.  Conversion to Canonical Form

1443	   [RFC2045] requires that an Internet mail entity be converted to
1444	   canonical form prior to being transferred, as described in Section 4
1445	   of [RFC2049].  Section 2.3.1 of this document describes the forms
1446	   allowed for subtypes of the "text" media type when transmitted over
1447	   HTTP.  [RFC2046] requires that content with a type of "text"
1448	   represent line breaks as CRLF and forbids the use of CR or LF outside
1449	   of line break sequences.  HTTP allows CRLF, bare CR, and bare LF to
1450	   indicate a line break within text content when a message is
1451	   transmitted over HTTP.

1453	   Where it is possible, a proxy or gateway from HTTP to a strict MIME
1454	   environment SHOULD translate all line breaks within the text media
1455	   types described in Section 2.3.1 of this document to the RFC 2049
1456	   canonical form of CRLF.  Note, however, that this might be
1457	   complicated by the presence of a Content-Encoding and by the fact
1458	   that HTTP allows the use of some character sets which do not use
1459	   octets 13 and 10 to represent CR and LF, as is the case for some
1460	   multi-byte character sets.

1462	   Implementors should note that conversion will break any cryptographic
1463	   checksums applied to the original content unless the original content
1464	   is already in canonical form.  Therefore, the canonical form is
1465	   recommended for any content that uses such checksums in HTTP.

1467	A.3.  Conversion of Date Formats

1469	   HTTP/1.1 uses a restricted set of date formats (Section 3.2 of
1470	   [Part1]) to simplify the process of date comparison.  Proxies and
1471	   gateways from other protocols SHOULD ensure that any Date header
1472	   field present in a message conforms to one of the HTTP/1.1 formats
1473	   and rewrite the date if necessary.

1475	A.4.  Introduction of Content-Encoding

1477	   RFC 2045 does not include any concept equivalent to HTTP/1.1's
1478	   Content-Encoding header field.  Since this acts as a modifier on the
1479	   media type, proxies and gateways from HTTP to MIME-compliant
1480	   protocols MUST either change the value of the Content-Type header
1481	   field or decode the entity-body before forwarding the message.  (Some
1482	   experimental applications of Content-Type for Internet mail have used
1483	   a media-type parameter of ";conversions=<content-coding>" to perform
1484	   a function equivalent to Content-Encoding.  However, this parameter
1485	   is not part of RFC 2045).

1487	A.5.  No Content-Transfer-Encoding

1489	   HTTP does not use the Content-Transfer-Encoding field of RFC 2045.
1490	   Proxies and gateways from MIME-compliant protocols to HTTP MUST
1491	   remove any Content-Transfer-Encoding prior to delivering the response
1492	   message to an HTTP client.

1494	   Proxies and gateways from HTTP to MIME-compliant protocols are
1495	   responsible for ensuring that the message is in the correct format
1496	   and encoding for safe transport on that protocol, where "safe
1497	   transport" is defined by the limitations of the protocol being used.
1498	   Such a proxy or gateway SHOULD label the data with an appropriate
1499	   Content-Transfer-Encoding if doing so will improve the likelihood of
1500	   safe transport over the destination protocol.

1502	A.6.  Introduction of Transfer-Encoding

1504	   HTTP/1.1 introduces the Transfer-Encoding header field (Section 8.7
1505	   of [Part1]).  Proxies/gateways MUST remove any transfer-coding prior
1506	   to forwarding a message via a MIME-compliant protocol.

1508	A.7.  MHTML and Line Length Limitations

1510	   HTTP implementations which share code with MHTML [RFC2557]
1511	   implementations need to be aware of MIME line length limitations.
1512	   Since HTTP does not have this limitation, HTTP does not fold long
1513	   lines.  MHTML messages being transported by HTTP follow all
1514	   conventions of MHTML, including line length limitations and folding,
1515	   canonicalization, etc., since HTTP transports all message-bodies as
1516	   payload (see Section 2.3.2) and does not interpret the content or any
1517	   MIME header lines that might be contained therein.

1519	Appendix B.  Additional Features

1521	   [RFC1945] and [RFC2068] document protocol elements used by some
1522	   existing HTTP implementations, but not consistently and correctly
1523	   across most HTTP/1.1 applications.  Implementors are advised to be
1524	   aware of these features, but cannot rely upon their presence in, or
1525	   interoperability with, other HTTP/1.1 applications.  Some of these
1526	   describe proposed experimental features, and some describe features
1527	   that experimental deployment found lacking that are now addressed in
1528	   the base HTTP/1.1 specification.

1530	   A number of other headers, such as Content-Disposition and Title,
1531	   from SMTP and MIME are also often implemented (see [RFC2076]).

1533	B.1.  Content-Disposition

1535	   The Content-Disposition response-header field has been proposed as a
1536	   means for the origin server to suggest a default filename if the user
1537	   requests that the content is saved to a file.  This usage is derived
1538	   from the definition of Content-Disposition in [RFC2183].

1540	     content-disposition = "Content-Disposition" ":" OWS
1541	                           content-disposition-v
1542	     content-disposition-v = disposition-type
1543	                             *( OWS ";" OWS disposition-parm )
1544	     disposition-type = "attachment" / disp-extension-token
1545	     disposition-parm = filename-parm / disp-extension-parm
1546	     filename-parm = "filename" "=" quoted-string
1547	     disp-extension-token = token
1548	     disp-extension-parm = token "=" ( token / quoted-string )

1550	   An example is

1552	     Content-Disposition: attachment; filename="fname.ext"

1554	   The receiving user agent SHOULD NOT respect any directory path
1555	   information present in the filename-parm parameter, which is the only
1556	   parameter believed to apply to HTTP implementations at this time.
1557	   The filename SHOULD be treated as a terminal component only.

1559	   If this header is used in a response with the application/
1560	   octet-stream content-type, the implied suggestion is that the user
1561	   agent should not display the response, but directly enter a `save
1562	   response as...' dialog.

1564	   See Section 7.2 for Content-Disposition security issues.

1566	Appendix C.  Compatibility with Previous Versions

1568	C.1.  Changes from RFC 2068

1570	   Transfer-coding and message lengths all interact in ways that
1571	   required fixing exactly when chunked encoding is used (to allow for
1572	   transfer encoding that may not be self delimiting); it was important
1573	   to straighten out exactly how message lengths are computed.
1574	   (Section 3.2.2, see also [Part1], [Part5] and [Part6]).

1576	   Charset wildcarding is introduced to avoid explosion of character set
1577	   names in accept headers.  (Section 5.2)

1579	   Content-Base was deleted from the specification: it was not
1580	   implemented widely, and there is no simple, safe way to introduce it
1581	   without a robust extension mechanism.  In addition, it is used in a
1582	   similar, but not identical fashion in MHTML [RFC2557].

1584	   A content-coding of "identity" was introduced, to solve problems
1585	   discovered in caching.  (Section 2.2)

1587	   The Alternates, Content-Version, Derived-From, Link, URI, Public and
1588	   Content-Base header fields were defined in previous versions of this
1589	   specification, but not commonly implemented.  See Section 19.6.2 of
1590	   [RFC2068].

1592	C.2.  Changes from RFC 2616

1594	   Clarify contexts that charset is used in.  (Section 2.1)

1596	   Remove reference to non-existant identity transfer-coding value
1597	   tokens.  (Appendix A.5)

1599	Appendix D.  Collected ABNF

1601	   Accept = "Accept:" OWS Accept-v
1602	   Accept-Charset = "Accept-Charset:" OWS Accept-Charset-v
1603	   Accept-Charset-v = *( "," OWS ) ( charset / "*" ) [ OWS ";" OWS "q="
1604	    qvalue ] *( OWS "," [ OWS ( charset / "*" ) [ OWS ";" OWS "q="
1605	    qvalue ] ] )
1606	   Accept-Encoding = "Accept-Encoding:" OWS Accept-Encoding-v
1607	   Accept-Encoding-v = [ ( "," / ( codings [ OWS ";" OWS "q=" qvalue ] )
1608	    ) *( OWS "," [ OWS codings [ OWS ";" OWS "q=" qvalue ] ] ) ]
1609	   Accept-Language = "Accept-Language:" OWS Accept-Language-v
1610	   Accept-Language-v = *( "," OWS ) language-range [ OWS ";" OWS "q="
1611	    qvalue ] *( OWS "," [ OWS language-range [ OWS ";" OWS "q=" qvalue ]
1612	    ] )
1613	   Accept-v = [ ( "," / ( media-range [ accept-params ] ) ) *( OWS "," [
1614	    OWS media-range [ accept-params ] ] ) ]

1616	   Content-Encoding = "Content-Encoding:" OWS Content-Encoding-v
1617	   Content-Encoding-v = *( "," OWS ) content-coding *( OWS "," [ OWS
1618	    content-coding ] )
1619	   Content-Language = "Content-Language:" OWS Content-Language-v
1620	   Content-Language-v = *( "," OWS ) language-tag *( OWS "," [ OWS
1621	    language-tag ] )
1622	   Content-Length = <Content-Length, defined in [Part1], Section 8.2>
1623	   Content-Location = "Content-Location:" OWS Content-Location-v
1624	   Content-Location-v = absolute-URI / partial-URI
1625	   Content-MD5 = "Content-MD5:" OWS Content-MD5-v
1626	   Content-MD5-v = <base64 of 128 bit MD5 digest as per [RFC1864]>
1627	   Content-Range = <Content-Range, defined in [Part5], Section 5.2>
1628	   Content-Type = "Content-Type:" OWS Content-Type-v
1629	   Content-Type-v = media-type

1631	   Expires = <Expires, defined in [Part6], Section 3.3>

1633	   Last-Modified = <Last-Modified, defined in [Part4], Section 6.6>

1635	   MIME-Version = "MIME-Version:" OWS MIME-Version-v
1636	   MIME-Version-v = 1*DIGIT "." 1*DIGIT

1638	   OWS = <OWS, defined in [Part1], Section 1.2.2>

1640	   absolute-URI = <absolute-URI, defined in [Part1], Section 2.1>
1641	   accept-ext = OWS ";" OWS token [ "=" ( token / quoted-string ) ]
1642	   accept-params = OWS ";" OWS "q=" qvalue *accept-ext
1643	   attribute = token

1645	   charset = token
1646	   codings = ( content-coding / "*" )
1647	   content-coding = token
1648	   content-disposition = "Content-Disposition:" OWS
1649	    content-disposition-v
1650	   content-disposition-v = disposition-type *( OWS ";" OWS
1651	    disposition-parm )

1653	   disp-extension-parm = token "=" ( token / quoted-string )
1654	   disp-extension-token = token
1655	   disposition-parm = filename-parm / disp-extension-parm
1656	   disposition-type = "attachment" / disp-extension-token

1658	   entity-body = *OCTET
1659	   entity-header = Content-Encoding / Content-Language / Content-Length
1660	    / Content-Location / Content-MD5 / Content-Range / Content-Type /
1661	    Expires / Last-Modified / extension-header
1662	   extension-header = message-header

1664	   filename-parm = "filename=" quoted-string

1666	   language-range = <language-range, defined in [RFC4647], Section 2.1>
1667	   language-tag = primary-tag *( "-" subtag )

1669	   media-range = ( "*/*" / ( type "/*" ) / ( type "/" subtype ) ) *( OWS
1670	    ";" OWS parameter )
1671	   media-type = type "/" subtype *( OWS ";" OWS parameter )
1672	   message-header = <message-header, defined in [Part1], Section 4.2>

1674	   parameter = attribute "=" value
1675	   partial-URI = <partial-URI, defined in [Part1], Section 2.1>
1676	   primary-tag = 1*8ALPHA

1678	   quoted-string = <quoted-string, defined in [Part1], Section 1.2.2>
1679	   qvalue = <qvalue, defined in [Part1], Section 3.5>

1681	   subtag = 1*8ALPHA
1682	   subtype = token

1684	   token = <token, defined in [Part1], Section 1.2.2>
1685	   type = token

1687	   value = token / quoted-string

1689	   ABNF diagnostics:

1691	   ; Accept defined but not used
1692	   ; Accept-Charset defined but not used
1693	   ; Accept-Encoding defined but not used
1694	   ; Accept-Language defined but not used
1695	   ; MIME-Version defined but not used
1696	   ; content-disposition defined but not used
1697	   ; entity-body defined but not used
1698	   ; entity-header defined but not used

1700	Appendix E.  Change Log (to be removed by RFC Editor before publication)

1702	E.1.  Since RFC2616

1704	   Extracted relevant partitions from [RFC2616].

1706	E.2.  Since draft-ietf-httpbis-p3-payload-00

1708	   Closed issues:

1710	   o  <http://tools.ietf.org/wg/httpbis/trac/ticket/8>: "Media Type
1711	      Registrations" (<http://purl.org/NET/http-errata#media-reg>)

1713	   o  <http://tools.ietf.org/wg/httpbis/trac/ticket/14>: "Clarification
1714	      regarding quoting of charset values"
1715	      (<http://purl.org/NET/http-errata#charactersets>)

1717	   o  <http://tools.ietf.org/wg/httpbis/trac/ticket/16>: "Remove
1718	      'identity' token references"
1719	      (<http://purl.org/NET/http-errata#identity>)

1721	   o  <http://tools.ietf.org/wg/httpbis/trac/ticket/25>: "Accept-
1722	      Encoding BNF"

1724	   o  <http://tools.ietf.org/wg/httpbis/trac/ticket/35>: "Normative and
1725	      Informative references"

1727	   o  <http://tools.ietf.org/wg/httpbis/trac/ticket/46>: "RFC1700
1728	      references"

1730	   o  <http://tools.ietf.org/wg/httpbis/trac/ticket/55>: "Updating to
1731	      RFC4288"

1733	   o  <http://tools.ietf.org/wg/httpbis/trac/ticket/65>: "Informative
1734	      references"

1736	   o  <http://tools.ietf.org/wg/httpbis/trac/ticket/66>: "ISO-8859-1
1737	      Reference"

1739	   o  <http://tools.ietf.org/wg/httpbis/trac/ticket/68>: "Encoding
1740	      References Normative"

1742	   o  <http://tools.ietf.org/wg/httpbis/trac/ticket/86>: "Normative up-
1743	      to-date references"

1745	E.3.  Since draft-ietf-httpbis-p3-payload-01

1747	   Ongoing work on ABNF conversion
1748	   (<http://tools.ietf.org/wg/httpbis/trac/ticket/36>):

1750	   o  Add explicit references to BNF syntax and rules imported from
1751	      other parts of the specification.

1753	E.4.  Since draft-ietf-httpbis-p3-payload-02

1755	   Closed issues:

1757	   o  <http://tools.ietf.org/wg/httpbis/trac/ticket/67>: "Quoting
1758	      Charsets"

1760	   o  <http://tools.ietf.org/wg/httpbis/trac/ticket/105>:
1761	      "Classification for Allow header"

1763	   o  <http://tools.ietf.org/wg/httpbis/trac/ticket/115>: "missing
1764	      default for qvalue in description of Accept-Encoding"

1766	   Ongoing work on IANA Message Header Registration
1767	   (<http://tools.ietf.org/wg/httpbis/trac/ticket/40>):

1769	   o  Reference RFC 3984, and update header registrations for headers
1770	      defined in this document.

1772	E.5.  Since draft-ietf-httpbis-p3-payload-03

1774	   Closed issues:

1776	   o  <http://tools.ietf.org/wg/httpbis/trac/ticket/67>: "Quoting
1777	      Charsets"

1779	   o  <http://tools.ietf.org/wg/httpbis/trac/ticket/113>: "language tag
1780	      matching (Accept-Language) vs RFC4647"

1782	   o  <http://tools.ietf.org/wg/httpbis/trac/ticket/121>: "RFC 1806 has
1783	      been replaced by RFC2183"

1785	   Other changes:

1787	   o  <http://tools.ietf.org/wg/httpbis/trac/ticket/68>: "Encoding
1788	      References Normative" -- rephrase the annotation and reference
1789	      [BCP97].

1791	E.6.  Since draft-ietf-httpbis-p3-payload-04

1793	   Closed issues:

1795	   o  <http://tools.ietf.org/wg/httpbis/trac/ticket/132>: "RFC 2822 is
1796	      updated by RFC 5322"

1798	   Ongoing work on ABNF conversion
1799	   (<http://tools.ietf.org/wg/httpbis/trac/ticket/36>):

1801	   o  Use "/" instead of "|" for alternatives.

1803	   o  Introduce new ABNF rules for "bad" whitespace ("BWS"), optional
1804	      whitespace ("OWS") and required whitespace ("RWS").

1806	   o  Rewrite ABNFs to spell out whitespace rules, factor out header
1807	      value format definitions.

1809	E.7.  Since draft-ietf-httpbis-p3-payload-05

1811	   Closed issues:

1813	   o  <http://tools.ietf.org/wg/httpbis/trac/ticket/118>: "Join
1814	      "Differences Between HTTP Entities and RFC 2045 Entities"?"

1816	   Final work on ABNF conversion
1817	   (<http://tools.ietf.org/wg/httpbis/trac/ticket/36>):

1819	   o  Add appendix containing collected and expanded ABNF, reorganize
1820	      ABNF introduction.

1822	   Other changes:

1824	   o  Move definition of quality values into Part 1.

1826	E.8.  Since draft-ietf-httpbis-p3-payload-06

1828	   Closed issues:

1830	   o  <http://tools.ietf.org/wg/httpbis/trac/ticket/80>: "Content-
1831	      Location isn't special"

1833	   o  <http://tools.ietf.org/wg/httpbis/trac/ticket/155>: "Content
1834	      Sniffing"

1836	Index

1838	   A
1839	      Accept header  16
1840	      Accept-Charset header  18
1841	      Accept-Encoding header  19
1842	      Accept-Language header  20
1843	      Alternates header  35

1845	   C
1846	      compress  8
1847	      Content Type Sniffing  13
1848	      Content-Base header  35
1849	      Content-Disposition header  34
1850	      Content-Encoding header  22
1851	      Content-Language header  23
1852	      Content-Location header  24
1853	      Content-MD5 header  24
1854	      Content-Type header  26
1855	      Content-Version header  35

1857	   D
1858	      deflate  8
1859	      Derived-From header  35

1861	   G
1862	      Grammar
1863	         Accept  16
1864	         Accept-Charset  18
1865	         Accept-Charset-v  18
1866	         Accept-Encoding  19
1867	         Accept-Encoding-v  19
1868	         accept-ext  16
1869	         Accept-Language  21
1870	         Accept-Language-v  21
1871	         accept-params  16
1872	         Accept-v  16
1873	         attribute  9
1874	         charset  7
1875	         codings  19
1876	         content-coding  8
1877	         content-disposition  34
1878	         content-disposition-v  34
1879	         Content-Encoding  22
1880	         Content-Encoding-v  22
1881	         Content-Language  23
1882	         Content-Language-v  23
1883	         Content-Location  24
1884	         Content-Location-v  24
1885	         Content-MD5  25
1886	         Content-MD5-v  25
1887	         Content-Type  26
1888	         Content-Type-v  26
1889	         disp-extension-parm  34
1890	         disp-extension-token  34
1891	         disposition-parm  34
1892	         disposition-type  34
1893	         entity-body  12
1894	         entity-header  12
1895	         extension-header  12
1896	         filename-parm  34
1897	         language-range  21
1898	         language-tag  11
1899	         media-range  16
1900	         media-type  9
1901	         MIME-Version  31
1902	         MIME-Version-v  31
1903	         parameter  9
1904	         primary-tag  11
1905	         subtag  11
1906	         subtype  9
1907	         type  9
1908	         value  9
1909	      gzip  8

1911	   H
1912	      Headers
1913	         Accept  16
1914	         Accept-Charset  18
1915	         Accept-Encoding  19
1916	         Accept-Language  20
1917	         Alternate  35
1918	         Content-Base  35
1919	         Content-Disposition  34
1920	         Content-Encoding  22
1921	         Content-Language  23
1922	         Content-Location  24
1923	         Content-MD5  24
1924	         Content-Type  26
1925	         Content-Version  35
1926	         Derived-From  35
1927	         Link  35
1928	         MIME-Version  31
1929	         Public  35
1930	         URI  35

1932	   I
1933	      identity  8

1935	   L
1936	      Link header  35

1938	   M
1939	      MIME-Version header  31

1941	   P
1942	      Public header  35

1944	   U
1945	      URI header  35

1947	Authors' Addresses

1949	   Roy T. Fielding (editor)
1950	   Day Software
1951	   23 Corporate Plaza DR, Suite 280
1952	   Newport Beach, CA  92660
1953	   USA

1955	   Phone: +1-949-706-5300
1956	   Fax:   +1-949-706-5305
1957	   Email: fielding@gbiv.com
1958	   URI:   http://roy.gbiv.com/

1960	   Jim Gettys
1961	   One Laptop per Child
1962	   21 Oak Knoll Road
1963	   Carlisle, MA  01741
1964	   USA

1966	   Email: jg@laptop.org
1967	   URI:   http://www.laptop.org/

1969	   Jeffrey C. Mogul
1970	   Hewlett-Packard Company
1971	   HP Labs, Large Scale Systems Group
1972	   1501 Page Mill Road, MS 1177
1973	   Palo Alto, CA  94304
1974	   USA

1976	   Email: JeffMogul@acm.org

1978	   Henrik Frystyk Nielsen
1979	   Microsoft Corporation
1980	   1 Microsoft Way
1981	   Redmond, WA  98052
1982	   USA

1984	   Email: henrikn@microsoft.com
1985	   Larry Masinter
1986	   Adobe Systems, Incorporated
1987	   345 Park Ave
1988	   San Jose, CA  95110
1989	   USA

1991	   Email: LMM@acm.org
1992	   URI:   http://larry.masinter.net/

1994	   Paul J. Leach
1995	   Microsoft Corporation
1996	   1 Microsoft Way
1997	   Redmond, WA  98052

1999	   Email: paulle@microsoft.com

2001	   Tim Berners-Lee
2002	   World Wide Web Consortium
2003	   MIT Computer Science and Artificial Intelligence Laboratory
2004	   The Stata Center, Building 32
2005	   32 Vassar Street
2006	   Cambridge, MA  02139
2007	   USA

2009	   Email: timbl@w3.org
2010	   URI:   http://www.w3.org/People/Berners-Lee/

2012	   Yves Lafon (editor)
2013	   World Wide Web Consortium
2014	   W3C / ERCIM
2015	   2004, rte des Lucioles
2016	   Sophia-Antipolis, AM  06902
2017	   France

2019	   Email: ylafon@w3.org
2020	   URI:   http://www.raubacapeu.net/people/yves/
2021	   Julian F. Reschke (editor)
2022	   greenbytes GmbH
2023	   Hafenweg 16
2024	   Muenster, NW  48155
2025	   Germany

2027	   Phone: +49 251 2807760
2028	   Fax:   +49 251 2807761
2029	   Email: julian.reschke@greenbytes.de
2030	   URI:   http://greenbytes.de/tech/webdav/