idnits 2.17.1 

draft-ietf-clue-data-model-schema-11.txt:

  Checking boilerplate required by RFC 5378 and the IETF Trust (see
  https://trustee.ietf.org/license-info):
  ----------------------------------------------------------------------------

     No issues found here.

  Checking nits according to https://www.ietf.org/id-info/1id-guidelines.txt:
  ----------------------------------------------------------------------------

     No issues found here.

  Checking nits according to https://www.ietf.org/id-info/checklist :
  ----------------------------------------------------------------------------

  ** There are 49 instances of too long lines in the document, the longest
     one being 7 characters in excess of 72.


  Miscellaneous warnings:
  ----------------------------------------------------------------------------

  == The copyright year in the IETF Trust and authors Copyright Line does not
     match the current year

  -- The document date (October 19, 2015) is 3105 days in the past.  Is this
     intentional?


  Checking references for intended status: Proposed Standard
  ----------------------------------------------------------------------------

     (See RFCs 3967 and 4897 for information about using normative references
     to lower-maturity documents in RFCs)

  == Missing Reference: '0-9' is mentioned on line 1113, but not defined

  == Outdated reference: A later version (-18) exists of
     draft-ietf-clue-datachannel-10

  == Outdated reference: A later version (-25) exists of
     draft-ietf-clue-framework-23

  == Outdated reference: A later version (-19) exists of
     draft-ietf-clue-protocol-06

  == Outdated reference: A later version (-38) exists of
     draft-ietf-ecrit-additional-data-37

  -- Obsolete informational reference (is this intentional?): RFC 5117
     (Obsoleted by RFC 7667)


     Summary: 1 error (**), 0 flaws (~~), 6 warnings (==), 2 comments (--).

     Run idnits with the --verbose option for more detailed information about
     the items above.

--------------------------------------------------------------------------------


2	CLUE Working Group                                             R. Presta
3	Internet-Draft                                               S P. Romano
4	Intended status: Standards Track                    University of Napoli
5	Expires: April 21, 2016                                 October 19, 2015

7	                 An XML Schema for the CLUE data model
8	                  draft-ietf-clue-data-model-schema-11

10	Abstract

12	   This document provides an XML schema file for the definition of CLUE
13	   data model types.

15	Status of This Memo

17	   This Internet-Draft is submitted in full conformance with the
18	   provisions of BCP 78 and BCP 79.

20	   Internet-Drafts are working documents of the Internet Engineering
21	   Task Force (IETF).  Note that other groups may also distribute
22	   working documents as Internet-Drafts.  The list of current Internet-
23	   Drafts is at http://datatracker.ietf.org/drafts/current/.

25	   Internet-Drafts are draft documents valid for a maximum of six months
26	   and may be updated, replaced, or obsoleted by other documents at any
27	   time.  It is inappropriate to use Internet-Drafts as reference
28	   material or to cite them other than as "work in progress."

30	   This Internet-Draft will expire on April 21, 2016.

32	Copyright Notice

34	   Copyright (c) 2015 IETF Trust and the persons identified as the
35	   document authors.  All rights reserved.

37	   This document is subject to BCP 78 and the IETF Trust's Legal
38	   Provisions Relating to IETF Documents
39	   (http://trustee.ietf.org/license-info) in effect on the date of
40	   publication of this document.  Please review these documents
41	   carefully, as they describe your rights and restrictions with respect
42	   to this document.  Code Components extracted from this document must
43	   include Simplified BSD License text as described in Section 4.e of
44	   the Trust Legal Provisions and are provided without warranty as
45	   described in the Simplified BSD License.

47	Table of Contents
48	   1.  Introduction . . . . . . . . . . . . . . . . . . . . . . . . .  4
49	   2.  Terminology  . . . . . . . . . . . . . . . . . . . . . . . . .  4
50	   3.  Definitions  . . . . . . . . . . . . . . . . . . . . . . . . .  4
51	   4.  XML Schema . . . . . . . . . . . . . . . . . . . . . . . . . .  6
52	   5.  <mediaCaptures>  . . . . . . . . . . . . . . . . . . . . . . . 17
53	   6.  <encodingGroups> . . . . . . . . . . . . . . . . . . . . . . . 17
54	   7.  <captureScenes>  . . . . . . . . . . . . . . . . . . . . . . . 18
55	   8.  <simultaneousSets> . . . . . . . . . . . . . . . . . . . . . . 18
56	   9.  <globalViews>  . . . . . . . . . . . . . . . . . . . . . . . . 18
57	   10. <captureEncodings> . . . . . . . . . . . . . . . . . . . . . . 18
58	   11. <mediaCapture> . . . . . . . . . . . . . . . . . . . . . . . . 18
59	     11.1.  captureID attribute . . . . . . . . . . . . . . . . . . . 19
60	     11.2.  mediaType attribute . . . . . . . . . . . . . . . . . . . 20
61	     11.3.  <captureSceneIDREF> . . . . . . . . . . . . . . . . . . . 20
62	     11.4.  <encGroupIDREF> . . . . . . . . . . . . . . . . . . . . . 20
63	     11.5.  <spatialInformation>  . . . . . . . . . . . . . . . . . . 20
64	       11.5.1.  <captureOrigin> . . . . . . . . . . . . . . . . . . . 21
65	       11.5.2.  <captureArea> . . . . . . . . . . . . . . . . . . . . 22
66	     11.6.  <nonSpatiallyDefinable> . . . . . . . . . . . . . . . . . 23
67	     11.7.  <content> . . . . . . . . . . . . . . . . . . . . . . . . 23
68	     11.8.  <synchronizationID> . . . . . . . . . . . . . . . . . . . 24
69	     11.9.  <allowSubsetChoice> . . . . . . . . . . . . . . . . . . . 24
70	     11.10. <policy>  . . . . . . . . . . . . . . . . . . . . . . . . 24
71	     11.11. <maxCaptures> . . . . . . . . . . . . . . . . . . . . . . 25
72	     11.12. <individual>  . . . . . . . . . . . . . . . . . . . . . . 26
73	     11.13. <description> . . . . . . . . . . . . . . . . . . . . . . 26
74	     11.14. <priority>  . . . . . . . . . . . . . . . . . . . . . . . 27
75	     11.15. <lang>  . . . . . . . . . . . . . . . . . . . . . . . . . 27
76	     11.16. <mobility>  . . . . . . . . . . . . . . . . . . . . . . . 27
77	     11.17. <relatedTo> . . . . . . . . . . . . . . . . . . . . . . . 27
78	     11.18. <view>  . . . . . . . . . . . . . . . . . . . . . . . . . 28
79	     11.19. <presentation>  . . . . . . . . . . . . . . . . . . . . . 28
80	       11.19.1. <embeddedText>  . . . . . . . . . . . . . . . . . . . 28
81	       11.19.2. <capturedPeople>  . . . . . . . . . . . . . . . . . . 29
82	     11.20. Audio captures  . . . . . . . . . . . . . . . . . . . . . 29
83	       11.20.1. <sensitivityPattern>  . . . . . . . . . . . . . . . . 30
84	     11.21. Video captures  . . . . . . . . . . . . . . . . . . . . . 30
85	     11.22. Text captures . . . . . . . . . . . . . . . . . . . . . . 31
86	     11.23. Other capture types . . . . . . . . . . . . . . . . . . . 31
87	     11.24. <captureScene>  . . . . . . . . . . . . . . . . . . . . . 32
88	       11.24.1. <sceneInformation>  . . . . . . . . . . . . . . . . . 33
89	       11.24.2. <sceneViews>  . . . . . . . . . . . . . . . . . . . . 33
90	       11.24.3. sceneID attribute . . . . . . . . . . . . . . . . . . 33
91	       11.24.4. scale attribute . . . . . . . . . . . . . . . . . . . 33
92	     11.25. <sceneView> . . . . . . . . . . . . . . . . . . . . . . . 34
93	       11.25.1. <mediaCaptureIDs> . . . . . . . . . . . . . . . . . . 35
94	       11.25.2. sceneViewID attribute . . . . . . . . . . . . . . . . 35
95	     11.26. <encodingGroup> . . . . . . . . . . . . . . . . . . . . . 35
96	       11.26.1. <maxGroupBandwidth> . . . . . . . . . . . . . . . . . 36
97	       11.26.2. <encodingIDList>  . . . . . . . . . . . . . . . . . . 36
98	       11.26.3. encodingGroupID attribute . . . . . . . . . . . . . . 36
99	     11.27. <simultaneousSet> . . . . . . . . . . . . . . . . . . . . 36
100	       11.27.1. setID attribute . . . . . . . . . . . . . . . . . . . 37
101	       11.27.2. mediaType attribute . . . . . . . . . . . . . . . . . 37
102	       11.27.3. <mediaCaptureIDREF> . . . . . . . . . . . . . . . . . 38
103	       11.27.4. <sceneViewIDREF>  . . . . . . . . . . . . . . . . . . 38
104	       11.27.5. <captureSceneIDREF> . . . . . . . . . . . . . . . . . 38
105	     11.28. <globalView>  . . . . . . . . . . . . . . . . . . . . . . 38
106	     11.29. <people>  . . . . . . . . . . . . . . . . . . . . . . . . 38
107	       11.29.1. <person>  . . . . . . . . . . . . . . . . . . . . . . 39
108	   12. <captureEncoding>  . . . . . . . . . . . . . . . . . . . . . . 40
109	     12.1.  <captureID> . . . . . . . . . . . . . . . . . . . . . . . 41
110	     12.2.  <encodingID>  . . . . . . . . . . . . . . . . . . . . . . 41
111	     12.3.  <configuredContent> . . . . . . . . . . . . . . . . . . . 41
112	   13. <clueInfo> . . . . . . . . . . . . . . . . . . . . . . . . . . 41
113	   14. XML Schema extensibility . . . . . . . . . . . . . . . . . . . 42
114	     14.1.  Example of extension  . . . . . . . . . . . . . . . . . . 43
115	   15. Security considerations  . . . . . . . . . . . . . . . . . . . 44
116	   16. IANA considerations  . . . . . . . . . . . . . . . . . . . . . 45
117	     16.1.  XML namespace registration  . . . . . . . . . . . . . . . 45
118	     16.2.  XML Schema registration . . . . . . . . . . . . . . . . . 46
119	     16.3.  MIME Media Type Registration for
120	            'application/clue_info+xml' . . . . . . . . . . . . . . . 46
121	   17. Sample XML file  . . . . . . . . . . . . . . . . . . . . . . . 47
122	   18. MCC example  . . . . . . . . . . . . . . . . . . . . . . . . . 54
123	   19. Diff with draft-ietf-clue-data-model-schema-10 version . . . . 60
124	   20. Diff with draft-ietf-clue-data-model-schema-09 version . . . . 61
125	   21. Diff with draft-ietf-clue-data-model-schema-08 version . . . . 61
126	   22. Diff with draft-ietf-clue-data-model-schema-07 version . . . . 61
127	   23. Diff with draft-ietf-clue-data-model-schema-06 version . . . . 61
128	   24. Diff with draft-ietf-clue-data-model-schema-04 version . . . . 62
129	   25. Diff with draft-ietf-clue-data-model-schema-03 version . . . . 63
130	   26. Diff with draft-ietf-clue-data-model-schema-02 version . . . . 63
131	   27. Acknowledgments  . . . . . . . . . . . . . . . . . . . . . . . 63
132	   28. Informative References . . . . . . . . . . . . . . . . . . . . 63

134	1.  Introduction

136	   This document provides an XML schema file for the definition of CLUE
137	   data model types.

139	   The schema is based on information contained in
140	   [I-D.ietf-clue-framework].  It encodes information and constraints
141	   defined in the aforementioned document in order to provide a formal
142	   representation of the concepts therein presented.

144	   The document aims at the definition of a coherent structure for
145	   information associated with the description of a telepresence
146	   scenario.  Such information is used within the CLUE protocol messages
147	   ([I-D.ietf-clue-protocol]) enabling the dialogue between a Media
148	   Provider and a Media Consumer.  CLUE protocol messages, indeed, are
149	   XML messages allowing (i) a Media Provider to advertise its
150	   telepresence capabilities in terms of media captures, capture scenes,
151	   and other features envisioned in the CLUE framework, according to the
152	   format herein defined and (ii) a Media Consumer to request the
153	   desired telepresence options in the form of capture encodings,
154	   represented as described in this document.

156	2.  Terminology

158	   The key words "MUST", "MUST NOT", "REQUIRED", "SHALL", "SHALL NOT",
159	   "SHOULD", "SHOULD NOT", "RECOMMENDED", "MAY", and "OPTIONAL" in this
160	   document are to be interpreted as described in [RFC2119].

162	3.  Definitions

164	   This document refers to the same definitions used in
165	   [I-D.ietf-clue-framework], except for the "CLUE Participant"
166	   definition.  We briefly recall herein some of the main terms used in
167	   the document.

169	   Audio Capture:  Media Capture for audio.  Denoted as ACn in the
170	      examples in this document.

172	   Capture:  Same as Media Capture.

174	   Capture Device:  A device that converts physical input, such as
175	      audio, video or text, into an electrical signal, in most cases to
176	      be fed into a media encoder.

178	   Capture Encoding:  A specific encoding of a Media Capture, to be sent
179	      by a Media Provider to a Media Consumer via RTP.

181	   Capture Scene:  A structure representing a spatial region captured by
182	      one or more Capture Devices, each capturing media representing a
183	      portion of the region.  The spatial region represented by a
184	      Capture Scene MAY correspond to a real region in physical space,
185	      such as a room.  A Capture Scene includes attributes and one or
186	      more Capture Scene Views, with each view including one or more
187	      Media Captures.

189	   Capture Scene View:  A list of Media Captures of the same media type
190	      that together form one way to represent the entire Capture Scene..

192	   CLUE Participant:  This term is not imported from the framework
193	      terminology.  A CLUE Participant identifies a generic entity
194	      (either an Endpoint or a MCU) making use of the CLUE protocol.

196	   Consumer:  Short for Media Consumer.

198	   Encoding or Individual Encoding:  A set of parameters representing a
199	      way to encode a Media Capture to become a Capture Encoding.

201	   Encoding Group:  A set of encoding parameters representing a total
202	      media encoding capability to be sub-divided across potentially
203	      multiple Individual Encodings.

205	   Endpoint  A CLUE-capable device which is the logical point of final
206	      termination through receiving, decoding and rendering, and/or
207	      initiation through capturing, encoding, and sending of media
208	      streams.  An endpoint consists of one or more physical devices
209	      which source and sink media streams, and exactly one [RFC4353]
210	      Participant (which, in turn, includes exactly one SIP User Agent).
211	      Endpoints can be anything from multiscreen/multicamera rooms to
212	      handheld devices.

214	   Media:  Any data that, after suitable encoding, can be conveyed over
215	      RTP, including audio, video or timed text.

217	   Media Capture:  A source of Media, such as from one or more Capture
218	      Devices or constructed from other Media streams.

220	   Media Consumer:  A CLUE-capable device that intends to receive
221	      Capture Encodings.

223	   Media Provider:  A CLUE-capable device that intends to send Capture
224	      Encodings.

226	   Multiple Content Capture:  A Capture that mixes and/or switches other
227	      Captures of a single type.  (E.g. all audio or all video.)
228	      Particular Media Captures may or may not be present in the
229	      resultant Capture Encoding depending on time or space.  Denoted as
230	      MCCn in the example cases in this document.

232	   Multipoint Control Unit (MCU):  A CLUE-capable device that connects
233	      two or more endpoints together into one single multimedia
234	      conference [RFC5117].  An MCU includes an [RFC4353] like Mixer,
235	      without the [RFC4353] requirement to send media to each
236	      participant.

238	   Plane of Interest:  The spatial plane containing the most relevant
239	      Subject matter.

241	   Provider:  Same as Media Provider.

243	   Render:   The process of reproducing the received Streams like, for
244	      instance, displaying of the remote video on the Media Consumer's
245	      screens, or playing of the remote audio through loudspeakers.

247	   Scene:  Same as Capture Scene.

249	   Simultaneous Transmission Set:  A set of Media Captures that can be
250	      transmitted simultaneously from a Media Provider.

252	   Single Media Capture:  A capture which contains media from a single
253	      source capture device, e.g. an audio capture from a single
254	      microphone, a video capture from a single camera.

256	   Spatial Relation:  The arrangement in space of two objects, in
257	      contrast to relation in time or other relationships.

259	   Stream:  A Capture Encoding sent from a Media Provider to a Media
260	      Consumer via RTP [RFC3550].

262	   Stream Characteristics:  The union of the features used to describe a
263	      Stream in the CLUE environment and in the SIP-SDP environment.

265	   Video Capture:  A Media Capture for video.

267	4.  XML Schema

269	   This section contains the CLUE data model schema definition.

271	   The element and attribute definitions are formal representations of
272	   the concepts needed to describe the capabilities of a Media Provider
273	   and the streams that are requested by a Media Consumer given the
274	   Media Provider's ADVERTISEMENT ([I-D.ietf-clue-protocol]).

276	   The main groups of information are:

278	      <mediaCaptures>: the list of media captures available (Section 5)

280	      <encodingGroups>: the list of encoding groups (Section 6)

282	      <captureScenes>: the list of capture scenes (Section 7)

284	      <simultaneousSets>: the list of simultaneous transmission sets
285	      (Section 8)

287	      <globalViews>: the list of global views sets (Section 9)

289	      <people>: meta data about the participants represented in the
290	      telepresence session (Section 11.29).

292	      <captureEncodings>: the list of instantiated capture encodings
293	      (Section 10)

295	   All of the above refers to concepts that have been introduced in
296	   [I-D.ietf-clue-framework] and further detailed in this document.

298	<?xml version="1.0" encoding="UTF-8" ?>
299	<xs:schema
300	   targetNamespace="urn:ietf:params:xml:ns:clue-info"
301	   xmlns:tns="urn:ietf:params:xml:ns:clue-info"
302	   xmlns:xs="http://www.w3.org/2001/XMLSchema"
303	   xmlns="urn:ietf:params:xml:ns:clue-info"
304	   xmlns:xcard="urn:ietf:params:xml:ns:vcard-4.0"
305	   elementFormDefault="qualified"
306	   attributeFormDefault="unqualified"
307	   version="1.0">

309	<!-- Import xcard XML schema -->
310	<xs:import namespace="urn:ietf:params:xml:ns:vcard-4.0"
311	schemaLocation="xcard.xsd"/>

313	<!-- ELEMENT DEFINITIONS -->
314	<xs:element name="mediaCaptures" type="mediaCapturesType"/>
315	<xs:element name="encodingGroups" type="encodingGroupsType"/>
316	<xs:element name="captureScenes" type="captureScenesType"/>
317	<xs:element name="simultaneousSets" type="simultaneousSetsType"/>
318	<xs:element name="globalViews" type="globalViewsType"/>
319	<xs:element name="people" type="peopleType"/>
320	<xs:element name="captureEncodings" type="captureEncodingsType"/>

322	<!-- MEDIA CAPTURES TYPE -->
323	<!-- envelope of media captures -->
324	<xs:complexType name="mediaCapturesType">
325	 <xs:sequence>
326	   <xs:element name="mediaCapture" type="mediaCaptureType"
327	   maxOccurs="unbounded"/>
328	 </xs:sequence>
329	</xs:complexType>

331	<!-- DESCRIPTION element -->
332	<xs:element name="description">
333	 <xs:complexType>
334	  <xs:simpleContent>
335	   <xs:extension base="xs:string">
336	     <xs:attribute name="lang" type="xs:language"/>
337	   </xs:extension>
338	  </xs:simpleContent>
339	 </xs:complexType>
340	</xs:element>

342	<!-- MEDIA CAPTURE TYPE -->
343	<xs:complexType name="mediaCaptureType" abstract="true">
344	  <xs:sequence>
345	    <!-- mandatory fields -->
346	    <xs:element name="captureSceneIDREF" type="xs:IDREF"/>
347	    <xs:choice>
348	      <xs:sequence>
349	        <xs:element name="spatialInformation"
350	                    type="tns:spatialInformationType"/>
351	      </xs:sequence>
352	      <xs:element name="nonSpatiallyDefinable" type="xs:boolean" fixed="true"/>
353	    </xs:choice>
354	    <!-- for handling multi-content captures: -->
355	    <xs:choice>
356	      <xs:sequence>
357	        <xs:element name="synchronizationID" type="xs:ID" minOccurs="0"/>
358	        <xs:element name="content" type="contentType" minOccurs="0"/>
359	        <xs:element name="policy" type="policyType" minOccurs="0"/>
360	        <xs:element name="maxCaptures" type="maxCapturesType" minOccurs="0"/>
361	        <xs:element name="allowSubsetChoice" type="xs:boolean" minOccurs="0"/>
362	      </xs:sequence>
363	      <xs:element name="individual" type="xs:boolean" fixed="true"/>
364	    </xs:choice>
365	    <!-- optional fields -->
366	    <xs:element name="encGroupIDREF" type="xs:IDREF" minOccurs="0"/>
367	    <xs:element ref="description" minOccurs="0" maxOccurs="unbounded"/>
368	    <xs:element name="priority" type="xs:unsignedInt" minOccurs="0"/>
369	    <xs:element name="lang" type="xs:language" minOccurs="0"
370	     maxOccurs="unbounded"/>
371	    <xs:element name="mobility" type="mobilityType" minOccurs="0"/>
372	    <xs:element name="presentation" type="presentationType" minOccurs="0"/>
373	    <xs:element ref="embeddedText" minOccurs="0"/>
374	    <xs:element name="view" type="viewType" minOccurs="0"/>
375	    <xs:element name="capturedPeople" type="capturedPeopleType" minOccurs="0"/>
376	    <xs:element name="relatedTo" type="xs:IDREF" minOccurs="0"/>
377	  </xs:sequence>
378	  <xs:attribute name="captureID" type="xs:ID" use="required"/>
379	  <xs:attribute name="mediaType" type="xs:string" use="required"/>

381	</xs:complexType>

383	<!-- POLICY TYPE -->
384	<xs:simpleType name="policyType">
385	 <xs:restriction base="xs:string">
386	      <xs:pattern value="([a-zA-Z0-9])+[:]([0-9])+"/>
387	    </xs:restriction>
388	</xs:simpleType>

390	<!-- CONTENT TYPE -->
391	<xs:complexType name="contentType">
392	 <xs:sequence>
393	   <xs:element name="captureIDREF" type="xs:IDREF"
394	   minOccurs="0" maxOccurs="unbounded"/>
395	   <xs:element name="sceneViewIDREF" type="xs:IDREF"
396	   minOccurs="0" maxOccurs="unbounded"/>
397	   <xs:any namespace="##other" processContents="lax" minOccurs="0"
398	   maxOccurs="unbounded"/>
399	 </xs:sequence>
400	 <xs:anyAttribute namespace="##other" processContents="lax"/>
401	</xs:complexType>

403	<!-- MAX CAPTURES TYPE -->
404	<xs:complexType name="maxCapturesType">
405	  <xs:simpleContent>
406	    <xs:extension base="xs:positiveInteger">
407	      <xs:attribute name="exactNumber" type="xs:boolean"/>
408	    </xs:extension>
409	  </xs:simpleContent>
410	</xs:complexType>

412	<!-- CAPTURED PEOPLE TYPE -->
413	<xs:complexType name="capturedPeopleType">
414	<xs:sequence>
415	<xs:element name="personIDREF" type="xs:IDREF" maxOccurs="unbounded"/>
416	</xs:sequence>
417	</xs:complexType>

419	<!-- PEOPLE TYPE -->
420	<xs:complexType name="peopleType">
421	<xs:sequence>
422	<xs:element name="person" type="personType"
423	                    maxOccurs="unbounded"/>
424	</xs:sequence>
425	</xs:complexType>

427	<!-- PERSON TYPE -->
428	<xs:complexType name="personType">
429	  <xs:sequence>
430	     <xs:element name="personInfo" type="xcard:vcardType" maxOccurs="1"
431	                         minOccurs="0"/>
432	     <xs:element name="personType" type="personTypeType"
433	                         minOccurs="0"
434	                 maxOccurs="unbounded"/>
435	     <xs:any namespace="##other" processContents="lax" minOccurs="0"
436	                 maxOccurs="unbounded"/>
437	  </xs:sequence>
438	  <xs:attribute name="personID" type="xs:ID" use="required"/>
439	  <xs:anyAttribute namespace="##other" processContents="lax"/>
440	</xs:complexType>

442	<!-- PERSON TYPE TYPE -->
443	<xs:simpleType name="personTypeType">
444	        <xs:restriction base="xs:string">
445	        <xs:enumeration value="chairman"/>
446	        <xs:enumeration value="vice-chairman"/>
447	        <xs:enumeration value="minute taker"/>
448	        <xs:enumeration value="presenter"/>
449	        <xs:enumeration value="translator"/>
450	        <xs:enumeration value="timekeeper"/>
451	        <xs:enumeration value="attendee"/>
452	        <xs:enumeration value="observer"/>
453	        </xs:restriction>
454	</xs:simpleType>

456	<!-- VIEW TYPE -->
457	<xs:simpleType name="viewType">
458	 <xs:restriction base="xs:string">
459	  <xs:enumeration value="room"/>
460	  <xs:enumeration value="table"/>
461	  <xs:enumeration value="lectern"/>
462	  <xs:enumeration value="individual"/>
463	  <xs:enumeration value="audience"/>
464	 </xs:restriction>
465	</xs:simpleType>

467	<!-- PRESENTATION TYPE -->
468	<xs:simpleType name="presentationType">
469	 <xs:restriction base="xs:string">
470	  <xs:enumeration value="slides"/>
471	  <xs:enumeration value="image"/>
472	 </xs:restriction>
473	</xs:simpleType>

475	<!-- SPATIAL INFORMATION TYPE -->
476	<xs:complexType name="spatialInformationType">
477	 <xs:sequence>
478	  <xs:element name="captureOrigin" type="captureOriginType" minOccurs="0"/>
479	  <xs:element name="captureArea" type="captureAreaType" minOccurs="0"/>
480	  <xs:any namespace="##other" processContents="lax" minOccurs="0"
481	  maxOccurs="unbounded"/>
482	 </xs:sequence>
483	 <xs:anyAttribute namespace="##other" processContents="lax"/>
484	</xs:complexType>

486	<!-- POINT TYPE -->
487	<xs:complexType name="pointType">
488	 <xs:sequence>
489	  <xs:element name="x" type="xs:decimal"/>
490	  <xs:element name="y" type="xs:decimal"/>
491	  <xs:element name="z" type="xs:decimal"/>
492	 </xs:sequence>
493	</xs:complexType>

495	<!-- CAPTURE ORIGIN TYPE -->
496	<xs:complexType name="captureOriginType">
497	  <xs:sequence>
498	        <xs:element name="capturePoint" type="pointType"></xs:element>
499	        <xs:element name="lineOfCapturePoint" type="pointType" minOccurs="0">
500	        </xs:element>
501	  </xs:sequence>
502	  <xs:anyAttribute namespace="##any" processContents="lax"/>
503	</xs:complexType>

505	<!-- CAPTURE AREA TYPE -->
506	<xs:complexType name="captureAreaType">
507	 <xs:sequence>
508	  <xs:element name="bottomLeft" type="pointType"/>
509	  <xs:element name="bottomRight" type="pointType"/>
510	  <xs:element name="topLeft" type="pointType"/>
511	  <xs:element name="topRight" type="pointType"/>
512	 </xs:sequence>
513	</xs:complexType>

515	<!-- MOBILITY TYPE -->
516	<xs:simpleType name="mobilityType">
517	 <xs:restriction base="xs:string">
518	  <xs:enumeration value="static"/>
519	  <xs:enumeration value="dynamic"/>
520	  <xs:enumeration value="highly-dynamic"/>
521	 </xs:restriction>
522	</xs:simpleType>

524	<!-- TEXT CAPTURE TYPE -->
525	<xs:complexType name="textCaptureType">
526	 <xs:complexContent>
527	  <xs:extension base="tns:mediaCaptureType">
528	  <xs:sequence>
529	  <xs:any namespace="##other" processContents="lax" minOccurs="0"
530	    maxOccurs="unbounded"/>
531	  </xs:sequence>
532	  <xs:anyAttribute namespace="##other" processContents="lax"/>
533	  </xs:extension>
534	 </xs:complexContent>
535	</xs:complexType>

537	<!-- OTHER CAPTURE TYPE -->
538	<xs:complexType name="otherCaptureType">
539	 <xs:complexContent>
540	  <xs:extension base="tns:mediaCaptureType">
541	  <xs:sequence>
542	  <xs:any namespace="##other" processContents="lax" minOccurs="0"
543	    maxOccurs="unbounded"/>
544	  </xs:sequence>
545	  <xs:anyAttribute namespace="##other" processContents="lax"/>
546	  </xs:extension>
547	 </xs:complexContent>
548	</xs:complexType>

550	<!-- AUDIO CAPTURE TYPE -->
551	<xs:complexType name="audioCaptureType">
552	 <xs:complexContent>
553	  <xs:extension base="tns:mediaCaptureType">
554	   <xs:sequence>
555	    <xs:element name="sensitivyPattern" type="sensitivyPatternType"
556	    minOccurs="0"/>
557	    <xs:any namespace="##other" processContents="lax" minOccurs="0"
558	    maxOccurs="unbounded"/>
559	   </xs:sequence>
560	   <xs:anyAttribute namespace="##other" processContents="lax"/>
561	  </xs:extension>
562	 </xs:complexContent>
563	</xs:complexType>

565	<!-- SENSITIVITY PATTERN TYPE -->
566	<xs:simpleType name="sensitivyPatternType">
567	 <xs:restriction base="xs:string">
568	  <xs:enumeration value="uni"/>
569	  <xs:enumeration value="shotgun"/>
570	  <xs:enumeration value="omni"/>
571	  <xs:enumeration value="figure8"/>
572	  <xs:enumeration value="cardioid"/>
573	  <xs:enumeration value="hyper-cardioid"/>
574	 </xs:restriction>
575	</xs:simpleType>

577	<!-- VIDEO CAPTURE TYPE -->
578	<xs:complexType name="videoCaptureType">
579	 <xs:complexContent>
580	  <xs:extension base="tns:mediaCaptureType">
581	   <xs:sequence>
582	    <xs:any namespace="##other" processContents="lax" minOccurs="0"
583	    maxOccurs="unbounded"/>
584	   </xs:sequence>
585	   <xs:anyAttribute namespace="##other" processContents="lax"/>
586	  </xs:extension>
587	 </xs:complexContent>
588	</xs:complexType>

590	<!-- EMBEDDED TEXT ELEMENT -->
591	<xs:element name="embeddedText">
592	 <xs:complexType>
593	  <xs:simpleContent>
594	   <xs:extension base="xs:boolean">
595	    <xs:attribute name="lang" type="xs:language"/>
596	   </xs:extension>
597	  </xs:simpleContent>
598	 </xs:complexType>
599	</xs:element>
600	<!-- CAPTURE SCENES TYPE -->
601	<!-- envelope of capture scenes -->
602	<xs:complexType name="captureScenesType">
603	 <xs:sequence>
604	  <xs:element name="captureScene" type="captureSceneType"
605	  maxOccurs="unbounded"/>
606	 </xs:sequence>
607	</xs:complexType>

609	<!-- CAPTURE SCENE TYPE -->
610	<xs:complexType name="captureSceneType">
611	 <xs:sequence>
612	  <xs:element ref="description" minOccurs="0" maxOccurs="unbounded"/>
613	  <xs:element name="sceneInformation" type="xcard:vcardType" minOccurs="0"/>
614	  <xs:element name="sceneViews" type="sceneViewsType" minOccurs="0"/>
615	  <xs:any namespace="##other" processContents="lax" minOccurs="0"
616	  maxOccurs="unbounded"/>
617	 </xs:sequence>
618	 <xs:attribute name="sceneID" type="xs:ID" use="required"/>
619	 <xs:attribute name="scale" type="scaleType" use="required"/>
620	 <xs:anyAttribute namespace="##other" processContents="lax"/>
621	</xs:complexType>

623	<!-- SCALE TYPE -->
624	<xs:simpleType name="scaleType">
625	 <xs:restriction base="xs:string">
626	  <xs:enumeration value="mm"/>
627	  <xs:enumeration value="unknown"/>
628	  <xs:enumeration value="noscale"/>
629	 </xs:restriction>
630	</xs:simpleType>

632	<!-- SCENE VIEWS TYPE -->
633	<!-- envelope of scene views of a capture scene -->
634	<xs:complexType name="sceneViewsType">
635	 <xs:sequence>
636	  <xs:element name="sceneView" type="sceneViewType"
637	  maxOccurs="unbounded"/>
638	 </xs:sequence>
639	</xs:complexType>

641	<!-- SCENE VIEW TYPE -->
642	<xs:complexType name="sceneViewType">
643	 <xs:sequence>
644	  <xs:element ref="description" minOccurs="0" maxOccurs="unbounded"/>
645	  <xs:element name="mediaCaptureIDs" type="captureIDListType"/>

647	 </xs:sequence>
648	 <xs:attribute name="sceneViewID" type="xs:ID" use="required"/>
649	</xs:complexType>

651	<!-- CAPTURE ID LIST TYPE -->
652	<xs:complexType name="captureIDListType">
653	 <xs:sequence>
654	  <xs:element name="captureIDREF" type="xs:IDREF"
655	  maxOccurs="unbounded"/>
656	 </xs:sequence>
657	</xs:complexType>

659	<!-- ENCODING GROUPS TYPE -->
660	<xs:complexType name="encodingGroupsType">
661	 <xs:sequence>
662	  <xs:element name="encodingGroup" type="tns:encodingGroupType"
663	  maxOccurs="unbounded"/>
664	 </xs:sequence>
665	</xs:complexType>

667	<!-- ENCODING GROUP TYPE -->
668	<xs:complexType name="encodingGroupType">
669	 <xs:sequence>
670	  <xs:element name="maxGroupBandwidth" type="xs:unsignedLong"/>
671	  <xs:element name="encodingIDList" type="encodingIDListType"/>
672	  <xs:any namespace="##other" processContents="lax" minOccurs="0"
673	  maxOccurs="unbounded"/>
674	 </xs:sequence>
675	 <xs:attribute name="encodingGroupID" type="xs:ID" use="required"/>
676	 <xs:anyAttribute namespace="##any" processContents="lax"/>
677	</xs:complexType>

679	<!-- ENCODING ID LIST TYPE -->
680	<xs:complexType name="encodingIDListType">
681	 <xs:sequence>
682	  <xs:element name="encID" type="xs:string" maxOccurs="unbounded"/>
683	 </xs:sequence>
684	</xs:complexType>

686	<!-- SIMULTANEOUS SETS TYPE -->
687	<xs:complexType name="simultaneousSetsType">
688	 <xs:sequence>
689	  <xs:element name="simultaneousSet" type="simultaneousSetType"
690	  maxOccurs="unbounded"/>
691	 </xs:sequence>
692	</xs:complexType>
693	<!-- SIMULTANEOUS SET TYPE -->
694	<xs:complexType name="simultaneousSetType">
695	 <xs:sequence>
696	   <xs:element name="mediaCaptureIDREF" type="xs:IDREF"
697	   minOccurs="0" maxOccurs="unbounded"/>
698	   <xs:element name="sceneViewIDREF" type="xs:IDREF"
699	   minOccurs="0" maxOccurs="unbounded"/>
700	   <xs:element name="captureSceneIDREF" type="xs:IDREF"
701	   minOccurs="0" maxOccurs="unbounded"/>
702	   <xs:any namespace="##other" processContents="lax" minOccurs="0"
703	   maxOccurs="unbounded"/>
704	 </xs:sequence>
705	 <xs:attribute name="setID" type="xs:ID" use="required"/>
706	 <xs:attribute name="mediaType" type="xs:string"/>
707	 <xs:anyAttribute namespace="##any" processContents="lax"/>
708	</xs:complexType>

710	<!-- GLOBAL VIEWS TYPE -->
711	<xs:complexType name="globalViewsType">
712	 <xs:sequence>
713	  <xs:element name="globalView" type="globalViewType"
714	  maxOccurs="unbounded"/>
715	 </xs:sequence>
716	</xs:complexType>

718	<!-- GLOBAL VIEW TYPE -->
719	<xs:complexType name="globalViewType">
720	 <xs:sequence>
721	   <xs:element name="sceneViewIDREF" type="xs:IDREF"
722	    maxOccurs="unbounded"/>
723	   <xs:any namespace="##other" processContents="lax" minOccurs="0"
724	   maxOccurs="unbounded"/>
725	 </xs:sequence>
726	 <xs:attribute name="globalViewID" type="xs:ID"/>
727	 <xs:anyAttribute namespace="##any" processContents="lax"/>
728	</xs:complexType>

730	<!-- CAPTURE ENCODINGS TYPE -->
731	<xs:complexType name="captureEncodingsType">
732	 <xs:sequence>
733	  <xs:element name="captureEncoding" type="captureEncodingType"
734	  maxOccurs="unbounded"/>
735	 </xs:sequence>
736	</xs:complexType>

738	<!-- CAPTURE ENCODING TYPE -->
739	<xs:complexType name="captureEncodingType">
740	 <xs:sequence>
741	  <xs:element name="captureID" type="xs:string"/>
742	  <xs:element name="encodingID" type="xs:string"/>
743	  <xs:element name="configuredContent" type="contentType" minOccurs="0"/>
744	  <xs:any namespace="##other" processContents="lax" minOccurs="0"
745	   maxOccurs="unbounded"/>
746	 </xs:sequence>
747	 <xs:attribute name="ID" type="xs:ID" use="required"/>
748	 <xs:anyAttribute namespace="##any" processContents="lax"/>
749	</xs:complexType>

751	<!-- CLUE INFO ELEMENT -->
752	<xs:element name="clueInfo" type="clueInfoType"/>

754	<!-- CLUE INFO TYPE -->
755	<xs:complexType name="clueInfoType">
756	  <xs:sequence>
757	   <xs:element ref="mediaCaptures"/>
758	   <xs:element ref="encodingGroups"/>
759	   <xs:element ref="captureScenes"/>
760	   <xs:element ref="simultaneousSets" minOccurs="0"/>
761	   <xs:element ref="globalViews" minOccurs="0"/>
762	   <xs:element ref="people" minOccurs="0"/>
763	   <xs:any namespace="##other" processContents="lax" minOccurs="0"
764	   maxOccurs="unbounded"/>
765	  </xs:sequence>
766	  <xs:attribute name="clueInfoID" type="xs:ID" use="required"/>
767	  <xs:anyAttribute namespace="##other" processContents="lax"/>
768	</xs:complexType>
769	</xs:schema>

771	   Following sections describe the XML schema in more detail.  As a
772	   general remark, please notice that optional elements that don't
773	   define what their absence means are intended to be associated with
774	   undefined properties.

776	5.  <mediaCaptures>

778	   <mediaCaptures> represents the list of one or more media captures
779	   available at the Media Provider's side.  Each media capture is
780	   represented by a <mediaCapture> element (Section 11).

782	6.  <encodingGroups>

784	   <encodingGroups> represents the list of the encoding groups organized
785	   on the Media Provider's side.  Each encoding group is represented by
786	   an <encodingGroup> element (Section 11.26).

788	7.  <captureScenes>

790	   <captureScenes> represents the list of the capture scenes organized
791	   on the Media Provider's side.  Each capture scene is represented by a
792	   <captureScene> element.  (Section 11.24).

794	8.  <simultaneousSets>

796	   <simultaneousSets> contains the simultaneous sets indicated by the
797	   Media Provider.  Each simultaneous set is represented by a
798	   <simultaneousSet> element.  (Section 11.27).

800	9.  <globalViews>

802	   <globalViews> contains a set of alternative representations of all
803	   the scenes that are offered by a Media Provider to a Media Consumer.
804	   Each alternative is named "global view" and it is represented by a
805	   <globalView> element.  (Section 11.28).

807	10.  <captureEncodings>

809	   <captureEncodings> is a list of capture encodings.  It can represent
810	   the list of the desired capture encodings indicated by the Media
811	   Consumer or the list of instantiated captures on the provider's side.
812	   Each capture encoding is represented by a <captureEncoding> element.
813	   (Section 12).

815	11.  <mediaCapture>

817	   A Media Capture is the fundamental representation of a media flow
818	   that is available on the provider's side.  Media captures are
819	   characterized (i) by a set of features that are independent from the
820	   specific type of medium, and (ii) by a set of features that are
821	   media-specific.  The features that are common to all media types
822	   appear within the media capture type, that has been designed as an
823	   abstract complex type.  Media-specific captures, such as video
824	   captures, audio captures and others, are specializations of that
825	   abstract media capture type, as in a typical generalization-
826	   specialization hierarchy.

828	   The following is the XML Schema definition of the media capture type:

830	<!-- MEDIA CAPTURE TYPE -->
831	<xs:complexType name="mediaCaptureType" abstract="true">
832	  <xs:sequence>
833	    <!-- mandatory fields -->
834	    <xs:element name="captureSceneIDREF" type="xs:IDREF"/>
835	    <xs:choice>
836	      <xs:sequence>
837	        <xs:element name="spatialInformation"
838	                    type="tns:spatialInformationType"/>
839	      </xs:sequence>
840	      <xs:element name="nonSpatiallyDefinable" type="xs:boolean" fixed="true"/>
841	    </xs:choice>
842	    <!-- for handling multi-content captures: -->
843	    <xs:choice>
844	      <xs:sequence>
845	        <xs:element name="synchronizationID" type="xs:ID" minOccurs="0"/>
846	        <xs:element name="content" type="contentType" minOccurs="0"/>
847	        <xs:element name="policy" type="policyType" minOccurs="0"/>
848	        <xs:element name="maxCaptures" type="maxCapturesType" minOccurs="0"/>
849	        <xs:element name="allowSubsetChoice" type="xs:boolean" minOccurs="0"/>
850	      </xs:sequence>
851	      <xs:element name="individual" type="xs:boolean" fixed="true"/>
852	    </xs:choice>
853	    <!-- optional fields -->
854	    <xs:element name="encGroupIDREF" type="xs:IDREF" minOccurs="0"/>
855	    <xs:element ref="description" minOccurs="0" maxOccurs="unbounded"/>
856	    <xs:element name="priority" type="xs:unsignedInt" minOccurs="0"/>
857	    <xs:element name="lang" type="xs:language" minOccurs="0"
858	     maxOccurs="unbounded"/>
859	    <xs:element name="mobility" type="mobilityType" minOccurs="0"/>
860	    <xs:element name="presentation" type="presentationType" minOccurs="0"/>
861	    <xs:element ref="embeddedText" minOccurs="0"/>
862	    <xs:element name="view" type="viewType" minOccurs="0"/>
863	    <xs:element name="capturedPeople" type="capturedPeopleType" minOccurs="0"/>
864	    <xs:element name="relatedTo" type="xs:IDREF" minOccurs="0"/>
865	  </xs:sequence>
866	  <xs:attribute name="captureID" type="xs:ID" use="required"/>
867	  <xs:attribute name="mediaType" type="xs:string" use="required"/>
868	</xs:complexType>

870	11.1.  captureID attribute

872	   The "captureID" attribute is a mandatory field containing the
873	   identifier of the media capture.  Such an identifier serves as the
874	   way the capture is referenced from other data model elements (e.g.,
875	   simultaneous sets, capture encodings, and others).

877	11.2.  mediaType attribute

879	   The "mediaType" attribute is a mandatory attribute specifying the
880	   media type of the capture.  Common values are "audio", "video",
881	   "text".  Other values can be provided.  It is assumed that
882	   implementations agree on the interpretation of those other values.

884	11.3.  <captureSceneIDREF>

886	   <captureSceneIDREF> is a mandatory field containing the value of the
887	   identifier of the capture scene the media capture is defined in,
888	   i.e., the value of the sceneID (Section 11.24.3) attribute of that
889	   capture scene.  Indeed, each media capture must be defined within one
890	   and only one capture scene.  When a media capture is spatially
891	   definable, some spatial information is provided along with it in the
892	   form of point coordinates (see Section 11.5).  Such coordinates refer
893	   to the space of coordinates defined for the capture scene containing
894	   the capture.

896	11.4.  <encGroupIDREF>

898	   <encGroupIDREF> is an optional field containing the identifier of the
899	   encoding group the media capture is associated with, i.e., the value
900	   of the encodingGroupID (Section 11.26.3) attribute of that encoding
901	   group.  Media captures that are not associated with any encoding
902	   group can not be instantiated as media streams.

904	11.5.  <spatialInformation>

906	   Media captures are divided into two categories: (i) non spatially
907	   definable captures and (ii) spatially definable captures.

909	   Captures are spatially definable when at least (i) it is possible to
910	   provide the coordinates of the device position within the
911	   telepresence room of origin (capture point) together with its
912	   capturing direction specified by a second point (point on line of
913	   capture), or (ii) it is possible to provide the represented area
914	   within the telepresence room, by listing the coordinates of the four
915	   co-planar points identifying the plane of interest (area of capture).
916	   The coordinates of the abovementioned points must be expressed
917	   according to the coordinate space of the capture scene the media
918	   captures belongs to.

920	   Non spatially definable captures cannot be characterized within the
921	   physical space of the telepresence room of origin.  Captures of this
922	   kind are for example those related to recordings, text captures,
923	   DVDs, registered presentations, or external streams that are played
924	   in the telepresence room and transmitted to remote sites.

926	   Spatially definable captures represent a part of the telepresence
927	   room.  The captured part of the telepresence room is described by
928	   means of the <spatialInformation> element.  By comparing the
929	   <spatialInformation> element of different media captures within the
930	   same capture scene, a consumer can better determine the spatial
931	   relationships between them and render them correctly.  Non spatially
932	   definable captures do not embed such element in their XML
933	   description: they are instead characterized by having the
934	   <nonSpatiallyDefinable> tag set to "true" (see Section 11.6).

936	   The definition of the spatial information type is the following:

938	<!-- SPATIAL INFORMATION TYPE -->
939	<xs:complexType name="spatialInformationType">
940	 <xs:sequence>
941	  <xs:element name="captureOrigin" type="captureOriginType" minOccurs="0"/>
942	  <xs:element name="captureArea" type="captureAreaType" minOccurs="0"/>
943	  <xs:any namespace="##other" processContents="lax" minOccurs="0"
944	  maxOccurs="unbounded"/>
945	 </xs:sequence>
946	 <xs:anyAttribute namespace="##other" processContents="lax"/>
947	</xs:complexType>

949	   The <captureOrigin> contains the coordinates of the capture device
950	   that is taking the capture (i.e., the capture point), as well as,
951	   optionally, the pointing direction (i.e., the point on line of
952	   capture) (see Section 11.5.1).

954	   The <captureArea> is an optional field containing four points
955	   defining the captured area covered by the capture (see
956	   Section 11.5.2).

958	   The scale of the points coordinates is specified in the scale
959	   (Section 11.24.4) attribute of the capture scene the media capture
960	   belongs to.  Indeed, all the spatially definable media captures
961	   referring to the same capture scene share the same coordinate system
962	   and express their spatial information according to the same scale.

964	11.5.1.  <captureOrigin>

966	   The <captureOrigin> element is used to represent the position and
967	   optionally the line of capture of a capture device. <captureOrigin>
968	   MUST be included in spatially definable audio captures, while it is
969	   optional for spatially definable video captures.

971	   The XML Schema definition of the <captureOrigin> element type is the
972	   following:

974	<!-- CAPTURE ORIGIN TYPE -->
975	<xs:complexType name="captureOriginType">
976	  <xs:sequence>
977	        <xs:element name="capturePoint" type="pointType"/>
978	        <xs:element name="lineOfCapturePoint" type="pointType" minOccurs="0"/>
979	  </xs:sequence>
980	  <xs:anyAttribute namespace="##any" processContents="lax"/>
981	</xs:complexType>

983	<!-- POINT TYPE -->
984	<xs:complexType name="pointType">
985	 <xs:sequence>
986	  <xs:element name="x" type="xs:decimal"/>
987	  <xs:element name="y" type="xs:decimal"/>
988	  <xs:element name="z" type="xs:decimal"/>
989	 </xs:sequence>
990	</xs:complexType>

992	   The point type contains three spatial coordinates (x,y,z)
993	   representing a point in the space associated with a certain capture
994	   scene.

996	   The <captureOrigin> element includes a mandatory <capturePoint>
997	   element and an optional <lineOfCapturePoint> element, both of the
998	   type "pointType". <capturePoint> specifies the three coordinates
999	   identifying the position of the capture device. <lineOfCapturePoint>
1000	   is another pointType element representing the "point on line of
1001	   capture", that gives the pointing direction of the capture device.

1003	   The coordinates of the point on line of capture MUST NOT be identical
1004	   to the capture point coordinates.  For a spatially definable video
1005	   capture, if the point on line of capture is provided, it MUST belong
1006	   to the region between the point of capture and the capture area.  For
1007	   a spatially definable audio capture, if the point on line of capture
1008	   is not provided, the sensitivity pattern should be considered
1009	   omnidirectional.

1011	11.5.2.  <captureArea>

1013	   <captureArea> is an optional element that can be contained within the
1014	   spatial information associated with a media capture.  It represents
1015	   the spatial area captured by the media capture. <captureArea> MUST be
1016	   included in the spatial information of spatially definable video
1017	   captures, while it MUST NOT be associated with audio captures.

1019	   The XML representation of that area is provided through a set of four
1020	   point-type elements, <bottomLeft>, <bottomRight>, <topLeft>, and
1021	   <topRight> that MUST be co-planar.  The four coplanar points are
1022	   identified from the perspective of the capture device.  The XML
1023	   schema definition is the following:

1025	   <!-- CAPTURE AREA TYPE -->
1026	   <xs:complexType name="captureAreaType">
1027	    <xs:sequence>
1028	     <xs:element name="bottomLeft" type="pointType"/>
1029	     <xs:element name="bottomRight" type="pointType"/>
1030	     <xs:element name="topLeft" type="pointType"/>
1031	     <xs:element name="topRight" type="pointType"/>
1032	    </xs:sequence>
1033	   </xs:complexType>

1035	11.6.  <nonSpatiallyDefinable>

1037	   When media captures are non spatially definable, they MUST be marked
1038	   with the boolean <nonSpatiallyDefinable> element set to "true" and no
1039	   <spatialInformation> MUST be provided.  Indeed,
1040	   <nonSpatiallyDefinable> and <spatialInformation> are mutually
1041	   exclusive tags, according to the <choice> section within the XML
1042	   Schema definition of the media capture type.

1044	11.7.  <content>

1046	   A media capture can be (i) an individual media capture or (ii) a
1047	   multiple content capture (MCC).  A multiple content capture is made
1048	   by different captures that can be arranged spatially (by a
1049	   composition operation), or temporally (by a switching operation), or
1050	   that can result from the orchestration of both the techniques.  If a
1051	   media capture is an MCC, then it MAY show in its XML data model
1052	   representation the <content> element.  It is composed by a list of
1053	   media capture identifiers ("captureIDREF") and capture scene view
1054	   identifiers ("sceneViewIDREF"), where the last ones are used as
1055	   shortcuts to refer to multiple capture identifiers.  The referenced
1056	   captures are used to create the MCC according to a certain strategy.
1057	   If the <content> element does not appear in a MCC, or it has no child
1058	   elements, then the MCC is assumed to be made of multiple sources but
1059	   no information regarding those sources is provided.

1061	   <!-- CONTENT TYPE -->
1062	   <xs:complexType name="contentType">
1063	    <xs:sequence>
1064	      <xs:element name="captureIDREF" type="xs:IDREF"
1065	      minOccurs="0" maxOccurs="unbounded"/>
1066	      <xs:element name="sceneViewIDREF" type="xs:IDREF"
1067	      minOccurs="0" maxOccurs="unbounded"/>
1068	      <xs:any namespace="##other" processContents="lax" minOccurs="0"
1069	      maxOccurs="unbounded"/>
1070	    </xs:sequence>
1071	    <xs:anyAttribute namespace="##other" processContents="lax"/>
1072	   </xs:complexType>

1074	11.8.  <synchronizationID>

1076	   <synchronizationID> is an optional element for multiple content
1077	   captures that contains a numeric identifier.  Multiple content
1078	   captures marked with the same identifier in the <synchronizationID>
1079	   contain at all times captures coming from the same sources.  It is
1080	   the Media Provider that determines what the source for the captures
1081	   is.  In this way, the Media Provider can choose how to group together
1082	   single captures for the purpose of keeping them synchronized
1083	   according to the <synchronizationID> element.

1085	11.9.  <allowSubsetChoice>

1087	   <allowSubsetChoice> is an optional boolean element for multiple
1088	   content captures.  It indicates whether or not the Provider allows
1089	   the Consumer to choose a specific subset of the captures referenced
1090	   by the MCC.  If this attribute is true, and the MCC references other
1091	   captures, then the Consumer MAY specify in a CONFIGURE message a
1092	   specific subset of those captures to be included in the MCC, and the
1093	   Provider MUST then include only that subset.  If this attribute is
1094	   false, or the MCC does not reference other captures, then the
1095	   Consumer MUST NOT select a subset.  If <allowSubsetChoice> is not
1096	   shown in the XML description of the MCC, its value is to be
1097	   considered "false".

1099	11.10.  <policy>

1101	   <policy> is an optional element that can be used only for multiple
1102	   content captures.  It indicates the criteria applied to build the
1103	   multiple content capture using the media captures referenced in
1104	   <contentCaptureIDs>.  The <policy> value is in the form of a token
1105	   that indicates the policy and an index representing an instance of
1106	   the policy, separated by a ":" (e.g., SoundLevel:2, RoundRobin:0,
1107	   etc.).  The XML schema defining the type of the <policy> element is
1108	   the following:

1110	   <!-- POLICY TYPE -->
1111	   <xs:simpleType name="policyType">
1112	    <xs:restriction base="xs:string">
1113	         <xs:pattern value="([a-zA-Z0-9])+[:]([0-9])+"/>
1114	       </xs:restriction>
1115	   </xs:simpleType>

1117	   At the time of writing, only two switching policies are defined in
1118	   [I-D.ietf-clue-framework]:

1120	      SoundLevel: the content of the MCC is determined by a sound level
1121	      detection algorithm.  The loudest (active) speaker (or a previous
1122	      speaker, depending on the index value) is contained in the MCC.
1123	      Index 0 represents the most current instance of the policy, i.e.,
1124	      the currently active speaker, 1 represents the previous instance,
1125	      i.e., the previous active speaker, and so on.

1127	      RoundRobin: the content of the MCC is determined by a time based
1128	      algorithm.

1130	   Other values for the <policy> element can be used.  In this case, it
1131	   is assumed that implementations agree on the meaning of those other
1132	   values and/or those new switching policies are defined in later
1133	   documents.

1135	11.11.  <maxCaptures>

1137	   <maxCaptures> is an optional element that can be used only for
1138	   multiple content captures (MCC).  It provides information about the
1139	   number of media captures that can be represented in the multiple
1140	   content capture at a time.  If <maxCaptures> is not provided, all the
1141	   media captures listed in the <content> element can appear at a time
1142	   in the capture encoding.  The type definition is provided below.

1144	   <!-- MAX CAPTURES TYPE -->
1145	   <xs:complexType name="maxCapturesType">
1146	     <xs:simpleContent>
1147	       <xs:extension base="xs:positiveInteger">
1148	         <xs:attribute name="exactNumber" type="xs:boolean"/>
1149	       </xs:extension>
1150	     </xs:simpleContent>

1152	   </xs:complexType>

1154	   When the "exactNumber" attribute is set to "true", it means the
1155	   <maxCaptures> element carries the exact number of the media captures
1156	   appearing at a time.  Otherwise, the number of the represented media
1157	   captures MUST be considered "<=" the <maxCaptures> value.

1159	   For instance, an audio MCC having the <maxCaptures> value set to 1
1160	   means that a media stream from the MCC will only contain audio from a
1161	   single one of its constituent captures at a time.  On the other hand,
1162	   if the <maxCaptures> value is set to 4 and the exactNumber attribute
1163	   is set to "true", it would mean that the media stream received from
1164	   the MCC will always contain a mix of audio from exactly four of its
1165	   constituent captures.

1167	11.12.  <individual>

1169	   <individual> is a boolean element that MUST be used for single-
1170	   content captures.  Its value is fixed and set to "true".  Such
1171	   element indicates the capture that is being described is not a
1172	   multiple content capture.  Indeed, <individual> and the
1173	   aforementioned tags related to MCC attributes (from Section 11.7 to
1174	   Section 11.11) are mutually exclusive, according to the <choice>
1175	   section within the XML Schema definition of the media capture type.

1177	11.13.  <description>

1179	   <description> is used to provide human-readable textual information.
1180	   This element is included in the XML definition of media captures,
1181	   capture scenes and capture scene views to the aim of providing human-
1182	   readable description of, respectively, media captures, capture scenes
1183	   and capture scene views.  According to the data model definition of a
1184	   media capture (Section 11)), zero or more <description> elements can
1185	   be used, each providing information in a different language.  The
1186	   <description> element definition is the following:

1188	   <!-- DESCRIPTION element -->
1189	   <xs:element name="description">
1190	    <xs:complexType>
1191	     <xs:simpleContent>
1192	      <xs:extension base="xs:string">
1193	        <xs:attribute name="lang" type="xs:language"/>
1194	      </xs:extension>
1195	     </xs:simpleContent>
1196	    </xs:complexType>

1198	   </xs:element>

1200	   As can be seen, <description> is a string element with an attribute
1201	   ("lang") indicating the language used in the textual description.

1203	11.14.  <priority>

1205	   <priority> is an optional unsigned integer field indicating the
1206	   importance of a media capture according to the Media Provider's
1207	   perspective.  It can be used on the receiver's side to automatically
1208	   identify the most relevant contribution from the Media Provider.  The
1209	   higher the importance, the lower the contained value.  If no priority
1210	   is assigned, no assumptions regarding relative importance of the
1211	   media capture can be assumed.

1213	11.15.  <lang>

1215	   <lang> is an optional element containing the language used in the
1216	   capture.  Zero or more <lang> elements can appear in the XML
1217	   description of a media capture.

1219	11.16.  <mobility>

1221	   <mobility> is an optional element indicating whether or not the
1222	   capture device originating the capture may move during the
1223	   telepresence session.  That optional element can assume one of the
1224	   three following values:

1226	   static  SHOULD NOT change for the duration of the CLUE session,
1227	      across multiple ADVERTISEMENT messages.

1229	   dynamic  MAY change in each new ADVERTISEMENT message.  Can be
1230	      assumed to remain unchanged until there is a new ADVERTISEMENT
1231	      message.

1233	   highly-dinamic  MAY change dynamically, even between consecutive
1234	      ADVERTISEMENT messages.  The spatial information provided in an
1235	      ADVERTISEMENT message is simply a snapshot of the current values
1236	      at the time when the message is sent.

1238	11.17.  <relatedTo>

1240	   The optional <relatedTo> element contains the value of the captureID
1241	   attribute (Section 11.1) of the media capture to which the considered
1242	   media capture refers.  The media capture marked with a <relatedTo>
1243	   element can be for example the translation of the referred media
1244	   capture in a different language.

1246	11.18.  <view>

1248	   The <view> element is an optional tag describing what is represented
1249	   in the spatial area covered by a media capture.  The current possible
1250	   values are: "table", "lectern", "individual", and "audience", as
1251	   listed in the enumerative view type in the following.

1253	   <!-- VIEW TYPE -->
1254	   <xs:simpleType name="viewType">
1255	    <xs:restriction base="xs:string">
1256	     <xs:enumeration value="room"/>
1257	     <xs:enumeration value="table"/>
1258	     <xs:enumeration value="lectern"/>
1259	     <xs:enumeration value="individual"/>
1260	     <xs:enumeration value="audience"/>
1261	    </xs:restriction>
1262	   </xs:simpleType>

1264	11.19.  <presentation>

1266	   The <presentation> element is an optional tag used for media captures
1267	   conveying information about presentations within the telepresence
1268	   session.  The current possible values are "slides" and "images", as
1269	   listed in the enumerative presentation type in the following.

1271	   <!-- PRESENTATION TYPE -->
1272	   <xs:simpleType name="presentationType">
1273	    <xs:restriction base="xs:string">
1274	     <xs:enumeration value="slides"/>
1275	     <xs:enumeration value="image"/>
1276	    </xs:restriction>
1277	   </xs:simpleType>
1278	   </section>

1280	11.19.1.  <embeddedText>

1282	   The <embeddedText> element is a boolean element indicating that there
1283	   is text embedded in the media capture (e.g., in a video capture).
1284	   The language used in such embedded textual description is reported in
1285	   <embeddedText> "lang" attribute.

1287	   The XML Schema definition of the <embeddedText> element is:

1289	   <!-- EMBEDDED TEXT ELEMENT -->
1290	   <xs:element name="embeddedText">
1291	    <xs:complexType>
1292	     <xs:simpleContent>
1293	      <xs:extension base="xs:boolean">
1294	       <xs:attribute name="lang" type="xs:language"/>
1295	      </xs:extension>
1296	     </xs:simpleContent>
1297	    </xs:complexType>
1298	   </xs:element>

1300	11.19.2.  <capturedPeople>

1302	   This optional element is used to indicate which telepresence session
1303	   participants are represented within the media captures.  For each
1304	   participant, a <personIDREF> element is provided.

1306	11.19.2.1.  <personIDREF>

1308	   <personIDREF> contains the identifier of the represented person,
1309	   i.e., the value of the related personID attribute
1310	   (Section 11.29.1.1).  Metadata about the represented participant can
1311	   be retrieved by accessing the <people> list (Section 11.29).

1313	11.20.  Audio captures

1315	   Audio captures inherit all the features of a generic media capture
1316	   and present further audio-specific characteristics.  The XML Schema
1317	   definition of the audio capture type is reported below:

1319	   <!-- AUDIO CAPTURE TYPE -->
1320	   <xs:complexType name="audioCaptureType">
1321	    <xs:complexContent>
1322	     <xs:extension base="tns:mediaCaptureType">
1323	      <xs:sequence>
1324	       <xs:element name="sensitivyPattern" type="sensitivyPatternType"
1325	       minOccurs="0"/>
1326	       <xs:any namespace="##other" processContents="lax" minOccurs="0"
1327	       maxOccurs="unbounded"/>
1328	      </xs:sequence>
1329	      <xs:anyAttribute namespace="##other" processContents="lax"/>
1330	     </xs:extension>
1331	    </xs:complexContent>
1332	   </xs:complexType>
1333	   An example of audio-specific information that can be included is
1334	   represented by the <sensitivyPattern> element.  (Section 11.20.1).

1336	11.20.1.  <sensitivityPattern>

1338	   The <sensitivityPattern> element is an optional field describing the
1339	   characteristics of the nominal sensitivity pattern of the microphone
1340	   capturing the audio signal.

1342	   The XML Schema definition is provided below:

1344	   <xs:simpleType name="sensitivyPatternType">
1345	    <xs:restriction base="xs:string">
1346	     <xs:enumeration value="uni"/>
1347	     <xs:enumeration value="shotgun"/>
1348	     <xs:enumeration value="omni"/>
1349	     <xs:enumeration value="figure8"/>
1350	     <xs:enumeration value="cardioid"/>
1351	     <xs:enumeration value="hyper-cardioid"/>
1352	    </xs:restriction>
1353	   </xs:simpleType>

1355	11.21.  Video captures

1357	   Video captures, similarly to audio captures, extend the information
1358	   of a generic media capture with video-specific features.

1360	   The XML Schema representation of the video capture type is provided
1361	   in the following:

1363	   <!-- VIDEO CAPTURE TYPE -->
1364	   <xs:complexType name="videoCaptureType">
1365	    <xs:complexContent>
1366	     <xs:extension base="tns:mediaCaptureType">
1367	      <xs:sequence>
1368	       <xs:any namespace="##other" processContents="lax" minOccurs="0"
1369	       maxOccurs="unbounded"/>
1370	      </xs:sequence>
1371	      <xs:anyAttribute namespace="##other" processContents="lax"/>
1372	     </xs:extension>
1373	    </xs:complexContent>
1374	   </xs:complexType>

1376	11.22.  Text captures

1378	   Also text captures can be described by extending the generic media
1379	   capture information, similarly to audio captures and video captures.

1381	   The XML Schema representation of the text capture type is currently
1382	   lacking text-specific information, as it can be seen by looking at
1383	   the definition below:

1385	   <!-- TEXT CAPTURE TYPE -->
1386	   <xs:complexType name="textCaptureType">
1387	    <xs:complexContent>
1388	     <xs:extension base="tns:mediaCaptureType">
1389	     <xs:sequence>
1390	     <xs:any namespace="##other" processContents="lax" minOccurs="0"
1391	       maxOccurs="unbounded"/>
1392	     </xs:sequence>
1393	     <xs:anyAttribute namespace="##other" processContents="lax"/>
1394	     </xs:extension>
1395	    </xs:complexContent>
1396	   </xs:complexType>

1398	   Text captures SHOULD be marked as non spatially definable (i.e., they
1399	   should present in their XML description the <nonSpatiallyDefinable>
1400	   (Section 11.6) element set to "true").

1402	11.23.  Other capture types

1404	   Other media capture types can be described by using the CLUE data
1405	   model.  They can be represented by exploiting the "otherCaptureType"
1406	   type.  This media capture type is conceived to be filled in with
1407	   elements defined within extensions of the current schema, i.e., with
1408	   elements defined in other XML schemas (see Section 14 for an
1409	   example).  The otherCaptureType inherits all the features envisioned
1410	   for the abstract mediaCaptureType.

1412	   The XML Schema representation of the otherCaptureType is the
1413	   following:

1415	   <!-- OTHER CAPTURE TYPE -->
1416	   <xs:complexType name="otherCaptureType">
1417	    <xs:complexContent>
1418	     <xs:extension base="tns:mediaCaptureType">
1419	     <xs:sequence>
1420	     <xs:any namespace="##other" processContents="lax" minOccurs="0"
1421	       maxOccurs="unbounded"/>
1422	     </xs:sequence>
1423	     <xs:anyAttribute namespace="##other" processContents="lax"/>
1424	     </xs:extension>
1425	    </xs:complexContent>
1426	   </xs:complexType>

1428	   When defining new media capture types that are going to be described
1429	   by means of the <otherMediaCapture> element, spatial properties of
1430	   such new media capture types SHOULD be defined (e.g., whether or not
1431	   they are spatially definable, whether or not they should be
1432	   associated with an area of capture, etc.).

1434	11.24.  <captureScene>

1436	   A Media Provider organizes the available captures in capture scenes
1437	   in order to help the receiver both in the rendering and in the
1438	   selection of the group of captures.  Capture scenes are made of media
1439	   captures and capture scene views, that are sets of media captures of
1440	   the same media type.  Each capture scene view is an alternative to
1441	   represent completely a capture scene for a fixed media type.

1443	   The XML Schema representation of a <captureScene> element is the
1444	   following:

1446	<!-- CAPTURE SCENE TYPE -->
1447	<xs:complexType name="captureSceneType">
1448	 <xs:sequence>
1449	  <xs:element ref="description" minOccurs="0" maxOccurs="unbounded"/>
1450	  <xs:element name="sceneInformation" type="xcard:vcardType" minOccurs="0"/>
1451	  <xs:element name="sceneViews" type="sceneViewsType" minOccurs="0"/>
1452	  <xs:any namespace="##other" processContents="lax" minOccurs="0"
1453	  maxOccurs="unbounded"/>
1454	 </xs:sequence>
1455	 <xs:attribute name="sceneID" type="xs:ID" use="required"/>
1456	 <xs:attribute name="scale" type="scaleType" use="required"/>
1457	 <xs:anyAttribute namespace="##other" processContents="lax"/>
1458	</xs:complexType>
1459	   Each capture scene is identified by a "sceneID" attribute.  The
1460	   <captureScene> element can contain zero or more textual <description>
1461	   elements, defined as in Section 11.13.  Besides <description>, there
1462	   is the optional <sceneInformation> element (Section 11.24.1), which
1463	   contains structured information about the scene in the vcard format,
1464	   and the optional <sceneViews> element (Section 11.24.2), which is the
1465	   list of the capture scene views.  When no <sceneViews> is provided,
1466	   the capture scene is assumed to be made of all the media captures
1467	   which contain the value of its sceneID attribute in their mandatory
1468	   captureSceneIDREF attribute.

1470	11.24.1.  <sceneInformation>

1472	   The <sceneInformation> element contains optional information about
1473	   the capture scene according to the vcard format.

1475	11.24.2.  <sceneViews>

1477	   The <sceneViews> element is a mandatory field of a capture scene
1478	   containing the list of scene views.  Each scene view is represented
1479	   by a <sceneView> element (Section 11.25).

1481	   <!-- SCENE VIEWS TYPE -->
1482	   <!-- envelope of scene views of a capture scene -->
1483	   <xs:complexType name="sceneViewsType">
1484	    <xs:sequence>
1485	     <xs:element name="sceneView" type="sceneViewType"
1486	     maxOccurs="unbounded"/>
1487	    </xs:sequence>
1488	   </xs:complexType>

1490	11.24.3.  sceneID attribute

1492	   The sceneID attribute is a mandatory attribute containing the
1493	   identifier of the capture scene.

1495	11.24.4.  scale attribute

1497	   The scale attribute is a mandatory attribute that specifies the scale
1498	   of the coordinates provided in the spatial information of the media
1499	   capture belonging to the considered capture scene.  The scale
1500	   attribute can assume three different values:

1502	      "mm" - the scale is in millimeters.  Systems which know their
1503	      physical dimensions (for example professionally installed
1504	      telepresence room systems) should always provide such real-world
1505	      measurements.

1507	      "unknown" - the scale is the same for every media capture in the
1508	      capture scene but the unity of measure is undefined.  Systems
1509	      which are not aware of specific physical dimensions yet still know
1510	      relative distances should select "unknown" in the scale attribute
1511	      of the capture scene to be described.

1513	      "noscale" - there is no common physical scale among the media
1514	      captures of the capture scene.  That means the scale could be
1515	      different for each media capture.

1517	   <!-- SCALE TYPE -->
1518	   <xs:simpleType name="scaleType">
1519	    <xs:restriction base="xs:string">
1520	     <xs:enumeration value="mm"/>
1521	     <xs:enumeration value="unknown"/>
1522	     <xs:enumeration value="noscale"/>
1523	    </xs:restriction>
1524	   </xs:simpleType>

1526	11.25.  <sceneView>

1528	   A <sceneView> element represents a capture scene view, which contains
1529	   a set of media captures of the same media type describing a capture
1530	   scene.

1532	   A <sceneView> element is characterized as follows.

1534	   <!-- SCENE VIEW TYPE -->
1535	   <xs:complexType name="sceneViewType">
1536	    <xs:sequence>
1537	     <xs:element ref="description" minOccurs="0" maxOccurs="unbounded"/>
1538	     <xs:element name="mediaCaptureIDs" type="captureIDListType"/>
1539	    </xs:sequence>
1540	    <xs:attribute name="sceneViewID" type="xs:ID" use="required"/>
1541	   </xs:complexType>
1542	   One or more optional <description> elements provide human-readable
1543	   information about what the scene view contains. <description> is
1544	   defined as already seen in Section 11.13.

1546	   The remaining child elements are described in the following
1547	   subsections.

1549	11.25.1.  <mediaCaptureIDs>

1551	   The <mediaCaptureIDs> is the list of the identifiers of the media
1552	   captures included in the scene view.  It is an element of the
1553	   captureIDListType type, which is defined as a sequence of
1554	   <captureIDREF>, each containing the identifier of a media capture
1555	   listed within the <mediaCaptures> element:

1557	   <!-- CAPTURE ID LIST TYPE -->
1558	   <xs:complexType name="captureIDListType">
1559	    <xs:sequence>
1560	     <xs:element name="captureIDREF" type="xs:IDREF"
1561	     maxOccurs="unbounded"/>
1562	    </xs:sequence>
1563	   </xs:complexType>

1565	11.25.2.  sceneViewID attribute

1567	   The sceneViewID attribute is a mandatory attribute containing the
1568	   identifier of the capture scene view represented by the <sceneView>
1569	   element.

1571	11.26.  <encodingGroup>

1573	   The <encodingGroup> element represents an encoding group, which is
1574	   made by a set of one or more individual encodings and some parameters
1575	   that apply to the group as a whole.  Encoding groups contain
1576	   references to individual encodings that can be applied to media
1577	   captures.  The definition of the <encodingGroup> element is the
1578	   following:

1580	   <!-- ENCODING GROUP TYPE -->
1581	   <xs:complexType name="encodingGroupType">
1582	    <xs:sequence>
1583	     <xs:element name="maxGroupBandwidth" type="xs:unsignedLong"/>
1584	     <xs:element name="encodingIDList" type="encodingIDListType"/>
1585	     <xs:any namespace="##other" processContents="lax" minOccurs="0"
1586	     maxOccurs="unbounded"/>
1587	    </xs:sequence>
1588	    <xs:attribute name="encodingGroupID" type="xs:ID" use="required"/>
1589	    <xs:anyAttribute namespace="##any" processContents="lax"/>
1590	   </xs:complexType>

1592	   In the following, the contained elements are further described.

1594	11.26.1.  <maxGroupBandwidth>

1596	   <maxGroupBandwidth> is an optional field containing the maximum
1597	   bitrate expressed in bits per second that can be shared by the
1598	   individual encodings included in the encoding group.

1600	11.26.2.  <encodingIDList>

1602	   <encodingIDList> is the list of the individual encodings grouped
1603	   together in the encoding group.  Each individual encoding is
1604	   represented through its identifier contained within an <encID>
1605	   element.

1607	   <!-- ENCODING ID LIST TYPE -->
1608	   <xs:complexType name="encodingIDListType">
1609	    <xs:sequence>
1610	     <xs:element name="encID" type="xs:string" maxOccurs="unbounded"/>
1611	    </xs:sequence>
1612	   </xs:complexType>

1614	11.26.3.  encodingGroupID attribute

1616	   The encodingGroupID attribute contains the identifier of the encoding
1617	   group.

1619	11.27.  <simultaneousSet>

1621	   <simultaneousSet> represents a simultaneous transmission set, i.e., a
1622	   list of captures of the same media type that can be transmitted at
1623	   the same time by a Media Provider.  There are different simultaneous
1624	   transmission sets for each media type.

1626	   <!-- SIMULTANEOUS SET TYPE -->
1627	   <xs:complexType name="simultaneousSetType">
1628	    <xs:sequence>
1629	      <xs:element name="mediaCaptureIDREF" type="xs:IDREF"
1630	      minOccurs="0" maxOccurs="unbounded"/>
1631	      <xs:element name="sceneViewIDREF" type="xs:IDREF"
1632	      minOccurs="0" maxOccurs="unbounded"/>
1633	      <xs:element name="captureSceneIDREF" type="xs:IDREF"
1634	      minOccurs="0" maxOccurs="unbounded"/>
1635	      <xs:any namespace="##other" processContents="lax" minOccurs="0"
1636	      maxOccurs="unbounded"/>
1637	    </xs:sequence>
1638	    <xs:attribute name="setID" type="xs:ID" use="required"/>
1639	    <xs:attribute name="mediaType" type="xs:string"/>
1640	    <xs:anyAttribute namespace="##any" processContents="lax"/>
1641	   </xs:complexType>

1643	   Besides the identifiers of the captures (<mediaCaptureIDREF>
1644	   elements), also the identifiers of capture scene views and of capture
1645	   scene can be exploited as shortcuts (<sceneViewIDREF> and
1646	   <captureSceneIDREF> elements).  As an example, let's consider the
1647	   situation where there are two capture scene views (S1 and S7).  S1
1648	   contains captures AC11, AC12, AC13.  S7 contains captures AC71, AC72.
1649	   Provided that AC11, AC12, AC13, AC71, AC72 can be simultaneously sent
1650	   to the media consumer, instead of having 5 <mediaCaptureIDREF>
1651	   elements listed in the simultaneous set (i.e., one
1652	   <mediaCaptureIDREF> for AC11, one for AC12, and so on), there can be
1653	   just two <sceneViewIDREF> elements (one for S1 and one for S7).

1655	11.27.1.  setID attribute

1657	   The "setID" attribute is a mandatory field containing the identifier
1658	   of the simultaneous set.

1660	11.27.2.  mediaType attribute

1662	   The "mediaType" attribute is an optional attribute containing the
1663	   media type of the captures referenced by the simultaneous set.

1665	   When only capture scene identifiers are listed within a simultaneous
1666	   set, the media type attribute MUST appear in the XML description in
1667	   order to determine which media captures can be simultaneously sent
1668	   together.

1670	11.27.3.  <mediaCaptureIDREF>

1672	   <mediaCaptureIDREF> contains the identifier of the media capture that
1673	   belongs to the simultanous set.

1675	11.27.4.  <sceneViewIDREF>

1677	   <sceneViewIDREF> contains the identifier of the scene view containing
1678	   a group of captures that are able to be sent simultaneously with the
1679	   other captures of the simultaneous set.

1681	11.27.5.  <captureSceneIDREF>

1683	   <captureSceneIDREF> contains the identifier of the capture scene
1684	   where all the included captures of a certain media type are able to
1685	   be sent together with the other captures of the simultaneous set.

1687	11.28.  <globalView>

1689	   <globalView> is a set of captures of the same media type representing
1690	   a summary of the complete Media Provider's offer.  The content of a
1691	   global view is expressed by leveraging only scene view identifiers,
1692	   put within <sceneViewIDREF> elements.  Each global view is identified
1693	   by a unique identifier within the "globalViewID" attribute.

1695	   <!-- GLOBAL VIEW TYPE -->
1696	   <xs:complexType name="globalViewType">
1697	    <xs:sequence>
1698	      <xs:element name="sceneViewIDREF" type="xs:IDREF"
1699	       maxOccurs="unbounded"/>
1700	      <xs:any namespace="##other" processContents="lax" minOccurs="0"
1701	      maxOccurs="unbounded"/>
1702	    </xs:sequence>
1703	    <xs:attribute name="globalViewID" type="xs:ID"/>
1704	    <xs:anyAttribute namespace="##any" processContents="lax"/>
1705	   </xs:complexType>

1707	11.29.  <people>

1709	   Information about the participants that are represented in the media
1710	   captures is conveyed via the <people> element.  As it can be seen
1711	   from the XML Schema depicted below, for each participant, a <person>
1712	   element is provided.

1714	   <!-- PEOPLE TYPE -->
1715	   <xs:complexType name="peopleType">
1716	   <xs:sequence>
1717	   <xs:element name="person" type="personType"
1718	                       maxOccurs="unbounded"/>
1719	   </xs:sequence>
1720	   </xs:complexType>

1722	11.29.1.  <person>

1724	   <person> includes all the metadata related to a person represented
1725	   within one or more media captures.  Such element provides the vcard
1726	   of the subject (via the <personInfo> element, see Section 11.29.1.2)
1727	   and his conference role(s) (via one or more <personType> elements,
1728	   see Section 11.29.1.3).  Furthermore, it has a mandatory "personID"
1729	   attribute (Section 11.29.1.1).

1731	 <!-- PERSON TYPE -->
1732	 <xs:complexType name="personType">
1733	   <xs:sequence>
1734	      <xs:element name="personInfo" type="xcard:vcardType" maxOccurs="1"
1735	                          minOccurs="0"/>
1736	      <xs:element name="personType" type="personTypeType"
1737	                          minOccurs="0"
1738	                  maxOccurs="unbounded"/>
1739	      <xs:any namespace="##other" processContents="lax" minOccurs="0"
1740	                  maxOccurs="unbounded"/>
1741	   </xs:sequence>
1742	   <xs:attribute name="personID" type="xs:ID" use="required"/>
1743	   <xs:anyAttribute namespace="##other" processContents="lax"/>
1744	 </xs:complexType>

1746	11.29.1.1.  personID attribute

1748	   The "personID" attribute carries the identifier of a represented
1749	   person.  Such an identifier can be used to refer to the participant,
1750	   as in the <capturedPeople> element in the media captures
1751	   representation (Section 11.19.2).

1753	11.29.1.2.  <personInfo>

1755	   The <personInfo> element is the XML representation of all the fields
1756	   composing a vcard as specified in the Xcard RFC [RFC6351].  The
1757	   vcardType is imported by the Xcard XML Schema provided by

1759	   [I-D.ietf-ecrit-additional-data].  As such schema specifies, the <fn>
1760	   element within <vcard> is mandatory.

1762	11.29.1.3.  <personType>

1764	   The value of the <personType> element determines the role of the
1765	   represented participant within the telepresence session organization.
1766	   It can be one of the following terms, that are defined in the
1767	   framework document: "presenter", "timekeeper", "attendee", "minute
1768	   taker", "translator", "chairman", "vice-chairman".

1770	   A participant can play more than one conference role.  In that case,
1771	   more than one <personType> element will appear in his description.

1773	   <!-- PERSON TYPE TYPE -->
1774	   <xs:simpleType name="personTypeType">
1775	           <xs:restriction base="xs:string">
1776	           <xs:enumeration value="chairman"/>
1777	           <xs:enumeration value="vice-chairman"/>
1778	           <xs:enumeration value="minute taker"/>
1779	           <xs:enumeration value="presenter"/>
1780	           <xs:enumeration value="translator"/>
1781	           <xs:enumeration value="timekeeper"/>
1782	           <xs:enumeration value="attendee"/>
1783	           <xs:enumeration value="observer"/>
1784	           </xs:restriction>
1785	   </xs:simpleType>

1787	12.  <captureEncoding>

1789	   A capture encoding is given from the association of a media capture
1790	   with an individual encoding, to form a capture stream as defined in
1791	   [I-D.ietf-clue-framework].  Capture encodings are used within
1792	   CONFIGURE messages from a Media Consumer to a Media Provider for
1793	   representing the streams desired by the Media Consumer.  For each
1794	   desired stream, the Media Consumer needs to be allowed to specify:
1795	   (i) the capture identifier of the desired capture that has been
1796	   advertised by the Media Provider; (ii) the encoding identifier of the
1797	   encoding to use, among those advertised by the Media Provider; (iii)
1798	   optionally, in case of multi-content captures, the list of the
1799	   capture identifiers of the desired captures.  All the mentioned
1800	   identifiers are intended to be included in the ADVERTISEMENT message
1801	   that the CONFIGURE message refers to.  The XML model of
1802	   <captureEncoding> is provided in the following.

1804	<!-- CAPTURE ENCODING TYPE -->
1805	<xs:complexType name="captureEncodingType">
1806	 <xs:sequence>
1807	  <xs:element name="captureID" type="xs:string"/>
1808	  <xs:element name="encodingID" type="xs:string"/>
1809	  <xs:element name="configuredContent" type="contentType" minOccurs="0"/>
1810	  <xs:any namespace="##other" processContents="lax" minOccurs="0"
1811	   maxOccurs="unbounded"/>
1812	 </xs:sequence>
1813	 <xs:attribute name="ID" type="xs:ID" use="required"/>
1814	 <xs:anyAttribute namespace="##any" processContents="lax"/>
1815	</xs:complexType>

1817	12.1.  <captureID>

1819	   <captureID> is the mandatory element containing the identifier of the
1820	   media capture that has been encoded to form the capture encoding.

1822	12.2.  <encodingID>

1824	   <encodingID> is the mandatory element containing the identifier of
1825	   the applied individual encoding.

1827	12.3.  <configuredContent>

1829	   <configuredContent> is an optional element to be used in case of
1830	   configuration of MCC.  It contains the list of capture identifiers
1831	   and capture scene view identifiers the Media Consumer wants within
1832	   the MCC.  That element is structured as the <content> element used to
1833	   describe the content of an MCC.  The total number of media captures
1834	   listed in the <configuredContent> must be lower than or equal to the
1835	   value carried within the <maxCaptures> attribute of the MCC.

1837	13.  <clueInfo>

1839	   The <clueInfo> element includes all the information needed to
1840	   represent the Media Provider's description of its telepresence
1841	   capabilities according to the CLUE framework.  Indeed, it is made by:

1843	      the list of the available media captures (<mediaCaptures>
1844	      (Section 5))
1845	      the list of encoding groups (<encodingGroups> (Section 6))

1847	      the list of capture scenes (<captureScenes> (Section 7))

1849	      the list of simultaneous transmission sets (<simultaneousSets>
1850	      (Section 8))

1852	      the list of global views sets (<globalViews> (Section 9))

1854	      meta data about the participants represented in the telepresence
1855	      session (<people> (Section 11.29)).

1857	   It has been conceived only for data model testing purposes and,
1858	   though it resembles the body of an ADVERTISEMENT message, it is not
1859	   actually used in the CLUE protocol message definitions.  The
1860	   telepresence capabilities descriptions compliant to this data model
1861	   specification that can be found in Section 17 and Section 18 are
1862	   provided by using the <clueInfo> element.

1864	   <!-- CLUE INFO TYPE -->
1865	   <xs:complexType name="clueInfoType">
1866	     <xs:sequence>
1867	      <xs:element ref="mediaCaptures"/>
1868	      <xs:element ref="encodingGroups"/>
1869	      <xs:element ref="captureScenes"/>
1870	      <xs:element ref="simultaneousSets" minOccurs="0"/>
1871	      <xs:element ref="globalViews" minOccurs="0"/>
1872	      <xs:element ref="people" minOccurs="0"/>
1873	      <xs:any namespace="##other" processContents="lax" minOccurs="0"
1874	      maxOccurs="unbounded"/>
1875	     </xs:sequence>
1876	     <xs:attribute name="clueInfoID" type="xs:ID" use="required"/>
1877	     <xs:anyAttribute namespace="##other" processContents="lax"/>
1878	   </xs:complexType>

1880	14.  XML Schema extensibility

1882	   The telepresence data model defined in this document is meant to be
1883	   extensible.  Extensions are accomplished by defining elements or
1884	   attributes qualified by namespaces other than
1885	   "urn:ietf:params:xml:ns:clue-info" and
1886	   "urn:ietf:params:xml:ns:vcard-4.0" for use wherever the schema allows
1887	   such extensions (i.e., where the XML Schema definition specifies
1888	   "anyAttribute" or "anyElement").  Elements or attributes from unknown
1889	   namespaces MUST be ignored.

1891	14.1.  Example of extension

1893	   When extending the CLUE data model, a new schema with a new namespace
1894	   associated with it needs to be specified.

1896	   In the following, an example of extension is provided.  The extension
1897	   defines a new audio capture attribute ("newAudioFeature") and an
1898	   attribute for characterizing the captures belonging to an
1899	   "otherCaptureType" defined by the user.  An XML document compliant
1900	   with the extension is also included.  The XML file results validated
1901	   against the current CLUE data model schema.

1903	   <?xml version="1.0" encoding="UTF-8" ?>
1904	   <xs:schema
1905	      targetNamespace="urn:ietf:params:xml:ns:clue-info-ext"
1906	      xmlns:tns="urn:ietf:params:xml:ns:clue-info-ext"
1907	      xmlns:clue-ext="urn:ietf:params:xml:ns:clue-info-ext"
1908	      xmlns:xs="http://www.w3.org/2001/XMLSchema"
1909	      xmlns="urn:ietf:params:xml:ns:clue-info-ext"
1910	      xmlns:xcard="urn:ietf:params:xml:ns:vcard-4.0"
1911	      xmlns:info="urn:ietf:params:xml:ns:clue-info"
1912	      elementFormDefault="qualified"
1913	      attributeFormDefault="unqualified">

1915	   <!-- Import xcard XML schema -->
1916	   <xs:import namespace="urn:ietf:params:xml:ns:vcard-4.0"
1917	   schemaLocation="xcard.xsd"/>

1919	   <!-- Import CLUE XML schema -->
1920	   <xs:import namespace="urn:ietf:params:xml:ns:clue-info"
1921	   schemaLocation="clue-data-model-schema.xsd"/>

1923	   <!-- ELEMENT DEFINITIONS -->
1924	   <xs:element name="newAudioFeature" type="xs:string"/>
1925	   <xs:element name="otherMediaCaptureTypeFeature" type="xs:string"/>

1927	   </xs:schema>

1929	<?xml version="1.0" encoding="UTF-8" standalone="yes"?>
1930	<clueInfo xmlns="urn:ietf:params:xml:ns:clue-info"
1931	xmlns:ns2="urn:ietf:params:xml:ns:vcard-4.0"
1932	xmlns:ns3="urn:ietf:params:xml:ns:clue-info-ext"
1933	clueInfoID="NapoliRoom">
1934	    <mediaCaptures>
1935	        <mediaCapture xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
1936	        xsi:type="audioCaptureType"
1937	        captureID="AC0"
1938	        mediaType="audio">
1939	            <captureSceneIDREF>CS1</captureSceneIDREF>
1940	            <nonSpatiallyDefinable>true</nonSpatiallyDefinable>
1941	            <individual>true</individual>
1942	            <encGroupIDREF>EG1</encGroupIDREF>
1943	            <ns3:newAudioFeature>newAudioFeatureValue</ns3:newAudioFeature>
1944	        </mediaCapture>
1945	        <mediaCapture xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
1946	        xsi:type="otherCaptureType"
1947	        captureID="OMC0"
1948	        mediaType="other media type">
1949	            <captureSceneIDREF>CS1</captureSceneIDREF>
1950	            <nonSpatiallyDefinable>true</nonSpatiallyDefinable>
1951	            <encGroupIDREF>EG1</encGroupIDREF>
1952	            <ns3:otherMediaCaptureTypeFeature>OtherValue
1953	            </ns3:otherMediaCaptureTypeFeature>
1954	        </mediaCapture>
1955	    </mediaCaptures>
1956	    <encodingGroups>
1957	        <encodingGroup encodingGroupID="EG1">
1958	            <maxGroupBandwidth>300000</maxGroupBandwidth>
1959	            <encodingIDList>
1960	                <encID>ENC4</encID>
1961	                <encID>ENC5</encID>
1962	            </encodingIDList>
1963	        </encodingGroup>
1964	    </encodingGroups>
1965	    <captureScenes>
1966	        <captureScene scale="unknown" sceneID="CS1"/>
1967	    </captureScenes>
1968	</clueInfo>

1970	15.  Security considerations

1972	   This document defines an XML Schema data model for telepresence
1973	   scenarios.  The modeled information is identified in the CLUE
1974	   framework as necessary in order to enable a full-optional media
1975	   stream negotiation and rendering.  Indeed, the XML elements herein
1976	   defined are used within CLUE protocol messages to describe both the
1977	   media streams representing the Media Provider's telepresence offer
1978	   and the desired selection requested by the Media Consumer.  Security
1979	   concerns described in [I-D.ietf-clue-framework], Section 15, apply to
1980	   this document.

1982	   Data model information carried within CLUE messages SHOULD be
1983	   accessed only by authenticated endpoints.  Indeed, some information
1984	   published by the Media Provider might reveal sensitive data about who
1985	   and what is represented in the transmitted streams.  The vCard
1986	   included in the <personInfo> elements (Section 11.29.1) mandatorily
1987	   contains the identity of the represented person.  Optionally vCards
1988	   can also carry the person's contact addresses, together with his/her
1989	   photo and other personal data.  Similar privacy-critical information
1990	   can be conveyed by means of <sceneInformation> elements
1991	   (Section 11.24.1) describing the capture scenes.  The <description>
1992	   elements (Section 11.13) also can specify details about the content
1993	   of media captures , capture scenes and scene views that should be
1994	   protected.

1996	   Integrity attacks to the data model information encapsulated in CLUE
1997	   messages can invalidate the success of the telepresence session's
1998	   setup by misleading the Media Consumer's and Media Provider's
1999	   interpretation of the offered and desired media streams.

2001	   The assurance of the authenticated access and of the integrity of the
2002	   data model information is up to the involved transport mechanisms,
2003	   namely the CLUE protocol [I-D.ietf-clue-protocol] and the CLUE data
2004	   channel [I-D.ietf-clue-datachannel].

2006	16.  IANA considerations

2008	   This document registers a new XML namespace, a new XML schema and the
2009	   MIME type for the schema.

2011	16.1.  XML namespace registration

2013	   URI: urn:ietf:params:xml:ns:clue-info

2015	   Registrant Contact: IETF CLUE Working Group <clue@ietf.org>, Roberta
2016	   Presta <roberta.presta@unina.it>

2018	   XML:

2020	BEGIN

2022	<?xml version="1.0"?>
2023	<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML Basic 1.0//EN"
2024	 "http://www.w3.org/TR/xhtml-basic/xhtml-basic10.dtd">
2025	<html xmlns="http://www.w3.org/1999/xhtml">
2026	 <head>
2027	  <meta http-equiv="content-type"
2028	        content="text/html;charset=iso-8859-1"/>
2029	  <title> CLUE Data Model Namespace</title>
2030	 </head>
2031	 <body>
2032	  <h1>Namespace for CLUE Data Model</h1>
2033	  <h2>urn:ietf:params:xml:ns:clue-info</h2>
2034	  <p>See <a href="http://www.rfc-editor.org/rfc/rfcXXXX.txt"> RFC XXXX</a>.
2035	  <!--[[NOTE TO IANA/RFC-EDITOR: Please update RFC URL and replace XXXX
2036	    with the RFC number for this specification.]]-->
2037	    </p>
2038	 </body>
2039	</html>

2041	END

2043	16.2.  XML Schema registration

2045	   This section registers an XML schema per the guidelines in [RFC3688].

2047	   URI: urn:ietf:params:xml:schema:clue-info

2049	   Registrant Contact: CLUE working group (clue@ietf.org), Roberta
2050	   Presta (roberta.presta@unina.it).

2052	   Schema: The XML for this schema can be found as the entirety of
2053	   Section 4 of this document.

2055	16.3.  MIME Media Type Registration for 'application/clue_info+xml'

2057	   This section registers the "application/clue_info+xml" MIME type.

2059	   To: ietf-types@iana.org

2061	   Subject: Registration of MIME media type application/clue+xml

2063	   MIME media type name: application

2065	   MIME subtype name: clue_info+xml
2066	   Required parameters: (none)

2068	   Optional parameters: charset
2069	   Same as the charset parameter of "application/xml" as specified in
2070	   [RFC7303], Section 3.2.

2072	   Encoding considerations: Same as the encoding considerations of
2073	   "application/xml" as specified in [RFC7303], Section 3.2.

2075	   Security considerations: This content type is designed to carry data
2076	   related to telepresence information.  Some of the data could be
2077	   considered private.  This media type does not provide any protection
2078	   and thus other mechanisms such as those described in Section 15 are
2079	   required to protect the data.  This media type does not contain
2080	   executable content.

2082	   Interoperability considerations: None.

2084	   Published specification: RFC XXXX [[NOTE TO IANA/RFC-EDITOR: Please
2085	   replace XXXX with the RFC number for this specification.]]

2087	   Applications that use this media type: None.

2089	   Additional Information: Magic Number(s): (none),
2090	   File extension(s): .clue,
2091	   Macintosh File Type Code(s): TEXT.

2093	   Person & email address to contact for further information: Roberta
2094	   Presta (roberta.presta@unina.it).

2096	   Intended usage: LIMITED USE

2098	   Author/Change controller: The IETF

2100	   Other information: This media type is a specialization of
2101	   application/xml [RFC7303], and many of the considerations described
2102	   there also apply to application/clue_info+xml.

2104	17.  Sample XML file

2106	   The following XML document represents a schema compliant example of a
2107	   CLUE telepresence scenario.  Taking inspiration from the examples
2108	   described in the framework draft ([I-D.ietf-clue-framework]), it is
2109	   provided the XML representation of an endpoint-style Media Provider's
2110	   offer.

2112	   There are three cameras, where the central one is also capable of
2113	   capturing a zoomed-out view of the overall telepresence room.

2115	   Besides the three video captures coming from the cameras, the Media
2116	   Provider makes available a further multi-content capture of the
2117	   loudest segment of the room, obtained by switching the video source
2118	   across the three cameras.  For the sake of simplicity, only one audio
2119	   capture is advertised for the audio of the whole room.

2121	   The three cameras are placed in front of three participants (Alice,
2122	   Bob and Ciccio), whose vcard and conference role details are also
2123	   provided.

2125	   Media captures are arranged into four capture scene views:

2127	   1.  (VC0, VC1, VC2) - left, center and right camera video captures

2129	   2.  (VC3) - video capture associated with loudest room segment

2131	   3.  (VC4) - video capture zoomed out view of all people in the room

2133	   4.  (AC0) - main audio

2135	   There are two encoding groups: (i) EG0, for video encodings, and (ii)
2136	   EG1, for audio encodings.

2138	   As to the simultaneous sets, only VC1 and VC4 cannot be transmitted
2139	   simultaneously since they are captured by the same device, i.e., the
2140	   central camera (VC4 is a zoomed-out view while VC1 is a focused view
2141	   of the front participant).  The simultaneous sets would then be the
2142	   following:

2144	   SS1  made by VC3 and all the captures in the first capture scene view
2145	      (VC0,VC1,VC2);

2147	   SS2  made by VC3, VC0, VC2, VC4

2149	<?xml version="1.0" encoding="UTF-8" standalone="yes"?>
2150	<clueInfo xmlns="urn:ietf:params:xml:ns:clue-info"
2151	          xmlns:ns2="urn:ietf:params:xml:ns:vcard-4.0" clueInfoID="NapoliRoom">
2152	    <mediaCaptures>
2153	        <mediaCapture xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
2154	         xsi:type="audioCaptureType" captureID="AC0" mediaType="audio">
2155	            <captureSceneIDREF>CS1</captureSceneIDREF>
2156	            <encGroupIDREF>EG1</encGroupIDREF>
2157	            <spatialInformation>
2158	                <captureOrigin>
2159	                        <capturePoint>
2160	                        <x>0.5</x>
2161	                        <y>1.0</y>
2162	                        <z>0.5</z>
2163	                    </capturePoint>
2164	                    <lineOfCapturePoint>
2165	                        <x>0.5</x>
2166	                        <y>0.0</y>
2167	                        <z>0.5</z>
2168	                    </lineOfCapturePoint>
2169	                </captureOrigin>
2170	            </spatialInformation>
2171	            <individual>true</individual>
2172	            <description lang="en">main audio from the room</description>
2173	            <priority>1</priority>
2174	            <lang>it</lang>
2175	            <mobility>static</mobility>
2176	            <view>room</view>
2177	            <capturedPeople>
2178	                <personIDREF>alice</personIDREF>
2179	                <personIDREF>bob</personIDREF>
2180	                <personIDREF>ciccio</personIDREF>
2181	            </capturedPeople>
2182	        </mediaCapture>
2183	        <mediaCapture xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
2184	         xsi:type="videoCaptureType" captureID="VC0" mediaType="video">
2185	            <captureSceneIDREF>CS1</captureSceneIDREF>
2186	            <encGroupIDREF>EG0</encGroupIDREF>
2187	            <spatialInformation>
2188	                <captureOrigin>
2189	                        <capturePoint>
2190	                        <x>0.5</x>
2191	                        <y>1.0</y>
2192	                        <z>0.5</z>
2193	                    </capturePoint>
2194	                    <lineOfCapturePoint>
2195	                        <x>0.5</x>
2196	                        <y>0.0</y>
2197	                        <z>0.5</z>
2198	                    </lineOfCapturePoint>
2199	                </captureOrigin>
2200	            </spatialInformation>
2201	            <individual>true</individual>
2202	            <description lang="en">left camera video capture</description>
2203	            <priority>1</priority>
2204	            <lang>it</lang>
2205	            <mobility>static</mobility>
2206	            <view>individual</view>
2207	            <capturedPeople>
2208	                <personIDREF>ciccio</personIDREF>
2209	            </capturedPeople>
2210	        </mediaCapture>
2211	        <mediaCapture xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
2212	        xsi:type="videoCaptureType" captureID="VC1" mediaType="video">
2213	            <captureSceneIDREF>CS1</captureSceneIDREF>
2214	            <encGroupIDREF>EG0</encGroupIDREF>
2215	            <spatialInformation>
2216	                <captureOrigin>
2217	                        <capturePoint>
2218	                        <x>0.5</x>
2219	                        <y>1.0</y>
2220	                        <z>0.5</z>
2221	                    </capturePoint>
2222	                    <lineOfCapturePoint>
2223	                        <x>0.5</x>
2224	                        <y>0.0</y>
2225	                        <z>0.5</z>
2226	                    </lineOfCapturePoint>
2227	                </captureOrigin>
2228	            </spatialInformation>
2229	            <individual>true</individual>
2230	            <description lang="en">central camera video capture</description>
2231	            <priority>1</priority>
2232	            <lang>it</lang>
2233	            <mobility>static</mobility>
2234	            <view>individual</view>
2235	            <capturedPeople>
2236	                <personIDREF>alice</personIDREF>
2237	            </capturedPeople>
2238	        </mediaCapture>
2239	        <mediaCapture xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
2240	        xsi:type="videoCaptureType" captureID="VC2" mediaType="video">
2241	            <captureSceneIDREF>CS1</captureSceneIDREF>
2242	            <encGroupIDREF>EG0</encGroupIDREF>
2243	            <spatialInformation>
2244	                <captureOrigin>
2245	                        <capturePoint>
2246	                        <x>0.5</x>
2247	                        <y>1.0</y>
2248	                        <z>0.5</z>
2249	                    </capturePoint>
2250	                    <lineOfCapturePoint>
2251	                        <x>0.5</x>
2252	                        <y>0.0</y>
2253	                        <z>0.5</z>
2254	                    </lineOfCapturePoint>
2255	                </captureOrigin>

2257	            </spatialInformation>
2258	            <individual>true</individual>
2259	            <description lang="en">right camera video capture</description>
2260	            <priority>1</priority>
2261	            <lang>it</lang>
2262	            <mobility>static</mobility>
2263	            <view>individual</view>
2264	            <capturedPeople>
2265	                <personIDREF>bob</personIDREF>
2266	            </capturedPeople>
2267	        </mediaCapture>
2268	        <mediaCapture xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
2269	        xsi:type="videoCaptureType" captureID="VC3" mediaType="video">
2270	            <captureSceneIDREF>CS1</captureSceneIDREF>
2271	            <encGroupIDREF>EG0</encGroupIDREF>
2272	            <nonSpatiallyDefinable>true</nonSpatiallyDefinable>
2273	            <policy>Soundlevel:0</policy>
2274	            <description lang="en">loudest room segment</description>
2275	            <priority>1</priority>
2276	            <lang>it</lang>
2277	            <mobility>static</mobility>
2278	            <view>individual</view>
2279	        </mediaCapture>
2280	        <mediaCapture xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
2281	        xsi:type="videoCaptureType" captureID="VC4" mediaType="video">
2282	            <captureSceneIDREF>CS1</captureSceneIDREF>
2283	            <encGroupIDREF>EG0</encGroupIDREF>
2284	            <spatialInformation>
2285	                <captureOrigin>
2286	                        <capturePoint>
2287	                        <x>0.5</x>
2288	                        <y>1.0</y>
2289	                        <z>0.5</z>
2290	                    </capturePoint>
2291	                    <lineOfCapturePoint>
2292	                        <x>0.5</x>
2293	                        <y>0.0</y>
2294	                        <z>0.5</z>
2295	                    </lineOfCapturePoint>
2296	                </captureOrigin>
2297	            </spatialInformation>
2298	            <individual>true</individual>
2299	            <description lang="en">zoomed out view of all people in the
2300	            room</description>
2301	            <priority>1</priority>
2302	            <lang>it</lang>
2303	            <mobility>static</mobility>
2304	            <view>room</view>
2305	            <capturedPeople>
2306	                <personIDREF>alice</personIDREF>
2307	                <personIDREF>bob</personIDREF>
2308	                <personIDREF>ciccio</personIDREF>
2309	            </capturedPeople>
2310	        </mediaCapture>
2311	    </mediaCaptures>
2312	    <encodingGroups>
2313	        <encodingGroup encodingGroupID="EG0">
2314	            <maxGroupBandwidth>600000</maxGroupBandwidth>
2315	            <encodingIDList>
2316	                <encID>ENC1</encID>
2317	                <encID>ENC2</encID>
2318	                <encID>ENC3</encID>
2319	            </encodingIDList>
2320	        </encodingGroup>
2321	        <encodingGroup encodingGroupID="EG1">
2322	            <maxGroupBandwidth>300000</maxGroupBandwidth>
2323	            <encodingIDList>
2324	                <encID>ENC4</encID>
2325	                <encID>ENC5</encID>
2326	            </encodingIDList>
2327	        </encodingGroup>
2328	    </encodingGroups>
2329	    <captureScenes>
2330	        <captureScene scale="unknown" sceneID="CS1">
2331	            <sceneViews>
2332	                <sceneView sceneViewID="SE1">
2333	                    <mediaCaptureIDs>
2334	                        <captureIDREF>VC0</captureIDREF>
2335	                        <captureIDREF>VC1</captureIDREF>
2336	                        <captureIDREF>VC2</captureIDREF>
2337	                    </mediaCaptureIDs>
2338	                </sceneView>
2339	                <sceneView sceneViewID="SE2">
2340	                    <mediaCaptureIDs>
2341	                        <captureIDREF>VC3</captureIDREF>
2342	                    </mediaCaptureIDs>
2343	                </sceneView>
2344	                <sceneView sceneViewID="SE3">
2345	                    <mediaCaptureIDs>
2346	                        <captureIDREF>VC4</captureIDREF>
2347	                    </mediaCaptureIDs>
2348	                </sceneView>
2349	                <sceneView sceneViewID="SE4">
2350	                    <mediaCaptureIDs>
2351	                        <captureIDREF>VC4</captureIDREF>
2352	                    </mediaCaptureIDs>

2354	                </sceneView>
2355	            </sceneViews>
2356	        </captureScene>
2357	    </captureScenes>
2358	    <simultaneousSets>
2359	        <simultaneousSet setID="SS1">
2360	            <mediaCaptureIDREF>VC3</mediaCaptureIDREF>
2361	            <sceneViewIDREF>SE1</sceneViewIDREF>
2362	        </simultaneousSet>
2363	        <simultaneousSet setID="SS2">
2364	            <mediaCaptureIDREF>VC0</mediaCaptureIDREF>
2365	            <mediaCaptureIDREF>VC2</mediaCaptureIDREF>
2366	            <mediaCaptureIDREF>VC4</mediaCaptureIDREF>
2367	            <mediaCaptureIDREF>VC3</mediaCaptureIDREF>
2368	        </simultaneousSet>
2369	    </simultaneousSets>
2370	    <people>
2371	        <person personID="bob">
2372	            <personInfo>
2373	                <ns2:fn>
2374	                    <ns2:text>Bob</ns2:text>
2375	                </ns2:fn>
2376	            </personInfo>
2377	            <personType>minute taker</personType>
2378	        </person>
2379	        <person personID="alice">
2380	            <personInfo>
2381	                <ns2:fn>
2382	                    <ns2:text>Alice</ns2:text>
2383	                </ns2:fn>
2384	            </personInfo>
2385	            <personType>presenter</personType>
2386	        </person>
2387	        <person personID="ciccio">
2388	            <personInfo>
2389	                <ns2:fn>
2390	                    <ns2:text>Ciccio</ns2:text>
2391	                </ns2:fn>
2392	            </personInfo>
2393	            <personType>chairman</personType>
2394	            <personType>timekeeper</personType>
2395	        </person>
2396	    </people>
2397	</clueInfo>
2398	18.  MCC example

2400	   Enhancing the scenario presented in the previous example, the Media
2401	   Provider is able to advertise a composed capture VC7 made by a big
2402	   picture representing the current speaker (VC3) and two picture-in-
2403	   picture boxes representing the previous speakers (the previous one
2404	   -VC5- and the oldest one -VC6).  The provider does not want to
2405	   instantiate and send VC5 and VC6, so it does not associate any
2406	   encoding group with them.  Their XML representations are provided for
2407	   enabling the description of VC7.

2409	   A possible description for that scenario could be the following:

2411	<?xml version="1.0" encoding="UTF-8" standalone="yes"?>
2412	<clueInfo xmlns="urn:ietf:params:xml:ns:clue-info"
2413	xmlns:ns2="urn:ietf:params:xml:ns:vcard-4.0" clueInfoID="NapoliRoom">
2414	    <mediaCaptures>
2415	        <mediaCapture xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
2416	        xsi:type="audioCaptureType" captureID="AC0" mediaType="audio">
2417	            <captureSceneIDREF>CS1</captureSceneIDREF>
2418	            <encGroupIDREF>EG1</encGroupIDREF>
2419	            <spatialInformation>
2420	                <captureOrigin>
2421	                        <capturePoint>
2422	                        <x>0.5</x>
2423	                        <y>1.0</y>
2424	                        <z>0.5</z>
2425	                    </capturePoint>
2426	                    <lineOfCapturePoint>
2427	                        <x>0.5</x>
2428	                        <y>0.0</y>
2429	                        <z>0.5</z>
2430	                    </lineOfCapturePoint>
2431	                </captureOrigin>
2432	            </spatialInformation>
2433	            <individual>true</individual>
2434	            <description lang="en">main audio from the room</description>
2435	            <priority>1</priority>
2436	            <lang>it</lang>
2437	            <mobility>static</mobility>
2438	            <view>room</view>
2439	            <capturedPeople>
2440	                <personIDREF>alice</personIDREF>
2441	                <personIDREF>bob</personIDREF>
2442	                <personIDREF>ciccio</personIDREF>
2443	            </capturedPeople>

2445	        </mediaCapture>
2446	        <mediaCapture xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
2447	        xsi:type="videoCaptureType" captureID="VC0" mediaType="video">
2448	            <captureSceneIDREF>CS1</captureSceneIDREF>
2449	            <encGroupIDREF>EG0</encGroupIDREF>
2450	            <spatialInformation>
2451	                <captureOrigin>
2452	                        <capturePoint>
2453	                        <x>0.5</x>
2454	                        <y>1.0</y>
2455	                        <z>0.5</z>
2456	                    </capturePoint>
2457	                    <lineOfCapturePoint>
2458	                        <x>0.5</x>
2459	                        <y>0.0</y>
2460	                        <z>0.5</z>
2461	                    </lineOfCapturePoint>
2462	                </captureOrigin>
2463	            </spatialInformation>
2464	            <individual>true</individual>
2465	            <description lang="en">left camera video capture</description>
2466	            <priority>1</priority>
2467	            <lang>it</lang>
2468	            <mobility>static</mobility>
2469	            <view>individual</view>
2470	            <capturedPeople>
2471	                <personIDREF>ciccio</personIDREF>
2472	            </capturedPeople>
2473	        </mediaCapture>
2474	        <mediaCapture xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
2475	        xsi:type="videoCaptureType" captureID="VC1" mediaType="video">
2476	            <captureSceneIDREF>CS1</captureSceneIDREF>
2477	            <encGroupIDREF>EG0</encGroupIDREF>
2478	            <spatialInformation>
2479	                <captureOrigin>
2480	                        <capturePoint>
2481	                        <x>0.5</x>
2482	                        <y>1.0</y>
2483	                        <z>0.5</z>
2484	                    </capturePoint>
2485	                    <lineOfCapturePoint>
2486	                        <x>0.5</x>
2487	                        <y>0.0</y>
2488	                        <z>0.5</z>
2489	                    </lineOfCapturePoint>
2490	                </captureOrigin>
2491	            </spatialInformation>
2492	            <individual>true</individual>
2493	            <description lang="en">central camera video capture</description>
2494	            <priority>1</priority>
2495	            <lang>it</lang>
2496	            <mobility>static</mobility>
2497	            <view>individual</view>
2498	            <capturedPeople>
2499	                <personIDREF>alice</personIDREF>
2500	            </capturedPeople>
2501	        </mediaCapture>
2502	        <mediaCapture xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
2503	        xsi:type="videoCaptureType" captureID="VC2" mediaType="video">
2504	            <captureSceneIDREF>CS1</captureSceneIDREF>
2505	            <encGroupIDREF>EG0</encGroupIDREF>
2506	            <spatialInformation>
2507	                <captureOrigin>
2508	                        <capturePoint>
2509	                        <x>0.5</x>
2510	                        <y>1.0</y>
2511	                        <z>0.5</z>
2512	                    </capturePoint>
2513	                    <lineOfCapturePoint>
2514	                        <x>0.5</x>
2515	                        <y>0.0</y>
2516	                        <z>0.5</z>
2517	                    </lineOfCapturePoint>
2518	                </captureOrigin>
2519	            </spatialInformation>
2520	            <individual>true</individual>
2521	            <description lang="en">right camera video capture</description>
2522	            <priority>1</priority>
2523	            <lang>it</lang>
2524	            <mobility>static</mobility>
2525	            <view>individual</view>
2526	            <capturedPeople>
2527	                <personIDREF>bob</personIDREF>
2528	            </capturedPeople>
2529	        </mediaCapture>
2530	        <mediaCapture xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
2531	        xsi:type="videoCaptureType" captureID="VC3" mediaType="video">
2532	            <captureSceneIDREF>CS1</captureSceneIDREF>
2533	            <encGroupIDREF>EG0</encGroupIDREF>
2534	            <nonSpatiallyDefinable>true</nonSpatiallyDefinable>
2535	            <content>
2536	                <sceneViewIDREF>SE1</sceneViewIDREF>
2537	            </content>
2538	            <policy>Soundlevel:0</policy>
2539	            <description lang="en">loudest room segment</description>
2540	            <priority>1</priority>
2541	            <lang>it</lang>
2542	            <mobility>static</mobility>
2543	            <view>individual</view>
2544	        </mediaCapture>
2545	        <mediaCapture xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
2546	        xsi:type="videoCaptureType" captureID="VC4" mediaType="video">
2547	            <captureSceneIDREF>CS1</captureSceneIDREF>
2548	            <encGroupIDREF>EG0</encGroupIDREF>
2549	            <spatialInformation>
2550	                <captureOrigin>
2551	                        <capturePoint>
2552	                        <x>0.5</x>
2553	                        <y>1.0</y>
2554	                        <z>0.5</z>
2555	                    </capturePoint>
2556	                    <lineOfCapturePoint>
2557	                        <x>0.5</x>
2558	                        <y>0.0</y>
2559	                        <z>0.5</z>
2560	                    </lineOfCapturePoint>
2561	                </captureOrigin>
2562	            </spatialInformation>
2563	            <individual>true</individual>
2564	            <description lang="en">zoomed out view of all people in the room
2565	            </description>
2566	            <priority>1</priority>
2567	            <lang>it</lang>
2568	            <mobility>static</mobility>
2569	            <view>room</view>
2570	            <capturedPeople>
2571	                <personIDREF>alice</personIDREF>
2572	                <personIDREF>bob</personIDREF>
2573	                <personIDREF>ciccio</personIDREF>
2574	            </capturedPeople>
2575	        </mediaCapture>
2576	        <mediaCapture xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
2577	        xsi:type="videoCaptureType" captureID="VC5" mediaType="video">
2578	            <captureSceneIDREF>CS1</captureSceneIDREF>
2579	            <nonSpatiallyDefinable>true</nonSpatiallyDefinable>
2580	            <content>
2581	                <sceneViewIDREF>SE1</sceneViewIDREF>
2582	            </content>
2583	            <policy>Soundlevel:1</policy>
2584	            <description lang="en">penultimate loudest room segment
2585	            </description>
2586	            <priority>1</priority>
2587	            <lang>it</lang>
2588	            <mobility>static</mobility>
2589	            <view>individual</view>
2590	        </mediaCapture>
2591	        <mediaCapture xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
2592	        xsi:type="videoCaptureType" captureID="VC6" mediaType="video">
2593	            <captureSceneIDREF>CS1</captureSceneIDREF>
2594	            <nonSpatiallyDefinable>true</nonSpatiallyDefinable>
2595	            <content>
2596	                <sceneViewIDREF>SE1</sceneViewIDREF>
2597	            </content>
2598	            <policy>Soundlevel:2</policy>
2599	            <description lang="en">last but two loudest room segment
2600	            </description>
2601	            <priority>1</priority>
2602	            <lang>it</lang>
2603	            <mobility>static</mobility>
2604	            <view>individual</view>
2605	        </mediaCapture>
2606	        <mediaCapture xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
2607	        xsi:type="videoCaptureType" captureID="VC7" mediaType="video">
2608	            <captureSceneIDREF>CS1</captureSceneIDREF>
2609	            <nonSpatiallyDefinable>true</nonSpatiallyDefinable>
2610	            <content>
2611	                <captureIDREF>VC3</captureIDREF>
2612	                <captureIDREF>VC5</captureIDREF>
2613	                <captureIDREF>VC6</captureIDREF>
2614	            </content>
2615	            <description lang="en">big picture of the current speaker +
2616	            pips about previous speakers</description>
2617	            <priority>1</priority>
2618	            <lang>it</lang>
2619	            <mobility>static</mobility>
2620	            <view>individual</view>
2621	        </mediaCapture>
2622	    </mediaCaptures>
2623	    <encodingGroups>
2624	        <encodingGroup encodingGroupID="EG0">
2625	            <maxGroupBandwidth>600000</maxGroupBandwidth>
2626	            <encodingIDList>
2627	                <encID>ENC1</encID>
2628	                <encID>ENC2</encID>
2629	                <encID>ENC3</encID>
2630	            </encodingIDList>
2631	        </encodingGroup>
2632	        <encodingGroup encodingGroupID="EG1">
2633	            <maxGroupBandwidth>300000</maxGroupBandwidth>
2634	            <encodingIDList>
2635	                <encID>ENC4</encID>
2636	                <encID>ENC5</encID>

2638	            </encodingIDList>
2639	        </encodingGroup>
2640	    </encodingGroups>
2641	    <captureScenes>
2642	        <captureScene scale="unknown" sceneID="CS1">
2643	            <sceneViews>
2644	                <sceneView sceneViewID="SE1">
2645	                    <description lang="en">participants' individual
2646	                    videos</description>
2647	                    <mediaCaptureIDs>
2648	                        <captureIDREF>VC0</captureIDREF>
2649	                        <captureIDREF>VC1</captureIDREF>
2650	                        <captureIDREF>VC2</captureIDREF>
2651	                    </mediaCaptureIDs>
2652	                </sceneView>
2653	                <sceneView sceneViewID="SE2">
2654	                    <description lang="en">loudest segment of the
2655	                    room</description>
2656	                    <mediaCaptureIDs>
2657	                        <captureIDREF>VC3</captureIDREF>
2658	                    </mediaCaptureIDs>
2659	                </sceneView>
2660	                <sceneView sceneViewID="SE5">
2661	                    <description lang="en">loudest segment of the
2662	                    room + pips</description>
2663	                    <mediaCaptureIDs>
2664	                        <captureIDREF>VC7</captureIDREF>
2665	                    </mediaCaptureIDs>
2666	                </sceneView>
2667	                <sceneView sceneViewID="SE4">
2668	                    <description lang="en">room audio</description>
2669	                    <mediaCaptureIDs>
2670	                        <captureIDREF>AC0</captureIDREF>
2671	                    </mediaCaptureIDs>
2672	                </sceneView>
2673	                <sceneView sceneViewID="SE3">
2674	                    <description lang="en">room video</description>
2675	                    <mediaCaptureIDs>
2676	                        <captureIDREF>VC4</captureIDREF>
2677	                    </mediaCaptureIDs>
2678	                </sceneView>
2679	            </sceneViews>
2680	        </captureScene>
2681	    </captureScenes>
2682	    <simultaneousSets>
2683	        <simultaneousSet setID="SS1">
2684	            <mediaCaptureIDREF>VC7</mediaCaptureIDREF>
2685	            <sceneViewIDREF>SE1</sceneViewIDREF>

2687	        </simultaneousSet>
2688	        <simultaneousSet setID="SS2">
2689	            <mediaCaptureIDREF>VC0</mediaCaptureIDREF>
2690	            <mediaCaptureIDREF>VC2</mediaCaptureIDREF>
2691	            <mediaCaptureIDREF>VC4</mediaCaptureIDREF>
2692	            <mediaCaptureIDREF>VC7</mediaCaptureIDREF>
2693	        </simultaneousSet>
2694	    </simultaneousSets>
2695	    <people>
2696	        <person personID="bob">
2697	            <personInfo>
2698	                <ns2:fn>
2699	                    <ns2:text>Bob</ns2:text>
2700	                </ns2:fn>
2701	            </personInfo>
2702	            <personType>minute taker</personType>
2703	        </person>
2704	        <person personID="alice">
2705	            <personInfo>
2706	                <ns2:fn>
2707	                    <ns2:text>Alice</ns2:text>
2708	                </ns2:fn>
2709	            </personInfo>
2710	            <personType>presenter</personType>
2711	        </person>
2712	        <person personID="ciccio">
2713	            <personInfo>
2714	                <ns2:fn>
2715	                    <ns2:text>Ciccio</ns2:text>
2716	                </ns2:fn>
2717	            </personInfo>
2718	            <personType>chairman</personType>
2719	            <personType>timekeeper</personType>
2720	        </person>
2721	    </people>
2722	</clueInfo>

2724	19.  Diff with draft-ietf-clue-data-model-schema-10 version

2726	   Minor modifications have been applied to address nits at page https:/
2727	   /www.ietf.org/tools/idnits?url=https://www.ietf.org/archive/id/
2728	   draft-ietf-clue-data-model-schema-10.txt.

2730	20.  Diff with draft-ietf-clue-data-model-schema-09 version

2732	   o  We have introduced a <captureOrigin> element containing a
2733	      mandatory <capturePoint> and an optional <lineOfCapturePoint> in
2734	      the definition of <spatialInformation> as per Paul's review

2736	   o  A new type definition for switching policies (resembled by
2737	      <policy> element) has been provided in order to have acceptable
2738	      values in the form of "token:index".

2740	   o  Minor modifications suggested in WGLC reviews have been applied.

2742	21.  Diff with draft-ietf-clue-data-model-schema-08 version

2744	   o  Typos correction

2746	22.  Diff with draft-ietf-clue-data-model-schema-07 version

2748	   o  IANA Considerations: text added

2750	   o  maxCaptureEncodings removed

2752	   o  personTypeType values aligned with CLUE framework

2754	   o  allowSubsetChoice added for multiple content captures

2756	   o  embeddedText moved from videoCaptureType definition to
2757	      mediaCaptureType definition

2759	   o  typos removed from section Terminology

2761	23.  Diff with draft-ietf-clue-data-model-schema-06 version

2763	   o  Capture Scene Entry/Entries renamed as Capture Scene View/Views in
2764	      the text, <sceneEntry>/<sceneEntries> renamed as <sceneView>/
2765	      <sceneViews> in the XML schema.

2767	   o  Global Scene Entry/Entries renamed as Global View/Views in the
2768	      text, <globalSceneEntry>/<globalSceneEntries> renamed as
2769	      <globalView>/<globalViews>

2771	   o  Security section added.

2773	   o  Extensibility: a new type is introduced to describe other types of
2774	      media capture (otherCaptureType), text and example added.

2776	   o  Spatial information section updated: capture point optional, text
2777	      now is coherent with the framework one.

2779	   o  Audio capture description: <sensitivityPattern> added,
2780	      <audioChannelFormat> removed, <captureArea> disallowed.

2782	   o  Simultaneous set definition: added <captureSceneIDREF> to refer to
2783	      capture scene identifiers as shortcuts and an optional mediaType
2784	      attribute which is mandatory to use when only capture scene
2785	      identifiers are listed.

2787	   o  Encoding groups: removed the constraint of the same media type.

2789	   o  Updated text about media captures without <encodingGroupIDREF>
2790	      (optional in the XML schema).

2792	   o  "mediaType" attribute removed from homogeneous groups of capture
2793	      (scene views and globlal views)

2795	   o  "mediaType" attribute removed from the global view textual
2796	      description.

2798	   o  "millimeters" scale value changed in "mm"

2800	24.  Diff with draft-ietf-clue-data-model-schema-04 version

2802	      globalCaptureEntries/Entry renamed as globalSceneEntries/Entry;

2804	      sceneInformation added;

2806	      Only capture scene entry identifiers listed within global scene
2807	      entries (media capture identifiers removed);

2809	      <participants> renamed as <people> in the >clueInfo< template

2811	      <vcard> renamed as <personInfo> to synch with the framework
2812	      terminology

2814	      <participantType> renamed as <personType> to synch with the
2815	      framework terminology

2817	      <participantIDs> renamed as <capturedPeople> in the media capture
2818	      type definition to remove ambiguity

2820	      Examples have been updated with the new definitions of
2821	      <globalSceneEntries> and of <people>.

2823	25.  Diff with draft-ietf-clue-data-model-schema-03 version

2825	      encodings section has been removed

2827	      global capture entries have been introduced

2829	      capture scene entry identifiers are used as shortcuts in listing
2830	      the content of MCC (similarly to simultaneous set and global
2831	      capture entries)

2833	      Examples have been updated.  A new example with global capture
2834	      entries has been added.

2836	      <encGroupIDREF> has been made optional.

2838	      <single> has been renamed into <individual>

2840	      Obsolete comments have been removed.

2842	      participants information has been added.

2844	26.  Diff with draft-ietf-clue-data-model-schema-02 version

2846	      captureParameters and encodingParameters have been removed from
2847	      the captureEncodingType

2849	      data model example has been updated and validated according to the
2850	      new schema.  Further description of the represented scenario has
2851	      been provided.

2853	      A multiple content capture example has been added.

2855	      Obsolete comments and references have been removed.

2857	27.  Acknowledgments

2859	   The authors thank all the CLUErs for their precious feedbacks and
2860	   support.

2862	28.  Informative References

2864	   [I-D.ietf-clue-datachannel]       Holmberg, C., "CLUE Protocol data
2865	                                     channel",
2866	                                     draft-ietf-clue-datachannel-10
2867	                                     (work in progress), September 2015.

2869	   [I-D.ietf-clue-framework]         Duckworth, M., Pepperell, A., and
2870	                                     S. Wenger, "Framework for
2871	                                     Telepresence Multi-Streams",
2872	                                     draft-ietf-clue-framework-23 (work
2873	                                     in progress), September 2015.

2875	   [I-D.ietf-clue-protocol]          Presta, R. and S. Romano, "CLUE
2876	                                     protocol",
2877	                                     draft-ietf-clue-protocol-06 (work
2878	                                     in progress), October 2015.

2880	   [I-D.ietf-ecrit-additional-data]  Gellens, R., Rosen, B., Tschofenig,
2881	                                     H., Marshall, R., and J.
2882	                                     Winterbottom, "Additional Data
2883	                                     Related to an Emergency Call",
2884	                                     draft-ietf-ecrit-additional-data-37
2885	                                     (work in progress), October 2015.

2887	   [RFC2119]                         Bradner, S., "Key words for use in
2888	                                     RFCs to Indicate Requirement
2889	                                     Levels", BCP 14, RFC 2119,
2890	                                     DOI 10.17487/RFC2119, March 1997, <
2891	                                     http://www.rfc-editor.org/info/
2892	                                     rfc2119>.

2894	   [RFC3550]                         Schulzrinne, H., Casner, S.,
2895	                                     Frederick, R., and V. Jacobson,
2896	                                     "RTP: A Transport Protocol for
2897	                                     Real-Time Applications", STD 64,
2898	                                     RFC 3550, DOI 10.17487/RFC3550,
2899	                                     July 2003, <http://
2900	                                     www.rfc-editor.org/info/rfc3550>.

2902	   [RFC3688]                         Mealling, M., "The IETF XML
2903	                                     Registry", BCP 81, RFC 3688,
2904	                                     DOI 10.17487/RFC3688, January 2004,
2905	                                     <http://www.rfc-editor.org/info/
2906	                                     rfc3688>.

2908	   [RFC4353]                         Rosenberg, J., "A Framework for
2909	                                     Conferencing with the Session
2910	                                     Initiation Protocol (SIP)",
2911	                                     RFC 4353, DOI 10.17487/RFC4353,
2912	                                     February 2006, <http://
2913	                                     www.rfc-editor.org/info/rfc4353>.

2915	   [RFC5117]                         Westerlund, M. and S. Wenger, "RTP
2916	                                     Topologies", RFC 5117,
2917	                                     DOI 10.17487/RFC5117, January 2008,
2918	                                     <http://www.rfc-editor.org/info/
2919	                                     rfc5117>.

2921	   [RFC6351]                         Perreault, S., "xCard: vCard XML
2922	                                     Representation", RFC 6351,
2923	                                     DOI 10.17487/RFC6351, August 2011,
2924	                                     <http://www.rfc-editor.org/info/
2925	                                     rfc6351>.

2927	   [RFC7303]                         Thompson, H. and C. Lilley, "XML
2928	                                     Media Types", RFC 7303,
2929	                                     DOI 10.17487/RFC7303, July 2014, <h
2930	                                     ttp://www.rfc-editor.org/info/
2931	                                     rfc7303>.

2933	Authors' Addresses

2935	   Roberta Presta
2936	   University of Napoli
2937	   Via Claudio 21
2938	   Napoli  80125
2939	   Italy

2941	   EMail: roberta.presta@unina.it

2943	   Simon Pietro Romano
2944	   University of Napoli
2945	   Via Claudio 21
2946	   Napoli  80125
2947	   Italy

2949	   EMail: spromano@unina.it