idnits 2.17.1 

draft-ietf-clue-data-model-schema-07.txt:

  Checking boilerplate required by RFC 5378 and the IETF Trust (see
  https://trustee.ietf.org/license-info):
  ----------------------------------------------------------------------------

     No issues found here.

  Checking nits according to https://www.ietf.org/id-info/1id-guidelines.txt:
  ----------------------------------------------------------------------------

     No issues found here.

  Checking nits according to https://www.ietf.org/id-info/checklist :
  ----------------------------------------------------------------------------

  ** There are 48 instances of too long lines in the document, the longest
     one being 18 characters in excess of 72.

  ** The document seems to lack a both a reference to RFC 2119 and the
     recommended RFC 2119 boilerplate, even if it appears to use RFC 2119
     keywords. 

     RFC 2119 keyword, line 180: '....  Capture Scene MAY correspond to a p...'
     RFC 2119 keyword, line 181: '...presence room or MAY focus only on the...'
     RFC 2119 keyword, line 962: '...   MUST be included in spatially defin...'
     RFC 2119 keyword, line 1001: '... line of capture MUST NOT be identical...'
     RFC 2119 keyword, line 1003: '...of capture is provided, it MUST belong...'
     (9 more instances...)


  Miscellaneous warnings:
  ----------------------------------------------------------------------------

  == The copyright year in the IETF Trust and authors Copyright Line does not
     match the current year

  == Line 127 has weird spacing: '...ff with  draft...'

  == Line 2596 has weird spacing: '...ff with  draft...'

  -- The document date (September 29, 2014) is 3497 days in the past.  Is
     this intentional?


  Checking references for intended status: Proposed Standard
  ----------------------------------------------------------------------------

     (See RFCs 3967 and 4897 for information about using normative references
     to lower-maturity documents in RFCs)

  == Unused Reference: 'RFC4796' is defined on line 2634, but no explicit
     reference was found in the text

  == Outdated reference: A later version (-18) exists of
     draft-ietf-clue-datachannel-01

  == Outdated reference: A later version (-25) exists of
     draft-ietf-clue-framework-17

  == Outdated reference: A later version (-19) exists of
     draft-ietf-clue-protocol-01

  == Outdated reference: A later version (-38) exists of
     draft-ietf-ecrit-additional-data-22


     Summary: 2 errors (**), 0 flaws (~~), 8 warnings (==), 1 comment (--).

     Run idnits with the --verbose option for more detailed information about
     the items above.

--------------------------------------------------------------------------------


2	CLUE Working Group                                             R. Presta
3	Internet-Draft                                               S P. Romano
4	Intended status: Standards Track                    University of Napoli
5	Expires: April 2, 2015                                September 29, 2014

7	                 An XML Schema for the CLUE data model
8	                  draft-ietf-clue-data-model-schema-07

10	Abstract

12	   This document provides an XML schema file for the definition of CLUE
13	   data model types.

15	Status of This Memo

17	   This Internet-Draft is submitted in full conformance with the
18	   provisions of BCP 78 and BCP 79.

20	   Internet-Drafts are working documents of the Internet Engineering
21	   Task Force (IETF).  Note that other groups may also distribute
22	   working documents as Internet-Drafts.  The list of current Internet-
23	   Drafts is at http://datatracker.ietf.org/drafts/current/.

25	   Internet-Drafts are draft documents valid for a maximum of six months
26	   and may be updated, replaced, or obsoleted by other documents at any
27	   time.  It is inappropriate to use Internet-Drafts as reference
28	   material or to cite them other than as "work in progress."

30	   This Internet-Draft will expire on April 2, 2015.

32	Copyright Notice

34	   Copyright (c) 2014 IETF Trust and the persons identified as the
35	   document authors.  All rights reserved.

37	   This document is subject to BCP 78 and the IETF Trust's Legal
38	   Provisions Relating to IETF Documents
39	   (http://trustee.ietf.org/license-info) in effect on the date of
40	   publication of this document.  Please review these documents
41	   carefully, as they describe your rights and restrictions with respect
42	   to this document.  Code Components extracted from this document must
43	   include Simplified BSD License text as described in Section 4.e of
44	   the Trust Legal Provisions and are provided without warranty as
45	   described in the Simplified BSD License.

47	Table of Contents
48	   1.  Introduction . . . . . . . . . . . . . . . . . . . . . . . . .  4
49	   2.  Terminology  . . . . . . . . . . . . . . . . . . . . . . . . .  4
50	   3.  XML Schema . . . . . . . . . . . . . . . . . . . . . . . . . .  7
51	   4.  <mediaCaptures>  . . . . . . . . . . . . . . . . . . . . . . . 17
52	   5.  <encodingGroups> . . . . . . . . . . . . . . . . . . . . . . . 18
53	   6.  <captureScenes>  . . . . . . . . . . . . . . . . . . . . . . . 18
54	   7.  <simultaneousSets> . . . . . . . . . . . . . . . . . . . . . . 18
55	   8.  <globalViews>  . . . . . . . . . . . . . . . . . . . . . . . . 18
56	   9.  <captureEncodings> . . . . . . . . . . . . . . . . . . . . . . 18
57	   10. <mediaCapture> . . . . . . . . . . . . . . . . . . . . . . . . 18
58	     10.1.  captureID attribute . . . . . . . . . . . . . . . . . . . 19
59	     10.2.  mediaType attribute . . . . . . . . . . . . . . . . . . . 20
60	     10.3.  <captureSceneIDREF> . . . . . . . . . . . . . . . . . . . 20
61	     10.4.  <encGroupIDREF> . . . . . . . . . . . . . . . . . . . . . 20
62	     10.5.  <spatialInformation>  . . . . . . . . . . . . . . . . . . 20
63	       10.5.1.  <capturePoint>  . . . . . . . . . . . . . . . . . . . 21
64	       10.5.2.  <captureArea> . . . . . . . . . . . . . . . . . . . . 22
65	     10.6.  <nonSpatiallyDefinable> . . . . . . . . . . . . . . . . . 23
66	     10.7.  <content> . . . . . . . . . . . . . . . . . . . . . . . . 23
67	     10.8.  <synchronizationID> . . . . . . . . . . . . . . . . . . . 24
68	     10.9.  <policy>  . . . . . . . . . . . . . . . . . . . . . . . . 24
69	     10.10. <maxCaptures> . . . . . . . . . . . . . . . . . . . . . . 24
70	     10.11. <individual>  . . . . . . . . . . . . . . . . . . . . . . 25
71	     10.12. <description> . . . . . . . . . . . . . . . . . . . . . . 25
72	     10.13. <priority>  . . . . . . . . . . . . . . . . . . . . . . . 26
73	     10.14. <lang>  . . . . . . . . . . . . . . . . . . . . . . . . . 26
74	     10.15. <mobility>  . . . . . . . . . . . . . . . . . . . . . . . 26
75	     10.16. <maxCaptureEncodings> . . . . . . . . . . . . . . . . . . 26
76	     10.17. <relatedTo> . . . . . . . . . . . . . . . . . . . . . . . 26
77	     10.18. <view>  . . . . . . . . . . . . . . . . . . . . . . . . . 26
78	     10.19. <presentation>  . . . . . . . . . . . . . . . . . . . . . 27
79	     10.20. <capturedPeople>  . . . . . . . . . . . . . . . . . . . . 27
80	       10.20.1. <personIDREF> . . . . . . . . . . . . . . . . . . . . 27
81	   11. Audio captures . . . . . . . . . . . . . . . . . . . . . . . . 27
82	     11.1.  <sensitivityPattern>  . . . . . . . . . . . . . . . . . . 28
83	   12. Video captures . . . . . . . . . . . . . . . . . . . . . . . . 28
84	     12.1.  <embeddedText>  . . . . . . . . . . . . . . . . . . . . . 29
85	   13. Text captures  . . . . . . . . . . . . . . . . . . . . . . . . 29
86	   14. Other capture types  . . . . . . . . . . . . . . . . . . . . . 30
87	   15. <captureScene> . . . . . . . . . . . . . . . . . . . . . . . . 30
88	     15.1.  <sceneInformation>  . . . . . . . . . . . . . . . . . . . 31
89	     15.2.  <sceneViews>  . . . . . . . . . . . . . . . . . . . . . . 31
90	     15.3.  sceneID attribute . . . . . . . . . . . . . . . . . . . . 32
91	     15.4.  scale attribute . . . . . . . . . . . . . . . . . . . . . 32
92	   16. <sceneView>  . . . . . . . . . . . . . . . . . . . . . . . . . 32
93	     16.1.  <mediaCaptureIDs> . . . . . . . . . . . . . . . . . . . . 33
94	     16.2.  sceneViewID attribute . . . . . . . . . . . . . . . . . . 33
95	   17. <encodingGroup>  . . . . . . . . . . . . . . . . . . . . . . . 33
96	     17.1.  <maxGroupBandwidth> . . . . . . . . . . . . . . . . . . . 34
97	     17.2.  <encodingIDList>  . . . . . . . . . . . . . . . . . . . . 34
98	     17.3.  encodingGroupID attribute . . . . . . . . . . . . . . . . 34
99	   18. <simultaneousSet>  . . . . . . . . . . . . . . . . . . . . . . 35
100	     18.1.  setID attribute . . . . . . . . . . . . . . . . . . . . . 35
101	     18.2.  mediaType attribute . . . . . . . . . . . . . . . . . . . 35
102	     18.3.  <mediaCaptureIDREF> . . . . . . . . . . . . . . . . . . . 36
103	     18.4.  <sceneViewIDREF>  . . . . . . . . . . . . . . . . . . . . 36
104	     18.5.  <captureSceneIDREF> . . . . . . . . . . . . . . . . . . . 36
105	   19. <globalView> . . . . . . . . . . . . . . . . . . . . . . . . . 36
106	   20. <people> . . . . . . . . . . . . . . . . . . . . . . . . . . . 36
107	     20.1.  <person>  . . . . . . . . . . . . . . . . . . . . . . . . 37
108	       20.1.1.  personID attribute  . . . . . . . . . . . . . . . . . 38
109	       20.1.2.  <personInfo>  . . . . . . . . . . . . . . . . . . . . 38
110	       20.1.3.  <personType>  . . . . . . . . . . . . . . . . . . . . 38
111	   21. <captureEncoding>  . . . . . . . . . . . . . . . . . . . . . . 38
112	     21.1.  <captureID> . . . . . . . . . . . . . . . . . . . . . . . 39
113	     21.2.  <encodingID>  . . . . . . . . . . . . . . . . . . . . . . 39
114	     21.3.  <configuredContent> . . . . . . . . . . . . . . . . . . . 39
115	   22. <clueInfo> . . . . . . . . . . . . . . . . . . . . . . . . . . 39
116	   23. XML Schema extensibility . . . . . . . . . . . . . . . . . . . 40
117	     23.1.  Example of extension  . . . . . . . . . . . . . . . . . . 40
118	   24. Security considerations  . . . . . . . . . . . . . . . . . . . 42
119	   25. IANA considerations  . . . . . . . . . . . . . . . . . . . . . 43
120	     25.1.  XML Schema registration . . . . . . . . . . . . . . . . . 43
121	     25.2.  XML namespace registration  . . . . . . . . . . . . . . . 43
122	   26. Sample XML file  . . . . . . . . . . . . . . . . . . . . . . . 43
123	   27. MCC example  . . . . . . . . . . . . . . . . . . . . . . . . . 50
124	   28. Diff with draft-ietf-clue-data-model-schema-06 version . . . . 56
125	   29. Diff with draft-ietf-clue-data-model-schema-04 version . . . . 57
126	   30. Diff with draft-ietf-clue-data-model-schema-03 version . . . . 58
127	   31. Diff with  draft-ietf-clue-data-model-schema-02 version  . . . 58
128	   32. Informative References . . . . . . . . . . . . . . . . . . . . 58

130	1.  Introduction

132	   This document provides an XML schema file for the definition of CLUE
133	   data model types.

135	   The schema is based on information contained in
136	   [I-D.ietf-clue-framework].  It encodes information and constraints
137	   defined in the aforementioned document in order to provide a formal
138	   representation of the concepts therein presented.  The schema
139	   definition is intended to be modified according to changes applied to
140	   the above mentioned CLUE document.

142	   The document aims at the definition of a coherent structure for all
143	   the information associated with the description of a telepresence
144	   scenario.  Such information is used within the CLUE protocol messages
145	   ([I-D.ietf-clue-protocol]) enabling the dialogue between a Media
146	   Provider and a Media Consumer.  CLUE protocol messages, indeed, are
147	   XML messages allowing (i) a Media Provider to advertise its
148	   telepresence capabilities in terms of media captures, capture scenes,
149	   and other features envisioned in the CLUE framework, according to the
150	   format herein defined and (ii) a Media Consumer to request the
151	   desired telepresence options in the form of capture encodings,
152	   represented as described in this document.

154	2.  Terminology

156	   This document refers to the same terminology used in
157	   [I-D.ietf-clue-framework], except for the "CLUE Participant"
158	   definition (which is still under discussion).  We briefly recall
159	   herein some of the main terms exploited in the document.

161	   Audio Capture:  Media Capture for audio.  Denoted as ACn in the
162	      example cases in this document.

164	   Camera-Left and Right:  For Media Captures, Camera-Left and Camera-
165	      Right are from the point of view of a person observing the
166	      rendered media.  They are the opposite of Stage-Left and Stage-
167	      Right.

169	   Capture:  Same as Media Capture.

171	   Capture Device:  A device that converts audio and video input into an
172	      electrical signal, in most cases to be fed into a media encoder.

174	   Capture Encoding:  A specific encoding of a Media Capture, to be sent
175	      by a Media Provider to a Media Consumer via RTP.

177	   Capture Scene:  An abstraction grouping semantically-coupled Media
178	      Captures available at the Media Provider's side, representing a
179	      precise portion of the local scene that can be transmitted
180	      remotely.  Capture Scene MAY correspond to a part of the
181	      telepresence room or MAY focus only on the presentation media.  A
182	      Capture Scene is characterized by a set of attributes and by a set
183	      of Capture Scene Views.

185	   Capture Scene View:  A list of Media Captures of the same media type
186	      that constitute a possible representation of a Capture Scene.
187	      Media Capture belonging to the same Capture Scene View can be sent
188	      simultaneously by the Media Provider.

190	   CLUE Participant:  This term is not imported from the framework
191	      terminology and should be considered temporary since it is under
192	      review.  We introduced it for the sake of simplicity in order to
193	      identify a generic entity (either an Endpoint or a MCU) making use
194	      of the CLUE protocol.

196	   Consumer:  Same as Media Consumer.

198	   Encoding or Individual Encoding:  The representation of an encoding
199	      technology.  In the CLUE datamodel, for each encoding it is
200	      provided a set of parameters representing the encoding
201	      constraints, like for example the maximum bandwidth of the Media
202	      Provider the encoding can consume. s

204	   Encoding Group:  The representation of a group of encodings.  For
205	      each group, it is provided a set of parameters representing the
206	      constraints to be applied to the group as a whole.  An example is
207	      the maximum bandwidth that can be consumed when using the
208	      contained encodings together simultaneously.

210	   Endpoint  The logical point of final termination through receiving,
211	      decoding and rendering, and/or initiation through capturing,
212	      encoding, and sending of media streams.  An endpoint consists of
213	      one or more physical devices which source and sink media streams,
214	      and exactly one SIP Conferencing Framework Participant (which, in
215	      turn, includes exactly one SIP User Agent).  Endpoints can be
216	      anything from multiscreen/multicamera room controllers to handheld
217	      devices.

219	   MCU:  Multipoint Control Unit (MCU) - a device that connects two or
220	      more endpoints together into one single multimedia conference.  An
221	      MCU may include a Mixer.

223	   Media:  Any data that, after suitable encoding, can be conveyed over
224	      RTP, including audio, video or timed text.

226	   Media Capture:  A "Media Capture", or simply "Capture", is a source
227	      of Media of a single type (i.e., audio or video or text).

229	   Media Stream:  The term "Media Stream", or simply "Stream", is used
230	      as a synonymous of Capture Encoding.

232	   Media Provider:  A CLUE participant (i.e., an Endpoint or a MCU) able
233	      to send Media Streams.

235	   Media Consumer:  A CLUE participant (i.e., an Endpoint or a MCU) able
236	      to receive Media Streams.

238	   Scene:  Same as Capture Scene.

240	   Scene View:  Same as Capture Scene View.

242	   Stream:  Same of Media Stream.

244	   Multiple Content Capture:  A Capture that can contain different Media
245	      Captures of the same media type.  It is denoted as MCC in this
246	      document.  In the Stream resulting from the MCC, the Stream coming
247	      from the encoding of the composing Media Captures can appear
248	      simultaneously, if the MCC is the result of a mixing operation, or
249	      can appear alternatively over the time, according to a certain
250	      switching policy.

252	   Plane of Interest:  The spatial plane containing the most relevant
253	      subject matter.

255	   Provider:  Same as Media Provider.

257	   Render:

259	   Simultaneous Transmission Set:  a set of Media Captures of the same
260	      media type that can be transmitted simultaneously from a Media
261	      Provider.

263	   Single Media Capture:  A Capture representing the Media coming from a
264	      single-source Capture Device.

266	   Spatial Information:  Data about the spatial position of a Capture
267	      Device that generate a Single Media Capture within the context of
268	      a Capture Scene representing a phisical portion of a Telepresence
269	      Room.

271	   Stream Characteristics:  The union of the features used to describe a
272	      Stream in the CLUE environment and in the SIP-SDP environment.

274	   Video Capture:  A Media Capture for video.

276	3.  XML Schema

278	   This section contains the CLUE data model schema definition.

280	   The element and attribute definitions are formal representation of
281	   the concepts needed to describe the capabilities of a Media Provider
282	   and the streams that are requested by a Media Consumer given the
283	   Media Provider's ADVERTISEMENT ([I-D.ietf-clue-protocol]).

285	   The main groups of information are:

287	      <mediaCaptures>: the list of media captures available (Section 4)

289	      <encodingGroups>: the list of encodings groups (Section 5)

291	      <captureScenes>: the list of capture scenes (Section 6)

293	      <simultaneousSets>: the list of simultaneous transmission sets
294	      (Section 7)

296	      <globalViews>: the list of global views sets (Section 8)

298	      <people>: meta data about the participants represented in the
299	      telepresence session (Section 20).

301	      <captureEncodings>: the list of instantiated capture encodings
302	      (Section 9)

304	   All of the above refers to concepts that have been introduced in
305	   [I-D.ietf-clue-framework] and further detailed in the following of
306	   this document.

308	<?xml version="1.0" encoding="UTF-8" ?>
309	<xs:schema
310	   targetNamespace="urn:ietf:params:xml:ns:clue-info"
311	   xmlns:tns="urn:ietf:params:xml:ns:clue-info"
312	   xmlns:xs="http://www.w3.org/2001/XMLSchema"
313	   xmlns="urn:ietf:params:xml:ns:clue-info"
314	   xmlns:xcard="urn:ietf:params:xml:ns:vcard-4.0"
315	   elementFormDefault="qualified"
316	   attributeFormDefault="unqualified"
317	   version="0.7">

319	<!-- Import xcard XML schema -->
320	<xs:import namespace="urn:ietf:params:xml:ns:vcard-4.0"
321	schemaLocation="xcard.xsd"/>

323	<!-- ELEMENT DEFINITIONS -->
324	<xs:element name="mediaCaptures" type="mediaCapturesType"/>
325	<xs:element name="encodingGroups" type="encodingGroupsType"/>
326	<xs:element name="captureScenes" type="captureScenesType"/>
327	<xs:element name="simultaneousSets" type="simultaneousSetsType"/>
328	<xs:element name="globalViews" type="globalViewsType"/>
329	<xs:element name="people" type="peopleType"/>

331	<xs:element name="captureEncodings" type="captureEncodingsType"/>

333	<!-- MEDIA CAPTURES TYPE -->
334	<!-- envelope of media captures -->
335	<xs:complexType name="mediaCapturesType">
336	 <xs:sequence>
337	   <xs:element name="mediaCapture" type="mediaCaptureType"
338	   maxOccurs="unbounded"/>
339	 </xs:sequence>
340	</xs:complexType>

342	<!-- DESCRIPTION element -->
343	<xs:element name="description">
344	 <xs:complexType>
345	  <xs:simpleContent>
346	   <xs:extension base="xs:string">
347	     <xs:attribute name="lang" type="xs:language"/>
348	   </xs:extension>
349	  </xs:simpleContent>
350	 </xs:complexType>
351	</xs:element>

353	<!-- MEDIA CAPTURE TYPE -->
354	<xs:complexType name="mediaCaptureType" abstract="true">
355	  <xs:sequence>
356	    <!-- mandatory fields -->
357	    <xs:element name="captureSceneIDREF" type="xs:IDREF"/>
358	    <xs:choice>
359	      <xs:sequence>
360	        <xs:element name="spatialInformation"
361	                    type="tns:spatialInformationType"/>
362	      </xs:sequence>
363	      <xs:element name="nonSpatiallyDefinable" type="xs:boolean" fixed="true"/>

365	    </xs:choice>
366	    <!-- for handling multi-content captures: -->
367	    <xs:choice>
368	      <xs:sequence>
369	        <xs:element name="synchronizationID" type="xs:ID" minOccurs="0"/>
370	        <xs:element name="content" type="contentType" minOccurs="0"/>
371	        <xs:element name="policy" type="xs:string" minOccurs="0"/>
372	        <xs:element name="maxCaptures" type="maxCapturesType" minOccurs="0"/>
373	      </xs:sequence>
374	      <xs:element name="individual" type="xs:boolean" fixed="true"/>
375	    </xs:choice>
376	    <!-- optional fields -->
377	    <xs:element name="encGroupIDREF" type="xs:IDREF" minOccurs="0"/>
378	    <xs:element ref="description" minOccurs="0" maxOccurs="unbounded"/>
379	    <xs:element name="priority" type="xs:unsignedInt" minOccurs="0"/>
380	    <xs:element name="lang" type="xs:language" minOccurs="0"/>
381	    <xs:element name="mobility" type="mobilityType" minOccurs="0"/>
382	    <xs:element name="presentation" type="presentationType" minOccurs="0"/>
383	    <xs:element name="view" type="viewType" minOccurs="0"/>
384	    <xs:element name="capturedPeople" type="capturedPeopleType" minOccurs="0"/>
385	    <xs:element name="maxCaptureEncodings" type="xs:unsignedInt"
386	    minOccurs="0"/>
387	    <xs:element name="relatedTo" type="xs:IDREF" minOccurs="0"/>
388	  </xs:sequence>
389	  <xs:attribute name="captureID" type="xs:ID" use="required"/>
390	  <xs:attribute name="mediaType" type="xs:string" use="required"/>

392	</xs:complexType>

394	<!-- CONTENT TYPE -->
395	<xs:complexType name="contentType">
396	 <xs:sequence>
397	   <xs:element name="captureIDREF" type="xs:IDREF"
398	   minOccurs="0" maxOccurs="unbounded"/>
399	   <xs:element name="sceneViewIDREF" type="xs:IDREF"
400	   minOccurs="0" maxOccurs="unbounded"/>
401	   <xs:any namespace="##other" processContents="lax" minOccurs="0"
402	   maxOccurs="unbounded"/>
403	 </xs:sequence>
404	 <xs:anyAttribute namespace="##other" processContents="lax"/>
405	</xs:complexType>

407	<!-- MAX CAPTURES TYPE -->
408	<xs:complexType name="maxCapturesType">
409	  <xs:simpleContent>
410	    <xs:extension base="xs:unsignedInt">
411	      <xs:attribute name="exactNumber" type="xs:boolean"/>

413	    </xs:extension>
414	  </xs:simpleContent>
415	</xs:complexType>

417	<!-- CAPTURED PEOPLE TYPE -->
418	<xs:complexType name="capturedPeopleType">
419	<xs:sequence>
420	<xs:element name="personIDREF" type="xs:IDREF" maxOccurs="unbounded"/>
421	</xs:sequence>
422	</xs:complexType>

424	<!-- PEOPLE TYPE -->
425	<xs:complexType name="peopleType">
426	<xs:sequence>
427	<xs:element name="person" type="personType"
428	                    maxOccurs="unbounded"/>
429	</xs:sequence>
430	</xs:complexType>

432	<!-- PERSON TYPE -->
433	<xs:complexType name="personType">
434	  <xs:sequence>
435	     <xs:element name="personInfo" type="xcard:vcardType" maxOccurs="1"
436	                         minOccurs="0"/>
437	     <xs:element name="personType" type="personTypeType"
438	                         minOccurs="0"
439	                 maxOccurs="unbounded"/>
440	     <xs:any namespace="##other" processContents="lax" minOccurs="0"
441	                 maxOccurs="unbounded"/>
442	  </xs:sequence>
443	  <xs:attribute name="personID" type="xs:ID" use="required"/>
444	  <xs:anyAttribute namespace="##other" processContents="lax"/>
445	</xs:complexType>

447	<!-- PERSON TYPE TYPE -->
448	<xs:simpleType name="personTypeType">
449	        <xs:restriction base="xs:string">
450	        <xs:enumeration value="chairman"/>
451	        <xs:enumeration value="vice-chairman"/>
452	        <xs:enumeration value="minute taker"/>
453	        <xs:enumeration value="presenter"/>
454	        <xs:enumeration value="translator"/>
455	        <xs:enumeration value="timekeeper"/>
456	        <xs:enumeration value="attendee"/>
457	        </xs:restriction>
458	</xs:simpleType>
459	<!-- VIEW TYPE -->
460	<xs:simpleType name="viewType">
461	 <xs:restriction base="xs:string">
462	  <xs:enumeration value="room"/>
463	  <xs:enumeration value="table"/>
464	  <xs:enumeration value="lectern"/>
465	  <xs:enumeration value="individual"/>
466	  <xs:enumeration value="audience"/>
467	 </xs:restriction>
468	</xs:simpleType>

470	<!-- PRESENTATION TYPE -->
471	<xs:simpleType name="presentationType">
472	 <xs:restriction base="xs:string">
473	  <xs:enumeration value="slides"/>
474	  <xs:enumeration value="image"/>
475	  <xs:enumeration value=""/>
476	 </xs:restriction>
477	</xs:simpleType>

479	<!-- SPATIAL INFORMATION TYPE -->
480	<xs:complexType name="spatialInformationType">
481	 <xs:sequence>
482	  <xs:element name="capturePoint" type="capturePointType" minOccurs="0"/>
483	  <xs:element name="captureArea" type="captureAreaType" minOccurs="0"/>
484	  <xs:any namespace="##other" processContents="lax" minOccurs="0"
485	  maxOccurs="unbounded"/>
486	 </xs:sequence>
487	 <xs:anyAttribute namespace="##other" processContents="lax"/>
488	</xs:complexType>

490	<!-- MOBILITY TYPE -->
491	<xs:simpleType name="mobilityType">
492	 <xs:restriction base="xs:string">
493	  <xs:enumeration value="static"/>
494	  <xs:enumeration value="dynamic"/>
495	  <xs:enumeration value="highly-dynamic"/>
496	 </xs:restriction>
497	</xs:simpleType>

499	<!-- TEXT CAPTURE TYPE -->
500	<xs:complexType name="textCaptureType">
501	 <xs:complexContent>
502	  <xs:extension base="tns:mediaCaptureType">
503	  <xs:sequence>
504	  <xs:any namespace="##other" processContents="lax" minOccurs="0"
505	    maxOccurs="unbounded"/>
506	  </xs:sequence>
507	  <xs:anyAttribute namespace="##other" processContents="lax"/>
508	  </xs:extension>
509	 </xs:complexContent>
510	</xs:complexType>

512	<!-- OTHER CAPTURE TYPE -->
513	<xs:complexType name="otherCaptureType">
514	 <xs:complexContent>
515	  <xs:extension base="tns:mediaCaptureType">
516	  <xs:sequence>
517	  <xs:any namespace="##other" processContents="lax" minOccurs="0"
518	    maxOccurs="unbounded"/>
519	  </xs:sequence>
520	  </xs:extension>
521	 </xs:complexContent>
522	</xs:complexType>

524	<!-- AUDIO CAPTURE TYPE -->
525	<xs:complexType name="audioCaptureType">
526	 <xs:complexContent>
527	  <xs:extension base="tns:mediaCaptureType">
528	   <xs:sequence>
529	    <xs:element name="sensitivyPattern" type="sensitivyPatternType"
530	    minOccurs="0"/>
531	    <xs:any namespace="##other" processContents="lax" minOccurs="0"
532	    maxOccurs="unbounded"/>
533	   </xs:sequence>
534	   <xs:anyAttribute namespace="##other" processContents="lax"/>
535	  </xs:extension>
536	 </xs:complexContent>
537	</xs:complexType>

539	<!-- SENSITIVITY PATTERN TYPE -->
540	<xs:simpleType name="sensitivyPatternType">
541	 <xs:restriction base="xs:string">
542	  <xs:enumeration value="uni"/>
543	  <xs:enumeration value="shotgun"/>
544	  <xs:enumeration value="omni"/>
545	  <xs:enumeration value="figure8"/>
546	  <xs:enumeration value="cardioid"/>
547	  <xs:enumeration value="hyper-cardioid"/>
548	 </xs:restriction>
549	</xs:simpleType>

551	<!-- VIDEO CAPTURE TYPE -->
552	<xs:complexType name="videoCaptureType">
553	 <xs:complexContent>
554	  <xs:extension base="tns:mediaCaptureType">
555	   <xs:sequence>
556	    <xs:element ref="embeddedText" minOccurs="0"/>
557	    <xs:any namespace="##other" processContents="lax" minOccurs="0"
558	    maxOccurs="unbounded"/>
559	   </xs:sequence>
560	   <xs:anyAttribute namespace="##other" processContents="lax"/>
561	  </xs:extension>
562	 </xs:complexContent>
563	</xs:complexType>

565	<!-- EMBEDDED TEXT ELEMENT -->
566	<xs:element name="embeddedText">
567	 <xs:complexType>
568	  <xs:simpleContent>
569	   <xs:extension base="xs:boolean">
570	    <xs:attribute name="lang" type="xs:language"/>
571	   </xs:extension>
572	  </xs:simpleContent>
573	 </xs:complexType>
574	</xs:element>

576	<!-- CAPTURE SCENES TYPE -->
577	<!-- envelope of capture scenes -->
578	<xs:complexType name="captureScenesType">
579	 <xs:sequence>
580	  <xs:element name="captureScene" type="captureSceneType"
581	  maxOccurs="unbounded"/>
582	 </xs:sequence>
583	</xs:complexType>

585	<!-- CAPTURE SCENE TYPE -->
586	<xs:complexType name="captureSceneType">
587	 <xs:sequence>
588	  <xs:element ref="description" minOccurs="0" maxOccurs="unbounded"/>
589	  <xs:element name="sceneInformation" type="xcard:vcardType" minOccurs="0"/>
590	  <xs:element name="sceneViews" type="sceneViewsType" minOccurs="0"/>
591	  <xs:any namespace="##other" processContents="lax" minOccurs="0"
592	  maxOccurs="unbounded"/>
593	 </xs:sequence>
594	 <xs:attribute name="sceneID" type="xs:ID" use="required"/>
595	 <xs:attribute name="scale" type="scaleType" use="required"/>
596	 <xs:anyAttribute namespace="##other" processContents="lax"/>
597	</xs:complexType>

599	<!-- SCALE TYPE -->
600	<xs:simpleType name="scaleType">
601	 <xs:restriction base="xs:string">
602	  <xs:enumeration value="mm"/>
603	  <xs:enumeration value="unknown"/>
604	  <xs:enumeration value="noscale"/>
605	 </xs:restriction>
606	</xs:simpleType>

608	<!-- CAPTURE AREA TYPE -->
609	<xs:complexType name="captureAreaType">
610	 <xs:sequence>
611	  <xs:element name="bottomLeft" type="pointType"/>
612	  <xs:element name="bottomRight" type="pointType"/>
613	  <xs:element name="topLeft" type="pointType"/>
614	  <xs:element name="topRight" type="pointType"/>
615	 </xs:sequence>
616	</xs:complexType>

618	<!-- POINT TYPE -->
619	<xs:complexType name="pointType">
620	 <xs:sequence>
621	  <xs:element name="x" type="xs:decimal"/>
622	  <xs:element name="y" type="xs:decimal"/>
623	  <xs:element name="z" type="xs:decimal"/>
624	 </xs:sequence>
625	</xs:complexType>

627	<!-- CAPTURE POINT TYPE -->
628	<xs:complexType name="capturePointType">
629	 <xs:complexContent>
630	  <xs:extension base="pointType">
631	   <xs:sequence>
632	    <xs:element name="lineOfCapturePoint" type="tns:pointType"
633	    minOccurs="0"/>
634	   </xs:sequence>
635	   <xs:anyAttribute namespace="##any" processContents="lax"/>
636	  </xs:extension>
637	 </xs:complexContent>
638	</xs:complexType>

640	<!-- SCENE VIEWS TYPE -->
641	<!-- envelope of scene views of a capture scene -->
642	<xs:complexType name="sceneViewsType">
643	 <xs:sequence>
644	  <xs:element name="sceneView" type="sceneViewType"
645	  maxOccurs="unbounded"/>
646	 </xs:sequence>

648	</xs:complexType>

650	<!-- SCENE VIEW TYPE -->
651	<xs:complexType name="sceneViewType">
652	 <xs:sequence>
653	  <xs:element ref="description" minOccurs="0" maxOccurs="unbounded"/>
654	  <xs:element name="mediaCaptureIDs" type="captureIDListType"/>
655	 </xs:sequence>
656	 <xs:attribute name="sceneViewID" type="xs:ID" use="required"/>
657	</xs:complexType>

659	<!-- CAPTURE ID LIST TYPE -->
660	<xs:complexType name="captureIDListType">
661	 <xs:sequence>
662	  <xs:element name="captureIDREF" type="xs:IDREF"
663	  maxOccurs="unbounded"/>
664	 </xs:sequence>
665	</xs:complexType>

667	<!-- ENCODING GROUPS TYPE -->
668	<xs:complexType name="encodingGroupsType">
669	 <xs:sequence>
670	  <xs:element name="encodingGroup" type="tns:encodingGroupType"
671	  maxOccurs="unbounded"/>
672	 </xs:sequence>
673	</xs:complexType>

675	<!-- ENCODING GROUP TYPE -->
676	<xs:complexType name="encodingGroupType">
677	 <xs:sequence>
678	  <xs:element name="maxGroupBandwidth" type="xs:unsignedLong"/>
679	  <xs:element name="encodingIDList" type="encodingIDListType"/>
680	  <xs:any namespace="##other" processContents="lax" minOccurs="0"
681	  maxOccurs="unbounded"/>
682	 </xs:sequence>
683	 <xs:attribute name="encodingGroupID" type="xs:ID" use="required"/>
684	 <xs:anyAttribute namespace="##any" processContents="lax"/>
685	</xs:complexType>

687	<!-- ENCODING ID LIST TYPE -->
688	<xs:complexType name="encodingIDListType">
689	 <xs:sequence>
690	  <xs:element name="encID" type="xs:string" maxOccurs="unbounded"/>
691	 </xs:sequence>
692	</xs:complexType>

694	<!-- SIMULTANEOUS SETS TYPE -->
695	<xs:complexType name="simultaneousSetsType">
696	 <xs:sequence>
697	  <xs:element name="simultaneousSet" type="simultaneousSetType"
698	  maxOccurs="unbounded"/>
699	 </xs:sequence>
700	</xs:complexType>

702	<!-- SIMULTANEOUS SET TYPE -->
703	<xs:complexType name="simultaneousSetType">
704	 <xs:sequence>
705	   <xs:element name="mediaCaptureIDREF" type="xs:IDREF"
706	   minOccurs="0" maxOccurs="unbounded"/>
707	   <xs:element name="sceneViewIDREF" type="xs:IDREF"
708	   minOccurs="0" maxOccurs="unbounded"/>
709	   <xs:element name="captureSceneIDREF" type="xs:IDREF"
710	   minOccurs="0" maxOccurs="unbounded"/>
711	   <xs:any namespace="##other" processContents="lax" minOccurs="0"
712	   maxOccurs="unbounded"/>
713	 </xs:sequence>
714	 <xs:attribute name="setID" type="xs:ID" use="required"/>
715	 <xs:attribute name="mediaType" type="xs:string"/>
716	 <xs:anyAttribute namespace="##any" processContents="lax"/>
717	</xs:complexType>

719	<!-- GLOBAL VIEWS TYPE -->
720	<xs:complexType name="globalViewsType">
721	 <xs:sequence>
722	  <xs:element name="globalView" type="globalViewType"
723	  maxOccurs="unbounded"/>
724	 </xs:sequence>
725	</xs:complexType>

727	<!-- GLOBAL VIEW TYPE -->
728	<xs:complexType name="globalViewType">
729	 <xs:sequence>
730	   <xs:element name="sceneViewIDREF" type="xs:IDREF"
731	    maxOccurs="unbounded"/>
732	   <xs:any namespace="##other" processContents="lax" minOccurs="0"
733	   maxOccurs="unbounded"/>
734	 </xs:sequence>
735	 <xs:attribute name="globalViewID" type="xs:ID"/>
736	 <xs:anyAttribute namespace="##any" processContents="lax"/>
737	</xs:complexType>

739	<!-- CAPTURE ENCODINGS TYPE -->
740	<xs:complexType name="captureEncodingsType">
741	 <xs:sequence>
742	  <xs:element name="captureEncoding" type="captureEncodingType"
743	  maxOccurs="unbounded"/>
744	 </xs:sequence>
745	</xs:complexType>

747	<!-- CAPTURE ENCODING TYPE -->
748	<xs:complexType name="captureEncodingType">
749	 <xs:sequence>
750	  <xs:element name="captureID" type="xs:string"/>
751	  <xs:element name="encodingID" type="xs:string"/>
752	  <xs:element name="configuredContent" type="contentType" minOccurs="0"/>
753	  <xs:any namespace="##other" processContents="lax" minOccurs="0"
754	   maxOccurs="unbounded"/>
755	 </xs:sequence>
756	 <xs:attribute name="ID" type="xs:ID"/>
757	 <xs:anyAttribute namespace="##any" processContents="lax"/>
758	</xs:complexType>

760	<!-- CLUE INFO ELEMENT -->
761	<!-- the <clueInfo> envelope can be seen
762	     as the ancestor of an <advertisement> envelope -->
763	<xs:element name="clueInfo" type="clueInfoType"/>

765	<!-- CLUE INFO TYPE -->
766	<xs:complexType name="clueInfoType">
767	  <xs:sequence>
768	   <xs:element ref="mediaCaptures"/>
769	   <xs:element ref="encodingGroups"/>
770	   <xs:element ref="captureScenes"/>
771	   <xs:element ref="simultaneousSets" minOccurs="0"/>
772	   <xs:element ref="globalViews" minOccurs="0"/>
773	   <xs:element ref="people" minOccurs="0"/>
774	   <xs:any namespace="##other" processContents="lax" minOccurs="0"
775	   maxOccurs="unbounded"/>
776	  </xs:sequence>
777	  <xs:attribute name="clueInfoID" type="xs:ID" use="required"/>
778	  <xs:anyAttribute namespace="##other" processContents="lax"/>
779	</xs:complexType>
780	</xs:schema>

782	   Following sections describe the XML schema in more detail.

784	4.  <mediaCaptures>

786	   <mediaCaptures> represents the list of one ore more media captures
787	   available on the Media Provider's side.  Each media capture is
788	   represented by a <mediaCapture> element (Section 10).

790	5.  <encodingGroups>

792	   <encodingGroups> represents the list of the encoding groups organized
793	   on the Media Provider's side.  Each encoding group is represented by
794	   a <encodingGroup> element (Section 17).

796	6.  <captureScenes>

798	   <captureScenes> represents the list of the capture scenes organized
799	   on the Media Provider's side.  Each capture scene is represented by a
800	   <captureScene> element.  (Section 15).

802	7.  <simultaneousSets>

804	   <simultaneousSets> contains the simultaneous sets indicated by the
805	   Media Provider.  Each simultaneous set is represented by a
806	   <simultaneousSet> element.  (Section 18).

808	8.  <globalViews>

810	   <globalViews> contains a set of alternative representations of all
811	   the scenes that are offered by a Media Provider to a Media Consumer.
812	   Each alternative is named "global view" and it is represented by a
813	   <globalView> element.  (Section 19).

815	9.  <captureEncodings>

817	   <captureEncodings> is a list of capture encodings.  It can represent
818	   the list of the desired capture encodings indicated by the Media
819	   Consumer or the list of instantiated captures on the provider's side.
820	   Each capture encoding is represented by a <captureEncoding> element.
821	   (Section 21).

823	10.  <mediaCapture>

825	   According to the CLUE framework, a media capture is the fundamental
826	   representation of a media flow that is available on the provider's
827	   side.  Media captures are characterized (i) by a set of features that
828	   are independent from the specific type of medium, and (ii) by a set
829	   of features that are media-specific.  The features that are common to
830	   all media types appear within the media capture type, that has been
831	   designed as an abstract complex type.  Media-specific captures, such
832	   as video captures, audio captures and others, are specialization of
833	   that abstract media capture type, as in a typical generalization-
834	   specialization hierarchy.

836	   The following is the XML Schema definition of the media capture type:

838	<!-- MEDIA CAPTURE TYPE -->
839	<xs:complexType name="mediaCaptureType" abstract="true">
840	  <xs:sequence>
841	    <!-- mandatory fields -->
842	    <xs:element name="captureSceneIDREF" type="xs:IDREF"/>
843	    <xs:choice>
844	      <xs:sequence>
845	        <xs:element name="spatialInformation"
846	                    type="tns:spatialInformationType"/>
847	      </xs:sequence>
848	      <xs:element name="nonSpatiallyDefinable" type="xs:boolean" fixed="true"/>
849	    </xs:choice>
850	    <!-- for handling multi-content captures: -->
851	    <xs:choice>
852	      <xs:sequence>
853	        <xs:element name="synchronizationID" type="xs:ID" minOccurs="0"/>
854	        <xs:element name="content" type="contentType" minOccurs="0"/>
855	        <xs:element name="policy" type="xs:string" minOccurs="0"/>
856	        <xs:element name="maxCaptures" type="maxCapturesType" minOccurs="0"/>
857	      </xs:sequence>
858	      <xs:element name="individual" type="xs:boolean" fixed="true"/>
859	    </xs:choice>
860	    <!-- optional fields -->
861	    <xs:element name="encGroupIDREF" type="xs:IDREF" minOccurs="0"/>
862	    <xs:element ref="description" minOccurs="0" maxOccurs="unbounded"/>
863	    <xs:element name="priority" type="xs:unsignedInt" minOccurs="0"/>
864	    <xs:element name="lang" type="xs:language" minOccurs="0"/>
865	    <xs:element name="mobility" type="mobilityType" minOccurs="0"/>
866	    <xs:element name="presentation" type="presentationType" minOccurs="0"/>
867	    <xs:element name="view" type="viewType" minOccurs="0"/>
868	    <xs:element name="capturedPeople" type="capturedPeopleType" minOccurs="0"/>
869	    <xs:element name="maxCaptureEncodings" type="xs:unsignedInt"
870	    minOccurs="0"/>
871	    <xs:element name="relatedTo" type="xs:IDREF" minOccurs="0"/>
872	  </xs:sequence>
873	  <xs:attribute name="captureID" type="xs:ID" use="required"/>
874	  <xs:attribute name="mediaType" type="xs:string" use="required"/>

876	</xs:complexType>

878	10.1.  captureID attribute

880	   The "captureID" attribute is a mandatory field containing the
881	   identifier of the media capture.

883	10.2.  mediaType attribute

885	   The "mediaType" attribute is a mandatory attribute specifying the
886	   media type of the capture ("audio", "video", "text",...).

888	10.3.  <captureSceneIDREF>

890	   <captureSceneIDREF> is a mandatory field containing the identifier of
891	   the capture scene the media capture is defined in.  Indeed, each
892	   media capture must be defined within one and only one capture scene.
893	   When a media capture is spatially definable, some spatial information
894	   is provided along with it in the form of point coordinates (see
895	   Section 10.5).  Such coordinates refers to the space of coordinates
896	   defined for the capture scene containing the capture.

898	10.4.  <encGroupIDREF>

900	   <encGroupIDREF> is an optional field containing the identifier of the
901	   encoding group the media capture is associated with.  Media captures
902	   that are not associated with any encoding group can not be
903	   instantiated as media streams.

905	10.5.  <spatialInformation>

907	   Media captures are divided into two categories: (i) non spatially
908	   definable captures and (ii) spatially definable captures.

910	   Captures are spatially definable when at least (i) it is possible to
911	   provide the coordinates of the device position within the
912	   telepresence room of origin (capture point) together with its
913	   capturing direction specified by a second point (point on line of
914	   capture), or (ii) it is possible to provide the represented area
915	   within the telepresence room, by listing the coordinates of the four
916	   co-planar points identifying the plane of interest (area of capture).
917	   The coordinates of the abovementioned points must be expressed
918	   according to the coordinate space of the capture scene the media
919	   captures belongs to.

921	   Non spatially definable captures cannot be characterized within the
922	   physical space of the telepresence room of origin.  Capture of this
923	   kind are for example those related to recordings, text captures,
924	   DVDs, registered presentation, or external streams that are played in
925	   the telepresence room and transmitted to remote sites.

927	   Spatially definable captures represent a part of the telepresence
928	   room.  The captured part of the telepresence room is described by
929	   means of the <spatialInformation> element.  By comparing the
930	   <spatialInformation> element of different media captures within the
931	   same capture scene, a consumer can better determine the spatial
932	   relationships between them and render them correctly.  Non spatially
933	   definable captures do not show in their XML description such element:
934	   they are instead characterized by having the <nonSpatiallyDefinable>
935	   tag set to "true" (see Section 10.6).

937	   The definition of the spatial information type is the following:

939	<!-- SPATIAL INFORMATION TYPE -->
940	<xs:complexType name="spatialInformationType">
941	 <xs:sequence>
942	  <xs:element name="capturePoint" type="capturePointType" minOccurs="0"/>
943	  <xs:element name="captureArea" type="captureAreaType" minOccurs="0"/>
944	  <xs:any namespace="##other" processContents="lax" minOccurs="0"
945	  maxOccurs="unbounded"/>
946	 </xs:sequence>
947	 <xs:anyAttribute namespace="##other" processContents="lax"/>
948	</xs:complexType>

950	   The <capturePoint> contains the coordinates of the capture device
951	   that is taking the capture, as well as, optionally, the pointing
952	   direction (see Section 10.5.1).

954	   The <captureArea> is an optional field containing four points
955	   defining the captured area covered by the capture (see
956	   Section 10.5.2).

958	10.5.1.  <capturePoint>

960	   The <capturePoint> element is used to represent the position and
961	   optionally the line of capture of a capture device. <capturePoint>
962	   MUST be included in spatially definable audio captures, while is
963	   optional for spatially definable video captures.

965	   The XML Schema definition of the <capturePoint> element type is the
966	   following:

968	   <!-- POINT TYPE -->
969	   <xs:complexType name="pointType">
970	    <xs:sequence>
971	     <xs:element name="x" type="xs:decimal"/>
972	     <xs:element name="y" type="xs:decimal"/>
973	     <xs:element name="z" type="xs:decimal"/>
974	    </xs:sequence>
975	   </xs:complexType>

977	   <!-- CAPTURE POINT TYPE -->
978	   <xs:complexType name="capturePointType">
979	    <xs:complexContent>
980	     <xs:extension base="pointType">
981	      <xs:sequence>
982	       <xs:element name="lineOfCapturePoint" type="tns:pointType"
983	       minOccurs="0"/>
984	      </xs:sequence>
985	      <xs:anyAttribute namespace="##any" processContents="lax"/>
986	     </xs:extension>
987	    </xs:complexContent>
988	   </xs:complexType>

990	   The point type contains three spatial coordinates (x,y,z)
991	   representing a point in the space associated with a certain capture
992	   scene.

994	   The capture point type extends the point type, i.e., it is
995	   represented by three coordinates identifying the position of the
996	   capture device, but can add further information.  Such further
997	   information is conveyed by the <lineOfCapturePoint>, which is another
998	   point-type element representing the "point on line of capture", that
999	   gives the pointing direction of the capture device.

1001	   The coordinates of the point on line of capture MUST NOT be identical
1002	   to the capture point coordinates.  For a spatially definable video
1003	   capture, if the point on line of capture is provided, it MUST belong
1004	   to the region between the point of capture and the capture area.  For
1005	   a spatially definable audio capture, if the point on line of capture
1006	   is not provided, the sensitivity pattern should be considered
1007	   omnidirectional.

1009	10.5.2.  <captureArea>

1011	   <captureArea> is an optional element that can be contained within the
1012	   spatial information associated with a media capture.  It represents
1013	   the spatial area captured by the media capture. <captureArea> MUST be
1014	   included in the spatial information of spatially definable video
1015	   capture, while MUST NOT be associated to audio captures.

1017	   The XML representation of that area is provided through a set of four
1018	   point-type element, <bottomLeft>, <bottomRight>, <topLeft>, and
1019	   <topRight>, as it can be seen from the following definition:

1021	   <!-- CAPTURE AREA TYPE -->
1022	   <xs:complexType name="captureAreaType">
1023	    <xs:sequence>
1024	     <xs:element name="bottomLeft" type="pointType"/>
1025	     <xs:element name="bottomRight" type="pointType"/>
1026	     <xs:element name="topLeft" type="pointType"/>
1027	     <xs:element name="topRight" type="pointType"/>
1028	    </xs:sequence>
1029	   </xs:complexType>

1031	   <bottomLeft>, <bottomRight>, <topLeft>, and <topRight> MUST be co-
1032	   planar.

1034	10.6.  <nonSpatiallyDefinable>

1036	   When media captures are non spatially definable, they are marked with
1037	   the boolean <nonSpatiallyDefinable> element set to "true" and no
1038	   <spatialInformation> is provided.  Indeed, <nonSpatiallyDefinable>
1039	   and <spatialInformation> are mutually exclusive tag, according to the
1040	   <choice> section within the XML Schema definition of the media
1041	   capture type.

1043	10.7.  <content>

1045	   A media capture can be (i) an individual media capture or (ii) a
1046	   multiple content capture (MCC).  A multiple content capture is made
1047	   by different captures that can be arranged spatially (by a
1048	   composition operation), or temporally (by a switching operation), or
1049	   that can result from the orchestration of both the techniques.  If a
1050	   media capture is a MCC, then it can show in its XML data model
1051	   representation the <content> element.  It is composed by a list of
1052	   media capture identifiers ("captureIDREF") and capture scene view
1053	   identifiers ("sceneViewIDREF"), where the last ones are used as
1054	   shortcuts to refer to multiple capture identifiers.  The referenced
1055	   captures are used to create the MCC according to a certain strategy.
1056	   If the <content> element does not appear in a MCC, or it has no child
1057	   elements, then the MCC is assumed to be made by multiple sources but
1058	   no information regarding those sources is given.

1060	   <!-- CONTENT TYPE -->
1061	   <xs:complexType name="contentType">
1062	    <xs:sequence>
1063	      <xs:element name="captureIDREF" type="xs:IDREF"
1064	      minOccurs="0" maxOccurs="unbounded"/>
1065	      <xs:element name="sceneViewIDREF" type="xs:IDREF"
1066	      minOccurs="0" maxOccurs="unbounded"/>
1067	      <xs:any namespace="##other" processContents="lax" minOccurs="0"
1068	      maxOccurs="unbounded"/>
1069	    </xs:sequence>
1070	    <xs:anyAttribute namespace="##other" processContents="lax"/>
1071	   </xs:complexType>

1073	10.8.  <synchronizationID>

1075	   <synchronizationID> is an optional element for multiple content
1076	   captures that contains a numeric identifier.  Multiple content
1077	   captures marked with the same identifier in the <synchronizationID>
1078	   contain at each time captures coming from the same source.  It is the
1079	   MP that determines what the source for the captures is.  By this way,
1080	   the MP can choose how to group together single captures for the
1081	   purpose of keeping them synchronized according to the
1082	   SynchronisationID attribute.

1084	10.9.  <policy>

1086	   <policy> is an optional element that can be used only for multiple
1087	   content captures.  It indicates the criteria applied to build the
1088	   multiple content capture using the media captures referenced in
1089	   <contentCaptureIDs>.  Such element can assume a list of pre-defined
1090	   values ([todo]).

1092	10.10.  <maxCaptures>

1094	   <maxCaptures> is an optional element that can be used only for
1095	   multiple content captures.  It provides information about the number
1096	   of media captures that can be represented in the multiple content
1097	   capture at a time.  The type definition is provided below.

1099	   <!-- MAX CAPTURES TYPE -->
1100	   <xs:complexType name="maxCapturesType">
1101	     <xs:simpleContent>
1102	       <xs:extension base="xs:unsignedInt">
1103	         <xs:attribute name="exactNumber" type="xs:boolean"/>
1104	       </xs:extension>

1106	     </xs:simpleContent>
1107	   </xs:complexType>

1109	   When the "exactNumber" attribute is set to "1", it means the
1110	   <maxCaptures> element carries the exact number of the media captures
1111	   appearing at a time.  Otherwise, the number of the represented media
1112	   captures MUST be considered "<=" of the <maxCaptures> value.

1114	10.11.  <individual>

1116	   <individual> is a boolean element that MUST be used for single-
1117	   content captures.  Its value is fixed and set to "true".  Such
1118	   element indicates the capture that is being described is not a
1119	   multiple content capture.  Indeed, <individual> and the
1120	   aforementioned tags related to MCC attributes (from Section 10.7 to
1121	   Section 10.10) are mutually exclusive, according to the <choice>
1122	   section within the XML Schema definition of the media capture type.

1124	10.12.  <description>

1126	   <description> is used to provide optionally human-readable textual
1127	   information about a media capture.  The same element is exploited to
1128	   describe, besides media captures, capture scenes and capture scene
1129	   views, as it is included in their XML representation.  A media
1130	   capture can be described by using multiple <description> elements,
1131	   each one providing information in a different language.  The
1132	   <description> element definition is the following:

1134	   <!-- DESCRIPTION element -->
1135	   <xs:element name="description">
1136	    <xs:complexType>
1137	     <xs:simpleContent>
1138	      <xs:extension base="xs:string">
1139	        <xs:attribute name="lang" type="xs:language"/>
1140	      </xs:extension>
1141	     </xs:simpleContent>
1142	    </xs:complexType>
1143	   </xs:element>

1145	   As it can be seen, <description> is a string element with an
1146	   attribute ("lang") indicating the language used in the textual
1147	   description.

1149	10.13.  <priority>

1151	   <priority> is an optional unsigned integer field indicating the
1152	   importance of a media capture according to the Media Provider's
1153	   perspective.  It can be used on the receiver's side to automatically
1154	   identify the most relevant contribution from the Media Provider.  The
1155	   higher the importance, the lower the contained value.  When media
1156	   captures are marked with a "0" priority value, it means that they are
1157	   "not subject to priority".

1159	10.14.  <lang>

1161	   <lang> is an optional element containing the language used in the
1162	   capture, if any.

1164	10.15.  <mobility>

1166	   <mobility> is an optional element indicating whether or not the
1167	   capture device originating the capture may move during the
1168	   telepresence session.  That optional element can assume one of the
1169	   three following values: (i) static, (ii) dynamic or (iii) highly
1170	   dynamic.

1172	10.16.  <maxCaptureEncodings>

1174	   The optional <maxCaptureEncodings> contains an unsigned integer
1175	   indicating the maximum number of capture encodings that can be
1176	   simultaneously active for the media capture.  If absent, this
1177	   parameter defaults to 1.  The minimum value for this attribute is 1.
1178	   The number of simultaneous capture encodings is also limited by the
1179	   restrictions of the encoding group the media capture refers to by
1180	   means of the <encGroupIDREF> element.

1182	10.17.  <relatedTo>

1184	   The optional <relatedTo> element contains the value of the ID
1185	   attribute of the media capture it refers to.  The media capture
1186	   marked with a <relatedTo> element can be for example the translation
1187	   of a main media capture in a different language.

1189	10.18.  <view>

1191	   The <view> element is an optional tag describing what is represented
1192	   in the spatial area covered by a media capture.  The current possible
1193	   values are: "table", "lectern", "individual", and "audience", as
1194	   listed in the enumerative view type in the following.

1196	10.19.  <presentation>

1198	   The <presentation> element is an optional tag used for media captures
1199	   conveing information about presentations within the telepresence
1200	   session.  The current possible values are "slides" and "images", as
1201	   listed in the enumerative presentation type in the following.

1203	10.20.  <capturedPeople>

1205	   This optional element is used to indicate which telepresence session
1206	   participants are represented within the media captures.  For each
1207	   participant, a <personIDREF> element is provided.

1209	10.20.1.  <personIDREF>

1211	   <personIDREF> contains the identifier of the represented person.
1212	   Metadata about the represented participant can be retrieved by
1213	   accessing the <people> list (Section 20).

1215	11.  Audio captures

1217	   Audio captures inherit all the features of a generic media capture
1218	   and present further audio-specific characteristics.  The XML Schema
1219	   definition of the audio capture type is reported below:

1221	   <!-- AUDIO CAPTURE TYPE -->
1222	   <xs:complexType name="audioCaptureType">
1223	    <xs:complexContent>
1224	     <xs:extension base="tns:mediaCaptureType">
1225	      <xs:sequence>
1226	       <xs:element name="sensitivyPattern" type="sensitivyPatternType"
1227	       minOccurs="0"/>
1228	       <xs:any namespace="##other" processContents="lax" minOccurs="0"
1229	       maxOccurs="unbounded"/>
1230	      </xs:sequence>
1231	      <xs:anyAttribute namespace="##other" processContents="lax"/>
1232	     </xs:extension>
1233	    </xs:complexContent>
1234	   </xs:complexType>

1236	   An example of audio-specific information that can be included is
1237	   represented by the <sensitivyPattern> element.  (Section 11.1).

1239	11.1.  <sensitivityPattern>

1241	   The <sensitivityPattern> element is an optional field describing the
1242	   characteristic of the nominal sensitivity pattern of the microphone
1243	   capturing the audio signal.

1245	   The XML Schema definition is provided below:

1247	   <xs:simpleType name="sensitivyPatternType">
1248	    <xs:restriction base="xs:string">
1249	     <xs:enumeration value="uni"/>
1250	     <xs:enumeration value="shotgun"/>
1251	     <xs:enumeration value="omni"/>
1252	     <xs:enumeration value="figure8"/>
1253	     <xs:enumeration value="cardioid"/>
1254	     <xs:enumeration value="hyper-cardioid"/>
1255	    </xs:restriction>
1256	   </xs:simpleType>

1258	12.  Video captures

1260	   Video captures, similarly to audio captures, extend the information
1261	   of a generic media capture with video-specific features, such as
1262	   <embeddedText> (Section 12.1).

1264	   The XML Schema representation of the video capture type is provided
1265	   in the following:

1267	   <!-- VIDEO CAPTURE TYPE -->
1268	   <xs:complexType name="videoCaptureType">
1269	    <xs:complexContent>
1270	     <xs:extension base="tns:mediaCaptureType">
1271	      <xs:sequence>
1272	       <xs:element ref="embeddedText" minOccurs="0"/>
1273	       <xs:any namespace="##other" processContents="lax" minOccurs="0"
1274	       maxOccurs="unbounded"/>
1275	      </xs:sequence>
1276	      <xs:anyAttribute namespace="##other" processContents="lax"/>
1277	     </xs:extension>
1278	    </xs:complexContent>
1279	   </xs:complexType>

1281	12.1.  <embeddedText>

1283	   The <embeddedText> element is a boolean element indicating that there
1284	   is text embedded in the video capture.  The language used in such
1285	   embedded textual description is reported in <embeddedText> "lang"
1286	   attribute.

1288	   The XML Schema definition of the <embeddedText> element is:

1290	   <!-- EMBEDDED TEXT ELEMENT -->
1291	   <xs:element name="embeddedText">
1292	    <xs:complexType>
1293	     <xs:simpleContent>
1294	      <xs:extension base="xs:boolean">
1295	       <xs:attribute name="lang" type="xs:language"/>
1296	      </xs:extension>
1297	     </xs:simpleContent>
1298	    </xs:complexType>
1299	   </xs:element>

1301	13.  Text captures

1303	   Also text captures can be described by extending the generic media
1304	   capture information, similarly to audio captures and video captures.

1306	   The XML Schema representation of the text capture type is currently
1307	   lacking text-specific information, as it can be seen by looking at
1308	   the definition below:

1310	   <!-- TEXT CAPTURE TYPE -->
1311	   <xs:complexType name="textCaptureType">
1312	    <xs:complexContent>
1313	     <xs:extension base="tns:mediaCaptureType">
1314	     <xs:sequence>
1315	     <xs:any namespace="##other" processContents="lax" minOccurs="0"
1316	       maxOccurs="unbounded"/>
1317	     </xs:sequence>
1318	     <xs:anyAttribute namespace="##other" processContents="lax"/>
1319	     </xs:extension>
1320	    </xs:complexContent>
1321	   </xs:complexType>

1323	14.  Other capture types

1325	   Other media capture types can be described by using the CLUE data
1326	   model.  They can be represented by exploiting "otherCaptureType"
1327	   type.  This media capture type is conceived to be filled with
1328	   elements defined within extensions of the current schema, i.e., with
1329	   elements defined in other XML schemas (see Section 23 for an
1330	   example).  The otherCaptureType inherits all the features envisioned
1331	   for the abstract mediaCaptureType.

1333	   The XML Schema representation of the otherCaptureType is the
1334	   following:

1336	   <!-- OTHER CAPTURE TYPE -->
1337	   <xs:complexType name="otherCaptureType">
1338	    <xs:complexContent>
1339	     <xs:extension base="tns:mediaCaptureType">
1340	     <xs:sequence>
1341	     <xs:any namespace="##other" processContents="lax" minOccurs="0"
1342	       maxOccurs="unbounded"/>
1343	     </xs:sequence>
1344	     <xs:anyAttribute namespace="##other" processContents="lax"/>
1345	     </xs:extension>
1346	    </xs:complexContent>
1347	   </xs:complexType>

1349	15.  <captureScene>

1351	   A Media Provider organizes the available capture in capture scenes in
1352	   order to help the receiver both in the rendering and in the selection
1353	   of the group of captures.  Capture scenes are made of media captures
1354	   and capture scene views, that are set of media captures of the same
1355	   media type.  Each capture scene view is an alternative to represent
1356	   completely a capture scene for a fixed media type.

1358	   The XML Schema representation of a <captureScene> element is the
1359	   following:

1361	<!-- CAPTURE SCENE TYPE -->
1362	<xs:complexType name="captureSceneType">
1363	 <xs:sequence>
1364	  <xs:element ref="description" minOccurs="0" maxOccurs="unbounded"/>
1365	  <xs:element name="sceneInformation" type="xcard:vcardType" minOccurs="0"/>
1366	  <xs:element name="sceneViews" type="sceneViewsType" minOccurs="0"/>
1367	  <xs:any namespace="##other" processContents="lax" minOccurs="0"
1368	  maxOccurs="unbounded"/>
1369	 </xs:sequence>
1370	 <xs:attribute name="sceneID" type="xs:ID" use="required"/>
1371	 <xs:attribute name="scale" type="scaleType" use="required"/>
1372	 <xs:anyAttribute namespace="##other" processContents="lax"/>
1373	</xs:complexType>

1375	   Each capture scene is identified by a "sceneID" attribute.  The
1376	   <captureScene> element can contain zero or more textual <description>
1377	   elements, defined as in Section 10.12.  Besides <description>, there
1378	   is the optional <sceneInformation> element (Section 15.1), which
1379	   contains structured information about the scene in the vcard format,
1380	   and the optional <sceneViews> element (Section 15.2), which is the
1381	   list of the capture scene views.  When no <sceneViews> is provided,
1382	   the capture scene is assumed to be made by all the media captures
1383	   showing the value of its sceneID attribute in their mandatory
1384	   captureSceneIDREF attribute.

1386	15.1.  <sceneInformation>

1388	   The <sceneInformation> element contains optional information about
1389	   the capture scene according to the vcard format.

1391	15.2.  <sceneViews>

1393	   The <sceneViews> element is a mandatory field of a capture scene
1394	   containing the list of scene views.  Each scene view is represented
1395	   by a <sceneView> element (Section 16).

1397	   <!-- SCENE VIEWS TYPE -->
1398	   <!-- envelope of scene views of a capture scene -->
1399	   <xs:complexType name="sceneViewsType">
1400	    <xs:sequence>
1401	     <xs:element name="sceneView" type="sceneViewType"
1402	     maxOccurs="unbounded"/>
1403	    </xs:sequence>
1404	   </xs:complexType>

1406	15.3.  sceneID attribute

1408	   The sceneID attribute is a mandatory attribute containing the
1409	   identifier of the capture scene.

1411	15.4.  scale attribute

1413	   The scale attribute is a mandatory attribute that specifies the scale
1414	   of the coordinates provided in the spatial information of the media
1415	   capture belonging to the considered capture scene.  The scale
1416	   attribute can assume three different values:

1418	      "mm" - the scale is in millimeters.  Systems which know their
1419	      physical dimensions (for example professionally installed
1420	      telepresence room systems) should always provide those real-world
1421	      measurements.

1423	      "unknown" - the scale is not necessarily millimeters, but the
1424	      scale is the same for every media capture in the capture scene.
1425	      Systems which don't know specific physical dimensions but still
1426	      know relative distances should select "unknown" in the scale
1427	      attribute of the capture scene to be described.

1429	      "noscale" - there is no a common physical scale among the media
1430	      captures of the capture scene.  That means the scale could be
1431	      different for each media capture.

1433	   <!-- SCALE TYPE -->
1434	   <xs:simpleType name="scaleType">
1435	    <xs:restriction base="xs:string">
1436	     <xs:enumeration value="mm"/>
1437	     <xs:enumeration value="unknown"/>
1438	     <xs:enumeration value="noscale"/>
1439	    </xs:restriction>
1440	   </xs:simpleType>

1442	16.  <sceneView>

1444	   A <sceneView> element represents a capture scene view, which contains
1445	   a set of media capture of the same media type describing a capture
1446	   scene.

1448	   A <sceneView> element is characterized as follows.

1450	   <!-- SCENE VIEW TYPE -->
1451	   <xs:complexType name="sceneViewType">
1452	    <xs:sequence>
1453	     <xs:element ref="description" minOccurs="0" maxOccurs="unbounded"/>
1454	     <xs:element name="mediaCaptureIDs" type="captureIDListType"/>
1455	    </xs:sequence>
1456	    <xs:attribute name="sceneViewID" type="xs:ID" use="required"/>
1457	   </xs:complexType>

1459	   One or more optional <description> elements provide human-readable
1460	   information about what the scene view contains. <description> is
1461	   defined as already seen in Section 10.12.

1463	   The remaining child elements are described in the following
1464	   subsections.

1466	16.1.  <mediaCaptureIDs>

1468	   The <mediaCaptureIDs> is the list of the identifiers of the media
1469	   captures included in the scene view.  It is an element of the
1470	   captureIDListType type, which is defined as a sequence of
1471	   <captureIDREF> each one containing the identifier of a media capture
1472	   listed within the <mediaCaptures> element:

1474	   <!-- CAPTURE ID LIST TYPE -->
1475	   <xs:complexType name="captureIDListType">
1476	    <xs:sequence>
1477	     <xs:element name="captureIDREF" type="xs:IDREF"
1478	     maxOccurs="unbounded"/>
1479	    </xs:sequence>
1480	   </xs:complexType>

1482	16.2.  sceneViewID attribute

1484	   The sceneViewID attribute is a mandatory attribute containing the
1485	   identifier of the capture scene view represented by the <sceneView>
1486	   element.

1488	17.  <encodingGroup>

1490	   The <encodingGroup> element represents an encoding group, which is
1491	   made by a set of one or more individual encodings and some parameters
1492	   that apply to the group as a whole.  Encoding groups contain
1493	   references to individual encodings that can be applied to media
1494	   captures.  The definition of the <encodingGroup> element is the
1495	   following:

1497	   <!-- ENCODING GROUP TYPE -->
1498	   <xs:complexType name="encodingGroupType">
1499	    <xs:sequence>
1500	     <xs:element name="maxGroupBandwidth" type="xs:unsignedLong"/>
1501	     <xs:element name="encodingIDList" type="encodingIDListType"/>
1502	     <xs:any namespace="##other" processContents="lax" minOccurs="0"
1503	     maxOccurs="unbounded"/>
1504	    </xs:sequence>
1505	    <xs:attribute name="encodingGroupID" type="xs:ID" use="required"/>
1506	    <xs:anyAttribute namespace="##any" processContents="lax"/>
1507	   </xs:complexType>

1509	   In the following, the contained elements are further described.

1511	17.1.  <maxGroupBandwidth>

1513	   <maxGroupBandwidth> is an optional field containing the maximum
1514	   bitrate expressed in bit per second that can be shared by the
1515	   individual encodings included in the encoding group.

1517	17.2.  <encodingIDList>

1519	   <encodingIDList> is the list of the individual encoding grouped
1520	   together in the encoding group.  Each individual encoding is
1521	   represented through its identifier contained within an <encID>
1522	   element.

1524	   <!-- ENCODING ID LIST TYPE -->
1525	   <xs:complexType name="encodingIDListType">
1526	    <xs:sequence>
1527	     <xs:element name="encID" type="xs:IDREF" maxOccurs="unbounded"/>
1528	    </xs:sequence>
1529	   </xs:complexType>

1531	17.3.  encodingGroupID attribute

1533	   The encodingGroupID attribute contains the identifier of the encoding
1534	   group.

1536	18.  <simultaneousSet>

1538	   <simultaneousSet> represents a simultaneous transmission set, i.e., a
1539	   list of captures of the same media type that can be transmitted at
1540	   the same time by a Media Provider.  There are different simultaneous
1541	   transmission sets for each media type.

1543	   <!-- SIMULTANEOUS SET TYPE -->
1544	   <xs:complexType name="simultaneousSetType">
1545	    <xs:sequence>
1546	      <xs:element name="mediaCaptureIDREF" type="xs:IDREF"
1547	      minOccurs="0" maxOccurs="unbounded"/>
1548	      <xs:element name="sceneViewIDREF" type="xs:IDREF"
1549	      minOccurs="0" maxOccurs="unbounded"/>
1550	      <xs:element name="captureSceneIDREF" type="xs:IDREF"
1551	      minOccurs="0" maxOccurs="unbounded"/>
1552	      <xs:any namespace="##other" processContents="lax" minOccurs="0"
1553	      maxOccurs="unbounded"/>
1554	    </xs:sequence>
1555	    <xs:attribute name="setID" type="xs:ID" use="required"/>
1556	    <xs:attribute name="mediaType" type="xs:string"/>
1557	    <xs:anyAttribute namespace="##any" processContents="lax"/>
1558	   </xs:complexType>

1560	   Besides the identifiers of the captures (<mediaCaptureIDREF>
1561	   elements), also the identifiers of capture scene views and of capture
1562	   scene can be exploited, as shortcuts (<sceneViewIDREF> and
1563	   <captureSceneIDREF> elements).

1565	18.1.  setID attribute

1567	   The "setID" attribute is a mandatory field containing the identifier
1568	   of the simultaneous set.

1570	   When only capture scene identifiers are listed within a simultaneous
1571	   set, the media type attribute MUST be used in order to determine
1572	   which media captures can be simultaneously sent together.

1574	18.2.  mediaType attribute

1576	   The "mediaType" attribute is an optional attribute containing the
1577	   media type of the captures referenced by the simultaneous set.

1579	   When only capture scene identifiers are listed within a simultaneous
1580	   set, the media type attribute MUST appear in the XML description in
1581	   order to determine which media captures can be simultaneously sent
1582	   together.

1584	18.3.  <mediaCaptureIDREF>

1586	   <mediaCaptureIDREF> contains the identifier of the media capture that
1587	   belongs to the simultanous set.

1589	18.4.  <sceneViewIDREF>

1591	   <sceneViewIDREF> contains the identifier of the scene view containing
1592	   a group of capture that are able to be sent simultaneously with the
1593	   other captures of the simultaneous set.

1595	18.5.  <captureSceneIDREF>

1597	   <captureSceneIDREF> contains the identifier of the capture scene
1598	   where all the included captures of a certain media type are able to
1599	   be sent together with the other captures of the simultaneous set.

1601	19.  <globalView>

1603	   <globalView> is a set of captures of the same media type representing
1604	   a summary of the complete Media Provider's offer.  The content of a
1605	   global view is expressed by leveraging only scene view identifiers,
1606	   put within <sceneViewIDREF> elements.  Each global view is identified
1607	   by a unique identifier within the "globalViewID" attribute.

1609	   <!-- GLOBAL VIEW TYPE -->
1610	   <xs:complexType name="globalViewType">
1611	    <xs:sequence>
1612	      <xs:element name="sceneViewIDREF" type="xs:IDREF"
1613	      minOccurs="0" maxOccurs="unbounded"/>
1614	      <xs:any namespace="##other" processContents="lax" minOccurs="0"
1615	      maxOccurs="unbounded"/>
1616	    </xs:sequence>
1617	    <xs:attribute name="globalViewID" type="xs:ID"/>
1618	    <xs:anyAttribute namespace="##any" processContents="lax"/>
1619	   </xs:complexType>

1621	20.  <people>

1623	   Information about the participants that are represented in the media
1624	   captures is conveyed via the <people> element.  As it can be seen
1625	   from the XML Schema depicted below, for each participant, a <person>
1626	   element is provided.

1628	<!-- PEOPLE TYPE -->
1629	<xs:complexType name="peopleType">
1630	<xs:sequence>
1631	<xs:element name="person" type="personType"
1632	                    maxOccurs="unbounded"/>
1633	</xs:sequence>
1634	</xs:complexType>

1636	        <!-- PERSON TYPE -->
1637	        <xs:complexType name="personType">
1638	          <xs:sequence>
1639	             <xs:element name="personInfo" type="xcard:vcardType" maxOccurs="1"
1640	                                 minOccurs="0"/>
1641	             <xs:element name="personType" type="personTypeType"
1642	                                 minOccurs="0"
1643	                         maxOccurs="unbounded"/>
1644	             <xs:any namespace="##other" processContents="lax" minOccurs="0"
1645	                         maxOccurs="unbounded"/>
1646	          </xs:sequence>
1647	          <xs:attribute name="personID" type="xs:ID" use="required"/>
1648	          <xs:anyAttribute namespace="##other" processContents="lax"/>
1649	        </xs:complexType>

1651	<!-- PERSON TYPE TYPE -->
1652	<xs:simpleType name="personTypeType">
1653	        <xs:restriction base="xs:string">
1654	        <xs:enumeration value="chairman"/>
1655	        <xs:enumeration value="vice-chairman"/>
1656	        <xs:enumeration value="minute taker"/>
1657	        <xs:enumeration value="presenter"/>
1658	        <xs:enumeration value="translator"/>
1659	        <xs:enumeration value="timekeeper"/>
1660	        <xs:enumeration value="attendee"/>
1661	        </xs:restriction>
1662	</xs:simpleType>

1664	20.1.  <person>

1666	   <person> includes all the metadata related to a person represented
1667	   within one or more media captures.  Such element provides the vcard
1668	   of the subject (via the <personInfo> element, see Section 20.1.2) and
1669	   his conference role(s) (via one or more <personType> elements, see
1670	   Section 20.1.3).  Furthermore, it has a mandatory "personID"
1671	   attribute (Section 20.1.1).

1673	20.1.1.  personID attribute

1675	   The "personID" attribute carries the identifier of a represented
1676	   person.  Such identifier can be used to refer to the participant, as
1677	   in the <capturedPeople> element in media captures representation
1678	   (Section 10.20).

1680	20.1.2.  <personInfo>

1682	   The <personInfo> element is the XML representation of all the fields
1683	   composing a vcard as specified in the Xcard RFC [RFC6351].  The
1684	   vcardType is imported by the Xcard XML Schema provided by
1685	   [I-D.ietf-ecrit-additional-data].  As such schema specifies, the <fn>
1686	   element within <vcard> is mandatory.

1688	20.1.3.  <personType>

1690	   The value of the <personType> element determines the role of the
1691	   represented participant within the telepresence session organization.
1692	   It can be one of the following terms, that are defined in the
1693	   framework document: "presenter", "timekeeper","attendee", "minute
1694	   taker", "translator", "chairman", "vice-chairman".

1696	   A participant can have more than one conference role.  In that case,
1697	   more than one <personType> element will appear in his description.

1699	21.  <captureEncoding>

1701	   A <captureEncoding> is given from the association of a media capture
1702	   and an individual encoding, to form a capture stream as defined in
1703	   [I-D.ietf-clue-framework].  The model of such an entity is provided
1704	   in the following.

1706	<!-- CAPTURE ENCODING TYPE -->
1707	<xs:complexType name="captureEncodingType">
1708	 <xs:sequence>
1709	  <xs:element name="captureID" type="xs:string"/>
1710	  <xs:element name="encodingID" type="xs:string"/>
1711	  <xs:element name="configuredContent" type="contentType" minOccurs="0"/>
1712	  <xs:any namespace="##other" processContents="lax" minOccurs="0"
1713	   maxOccurs="unbounded"/>
1714	 </xs:sequence>
1715	 <xs:attribute name="ID" type="xs:ID" use="required"/>
1716	 <xs:anyAttribute namespace="##any" processContents="lax"/>

1718	</xs:complexType>

1720	21.1.  <captureID>

1722	   <captureID> is the mandatory element containing the identifier of the
1723	   media capture that has been encoded to form the capture encoding.

1725	21.2.  <encodingID>

1727	   <encodingID> is the mandatory element containing the identifier of
1728	   the applied individual encoding.

1730	21.3.  <configuredContent>

1732	   <configuredContent> is an optional element to be used in case of
1733	   configuration of MCCs.  It contains the list of capture identifiers
1734	   and capture scene view identifiers the Media Consumer wants within
1735	   the MCC.  That element is structured as the <content> element used to
1736	   describe the content of a MCC, i.e., it contains The total number of
1737	   the media captures listed in the <configuredContent> must be lower
1738	   than or equal to the value carried within the <maxCaptures> attribute
1739	   of the MCC.

1741	22.  <clueInfo>

1743	   The <clueInfo> element has been left within the XML Schema for
1744	   representing a drafty version of the body of an ADVERTISEMENT message
1745	   (see the example section).

1747	   <!-- CLUE INFO ELEMENT -->
1748	   <!-- the <clueInfo> envelope can be seen
1749	        as the ancestor of an <advertisement> envelope -->
1750	   <xs:element name="clueInfo" type="clueInfoType"/>

1752	   <!-- CLUE INFO TYPE -->
1753	   <xs:complexType name="clueInfoType">
1754	     <xs:sequence>
1755	      <xs:element ref="mediaCaptures"/>
1756	      <xs:element ref="encodingGroups"/>
1757	      <xs:element ref="captureScenes"/>
1758	      <xs:element ref="simultaneousSets" minOccurs="0"/>
1759	      <xs:element ref="globalViews" minOccurs="0"/>
1760	      <xs:element ref="people" minOccurs="0"/>
1761	      <xs:any namespace="##other" processContents="lax" minOccurs="0"
1762	      maxOccurs="unbounded"/>
1763	     </xs:sequence>
1764	     <xs:attribute name="clueInfoID" type="xs:ID" use="required"/>
1765	     <xs:anyAttribute namespace="##other" processContents="lax"/>
1766	   </xs:complexType>

1768	23.  XML Schema extensibility

1770	   The telepresence data model defined in this document is meant to be
1771	   extensible.  Extensions are accomplished by defining elements or
1772	   attributes qualified by namespaces other than
1773	   "urn:ietf:params:xml:ns:clue-info" and
1774	   "urn:ietf:params:xml:ns:vcard-4.0" for use wherever the schema allows
1775	   such extensions (i.e., where the XML Schema definition specifies
1776	   "anyAttribute" or "anyElement").  Elements or attributes from unknown
1777	   namespaces MUST be ignored.

1779	23.1.  Example of extension

1781	   When extending the CLUE data model, a new schema with a new namespace
1782	   associated with it needs to be specified.

1784	   In the following, an example of extension is provided.  The extension
1785	   defines a new audio capture attribute ("newAudioFeature") and an
1786	   attribute for characterizing the captures belonging to an
1787	   "otherCaptureType" defined by the user.  An XML document compliant
1788	   with the extension is also included.  The XML file results validated
1789	   against the current CLUE data model schema.

1791	   <?xml version="1.0" encoding="UTF-8" ?>
1792	   <xs:schema
1793	      targetNamespace="urn:ietf:params:xml:ns:clue-info-ext"
1794	      xmlns:tns="urn:ietf:params:xml:ns:clue-info-ext"
1795	      xmlns:clue-ext="urn:ietf:params:xml:ns:clue-info-ext"
1796	      xmlns:xs="http://www.w3.org/2001/XMLSchema"
1797	      xmlns="urn:ietf:params:xml:ns:clue-info-ext"
1798	      xmlns:xcard="urn:ietf:params:xml:ns:vcard-4.0"
1799	      xmlns:info="urn:ietf:params:xml:ns:clue-info"
1800	      elementFormDefault="qualified"
1801	      attributeFormDefault="unqualified">

1803	   <!-- Import xcard XML schema -->
1804	   <xs:import namespace="urn:ietf:params:xml:ns:vcard-4.0"
1805	   schemaLocation="xcard.xsd"/>

1807	   <!-- Import CLUE XML schema -->
1808	   <xs:import namespace="urn:ietf:params:xml:ns:clue-info"
1809	   schemaLocation="data-model-schema-07.xsd"/>

1811	   <!-- ELEMENT DEFINITIONS -->
1812	   <xs:element name="newAudioFeature" type="xs:string"/>
1813	   <xs:element name="otherMediaCaptureTypeFeature" type="xs:string"/>

1815	   </xs:schema>

1817	<?xml version="1.0" encoding="UTF-8" standalone="yes"?>
1818	<clueInfo xmlns="urn:ietf:params:xml:ns:clue-info"
1819	xmlns:ns2="urn:ietf:params:xml:ns:vcard-4.0"
1820	xmlns:ns3="urn:ietf:params:xml:ns:clue-info-ext"
1821	clueInfoID="NapoliRoom">
1822	    <mediaCaptures>
1823	        <mediaCapture xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
1824	        xsi:type="audioCaptureType"
1825	        captureID="AC0"
1826	        mediaType="audio">
1827	            <captureSceneIDREF>CS1</captureSceneIDREF>
1828	            <nonSpatiallyDefinable>true</nonSpatiallyDefinable>
1829	            <individual>true</individual>
1830	            <encGroupIDREF>EG1</encGroupIDREF>
1831	            <ns3:newAudioFeature>newAudioFeatureValue</ns3:newAudioFeature>
1832	        </mediaCapture>
1833	        <mediaCapture xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
1834	        xsi:type="otherCaptureType"
1835	        captureID="OMC0"
1836	        mediaType="other media type">
1837	            <captureSceneIDREF>CS1</captureSceneIDREF>
1838	            <nonSpatiallyDefinable>true</nonSpatiallyDefinable>
1839	            <encGroupIDREF>EG1</encGroupIDREF>
1840	            <ns3:otherMediaCaptureTypeFeature>OtherValue
1841	            </ns3:otherMediaCaptureTypeFeature>
1842	        </mediaCapture>
1843	    </mediaCaptures>
1844	    <encodingGroups>
1845	        <encodingGroup encodingGroupID="EG1">
1846	            <maxGroupBandwidth>300000</maxGroupBandwidth>
1847	            <encodingIDList>
1848	                <encID>ENC4</encID>
1849	                <encID>ENC5</encID>
1850	            </encodingIDList>
1851	        </encodingGroup>
1852	    </encodingGroups>
1853	    <captureScenes>
1854	        <captureScene scale="unknown" sceneID="CS1"/>
1855	    </captureScenes>
1856	</clueInfo>

1858	24.  Security considerations

1860	   This document defines an XML Schema data model for telepresence
1861	   scenarios.  The modeled information is identified in the CLUE
1862	   framework as the needed one in order to enable a full-optional media
1863	   stream negotiation and rendering.  Indeed, the XML elements herein
1864	   defined are used within CLUE protocol messages to describe both the
1865	   media streams representing the MP's telepresence offer and the
1866	   desired selection requested by the MC.  Security concerns described
1867	   in [I-D.ietf-clue-framework], Section 15, apply to this document.

1869	   Data model information carried within CLUE messages SHOULD be
1870	   accessed only by authenticated endpoints.  Indeed, some information
1871	   published by the MP might reveal sensitive data about who and what is
1872	   represented in the transmitted streams.  The vCards included in the
1873	   <personInfo> elements (Section 20.1) mandatorily contains the
1874	   identity of the represented person.  Optionally vCards can also carry
1875	   the person's contact addresses, together with his/her photo and other
1876	   personal data.  Similar privacy-critical information can be conveyed
1877	   by means of <sceneInformation> elements (Section 15.1) describing the
1878	   capture scenes.  The <description> elements also can specify details
1879	   that should be protected about the content of media captures
1880	   (Section 10.12), capture scenes (Section 15), scene views
1881	   (Section 16).

1883	   Integrity attacks to the data model information incapsulated in CLUE
1884	   messages can invalidate the success of the telepresence session's
1885	   setup by misleading the MC's and MP's interpretation of the offered
1886	   and desired media streams.

1888	   The assurance of the authenticated access and of the integrity of the
1889	   data model information is up to the involved transport mechanisms,
1890	   namely the CLUE protocol [I-D.ietf-clue-protocol] and the CLUE data
1891	   channel [I-D.ietf-clue-datachannel].

1893	25.  IANA considerations

1895	   ToDo.

1897	25.1.  XML Schema registration

1899	   ToDo.

1901	25.2.  XML namespace registration

1903	   ToDo.

1905	26.  Sample XML file

1907	   The following XML document represents a schema compliant example of a
1908	   CLUE telepresence scenario.  Taking inspiration from the examples
1909	   described in the framework draft ([I-D.ietf-clue-framework]), it is
1910	   provided the XML representation of an endpoint-style Media Provider's
1911	   offer.

1913	   There are three cameras, where the central one is also able of
1914	   capturing a zoomed-out view of the overall telepresence room.
1915	   Besides the three video captures coming from such cameras, the MP
1916	   makes available a further multi-content capture about the loudest
1917	   segment of the room, obtained by switching the video source across
1918	   the three cameras.  For the sake of simplicity, only one audio
1919	   capture is advertised for the audio of the whole room.

1921	   The three cameras are placed in front of three participants (Alice,
1922	   Bob and Ciccio), whose vcard and conference roles details are also
1923	   provided.

1925	   Media captures are arranged into four capture scene views:

1927	   1.  (VC0, VC1, VC2) - left, center and right camera video captures

1929	   2.  (VC3) - video capture associated with loudest room segment

1931	   3.  (VC4) - video capture zoomed out view of all people in the room

1933	   4.  (AC0) - main audio

1935	   There are two encoding groups: (i) EG0, for video encodings, and (ii)
1936	   EG1, for audio encodings.

1938	   As to the simultaneous sets, only VC1 and VC4 cannot be transmitted
1939	   simultaneously since they are captured by the same device, i.e., the
1940	   central camera (VC4 is a zoomed-out view while VC1 is a focused view
1941	   of the front participant).  The simultaneous sets would then be the
1942	   following:

1944	   SS1  made by VC3 and all the captures in the first capture scene view
1945	      (VC0,VC1,VC2);

1947	   SS2  made by VC3, VC0, VC2, VC4

1949	<?xml version="1.0" encoding="UTF-8" standalone="yes"?>
1950	<clueInfo xmlns="urn:ietf:params:xml:ns:clue-info"
1951	          xmlns:ns2="urn:ietf:params:xml:ns:vcard-4.0" clueInfoID="NapoliRoom">
1952	    <mediaCaptures>
1953	        <mediaCapture xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
1954	         xsi:type="audioCaptureType" captureID="AC0" mediaType="audio">
1955	            <captureSceneIDREF>CS1</captureSceneIDREF>
1956	            <encGroupIDREF>EG1</encGroupIDREF>
1957	            <spatialInformation>
1958	                <capturePoint>
1959	                    <x>0.5</x>
1960	                    <y>1.0</y>
1961	                    <z>0.5</z>
1962	                    <lineOfCapturePoint>
1963	                        <x>0.5</x>
1964	                        <y>0.0</y>
1965	                        <z>0.5</z>
1966	                    </lineOfCapturePoint>
1967	                </capturePoint>
1968	            </spatialInformation>
1969	            <individual>true</individual>
1970	            <description lang="en">main audio from the room</description>
1971	            <priority>1</priority>
1972	            <lang>it</lang>
1973	            <mobility>static</mobility>
1974	            <view>room</view>
1975	            <capturedPeople>
1976	                <personIDREF>alice</personIDREF>
1977	                <personIDREF>bob</personIDREF>
1978	                <personIDREF>ciccio</personIDREF>
1979	            </capturedPeople>
1980	            <maxCaptureEncodings>1</maxCaptureEncodings>
1981	        </mediaCapture>
1982	        <mediaCapture xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
1983	         xsi:type="videoCaptureType" captureID="VC0" mediaType="video">
1984	            <captureSceneIDREF>CS1</captureSceneIDREF>
1985	            <encGroupIDREF>EG0</encGroupIDREF>
1986	            <spatialInformation>
1987	                <capturePoint>
1988	                    <x>0.5</x>
1989	                    <y>1.0</y>
1990	                    <z>0.5</z>
1991	                    <lineOfCapturePoint>
1992	                        <x>0.5</x>
1993	                        <y>0.0</y>
1994	                        <z>0.5</z>
1995	                    </lineOfCapturePoint>
1996	                </capturePoint>
1997	            </spatialInformation>
1998	            <individual>true</individual>
1999	            <description lang="en">left camera video capture</description>
2000	            <priority>1</priority>
2001	            <lang>it</lang>
2002	            <mobility>static</mobility>
2003	            <view>individual</view>
2004	            <capturedPeople>
2005	                <personIDREF>ciccio</personIDREF>

2007	            </capturedPeople>
2008	            <maxCaptureEncodings>2</maxCaptureEncodings>
2009	        </mediaCapture>
2010	        <mediaCapture xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
2011	        xsi:type="videoCaptureType" captureID="VC1" mediaType="video">
2012	            <captureSceneIDREF>CS1</captureSceneIDREF>
2013	            <encGroupIDREF>EG0</encGroupIDREF>
2014	            <spatialInformation>
2015	                <capturePoint>
2016	                    <x>0.5</x>
2017	                    <y>1.0</y>
2018	                    <z>0.5</z>
2019	                    <lineOfCapturePoint>
2020	                        <x>0.5</x>
2021	                        <y>0.0</y>
2022	                        <z>0.5</z>
2023	                    </lineOfCapturePoint>
2024	                </capturePoint>
2025	            </spatialInformation>
2026	            <individual>true</individual>
2027	            <description lang="en">central camera video capture</description>
2028	            <priority>1</priority>
2029	            <lang>it</lang>
2030	            <mobility>static</mobility>
2031	            <view>individual</view>
2032	            <capturedPeople>
2033	                <personIDREF>alice</personIDREF>
2034	            </capturedPeople>
2035	            <maxCaptureEncodings>2</maxCaptureEncodings>
2036	        </mediaCapture>
2037	        <mediaCapture xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
2038	        xsi:type="videoCaptureType" captureID="VC2" mediaType="video">
2039	            <captureSceneIDREF>CS1</captureSceneIDREF>
2040	            <encGroupIDREF>EG0</encGroupIDREF>
2041	            <spatialInformation>
2042	                <capturePoint>
2043	                    <x>0.5</x>
2044	                    <y>1.0</y>
2045	                    <z>0.5</z>
2046	                    <lineOfCapturePoint>
2047	                        <x>0.5</x>
2048	                        <y>0.0</y>
2049	                        <z>0.5</z>
2050	                    </lineOfCapturePoint>
2051	                </capturePoint>
2052	            </spatialInformation>
2053	            <individual>true</individual>
2054	            <description lang="en">right camera video capture</description>
2055	            <priority>1</priority>
2056	            <lang>it</lang>
2057	            <mobility>static</mobility>
2058	            <view>individual</view>
2059	            <capturedPeople>
2060	                <personIDREF>bob</personIDREF>
2061	            </capturedPeople>
2062	            <maxCaptureEncodings>2</maxCaptureEncodings>
2063	        </mediaCapture>
2064	        <mediaCapture xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
2065	        xsi:type="videoCaptureType" captureID="VC3" mediaType="video">
2066	            <captureSceneIDREF>CS1</captureSceneIDREF>
2067	            <encGroupIDREF>EG0</encGroupIDREF>
2068	            <nonSpatiallyDefinable>true</nonSpatiallyDefinable>
2069	            <policy>Soundlevel:0</policy>
2070	            <description lang="en">loudest room segment</description>
2071	            <priority>1</priority>
2072	            <lang>it</lang>
2073	            <mobility>static</mobility>
2074	            <view>individual</view>
2075	            <maxCaptureEncodings>1</maxCaptureEncodings>
2076	        </mediaCapture>
2077	        <mediaCapture xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
2078	        xsi:type="videoCaptureType" captureID="VC4" mediaType="video">
2079	            <captureSceneIDREF>CS1</captureSceneIDREF>
2080	            <encGroupIDREF>EG0</encGroupIDREF>
2081	            <spatialInformation>
2082	                <capturePoint>
2083	                    <x>0.5</x>
2084	                    <y>1.0</y>
2085	                    <z>0.5</z>
2086	                    <lineOfCapturePoint>
2087	                        <x>0.5</x>
2088	                        <y>0.0</y>
2089	                        <z>0.5</z>
2090	                    </lineOfCapturePoint>
2091	                </capturePoint>
2092	            </spatialInformation>
2093	            <individual>true</individual>
2094	            <description lang="en">zoomed out view of all people in the
2095	            room</description>
2096	            <priority>1</priority>
2097	            <lang>it</lang>
2098	            <mobility>static</mobility>
2099	            <view>room</view>
2100	            <capturedPeople>
2101	                <personIDREF>alice</personIDREF>
2102	                <personIDREF>bob</personIDREF>
2103	                <personIDREF>ciccio</personIDREF>
2104	            </capturedPeople>
2105	            <maxCaptureEncodings>1</maxCaptureEncodings>
2106	        </mediaCapture>
2107	    </mediaCaptures>
2108	    <encodingGroups>
2109	        <encodingGroup encodingGroupID="EG0">
2110	            <maxGroupBandwidth>600000</maxGroupBandwidth>
2111	            <encodingIDList>
2112	                <encID>ENC1</encID>
2113	                <encID>ENC2</encID>
2114	                <encID>ENC3</encID>
2115	            </encodingIDList>
2116	        </encodingGroup>
2117	        <encodingGroup encodingGroupID="EG1">
2118	            <maxGroupBandwidth>300000</maxGroupBandwidth>
2119	            <encodingIDList>
2120	                <encID>ENC4</encID>
2121	                <encID>ENC5</encID>
2122	            </encodingIDList>
2123	        </encodingGroup>
2124	    </encodingGroups>
2125	    <captureScenes>
2126	        <captureScene scale="unknown" sceneID="CS1">
2127	            <sceneViews>
2128	                <sceneView sceneViewID="SE1">
2129	                    <mediaCaptureIDs>
2130	                        <captureIDREF>VC0</captureIDREF>
2131	                        <captureIDREF>VC1</captureIDREF>
2132	                        <captureIDREF>VC2</captureIDREF>
2133	                    </mediaCaptureIDs>
2134	                </sceneView>
2135	                <sceneView sceneViewID="SE2">
2136	                    <mediaCaptureIDs>
2137	                        <captureIDREF>VC3</captureIDREF>
2138	                    </mediaCaptureIDs>
2139	                </sceneView>
2140	                <sceneView sceneViewID="SE3">
2141	                    <mediaCaptureIDs>
2142	                        <captureIDREF>VC4</captureIDREF>
2143	                    </mediaCaptureIDs>
2144	                </sceneView>
2145	                <sceneView sceneViewID="SE4">
2146	                    <mediaCaptureIDs>
2147	                        <captureIDREF>VC4</captureIDREF>
2148	                    </mediaCaptureIDs>
2149	                </sceneView>
2150	            </sceneViews>

2152	        </captureScene>
2153	    </captureScenes>
2154	    <simultaneousSets>
2155	        <simultaneousSet setID="SS1">
2156	            <mediaCaptureIDREF>VC3</mediaCaptureIDREF>
2157	            <sceneViewIDREF>SE1</sceneViewIDREF>
2158	        </simultaneousSet>
2159	        <simultaneousSet setID="SS2">
2160	            <mediaCaptureIDREF>VC0</mediaCaptureIDREF>
2161	            <mediaCaptureIDREF>VC2</mediaCaptureIDREF>
2162	            <mediaCaptureIDREF>VC4</mediaCaptureIDREF>
2163	            <mediaCaptureIDREF>VC3</mediaCaptureIDREF>
2164	        </simultaneousSet>
2165	    </simultaneousSets>
2166	    <people>
2167	        <person personID="bob">
2168	            <personInfo>
2169	                <ns2:fn>
2170	                    <ns2:text>Bob</ns2:text>
2171	                </ns2:fn>
2172	            </personInfo>
2173	            <personType>minute taker</personType>
2174	        </person>
2175	        <person personID="alice">
2176	            <personInfo>
2177	                <ns2:fn>
2178	                    <ns2:text>Alice</ns2:text>
2179	                </ns2:fn>
2180	            </personInfo>
2181	            <personType>presenter</personType>
2182	        </person>
2183	        <person personID="ciccio">
2184	            <personInfo>
2185	                <ns2:fn>
2186	                    <ns2:text>Ciccio</ns2:text>
2187	                </ns2:fn>
2188	            </personInfo>
2189	            <personType>chairman</personType>
2190	            <personType>timekeeper</personType>
2191	        </person>
2192	    </people>
2193	</clueInfo>
2194	27.  MCC example

2196	   Enhancing the scenario presented in the previous example, the Media
2197	   Provider is able to advertise a composed capture VC7 made by a big
2198	   picture representing the current speaker (VC3) and two picture-in-
2199	   picture boxes representing the previous speakers (the previous one
2200	   -VC5- and the oldest one -VC6).  The provider does not want to
2201	   instantiate and send VC5 and VC6, so it does not associate any
2202	   encoding group with them.  Their XML representations are provided for
2203	   enabling the description of VC7.

2205	   A possible description for that scenario could be the following:

2207	<?xml version="1.0" encoding="UTF-8" standalone="yes"?>
2208	<clueInfo xmlns="urn:ietf:params:xml:ns:clue-info"
2209	xmlns:ns2="urn:ietf:params:xml:ns:vcard-4.0" clueInfoID="NapoliRoom">
2210	    <mediaCaptures>
2211	        <mediaCapture xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
2212	        xsi:type="audioCaptureType" captureID="AC0" mediaType="audio">
2213	            <captureSceneIDREF>CS1</captureSceneIDREF>
2214	            <encGroupIDREF>EG1</encGroupIDREF>
2215	            <spatialInformation>
2216	                <capturePoint>
2217	                    <x>0.5</x>
2218	                    <y>1.0</y>
2219	                    <z>0.5</z>
2220	                    <lineOfCapturePoint>
2221	                        <x>0.5</x>
2222	                        <y>0.0</y>
2223	                        <z>0.5</z>
2224	                    </lineOfCapturePoint>
2225	                </capturePoint>
2226	            </spatialInformation>
2227	            <individual>true</individual>
2228	            <description lang="en">main audio from the room</description>
2229	            <priority>1</priority>
2230	            <lang>it</lang>
2231	            <mobility>static</mobility>
2232	            <view>room</view>
2233	            <capturedPeople>
2234	                <personIDREF>alice</personIDREF>
2235	                <personIDREF>bob</personIDREF>
2236	                <personIDREF>ciccio</personIDREF>
2237	            </capturedPeople>
2238	            <maxCaptureEncodings>1</maxCaptureEncodings>
2239	        </mediaCapture>
2240	        <mediaCapture xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
2241	        xsi:type="videoCaptureType" captureID="VC0" mediaType="video">
2242	            <captureSceneIDREF>CS1</captureSceneIDREF>
2243	            <encGroupIDREF>EG0</encGroupIDREF>
2244	            <spatialInformation>
2245	                <capturePoint>
2246	                    <x>0.5</x>
2247	                    <y>1.0</y>
2248	                    <z>0.5</z>
2249	                    <lineOfCapturePoint>
2250	                        <x>0.5</x>
2251	                        <y>0.0</y>
2252	                        <z>0.5</z>
2253	                    </lineOfCapturePoint>
2254	                </capturePoint>
2255	            </spatialInformation>
2256	            <individual>true</individual>
2257	            <description lang="en">left camera video capture</description>
2258	            <priority>1</priority>
2259	            <lang>it</lang>
2260	            <mobility>static</mobility>
2261	            <view>individual</view>
2262	            <capturedPeople>
2263	                <personIDREF>ciccio</personIDREF>
2264	            </capturedPeople>
2265	            <maxCaptureEncodings>2</maxCaptureEncodings>
2266	        </mediaCapture>
2267	        <mediaCapture xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
2268	        xsi:type="videoCaptureType" captureID="VC1" mediaType="video">
2269	            <captureSceneIDREF>CS1</captureSceneIDREF>
2270	            <encGroupIDREF>EG0</encGroupIDREF>
2271	            <spatialInformation>
2272	                <capturePoint>
2273	                    <x>0.5</x>
2274	                    <y>1.0</y>
2275	                    <z>0.5</z>
2276	                    <lineOfCapturePoint>
2277	                        <x>0.5</x>
2278	                        <y>0.0</y>
2279	                        <z>0.5</z>
2280	                    </lineOfCapturePoint>
2281	                </capturePoint>
2282	            </spatialInformation>
2283	            <individual>true</individual>
2284	            <description lang="en">central camera video capture</description>
2285	            <priority>1</priority>
2286	            <lang>it</lang>
2287	            <mobility>static</mobility>
2288	            <view>individual</view>
2289	            <capturedPeople>
2290	                <personIDREF>alice</personIDREF>
2291	            </capturedPeople>
2292	            <maxCaptureEncodings>2</maxCaptureEncodings>
2293	        </mediaCapture>
2294	        <mediaCapture xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
2295	        xsi:type="videoCaptureType" captureID="VC2" mediaType="video">
2296	            <captureSceneIDREF>CS1</captureSceneIDREF>
2297	            <encGroupIDREF>EG0</encGroupIDREF>
2298	            <spatialInformation>
2299	                <capturePoint>
2300	                    <x>0.5</x>
2301	                    <y>1.0</y>
2302	                    <z>0.5</z>
2303	                    <lineOfCapturePoint>
2304	                        <x>0.5</x>
2305	                        <y>0.0</y>
2306	                        <z>0.5</z>
2307	                    </lineOfCapturePoint>
2308	                </capturePoint>
2309	            </spatialInformation>
2310	            <individual>true</individual>
2311	            <description lang="en">right camera video capture</description>
2312	            <priority>1</priority>
2313	            <lang>it</lang>
2314	            <mobility>static</mobility>
2315	            <view>individual</view>
2316	            <capturedPeople>
2317	                <personIDREF>bob</personIDREF>
2318	            </capturedPeople>
2319	            <maxCaptureEncodings>2</maxCaptureEncodings>
2320	        </mediaCapture>
2321	        <mediaCapture xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
2322	        xsi:type="videoCaptureType" captureID="VC3" mediaType="video">
2323	            <captureSceneIDREF>CS1</captureSceneIDREF>
2324	            <encGroupIDREF>EG0</encGroupIDREF>
2325	            <nonSpatiallyDefinable>true</nonSpatiallyDefinable>
2326	            <content>
2327	                <sceneViewIDREF>SE1</sceneViewIDREF>
2328	            </content>
2329	            <policy>Soundlevel:0</policy>
2330	            <description lang="en">loudest room segment</description>
2331	            <priority>1</priority>
2332	            <lang>it</lang>
2333	            <mobility>static</mobility>
2334	            <view>individual</view>
2335	            <maxCaptureEncodings>1</maxCaptureEncodings>

2337	        </mediaCapture>
2338	        <mediaCapture xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
2339	        xsi:type="videoCaptureType" captureID="VC4" mediaType="video">
2340	            <captureSceneIDREF>CS1</captureSceneIDREF>
2341	            <encGroupIDREF>EG0</encGroupIDREF>
2342	            <spatialInformation>
2343	                <capturePoint>
2344	                    <x>0.5</x>
2345	                    <y>1.0</y>
2346	                    <z>0.5</z>
2347	                    <lineOfCapturePoint>
2348	                        <x>0.5</x>
2349	                        <y>0.0</y>
2350	                        <z>0.5</z>
2351	                    </lineOfCapturePoint>
2352	                </capturePoint>
2353	            </spatialInformation>
2354	            <individual>true</individual>
2355	            <description lang="en">zoomed out view of all people in the room</description>
2356	            <priority>1</priority>
2357	            <lang>it</lang>
2358	            <mobility>static</mobility>
2359	            <view>room</view>
2360	            <capturedPeople>
2361	                <personIDREF>alice</personIDREF>
2362	                <personIDREF>bob</personIDREF>
2363	                <personIDREF>ciccio</personIDREF>
2364	            </capturedPeople>
2365	            <maxCaptureEncodings>1</maxCaptureEncodings>
2366	        </mediaCapture>
2367	        <mediaCapture xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
2368	        xsi:type="videoCaptureType" captureID="VC5" mediaType="video">
2369	            <captureSceneIDREF>CS1</captureSceneIDREF>
2370	            <nonSpatiallyDefinable>true</nonSpatiallyDefinable>
2371	            <content>
2372	                <sceneViewIDREF>SE1</sceneViewIDREF>
2373	            </content>
2374	            <policy>Soundlevel:1</policy>
2375	            <description lang="en">penultimate loudest room segment</description>
2376	            <priority>1</priority>
2377	            <lang>it</lang>
2378	            <mobility>static</mobility>
2379	            <view>individual</view>
2380	            <maxCaptureEncodings>1</maxCaptureEncodings>
2381	        </mediaCapture>
2382	        <mediaCapture xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
2383	        xsi:type="videoCaptureType" captureID="VC6" mediaType="video">
2384	            <captureSceneIDREF>CS1</captureSceneIDREF>
2385	            <nonSpatiallyDefinable>true</nonSpatiallyDefinable>
2386	            <content>
2387	                <sceneViewIDREF>SE1</sceneViewIDREF>
2388	            </content>
2389	            <policy>Soundlevel:2</policy>
2390	            <description lang="en">last but two loudest room segment</description>
2391	            <priority>1</priority>
2392	            <lang>it</lang>
2393	            <mobility>static</mobility>
2394	            <view>individual</view>
2395	            <maxCaptureEncodings>1</maxCaptureEncodings>
2396	        </mediaCapture>
2397	        <mediaCapture xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
2398	        xsi:type="videoCaptureType" captureID="VC7" mediaType="video">
2399	            <captureSceneIDREF>CS1</captureSceneIDREF>
2400	            <nonSpatiallyDefinable>true</nonSpatiallyDefinable>
2401	            <content>
2402	                <captureIDREF>VC3</captureIDREF>
2403	                <captureIDREF>VC5</captureIDREF>
2404	                <captureIDREF>VC6</captureIDREF>
2405	            </content>
2406	            <description lang="en">big picture of the current speaker +
2407	            pips about previous speakers</description>
2408	            <priority>1</priority>
2409	            <lang>it</lang>
2410	            <mobility>static</mobility>
2411	            <view>individual</view>
2412	            <maxCaptureEncodings>1</maxCaptureEncodings>
2413	        </mediaCapture>
2414	    </mediaCaptures>
2415	    <encodingGroups>
2416	        <encodingGroup encodingGroupID="EG0">
2417	            <maxGroupBandwidth>600000</maxGroupBandwidth>
2418	            <encodingIDList>
2419	                <encID>ENC1</encID>
2420	                <encID>ENC2</encID>
2421	                <encID>ENC3</encID>
2422	            </encodingIDList>
2423	        </encodingGroup>
2424	        <encodingGroup encodingGroupID="EG1">
2425	            <maxGroupBandwidth>300000</maxGroupBandwidth>
2426	            <encodingIDList>
2427	                <encID>ENC4</encID>
2428	                <encID>ENC5</encID>
2429	            </encodingIDList>
2430	        </encodingGroup>
2431	    </encodingGroups>
2432	    <captureScenes>
2433	        <captureScene scale="unknown" sceneID="CS1">
2434	            <sceneViews>
2435	                <sceneView sceneViewID="SE1">
2436	                    <description lang="en">participants' individual
2437	                    videos</description>
2438	                    <mediaCaptureIDs>
2439	                        <captureIDREF>VC0</captureIDREF>
2440	                        <captureIDREF>VC1</captureIDREF>
2441	                        <captureIDREF>VC2</captureIDREF>
2442	                    </mediaCaptureIDs>
2443	                </sceneView>
2444	                <sceneView sceneViewID="SE2">
2445	                    <description lang="en">loudest segment of the
2446	                    room</description>
2447	                    <mediaCaptureIDs>
2448	                        <captureIDREF>VC3</captureIDREF>
2449	                    </mediaCaptureIDs>
2450	                </sceneView>
2451	                <sceneView sceneViewID="SE5">
2452	                    <description lang="en">loudest segment of the
2453	                    room + pips</description>
2454	                    <mediaCaptureIDs>
2455	                        <captureIDREF>VC7</captureIDREF>
2456	                    </mediaCaptureIDs>
2457	                </sceneView>
2458	                <sceneView sceneViewID="SE4">
2459	                    <description lang="en">room audio</description>
2460	                    <mediaCaptureIDs>
2461	                        <captureIDREF>AC0</captureIDREF>
2462	                    </mediaCaptureIDs>
2463	                </sceneView>
2464	                <sceneView sceneViewID="SE3">
2465	                    <description lang="en">room video</description>
2466	                    <mediaCaptureIDs>
2467	                        <captureIDREF>VC4</captureIDREF>
2468	                    </mediaCaptureIDs>
2469	                </sceneView>
2470	            </sceneViews>
2471	        </captureScene>
2472	    </captureScenes>
2473	    <simultaneousSets>
2474	        <simultaneousSet setID="SS1">
2475	            <mediaCaptureIDREF>VC7</mediaCaptureIDREF>
2476	            <sceneViewIDREF>SE1</sceneViewIDREF>
2477	        </simultaneousSet>
2478	        <simultaneousSet setID="SS2">
2479	            <mediaCaptureIDREF>VC0</mediaCaptureIDREF>
2480	            <mediaCaptureIDREF>VC2</mediaCaptureIDREF>
2481	            <mediaCaptureIDREF>VC4</mediaCaptureIDREF>
2482	            <mediaCaptureIDREF>VC7</mediaCaptureIDREF>
2483	        </simultaneousSet>
2484	    </simultaneousSets>
2485	    <people>
2486	        <person personID="bob">
2487	            <personInfo>
2488	                <ns2:fn>
2489	                    <ns2:text>Bob</ns2:text>
2490	                </ns2:fn>
2491	            </personInfo>
2492	            <personType>minute taker</personType>
2493	        </person>
2494	        <person personID="alice">
2495	            <personInfo>
2496	                <ns2:fn>
2497	                    <ns2:text>Alice</ns2:text>
2498	                </ns2:fn>
2499	            </personInfo>
2500	            <personType>presenter</personType>
2501	        </person>
2502	        <person personID="ciccio">
2503	            <personInfo>
2504	                <ns2:fn>
2505	                    <ns2:text>Ciccio</ns2:text>
2506	                </ns2:fn>
2507	            </personInfo>
2508	            <personType>chairman</personType>
2509	            <personType>timekeeper</personType>
2510	        </person>
2511	    </people>
2512	</clueInfo>

2514	28.  Diff with draft-ietf-clue-data-model-schema-06 version

2516	   o  Capture Scene Entry/Entries renamed as Capture Scene View/Views in
2517	      the text, <sceneEntry>/<sceneEntries> renamed as <sceneView>/
2518	      <sceneViews> in the XML schema.

2520	   o  Global Scene Entry/Entries renamed as Global View/Views in the
2521	      text, <globalSceneEntry>/<globalSceneEntries> renamed as
2522	      <globalView>/<globalViews>

2524	   o  Security section added.

2526	   o  Extensibility: a new type is introduced to describe other types of
2527	      media capture (otherCaptureType), text and example added.

2529	   o  Spatial information section updated: capture point optional, text
2530	      now is coherent with the framework one.

2532	   o  Audio capture description: <sensitivityPattern> added,
2533	      <audioChannelFormat> removed, <captureArea> disallowed.

2535	   o  Simultaneous set definition: added <captureSceneIDREF> to refer to
2536	      capture scene identifiers as shortcuts and an optional mediaType
2537	      attribute which is mandatory to use when only capture scene
2538	      identifiers are listed.

2540	   o  Encoding groups: removed the constraint of the same media type.

2542	   o  Updated text about media captures without <encodingGroupIDREF>
2543	      (optional in the XML schema).

2545	   o  "mediaType" attribute removed from homogeneous groups of capture
2546	      (scene views and globlal views)

2548	   o  "mediaType" attribute removed from the global view textual
2549	      description.

2551	   o  "millimeters" scale value changed in "mm"

2553	29.  Diff with draft-ietf-clue-data-model-schema-04 version

2555	      globalCaptureEntries/Entry renamed as globalSceneEntries/Entry;

2557	      sceneInformation added;

2559	      Only capture scene entry identifiers listed within global scene
2560	      entries (media capture identifiers removed);

2562	      <participants> renamed as <people> in the >clueInfo< template

2564	      <vcard> renamed as <personInfo> to synch with the framework
2565	      terminology

2567	      <participantType> renamed as <personType> to synch with the
2568	      framework terminology

2570	      <participantIDs> renamed as <capturedPeople> in the media capture
2571	      type definition to remove ambiguity
2572	      Examples have been updated with the new definitions of
2573	      <globalSceneEntries> and of <people>.

2575	30.  Diff with draft-ietf-clue-data-model-schema-03 version

2577	      encodings section has been removed

2579	      global capture entries have been introduced

2581	      capture scene entry identifiers are used as shortcuts in listing
2582	      the content of MCC (similarly to simultaneous set and global
2583	      capture entries)

2585	      Examples have been updated.  A new example with global capture
2586	      entries has been added.

2588	      <encGroupIDREF> has been made optional.

2590	      <single> has been renamed into <individual>

2592	      Obsolete comments have been removed.

2594	      participants information has been added.

2596	31.  Diff with  draft-ietf-clue-data-model-schema-02 version

2598	      captureParameters and encodingParameters have been removed from
2599	      the captureEncodingType

2601	      data model example has been updated and validated according to the
2602	      new schema.  Further description of the represented scenario has
2603	      been provided.

2605	      A multiple content capture example has been added.

2607	      Obsolete comments and references have been removed.

2609	32.  Informative References

2611	   [I-D.ietf-clue-datachannel]       Holmberg, C., "CLUE Protocol Data
2612	                                     Channel",
2613	                                     draft-ietf-clue-datachannel-01
2614	                                     (work in progress), September 2014.

2616	   [I-D.ietf-clue-framework]         Duckworth, M., Pepperell, A., and
2617	                                     S. Wenger, "Framework for
2618	                                     Telepresence Multi-Streams",
2619	                                     draft-ietf-clue-framework-17 (work
2620	                                     in progress), September 2014.

2622	   [I-D.ietf-clue-protocol]          Presta, R. and S. Romano, "CLUE
2623	                                     protocol",
2624	                                     draft-ietf-clue-protocol-01 (work
2625	                                     in progress), June 2014.

2627	   [I-D.ietf-ecrit-additional-data]  Rosen, B., Tschofenig, H.,
2628	                                     Marshall, R., Randy, R., and J.
2629	                                     Winterbottom, "Additional Data
2630	                                     related to an Emergency Call",
2631	                                     draft-ietf-ecrit-additional-data-22
2632	                                     (work in progress), April 2014.

2634	   [RFC4796]                         Hautakorpi, J. and G. Camarillo,
2635	                                     "The Session Description Protocol
2636	                                     (SDP) Content Attribute", RFC 4796,
2637	                                     February 2007.

2639	   [RFC6351]                         Perreault, S., "xCard: vCard XML
2640	                                     Representation", RFC 6351,
2641	                                     August 2011.

2643	Authors' Addresses

2645	   Roberta Presta
2646	   University of Napoli
2647	   Via Claudio 21
2648	   Napoli  80125
2649	   Italy

2651	   EMail: roberta.presta@unina.it

2653	   Simon Pietro Romano
2654	   University of Napoli
2655	   Via Claudio 21
2656	   Napoli  80125
2657	   Italy

2659	   EMail: spromano@unina.it