idnits 2.17.1 

draft-presta-clue-data-model-schema-03.txt:

  Checking boilerplate required by RFC 5378 and the IETF Trust (see
  https://trustee.ietf.org/license-info):
  ----------------------------------------------------------------------------

     No issues found here.

  Checking nits according to https://www.ietf.org/id-info/1id-guidelines.txt:
  ----------------------------------------------------------------------------

     No issues found here.

  Checking nits according to https://www.ietf.org/id-info/checklist :
  ----------------------------------------------------------------------------

  ** The document seems to lack a Security Considerations section.

  ** The document seems to lack an IANA Considerations section.  (See Section
     2.2 of https://www.ietf.org/id-info/checklist for how to handle the case
     when there are no actions for IANA.)

  ** There are 4 instances of too long lines in the document, the longest one
     being 7 characters in excess of 72.

  ** The document seems to lack a both a reference to RFC 2119 and the
     recommended RFC 2119 boilerplate, even if it appears to use RFC 2119
     keywords. 

     RFC 2119 keyword, line 769: '... line of capture MUST NOT be identical...'


  Miscellaneous warnings:
  ----------------------------------------------------------------------------

  == The copyright year in the IETF Trust and authors Copyright Line does not
     match the current year

  == Line 2045 has weird spacing: '...ncoding  to be...'

  -- The document date (March 8, 2013) is 4066 days in the past.  Is this
     intentional?


  Checking references for intended status: Informational
  ----------------------------------------------------------------------------

  == Missing Reference: 'TBD' is mentioned on line 139, but not defined

  == Outdated reference: A later version (-25) exists of
     draft-ietf-clue-framework-09


     Summary: 4 errors (**), 0 flaws (~~), 4 warnings (==), 1 comment (--).

     Run idnits with the --verbose option for more detailed information about
     the items above.

--------------------------------------------------------------------------------


2	CLUE Working Group                                             R. Presta
3	Internet-Draft                                               S P. Romano
4	Intended status: Informational                      University of Napoli
5	Expires: September 9, 2013                                 March 8, 2013

7	                 An XML Schema for the CLUE data model
8	                 draft-presta-clue-data-model-schema-03

10	Abstract

12	   This document provides an XML schema file for the definition of CLUE
13	   data model types.

15	Status of This Memo

17	   This Internet-Draft is submitted in full conformance with the
18	   provisions of BCP 78 and BCP 79.

20	   Internet-Drafts are working documents of the Internet Engineering
21	   Task Force (IETF).  Note that other groups may also distribute
22	   working documents as Internet-Drafts.  The list of current Internet-
23	   Drafts is at http://datatracker.ietf.org/drafts/current/.

25	   Internet-Drafts are draft documents valid for a maximum of six months
26	   and may be updated, replaced, or obsoleted by other documents at any
27	   time.  It is inappropriate to use Internet-Drafts as reference
28	   material or to cite them other than as "work in progress."

30	   This Internet-Draft will expire on September 9, 2013.

32	Copyright Notice

34	   Copyright (c) 2013 IETF Trust and the persons identified as the
35	   document authors.  All rights reserved.

37	   This document is subject to BCP 78 and the IETF Trust's Legal
38	   Provisions Relating to IETF Documents
39	   (http://trustee.ietf.org/license-info) in effect on the date of
40	   publication of this document.  Please review these documents
41	   carefully, as they describe your rights and restrictions with respect
42	   to this document.  Code Components extracted from this document must
43	   include Simplified BSD License text as described in Section 4.e of
44	   the Trust Legal Provisions and are provided without warranty as
45	   described in the Simplified BSD License.

47	Table of Contents
48	   1.  Introduction . . . . . . . . . . . . . . . . . . . . . . . . .  4
49	   2.  Terminology  . . . . . . . . . . . . . . . . . . . . . . . . .  4
50	   3.  XML Schema . . . . . . . . . . . . . . . . . . . . . . . . . .  4
51	   4.  <mediaCaptures>  . . . . . . . . . . . . . . . . . . . . . . . 13
52	   5.  <encodings>  . . . . . . . . . . . . . . . . . . . . . . . . . 13
53	   6.  <encodingGroups> . . . . . . . . . . . . . . . . . . . . . . . 13
54	   7.  <captureScenes>  . . . . . . . . . . . . . . . . . . . . . . . 14
55	   8.  <simultaneousSets> . . . . . . . . . . . . . . . . . . . . . . 14
56	   9.  <captureEncodings> . . . . . . . . . . . . . . . . . . . . . . 14
57	   10. <mediaCapture> . . . . . . . . . . . . . . . . . . . . . . . . 14
58	     10.1.  <capturedMedia> . . . . . . . . . . . . . . . . . . . . . 15
59	     10.2.  <captureSceneIDREF> . . . . . . . . . . . . . . . . . . . 15
60	     10.3.  <encGroupIDREF> . . . . . . . . . . . . . . . . . . . . . 16
61	     10.4.  <spatialInformation>  . . . . . . . . . . . . . . . . . . 16
62	       10.4.1.  <capturePoint>  . . . . . . . . . . . . . . . . . . . 17
63	       10.4.2.  <captureArea> . . . . . . . . . . . . . . . . . . . . 18
64	     10.5.  <nonSpatiallyDefinible> . . . . . . . . . . . . . . . . . 18
65	     10.6.  <description> . . . . . . . . . . . . . . . . . . . . . . 19
66	     10.7.  <priority>  . . . . . . . . . . . . . . . . . . . . . . . 19
67	     10.8.  <lang>  . . . . . . . . . . . . . . . . . . . . . . . . . 19
68	     10.9.  <content> . . . . . . . . . . . . . . . . . . . . . . . . 19
69	     10.10. <switched>  . . . . . . . . . . . . . . . . . . . . . . . 20
70	     10.11. <dynamic> . . . . . . . . . . . . . . . . . . . . . . . . 20
71	     10.12. <composed>  . . . . . . . . . . . . . . . . . . . . . . . 20
72	     10.13. <maxCaptureEncodings> . . . . . . . . . . . . . . . . . . 20
73	     10.14. <relatedTo> . . . . . . . . . . . . . . . . . . . . . . . 20
74	     10.15. captureID attribute . . . . . . . . . . . . . . . . . . . 21
75	   11. Audio captures . . . . . . . . . . . . . . . . . . . . . . . . 21
76	     11.1.  <audioChannelFormat>  . . . . . . . . . . . . . . . . . . 21
77	     11.2.  <micPattern>  . . . . . . . . . . . . . . . . . . . . . . 22
78	   12. Video captures . . . . . . . . . . . . . . . . . . . . . . . . 22
79	     12.1.  <nativeAspectRatio> . . . . . . . . . . . . . . . . . . . 23
80	     12.2.  <embeddedText>  . . . . . . . . . . . . . . . . . . . . . 23
81	   13. Text captures  . . . . . . . . . . . . . . . . . . . . . . . . 24
82	   14. <captureScene> . . . . . . . . . . . . . . . . . . . . . . . . 24
83	     14.1.  <sceneSpace> (was:<sceneArea>)  . . . . . . . . . . . . . 25
84	     14.2.  <sceneEntries>  . . . . . . . . . . . . . . . . . . . . . 26
85	     14.3.  sceneID attribute . . . . . . . . . . . . . . . . . . . . 26
86	     14.4.  scale attribute . . . . . . . . . . . . . . . . . . . . . 26
87	   15. <sceneEntry> . . . . . . . . . . . . . . . . . . . . . . . . . 27
88	     15.1.  <switchingPolicies> . . . . . . . . . . . . . . . . . . . 27
89	     15.2.  <mediaCaptureIDs> . . . . . . . . . . . . . . . . . . . . 28
90	     15.3.  sceneEntryID attribute  . . . . . . . . . . . . . . . . . 29
91	     15.4.  mediaType attribute . . . . . . . . . . . . . . . . . . . 29
92	   16. <encoding> . . . . . . . . . . . . . . . . . . . . . . . . . . 29
93	     16.1.  <encodingName>  . . . . . . . . . . . . . . . . . . . . . 29
94	     16.2.  <maxBandwidth>  . . . . . . . . . . . . . . . . . . . . . 29
95	     16.3.  encodingID attribute  . . . . . . . . . . . . . . . . . . 30

97	   17. Audio encodings  . . . . . . . . . . . . . . . . . . . . . . . 30
98	   18. Video encodings  . . . . . . . . . . . . . . . . . . . . . . . 30
99	     18.1.  <maxWidth>  . . . . . . . . . . . . . . . . . . . . . . . 31
100	     18.2.  <maxHeight> . . . . . . . . . . . . . . . . . . . . . . . 31
101	     18.3.  <maxFrameRate>  . . . . . . . . . . . . . . . . . . . . . 31
102	   19. H26X encodings . . . . . . . . . . . . . . . . . . . . . . . . 31
103	   20. <encodingGroup>  . . . . . . . . . . . . . . . . . . . . . . . 32
104	     20.1.  <maxGroupBandwidth> . . . . . . . . . . . . . . . . . . . 32
105	     20.2.  <maxGroupPps> . . . . . . . . . . . . . . . . . . . . . . 33
106	     20.3.  <encodingIDList>  . . . . . . . . . . . . . . . . . . . . 33
107	     20.4.  encodingGroupID attribute . . . . . . . . . . . . . . . . 33
108	   21. <simultaneousSet>  . . . . . . . . . . . . . . . . . . . . . . 33
109	     21.1.  <captureIDREF>  . . . . . . . . . . . . . . . . . . . . . 34
110	     21.2.  <sceneEntryIDREF> . . . . . . . . . . . . . . . . . . . . 34
111	   22. <captureEncoding>  . . . . . . . . . . . . . . . . . . . . . . 34
112	     22.1.  <mediaCaptureID>  . . . . . . . . . . . . . . . . . . . . 34
113	     22.2.  <encodingID>  . . . . . . . . . . . . . . . . . . . . . . 34
114	   23. <clueInfo> . . . . . . . . . . . . . . . . . . . . . . . . . . 34
115	   24. Sample XML file  . . . . . . . . . . . . . . . . . . . . . . . 35
116	   25. Diff with unofficial -02 version . . . . . . . . . . . . . . . 44
117	   26. Diff with -02 version  . . . . . . . . . . . . . . . . . . . . 46
118	   27. Informative References . . . . . . . . . . . . . . . . . . . . 46

120	1.  Introduction

122	   This document provides an XML schema file for the definition of CLUE
123	   data model types.

125	   The schema is based on information contained in
126	   [I-D.ietf-clue-framework] and also relates to the data model sketched
127	   in [I-D.romanow-clue-data-model].  It encodes information and
128	   constraints defined in the aforementioned documents in order to
129	   provide a formal representation of the concepts therein presented.
130	   The schema definition is intended to be modified according to changes
131	   applied to the above mentioned CLUE documents.

133	   The document actually represents a strawman proposal aiming at the
134	   definition of a coherent structure for all the information associated
135	   with the description of a telepresence scenario.

137	2.  Terminology

139	   [TBD] Copy text from the framework document.

141	3.  XML Schema

143	   This section contains the proposed CLUE data model schema definition.

145	   The element and attribute definitions are formal representation of
146	   the concepts needed to describe the capabilities of a media provider
147	   and the current streams it is transmitting within a telepresence
148	   session.

150	   The main groups of information are:

152	      <mediaCaptures>: the list of media captures available (Section 4)

154	      <encodings>: the list of individual encodings (Section 5)

156	      <encodingGroups>: the list of encodings groups (Section 6)

158	      <captureScenes>: the list of capture scenes (Section 7)

160	      <simultaneousSets>: the list of simultaneous capture
161	      sets(Section 8)

163	      <captureEncodings>: the list of instantiated capture encodings
164	      (Section 9)

166	   All of the above refers to concepts that have been introduced in
167	   [I-D.ietf-clue-framework] and [I-D.romanow-clue-data-model] and
168	   further detailed in threads on the mailing list as well as in the
169	   following of this document.

171	<?xml version="1.0" encoding="UTF-8" ?>
172	<xs:schema
173	   targetNamespace="urn:ietf:params:xml:ns:clue-info"
174	   xmlns:tns="urn:ietf:params:xml:ns:clue-info"
175	   xmlns:xs="http://www.w3.org/2001/XMLSchema"
176	   xmlns="urn:ietf:params:xml:ns:clue-info"
177	   elementFormDefault="qualified"
178	   attributeFormDefault="unqualified">

180	<!-- ELEMENT DEFINITIONS -->
181	<xs:element name="mediaCaptures" type="mediaCapturesType"/>
182	<xs:element name="encodings" type="encodingsType"/>
183	<xs:element name="encodingGroups" type="encodingGroupsType"/>
184	<xs:element name="captureScenes" type="captureScenesType"/>
185	<xs:element name="simultaneousSets" type="simultaneousSetsType"/>
186	<xs:element name="captureEncodings" type="captureEncodingsType"/>

188	<!-- MEDIA CAPTURES TYPE -->
189	<!-- envelope of media captures -->
190	<xs:complexType name="mediaCapturesType">
191	 <xs:sequence>
192	   <xs:element name="mediaCapture" type="mediaCaptureType"
193	   maxOccurs="unbounded"/>
194	 </xs:sequence>
195	</xs:complexType>

197	<!-- DESCRIPTION element -->
198	<xs:element name="description">
199	 <xs:complexType>
200	  <xs:simpleContent>
201	   <xs:extension base="xs:string">
202	     <xs:attribute name="lang" type="xs:language"/>
203	   </xs:extension>
204	  </xs:simpleContent>
205	 </xs:complexType>
206	</xs:element>

208	<!-- MEDIA CAPTURE TYPE -->
209	<xs:complexType name="mediaCaptureType" abstract="true">
210	  <xs:sequence>
211	    <!-- mandatory fields -->
212	    <xs:element name="capturedMedia" type="xs:string"/>
213	    <xs:element name="captureSceneIDREF" type="xs:IDREF"/>
214	    <xs:element name="encGroupIDREF" type="xs:IDREF"/>
215	    <xs:choice>
216	      <xs:sequence>
217	        <xs:element name="spatialInformation" type="tns:spatialInformationType"
218	        maxOccurs="unbounded"/>
219	      </xs:sequence>
220	      <xs:element name="nonSpatiallyDefinible" type="xs:boolean" fixed="true"/>
221	    </xs:choice>
222	    <!-- optional fields -->
223	    <xs:element ref="description" minOccurs="0" maxOccurs="unbounded"/>
224	    <xs:element name="priority" type="xs:integer" minOccurs="0"/>
225	    <xs:element name="lang" type="xs:language" minOccurs="0"/>
226	    <xs:element name="content" type="xs:string" minOccurs="0"/>
227	    <xs:element name="switched" type="xs:boolean" minOccurs="0"/>
228	    <xs:element name="dynamic" type="xs:boolean" minOccurs="0"/>
229	    <xs:element name="composed" type="xs:boolean" minOccurs="0"/>
230	    <xs:element name="maxCaptureEncodings" type="xs:unsignedInt"
231	    minOccurs="0"/>
232	    <!-- this is in place of "supplementary info": -->
233	    <xs:element name="relatedTo" type="xs:IDREF" minOccurs="0"/>
234	    <xs:any namespace="##other" processContents="lax" minOccurs="0"
235	    maxOccurs="unbounded"/>
236	  </xs:sequence>
237	  <xs:attribute name="captureID" type="xs:ID" use="required"/>
238	  <xs:anyAttribute namespace="##other" processContents="lax"/>
239	</xs:complexType>

241	<!-- SPATIAL INFORMATION TYPE -->
242	<xs:complexType name="spatialInformationType">
243	 <xs:sequence>
244	  <xs:element name="capturePoint" type="capturePointType"/>
245	  <xs:element name="captureArea" type="captureAreaType"
246	  minOccurs="0"/>
247	  <xs:any namespace="##other" processContents="lax" minOccurs="0"
248	  maxOccurs="unbounded"/>
249	 </xs:sequence>
250	 <xs:anyAttribute namespace="##other" processContents="lax"/>
251	</xs:complexType>

253	<!-- TEXT CAPTURE TYPE -->
254	<xs:complexType name="textCaptureType">
255	 <xs:complexContent>
256	  <xs:extension base="tns:mediaCaptureType">
257	  </xs:extension>
258	 </xs:complexContent>
259	</xs:complexType>

261	<!-- AUDIO CAPTURE TYPE -->
262	<xs:complexType name="audioCaptureType">
263	 <xs:complexContent>
264	  <xs:extension base="tns:mediaCaptureType">
265	   <xs:sequence>
266	    <xs:element name="audioChannelFormat" type="audioChannelFormatType"
267	    minOccurs="0"/>
268	    <xs:element name="micPattern" type="tns:micPatternType"
269	    minOccurs="0"/>
270	   </xs:sequence>
271	  </xs:extension>
272	 </xs:complexContent>
273	</xs:complexType>

275	<!-- MIC PATTERN TYPE -->
276	<xs:simpleType name="micPatternType">
277	 <xs:restriction base="xs:string">
278	  <xs:enumeration value="uni"/>
279	  <xs:enumeration value="shotgun"/>
280	  <xs:enumeration value="omni"/>
281	  <xs:enumeration value="figure8"/>
282	  <xs:enumeration value="cardioid"/>
283	  <xs:enumeration value="hyper-cardioid"/>
284	 </xs:restriction>
285	</xs:simpleType>

287	<!-- AUDIO CHANNEL FORMAT TYPE -->
288	<xs:simpleType name="audioChannelFormatType">
289	 <xs:restriction base="xs:string">
290	  <xs:enumeration value="mono"/>
291	  <xs:enumeration value="stereo"/>
292	 </xs:restriction>
293	</xs:simpleType>

295	<!-- VIDEO CAPTURE TYPE -->
296	<xs:complexType name="videoCaptureType">
297	 <xs:complexContent>
298	  <xs:extension base="tns:mediaCaptureType">
299	   <xs:sequence>
300	    <xs:element name="nativeAspectRatio" type="xs:string"
301	    minOccurs="0"/>
302	    <xs:element ref="embeddedText" minOccurs="0"/>
303	   </xs:sequence>
304	  </xs:extension>
305	 </xs:complexContent>
306	</xs:complexType>

308	<!-- EMBEDDED TEXT ELEMENT -->
309	<xs:element name="embeddedText">
310	 <xs:complexType>
311	  <xs:simpleContent>
312	   <xs:extension base="xs:boolean">
313	    <xs:attribute name="lang" type="xs:language"/>
314	   </xs:extension>
315	  </xs:simpleContent>
316	 </xs:complexType>
317	</xs:element>

319	<!-- CAPTURE SCENES TYPE -->
320	<!-- envelope of capture scenes -->
321	<xs:complexType name="captureScenesType">
322	 <xs:sequence>
323	  <xs:element name="captureScene" type="captureSceneType"
324	  maxOccurs="unbounded"/>
325	 </xs:sequence>
326	</xs:complexType>

328	<!-- CAPTURE SCENE TYPE -->
329	<xs:complexType name="captureSceneType">
330	 <xs:sequence>
331	  <xs:element ref="description" minOccurs="0" maxOccurs="unbounded"/>
332	  <xs:element name="sceneSpace" type="captureSpaceType" minOccurs="0"/>
333	  <xs:element name="sceneEntries" type="sceneEntriesType"/>
334	  <xs:any namespace="##other" processContents="lax" minOccurs="0"
335	  maxOccurs="unbounded"/>
336	 </xs:sequence>
337	 <xs:attribute name="sceneID" type="xs:ID" use="required"/>
338	 <xs:attribute name="scale" type="scaleType" use="required"/>
339	 <xs:anyAttribute namespace="##other" processContents="lax"/>
340	</xs:complexType>

342	<!-- SCALE TYPE -->
343	<xs:simpleType name="scaleType">
344	 <xs:restriction base="xs:string">
345	  <xs:enumeration value="millimeters"/>
346	  <xs:enumeration value="unknown"/>
347	  <xs:enumeration value="noscale"/>
348	 </xs:restriction>
349	</xs:simpleType>

351	<!-- CAPTURE AREA TYPE -->
352	<xs:complexType name="captureAreaType">
353	 <xs:sequence>
354	  <xs:element name="bottomLeft" type="pointType"/>
355	  <xs:element name="bottomRight" type="pointType"/>
356	  <xs:element name="topLeft" type="pointType"/>
357	  <xs:element name="topRight" type="pointType"/>

359	 </xs:sequence>
360	</xs:complexType>

362	<!-- CAPTURE SPACE TYPE -->
363	<xs:complexType name="captureSpaceType">
364	 <xs:sequence>
365	  <xs:element name="bottomLeftFront" type="pointType"/>
366	  <xs:element name="bottomRightFront" type="pointType"/>
367	  <xs:element name="topLeftFront" type="pointType"/>
368	  <xs:element name="topRightFront" type="pointType"/>
369	  <xs:element name="bottomLeftBack" type="pointType"/>
370	  <xs:element name="bottomRightBack" type="pointType"/>
371	  <xs:element name="topLeftBack" type="pointType"/>
372	  <xs:element name="topRightBack" type="pointType"/>
373	 </xs:sequence>
374	</xs:complexType>

376	<!-- POINT TYPE -->
377	<xs:complexType name="pointType">
378	 <xs:sequence>
379	  <xs:element name="x" type="xs:decimal"/>
380	  <xs:element name="y" type="xs:decimal"/>
381	  <xs:element name="z" type="xs:decimal"/>
382	 </xs:sequence>
383	</xs:complexType>

385	<!-- CAPTURE POINT TYPE -->
386	<xs:complexType name="capturePointType">
387	 <xs:complexContent>
388	  <xs:extension base="pointType">
389	   <xs:sequence>
390	    <xs:element name="lineOfCapturePoint" type="tns:pointType"
391	    minOccurs="0"/>
392	   </xs:sequence>
393	   <xs:attribute name="pointID" type="xs:ID"/>
394	  </xs:extension>
395	 </xs:complexContent>
396	</xs:complexType>

398	<!-- SCENE ENTRIES TYPE -->
399	<!-- envelope of scene entries of a capture scene -->
400	<xs:complexType name="sceneEntriesType">
401	 <xs:sequence>
402	  <xs:element name="sceneEntry" type="sceneEntryType"
403	  maxOccurs="unbounded"/>
404	 </xs:sequence>
405	</xs:complexType>
406	<!-- SCENE ENTRY TYPE -->
407	<xs:complexType name="sceneEntryType">
408	 <xs:sequence>
409	  <xs:element ref="description" minOccurs="0" maxOccurs="unbounded"/>
410	  <xs:element name="switchingPolicies" type="switchingPoliciesType"
411	  minOccurs="0"/>
412	  <xs:element name="mediaCaptureIDs" type="captureIDListType"/>
413	 </xs:sequence>
414	 <xs:attribute name="sceneEntryID" type="xs:ID" use="required"/>
415	 <xs:attribute name="mediaType" type="xs:string" use="required"/>
416	</xs:complexType>

418	<!-- SWITCHING POLICIES TYPE -->
419	<xs:complexType name="switchingPoliciesType">
420	 <xs:sequence>
421	  <xs:element name="siteSwitching" type="xs:boolean" minOccurs="0"/>
422	  <xs:element name="segmentSwitching" type="xs:boolean"
423	  minOccurs="0"/>
424	 </xs:sequence>
425	</xs:complexType>

427	<!-- CAPTURE ID LIST TYPE -->
428	<xs:complexType name="captureIDListType">
429	 <xs:sequence>
430	  <xs:element name="captureIDREF" type="xs:IDREF"
431	  maxOccurs="unbounded"/>
432	 </xs:sequence>
433	</xs:complexType>

435	<!-- ENCODINGS TYPE -->
436	<xs:complexType name="encodingsType">
437	 <xs:sequence>
438	  <xs:element name="encoding" type="encodingType"
439	  maxOccurs="unbounded"/>
440	 </xs:sequence>
441	</xs:complexType>

443	<!-- ENCODING TYPE -->
444	<xs:complexType name="encodingType" abstract="true">
445	 <xs:sequence>
446	  <xs:element name="encodingName" type="xs:string"/>
447	  <xs:element name="maxBandwidth" type="xs:integer"/>
448	  <xs:any namespace="##other" processContents="lax" minOccurs="0"
449	  maxOccurs="unbounded"/>
450	 </xs:sequence>
451	 <xs:attribute name="encodingID" type="xs:ID" use="required"/>
452	 <xs:anyAttribute namespace="##any" processContents="lax"/>
453	</xs:complexType>
454	<!-- AUDIO ENCODING TYPE -->
455	<xs:complexType name="audioEncodingType">
456	 <xs:complexContent>
457	  <xs:extension base="tns:encodingType">
458	   <xs:sequence>
459	    <xs:element name="encodedMedia" type="xs:string" fixed="audio"
460	    minOccurs="0"/>
461	   </xs:sequence>
462	  </xs:extension>
463	 </xs:complexContent>
464	</xs:complexType>

466	<!-- VIDEO ENCODING TYPE -->
467	<xs:complexType name="videoEncodingType">
468	 <xs:complexContent>
469	  <xs:extension base="tns:encodingType">
470	   <xs:sequence>
471	    <xs:element name="encodedMedia" type="xs:string" fixed="video"
472	    minOccurs="0"/>
473	    <xs:element name="maxWidth" type="xs:integer" minOccurs="0"/>
474	    <xs:element name="maxHeight" type="xs:integer" minOccurs="0"/>
475	    <xs:element name="maxFrameRate" type="xs:integer" minOccurs="0"/>
476	   </xs:sequence>
477	  </xs:extension>
478	 </xs:complexContent>
479	</xs:complexType>

481	<!-- H26X ENCODING TYPE -->
482	<xs:complexType name="h26XEncodingType">
483	 <xs:complexContent>
484	  <xs:extension base="tns:videoEncodingType">
485	   <xs:sequence>
486	    <!-- max number of pixels to be processed per second -->
487	    <xs:element name="maxH26Xpps" type="xs:integer"
488	    minOccurs="0"/>
489	   </xs:sequence>
490	  </xs:extension>
491	 </xs:complexContent>
492	</xs:complexType>

494	<!-- ENCODING GROUPS TYPE -->
495	<xs:complexType name="encodingGroupsType">
496	 <xs:sequence>
497	  <xs:element name="encodingGroup" type="tns:encodingGroupType"
498	  maxOccurs="unbounded"/>
499	 </xs:sequence>
500	</xs:complexType>
501	<!-- ENCODING GROUP TYPE -->
502	<xs:complexType name="encodingGroupType">
503	 <xs:sequence>
504	  <xs:element name="maxGroupBandwidth" type="xs:integer"/>
505	  <xs:element name="maxGroupPps" type="xs:integer"
506	  minOccurs="0"/>
507	  <xs:element name="encodingIDList" type="encodingIDListType"/>
508	  <xs:any namespace="##other" processContents="lax" minOccurs="0"
509	  maxOccurs="unbounded"/>
510	 </xs:sequence>
511	 <xs:attribute name="encodingGroupID" type="xs:ID" use="required"/>
512	 <xs:anyAttribute namespace="##any" processContents="lax"/>
513	</xs:complexType>

515	<!-- ENCODING ID LIST TYPE -->
516	<xs:complexType name="encodingIDListType">
517	 <xs:sequence>
518	  <xs:element name="encIDREF" type="xs:IDREF" maxOccurs="unbounded"/>
519	 </xs:sequence>
520	</xs:complexType>

522	<!-- SIMULTANEOUS SETS TYPE -->
523	<xs:complexType name="simultaneousSetsType">
524	 <xs:sequence>
525	  <xs:element name="simultaneousSet" type="simultaneousSetType"
526	  maxOccurs="unbounded"/>
527	 </xs:sequence>
528	</xs:complexType>

530	<!-- SIMULTANEOUS SET TYPE -->
531	<xs:complexType name="simultaneousSetType">
532	 <xs:sequence>
533	   <xs:element name="captureIDREF" type="xs:IDREF"
534	   minOccurs="0" maxOccurs="unbounded"/>
535	   <xs:element name="sceneEntryIDREF" type="xs:IDREF"
536	   minOccurs="0" maxOccurs="unbounded"/>
537	 </xs:sequence>
538	</xs:complexType>

540	<!-- CAPTURE ENCODING TYPE -->
541	<xs:complexType name="captureEncodingType">
542	 <xs:sequence>
543	  <xs:element name="mediaCaptureID" type="xs:string"/>
544	  <xs:element name="encodingID" type="xs:string"/>
545	 </xs:sequence>
546	</xs:complexType>

548	<!-- CAPTURE ENCODINGS TYPE -->
549	<xs:complexType name="captureEncodingsType">
550	 <xs:sequence>
551	  <xs:element name="captureEncoding" type="captureEncodingType"
552	  maxOccurs="unbounded"/>
553	 </xs:sequence>
554	</xs:complexType>

556	<!-- CLUE INFO ELEMENT -->
557	<!-- the <clueInfo> envelope can be seen
558	     as the ancestor of an <advertisement> envelope -->
559	<xs:element name="clueInfo" type="clueInfoType"/>

561	<!-- CLUE INFO TYPE -->
562	<xs:complexType name="clueInfoType">
563	  <xs:sequence>
564	   <xs:element ref="mediaCaptures"/>
565	   <xs:element ref="encodings"/>
566	   <xs:element ref="encodingGroups"/>
567	   <xs:element ref="captureScenes"/>
568	   <xs:element ref="simultaneousSets"/>
569	   <xs:any namespace="##other" processContents="lax" minOccurs="0"
570	   maxOccurs="unbounded"/>
571	  </xs:sequence>
572	  <xs:attribute name="clueInfoID" type="xs:ID" use="required"/>
573	  <xs:anyAttribute namespace="##other" processContents="lax"/>
574	</xs:complexType>

576	</xs:schema>

578	   Following sections describe the XML schema in more detail.

580	4.  <mediaCaptures>

582	   <mediaCaptures> represents the list of one ore more media captures
583	   available on the media provider's side.  Each media capture is
584	   represented by a <mediaCapture> element (Section 10).

586	5.  <encodings>

588	   <encodings> represents the list of individual encodings available on
589	   the media provider's side.  Each individual encoding is represented
590	   by an <encoding> element (Section 16).

592	6.  <encodingGroups>

594	   <encodingGroups> represents the list of the encoding groups organized
595	   on the media provider's side.  Each encoding group is represented by
596	   a <encodingGroup> element (Section 20).

598	7.  <captureScenes>

600	   <captureScenes> represents the list of the capture scenes organized
601	   on the media provider's side.  Each capture scene is represented by a
602	   <captureScene> element.  (Section 14).

604	8.  <simultaneousSets>

606	   <simultaneousSets> contains the simultaneous sets indicated by the
607	   media provider.  Each simultaneous set is represented by a
608	   <simultaneousSet> element.  (Section 21).

610	9.  <captureEncodings>

612	   <captureEncodings> is a list of capture encodings.  It can represents
613	   the list of the desired capture encodings indicated by the media
614	   consumer or the list of instantiated captures on the provider's side.
615	   Each capture encoding is represented by a <captureEncoding> element.
616	   (Section 22).

618	10.  <mediaCapture>

620	   According to the CLUE framework, a media capture is the fundamental
621	   representation of a media flow that is available on the provider's
622	   side.  Media captures are characterized with a set of features that
623	   are independent from the specific type of medium, and with a set of
624	   feature that are media-specific.  We design the media capture type as
625	   an abstract type, providing all the features that can be common to
626	   all media types.  Media-specific captures, such as video captures,
627	   audio captures and others, are specialization of that media capture
628	   type, as in a typical generalization-specialization hierarchy.

630	   The following is the XML Schema definition of the media capture type:

632	<!-- MEDIA CAPTURE TYPE -->
633	<xs:complexType name="mediaCaptureType" abstract="true">
634	  <xs:sequence>
635	    <!-- mandatory fields -->
636	    <xs:element name="capturedMedia" type="xs:string"/>
637	    <xs:element name="captureSceneIDREF" type="xs:IDREF"/>
638	    <xs:element name="encGroupIDREF" type="xs:IDREF"/>
639	    <xs:choice>
640	      <xs:sequence>
641	        <xs:element name="spatialInformation" type="tns:spatialInformationType"
642	        maxOccurs="unbounded"/>
643	      </xs:sequence>
644	      <xs:element name="nonSpatiallyDefinible" type="xs:boolean" fixed="true"/>
645	    </xs:choice>
646	    <!-- optional fields -->
647	    <xs:element ref="description" minOccurs="0" maxOccurs="unbounded"/>
648	    <xs:element name="priority" type="xs:integer" minOccurs="0"/>
649	    <xs:element name="lang" type="xs:language" minOccurs="0"/>
650	    <xs:element name="content" type="xs:string" minOccurs="0"/>
651	    <xs:element name="switched" type="xs:boolean" minOccurs="0"/>
652	    <xs:element name="dynamic" type="xs:boolean" minOccurs="0"/>
653	    <xs:element name="composed" type="xs:boolean" minOccurs="0"/>
654	    <xs:element name="maxCaptureEncodings" type="xs:unsignedInt"
655	    minOccurs="0"/>
656	    <!-- this is in place of "supplementary info": -->
657	    <xs:element name="relatedTo" type="xs:IDREF" minOccurs="0"/>
658	    <xs:any namespace="##other" processContents="lax" minOccurs="0"
659	    maxOccurs="unbounded"/>
660	  </xs:sequence>
661	  <xs:attribute name="captureID" type="xs:ID" use="required"/>
662	  <xs:anyAttribute namespace="##other" processContents="lax"/>
663	</xs:complexType>

665	10.1.  <capturedMedia>

667	   <capturedMedia> is a mandatory field specifying the media type of the
668	   capture ("audio", "video", "text",...).

670	10.2.  <captureSceneIDREF>

672	   <captureSceneIDREF> is a mandatory field containing the identifier of
673	   the capture scene the media capture belongs to.  Indeed, each media
674	   capture must be associated with one and only capture scene.  When a
675	   media capture is spatially definible, some spatial information is
676	   provided along with it in the form of point coordinates (see
677	   Section 10.4).  Such coordinates refers to the space of coordinates
678	   defined for the capture scene containing the capture.

680	10.3.  <encGroupIDREF>

682	   <encGroupIDREF> is a mandatory field containing the identifier of the
683	   encoding group the media capture is associated with.

685	10.4.  <spatialInformation>

687	   Media captures are divided into two categories: non spatially
688	   definible captures and spatially definible captures.

690	   Non spatially definible captures are those that do not capture parts
691	   of the telepresence room.  Capture of this case are for example those
692	   related to registrations, text captures, DVDs, registered
693	   presentation, or external streams, that are played in the
694	   telepresence room and transmitted to remote sites.

696	   Spatially definible captures are those that capture part of the
697	   telepresence room.  The captured part of the telepresence room is
698	   described by means of the <spatialInformation> element.

700	   This is the definition of the spatial information type:

702	   <!-- SPATIAL INFORMATION TYPE -->
703	   <xs:complexType name="spatialInformationType">
704	    <xs:sequence>
705	     <xs:element name="capturePoint" type="capturePointType"/>
706	     <xs:element name="captureArea" type="captureAreaType"
707	     minOccurs="0"/>
708	     <xs:any namespace="##other" processContents="lax"
709	     minOccurs="0" maxOccurs="unbounded"/>
710	    </xs:sequence>
711	    <xs:anyAttribute namespace="##other"
712	    processContents="lax"/>
713	   </xs:complexType>

715	   The <capturePoint> contains the coordinates of the capture device
716	   that is taking the capture, as well as, optionally, the pointing
717	   direction (see Section 10.4.1).  It is a mandatory field when the
718	   media capture is spatially definible, independently from the media
719	   type.

721	   The <captureArea> is an optional field containing four points
722	   defining the captured area represented by the capture (see
723	   Section 10.4.2).

725	10.4.1.  <capturePoint>

727	   The <capturePoint> element is used to represent the position and the
728	   line of capture of a capture device.  The XML Schema definition of
729	   the <capturePoint> element type is the following:

731	   <!-- CAPTURE POINT TYPE -->
732	   <xs:complexType name="capturePointType">
733	    <xs:complexContent>
734	     <xs:extension base="pointType">
735	      <xs:sequence>
736	       <xs:element name="lineOfCapturePoint"
737	       type="tns:pointType"
738	       minOccurs="0"/>
739	      </xs:sequence>
740	      <xs:attribute name="pointID" type="xs:ID"/>
741	     </xs:extension>
742	    </xs:complexContent>
743	   </xs:complexType>

745	   <!-- POINT TYPE -->
746	   <xs:complexType name="pointType">
747	    <xs:sequence>
748	     <xs:element name="x" type="xs:decimal"/>
749	     <xs:element name="y" type="xs:decimal"/>
750	     <xs:element name="z" type="xs:decimal"/>
751	    </xs:sequence>
752	   </xs:complexType>

754	   The point type contains three spatial coordinates ("x","y","z")
755	   representing a point in the space associated with a certain capture
756	   scene.

758	   The capture point type extends the point type, i.e., it is
759	   represented by three coordinates identifying the position of the
760	   capture device, but can add further information.  Such further
761	   information is conveyed by the <lineOfCapturePoint>, which is another
762	   point-type element representing the "point on line of capture", that
763	   gives the pointing direction of the capture device.

765	   If the point of capture is not specified, it means the consumer
766	   should not assume anything about the spatial location of the
767	   capturing device.

769	   The coordinates of the point on line of capture MUST NOT be identical
770	   to the capture point coordinates.  If the point on line of capture is
771	   not specified, no assumptions are made about the axis of the
772	   capturing device.

774	10.4.2.  <captureArea>

776	   <captureArea> is an optional element that can be contained within the
777	   spatial information associated with a media capture.  It represents
778	   the spatial area captured by the media capture.

780	   The XML representation of that area is provided through a set of four
781	   point-type element, <bottomLeft>, <bottomRight>, <topLeft>, and
782	   <topRight>, as it can be seen from the following definition:

784	   <!-- CAPTURE AREA TYPE -->
785	   <xs:complexType name="captureAreaType">
786	    <xs:sequence>
787	     <xs:element name="bottomLeft" type="pointType"/>
788	     <xs:element name="bottomRight" type="pointType"/>
789	     <xs:element name="topLeft" type="pointType"/>
790	     <xs:element name="topRight" type="pointType"/>
791	    </xs:sequence>
792	   </xs:complexType>

794	   <bottomLeft>, <bottomRight>, <topLeft>, and <topRight> should be co-
795	   planar.

797	   For a switched capture that switches between different sections
798	   within a larger area, the area of capture should use coordinates for
799	   the larger potential area.

801	   By comparing the capture area of different media captures within the
802	   same capture scene, a consumer can determine the spatial
803	   relationships between them and render them correctly.  If the area of
804	   capture is not specified, it means the Media Capture is not spatially
805	   related to any other media capture.

807	10.5.  <nonSpatiallyDefinible>

809	   When media captures are non spatially definible, they are marked with
810	   the boolean <nonSpatiallyDefinible> element set to "true".

812	10.6.  <description>

814	   <description> is used to provide optionally human-readable textual
815	   information.  It is used to describe media captures, capture scenes
816	   and capture scene entries.  A media capture can be described by using
817	   multiple <description> elements, each one providing information in a
818	   different language.  Indeed, the <description> element definition is
819	   the following:

821	   <!-- DESCRIPTION element -->
822	   <xs:element name="description">
823	    <xs:complexType>
824	     <xs:simpleContent>
825	      <xs:extension base="xs:string">
826	        <xs:attribute name="lang" type="xs:language"/>
827	      </xs:extension>
828	     </xs:simpleContent>
829	    </xs:complexType>
830	   </xs:element>

832	   As it can be seen, <description> is a string element with an
833	   attribute ("lang") indicating the language used in the textual
834	   description.

836	10.7.  <priority>

838	   <priority> ([I-D.groves-clue-capture-attr]) is an optional integer
839	   field indicating the importance of a media capture according to the
840	   media provider's perspective.  It can be used on the receiver's side
841	   to automatically identify the most "important" contribution available
842	   from the media provider.

844	   [edt note: no final consensus has been reached on the adoption of
845	   such media capture attribute.]

847	10.8.  <lang>

849	   <lang> is an optional element containing the language used in the
850	   capture, if any.  The purpose of the element could match the one of
851	   the "language" attribute proposed in [I-D.groves-clue-capture-attr].

853	10.9.  <content>

855	   <content> is an optional string element.  It contains enumerated
856	   values describing the "role" of the media capture according to what
857	   is envisionend in [RFC4796] ("slides", "speaker", "sl", "main",
858	   "alt").  The values for this attribute are the same as the mediacnt
859	   values for the content attribute in [RFC4796].  This attribute can
860	   list multiple values, for example "main, speaker".

862	   [edt note: a better XML Schema definition for that element will soon
863	   be defined.]

865	10.10.  <switched>

867	   <switched> is a boolean element which indicates whether or not the
868	   media capture represents the most appropriate subset of a "whole".
869	   What is "most appropriate" is up to the provider and could be the
870	   active speaker, a lecturer or a VIP.

872	   [edt note: :(]

874	10.11.  <dynamic>

876	   <dynamic> is an optional boolean element indicating wheter or not the
877	   capture device originating the capture moves during the telepresence
878	   session.  That optional boolean element has the same purpose of the
879	   dynamic attribute proposed in [I-D.groves-clue-capture-attr].

881	   [edt note: There isn't yet final consensus about that element.]

883	10.12.  <composed>

885	   <composed> is an optional boolean element indicating wheter or not
886	   the media capture is a mix (audio) or composition (video) of streams.
887	   This attribute is useful for a media consumer for example to avoid
888	   nesting a composed video capture into another composed capture or
889	   rendering.

891	10.13.  <maxCaptureEncodings>

893	   The optional <maxCaptureEncodings> contains an unsigned integer
894	   indicating the maximum number of capture encodings that can be
895	   simultaneously active for the media capture.  If absent, this
896	   parameter defaults to 1.  The minimum value for this attribute is 1.
897	   The number of simultaneous capture encodings is also limited by the
898	   restrictions of the encoding group the media capture refers to my
899	   means of the <encGroupIDREF> element.

901	10.14.  <relatedTo>

903	   The optional <relatedTo> element contains the value of the ID
904	   attribute of the media capture it refers to.  The media capture
905	   marked with a <relatedTo> element can be for example the translation
906	   of a main media capture in a different language.  The <relatedTo>
907	   element could be interpreted the same manner of the supplementary
908	   information attribute proposed in [I-D.groves-clue-capture-attr] and
909	   further discussed in
910	   http://www.ietf.org/mail-archive/web/clue/current/msg02238.html.

912	   [edt note: There isn't yet final consensus about that element.]

914	10.15.  captureID attribute

916	   The "captureID" attribute is a mandatory field containing the
917	   identifier of the media capture.

919	11.  Audio captures

921	   Audio captures inherit all the features of a generic media capture
922	   and present further audio-specific characteristics.  The XML Schema
923	   definition of the audio capture type is reported below:

925	 <!-- AUDIO CAPTURE TYPE -->
926	 <xs:complexType name="audioCaptureType">
927	  <xs:complexContent>
928	   <xs:extension base="tns:mediaCaptureType">
929	    <xs:sequence>
930	     <xs:element name="audioChannelFormat" type="audioChannelFormatType"
931	     minOccurs="0"/>
932	     <xs:element name="micPattern" type="tns:micPatternType"
933	     minOccurs="0"/>
934	    </xs:sequence>
935	   </xs:extension>
936	  </xs:complexContent>
937	 </xs:complexType>

939	   Audio-specific information about the audio capture is contained in
940	   <audioChannelFormat> (Section 11.1) and in <micPattern>
941	   (Section 11.2).

943	11.1.  <audioChannelFormat>

945	   The optional <audioChannelFormat> element is a field with enumerated
946	   values ("mono" and "stereo") which describes the method of encoding
947	   used for audio.  A value of "mono" means the audio capture has one
948	   channel.  A value of "stereo" means the audio capture has two audio
949	   channels, left and right.  A single stereo capture is different from
950	   two mono captures that have a left-right spatial relationship.  A
951	   stereo capture maps to a single RTP stream, while each mono audio
952	   capture maps to a separate RTP stream.

954	   The XML Schema definition of the <audioChannelFormat> element type is
955	   provided below:

957	   <!-- AUDIO CHANNEL FORMAT TYPE -->
958	   <xs:simpleType name="audioChannelFormatType">
959	    <xs:restriction base="xs:string">
960	     <xs:enumeration value="mono"/>
961	     <xs:enumeration value="stereo"/>
962	    </xs:restriction>
963	   </xs:simpleType>

965	11.2.  <micPattern>

967	   The <micPattern> element is an optional field describing the
968	   characteristic of the mic capturing the audio signal.  It can
969	   contains the enumerated values listed below:

971	   <!-- MIC PATTERN TYPE -->
972	   <xs:simpleType name="micPatternType">
973	    <xs:restriction base="xs:string">
974	     <xs:enumeration value="uni"/>
975	     <xs:enumeration value="shotgun"/>
976	     <xs:enumeration value="omni"/>
977	     <xs:enumeration value="figure8"/>
978	     <xs:enumeration value="cardioid"/>
979	     <xs:enumeration value="hyper-cardioid"/>
980	    </xs:restriction>
981	   </xs:simpleType>

983	12.  Video captures

985	   Video captures, similarly to audio captures, extend the information
986	   of a generic media capture with video-specific features, such as
987	   <nativeAspectRatio> (Section 12.1) and <embeddedText> (Section 12.2).

989	   The XML Schema representation of the video capture type is provided
990	   in the following:

992	   <!-- VIDEO CAPTURE TYPE -->
993	   <xs:complexType name="videoCaptureType">
994	    <xs:complexContent>
995	     <xs:extension base="tns:mediaCaptureType">
996	      <xs:sequence>
997	       <xs:element name="nativeAspectRatio" type="xs:string"
998	       minOccurs="0"/>
999	       <xs:element ref="embeddedText" minOccurs="0"/>
1000	      </xs:sequence>
1001	     </xs:extension>
1002	    </xs:complexContent>
1003	   </xs:complexType>

1005	12.1.  <nativeAspectRatio>

1007	   If a video capture has a native aspect ratio (for instance, it
1008	   corresponds to a camera that generates 4:3 video), then it can be
1009	   supplied as a value of the <nativeAspectRatio> element, in order to
1010	   help rendering.

1012	12.2.  <embeddedText>

1014	   The <embeddedText> element is a boolean element indicating that there
1015	   is text embedded in the video capture.  The language used in such
1016	   embedded textual description is reported in <embeddedText> "lang"
1017	   attribute.

1019	   The XML Schema definition of the <embeddedText> element is:

1021	   <!-- EMBEDDED TEXT ELEMENT -->
1022	   <xs:element name="embeddedText">
1023	    <xs:complexType>
1024	     <xs:simpleContent>
1025	      <xs:extension base="xs:boolean">
1026	       <xs:attribute name="lang" type="xs:language"/>
1027	      </xs:extension>
1028	     </xs:simpleContent>
1029	    </xs:complexType>

1031	   </xs:element>

1033	   The <embeddedText> element could correspond to the embedded-text
1034	   attribute introduced in [I-D.groves-clue-capture-attr]

1036	   [edt note: no final consensus has been reached yet about the adoption
1037	   of such element]

1039	13.  Text captures

1041	   Also text captures can be described by extending the generic media
1042	   capture information, similarly to audio captures and video captures.

1044	   The XML Schema representation of the text capture type is currently
1045	   lacking text-specific information, as it can be seen by looking at
1046	   the definition below:

1048	   <!-- TEXT CAPTURE TYPE -->
1049	   <xs:complexType name="textCaptureType">
1050	    <xs:complexContent>
1051	     <xs:extension base="tns:mediaCaptureType">
1052	     </xs:extension>
1053	    </xs:complexContent>
1054	   </xs:complexType>

1056	14.  <captureScene>

1058	   A media provider organizes the available capture in capture scenes in
1059	   order to help the receiver both in the rendering and in the selection
1060	   of the group of captures.  Capture scenes are made of capture scene
1061	   entries, that are set of media captures of the same media type.  Each
1062	   capture scene entry represents an alternative to represent completely
1063	   a capture scene for a fixed media type.

1065	   The XML Schema representation of a <captureScene> element is the
1066	   following:

1068	 <!-- CAPTURE SCENE TYPE -->
1069	 <xs:complexType name="captureSceneType">
1070	  <xs:sequence>
1071	   <xs:element ref="description" minOccurs="0" maxOccurs="unbounded"/>
1072	   <xs:element name="sceneSpace" type="captureSpaceType" minOccurs="0"/>
1073	   <xs:element name="sceneEntries" type="sceneEntriesType"/>
1074	   <xs:any namespace="##other" processContents="lax" minOccurs="0"
1075	   maxOccurs="unbounded"/>
1076	  </xs:sequence>
1077	  <xs:attribute name="sceneID" type="xs:ID" use="required"/>
1078	  <xs:attribute name="scale" type="scaleType" use="required"/>
1079	  <xs:anyAttribute namespace="##other" processContents="lax"/>
1080	 </xs:complexType>

1082	   The <captureScene> element can contain zero or more textual
1083	   <description> elements, defined as in Section 10.6.  Besides
1084	   <description>, there are two other fields: <sceneSpace>
1085	   (Section 14.1), describing the coordinate space which the media
1086	   captures of the capture scene refer to, and <sceneEntries>
1087	   (Section 14.2), the list of the capture scene entries.

1089	14.1.  <sceneSpace> (was:<sceneArea>)

1091	   The <sceneSpace> describes a bounding volume for the spatial
1092	   information provided alongside spatially-definible media capture
1093	   associated with the considered capture scene.  Such volume is
1094	   described as an arbitrary hexahedrons with eight points
1095	   (<bottomLeftFront>, <bottomRightFront>, <topLeftFront>,
1096	   <topRightFront>, <bottomLeftBack>, <bottomRightBack>, <topLeftBack>,
1097	   and <topRightBack>).  The coordinate system is Cartesian X, Y, Z with
1098	   the origin at a spatial location of the media provider's choosing.
1099	   The media provider must use the same coordinate system with same
1100	   scale and origin for all media capture coordinates within the same
1101	   capture scene.

1103	   <!-- CAPTURE SPACE TYPE -->
1104	   <xs:complexType name="captureSpaceType">
1105	    <xs:sequence>
1106	     <xs:element name="bottomLeftFront" type="pointType"/>
1107	     <xs:element name="bottomRightFront" type="pointType"/>
1108	     <xs:element name="topLeftFront" type="pointType"/>
1109	     <xs:element name="topRightFront" type="pointType"/>
1110	     <xs:element name="bottomLeftBack" type="pointType"/>
1111	     <xs:element name="bottomRightBack" type="pointType"/>
1112	     <xs:element name="topLeftBack" type="pointType"/>
1113	     <xs:element name="topRightBack" type="pointType"/>
1114	    </xs:sequence>
1115	   </xs:complexType>

1117	   [edt note: this is just a place holder, the definition of the
1118	   bounding volume has to be discussed]

1120	14.2.  <sceneEntries>

1122	   The <sceneEntries> element is a mandatory field of a capture scene
1123	   containing the list of scene entries.  Each scene entry is
1124	   represented by a <sceneEntry> element (Section 15).

1126	   <!-- SCENE ENTRIES TYPE -->
1127	   <!-- envelope of scene entries of a capture scene -->
1128	   <xs:complexType name="sceneEntriesType">
1129	    <xs:sequence>
1130	     <xs:element name="sceneEntry" type="sceneEntryType"
1131	     maxOccurs="unbounded"/>
1132	    </xs:sequence>
1133	   </xs:complexType>

1135	14.3.  sceneID attribute

1137	   The sceneID attribute is a mandatory attribute containing the
1138	   identifier of the capture scene.

1140	14.4.  scale attribute

1142	   The scale attribute is a mandatory attribute that specifies the scale
1143	   of the coordinates provided in the capture space and in the spatial
1144	   information of the media capture belonging to the considered capture
1145	   scene.  The scale attribute can assume three different values:

1147	      "millimeters" - the scale is in millimeters.  Systems which know
1148	      their physical dimensions (for example professionally installed
1149	      telepresence room systems) should always provide those real-world
1150	      measurements.

1152	      "unknown" - the scale is not necessarily millimeters, but the
1153	      scale is the same for every media capture in the capture scene.
1154	      Systems which don't know specific physical dimensions but still
1155	      know relative distances should select "unknown" in the scale
1156	      attribute of the capture scene to be described.

1158	      "noscale" - there is no a common physical scale among the media
1159	      captures of the capture scene.  That means the scale could be
1160	      different for each media capture.

1162	   <!-- SCALE TYPE -->
1163	   <xs:simpleType name="scaleType">
1164	    <xs:restriction base="xs:string">
1165	     <xs:enumeration value="millimeters"/>
1166	     <xs:enumeration value="unknown"/>
1167	     <xs:enumeration value="noscale"/>
1168	    </xs:restriction>
1169	   </xs:simpleType>

1171	15.  <sceneEntry>

1173	   A <sceneEntry> element represents a capture scene entry, which
1174	   contains a set of media capture of the same media type describing a
1175	   capture scene.

1177	   A <sceneEntry> element is characterized as follows.

1179	   <!-- SCENE ENTRY TYPE -->
1180	   <xs:complexType name="sceneEntryType">
1181	    <xs:sequence>
1182	     <xs:element ref="description" minOccurs="0" maxOccurs="unbounded"/>
1183	     <xs:element name="switchingPolicies" type="switchingPoliciesType"
1184	     minOccurs="0"/>
1185	     <xs:element name="mediaCaptureIDs" type="captureIDListType"/>
1186	    </xs:sequence>
1187	    <xs:attribute name="sceneEntryID" type="xs:ID" use="required"/>
1188	    <xs:attribute name="mediaType" type="xs:string" use="required"/>
1189	   </xs:complexType>

1191	   One or more optional <description> elements provide human-readable
1192	   information about what the scene entry contains. <description> is
1193	   defined as already seen in Section 10.6.

1195	   The remaining child elements are described in the following
1196	   subsections.

1198	15.1.  <switchingPolicies>

1200	   <switchingPolicies> represents the switching policies the media
1201	   provider support for the media captures contained inside a scene
1202	   entry.  The <switchingPolicies> element contains two boolean
1203	   elements:

1205	      <siteSwitching>: if set to "true", it means that the media
1206	      provider supports the site switching policy for the included media
1207	      captures;

1209	      <segmentSwitching>: if set to "true", it means that the media
1210	      provider supports the segment switching policy for the included
1211	      media captures.

1213	   The "site-switch" policy means all captures are switched at the same
1214	   time to keep captures from the same endpoint site together.

1216	   The "segment-switch" policy means different captures can switch at
1217	   different times, and can be coming from different endpoints.

1219	   <!-- SWITCHING POLICIES TYPE -->
1220	   <xs:complexType name="switchingPoliciesType">
1221	    <xs:sequence>
1222	     <xs:element name="siteSwitching" type="xs:boolean" minOccurs="0"/>
1223	     <xs:element name="segmentSwitching" type="xs:boolean"
1224	     minOccurs="0"/>
1225	    </xs:sequence>
1226	   </xs:complexType>

1228	15.2.  <mediaCaptureIDs>

1230	   The <mediaCaptureIDs> is the list of the identifiers of the media
1231	   captures included in the scene entry.  It is an element of the
1232	   captureIDListType type, which is defined as a sequence of
1233	   <captureIDREF> each one containing the identifier of a media capture
1234	   listed within the <mediaCaptures> element:

1236	   <!-- CAPTURE ID LIST TYPE -->
1237	   <xs:complexType name="captureIDListType">
1238	    <xs:sequence>
1239	     <xs:element name="captureIDREF" type="xs:IDREF"
1240	     maxOccurs="unbounded"/>
1241	    </xs:sequence>
1242	   </xs:complexType>

1244	15.3.  sceneEntryID attribute

1246	   The sceneEntryID attribute is a mandatory attribute containing the
1247	   identifier of the capture scene entry represented by the <sceneEntry>
1248	   element.

1250	15.4.  mediaType attribute

1252	   The mediaType attribute contains the media type of the media captures
1253	   included in the scene entry.

1255	16.  <encoding>

1257	   The <encoding> element represents an individual encoding, i.e., a way
1258	   to encode a media capture.  Individual encodings can be characterized
1259	   with features that are independent from the specific type of medium,
1260	   and with features that are media-specific.  We design the individual
1261	   encoding type as an abstract type, providing all the features that
1262	   can be common to all media types.  Media-specific individual
1263	   encodings, such as video encodings, audio encodings and others, are
1264	   specialization of that type, as in a typical generalization-
1265	   specialization hierarchy.

1267	   <!-- ENCODING TYPE -->
1268	   <xs:complexType name="encodingType" abstract="true">
1269	    <xs:sequence>
1270	     <xs:element name="encodingName" type="xs:string"/>
1271	     <xs:element name="maxBandwidth" type="xs:integer"/>
1272	     <xs:any namespace="##other" processContents="lax" minOccurs="0"
1273	     maxOccurs="unbounded"/>
1274	    </xs:sequence>
1275	    <xs:attribute name="encodingID" type="xs:ID" use="required"/>
1276	    <xs:anyAttribute namespace="##any" processContents="lax"/>
1277	   </xs:complexType>

1279	16.1.  <encodingName>

1281	   <encodingName> is a mandatory field containing the name of the
1282	   encoding (e.g., G711, H264, ...).

1284	16.2.  <maxBandwidth>

1286	   <maxBandwidth> represent the maximum bitrate the media provider can
1287	   instantiate for that encoding.

1289	16.3.  encodingID attribute

1291	   The encodingID attribute is a mandatory attribute containing the
1292	   identifier of the individual encoding.

1294	17.  Audio encodings

1296	   Audio encodings inherit all the features of a generic individual
1297	   encoding and can present further audio-specific encoding
1298	   characteristics.  The XML Schema definition of the audio encoding
1299	   type is reported below:

1301	   <!-- AUDIO ENCODING TYPE -->
1302	   <xs:complexType name="audioEncodingType">
1303	    <xs:complexContent>
1304	     <xs:extension base="tns:encodingType">
1305	      <xs:sequence>
1306	       <xs:element name="encodedMedia" type="xs:string" fixed="audio"
1307	       minOccurs="0"/>
1308	      </xs:sequence>
1309	     </xs:extension>
1310	    </xs:complexContent>
1311	   </xs:complexType>

1313	   Up to now the only audio-specific information is the <encodedMedia>
1314	   element containing the media type of the media captures that can be
1315	   encoded with the considered individual encoding.  In the case of
1316	   audio encoding, that element is forced to "audio".

1318	18.  Video encodings

1320	   Similarly to audio encodings, video encodings can extend the
1321	   information of a generic individual encoding with video-specific
1322	   encoding features, such as <maxWidth>, <maxHeight> and
1323	   <maxFrameRate>.

1325	   The <encodedMedia> element contains the media type of the media
1326	   captures that can be encoded with the considered individual encoding.
1327	   In the case of video encoding, that element is forced to "video".

1329	   <!-- VIDEO ENCODING TYPE -->
1330	   <xs:complexType name="videoEncodingType">
1331	    <xs:complexContent>
1332	     <xs:extension base="tns:encodingType">
1333	      <xs:sequence>
1334	       <xs:element name="encodedMedia" type="xs:string" fixed="video"
1335	       minOccurs="0"/>
1336	       <xs:element name="maxWidth" type="xs:integer" minOccurs="0"/>
1337	       <xs:element name="maxHeight" type="xs:integer" minOccurs="0"/>
1338	       <xs:element name="maxFrameRate" type="xs:integer" minOccurs="0"/>
1339	      </xs:sequence>
1340	     </xs:extension>
1341	    </xs:complexContent>
1342	   </xs:complexType>

1344	18.1.  <maxWidth>

1346	   <maxWidth> represents the video resolution's maximum width supported
1347	   by the video encoding, expressed in pixels.

1349	   [edt note: not present in -09 version of the framework doc]

1351	18.2.  <maxHeight>

1353	   <maxHeight> representd the video resolution's maximum heith supported
1354	   by the video encoding, expressed in pixels.

1356	   [edt note: not present in -09 version of the framework doc]

1358	18.3.  <maxFrameRate>

1360	   <maxFrameRate> provides the maximum frame rate supported by the video
1361	   encoding for the video capture to be encoded.

1363	   [edt note: not present in -09 version of the framework doc]

1365	19.  H26X encodings

1367	   This is an example of how it is possible to further specialize the
1368	   definition of a video individual encoding in order to cover encoding
1369	   specific information.  A H26X video encoding can be represented
1370	   through an element inheriting the video encoding characteristics
1371	   described above (Section 18) and by adding other information such as
1372	   <maxH26Xpps>, which represent the maximum number of pixels to be
1373	   processed per second;.

1375	   <!-- H26X ENCODING TYPE -->
1376	   <xs:complexType name="h26XEncodingType">
1377	    <xs:complexContent>
1378	     <xs:extension base="tns:videoEncodingType">
1379	      <xs:sequence>
1380	       <!-- max number of pixels to be processed per second -->
1381	       <xs:element name="maxH26Xpps" type="xs:integer"
1382	       minOccurs="0"/>
1383	      </xs:sequence>
1384	     </xs:extension>
1385	    </xs:complexContent>
1386	   </xs:complexType>

1388	   [edt note: Need to be checked]

1390	20.  <encodingGroup>

1392	   The <encodingGroup> element represents an encoding group, which is a
1393	   set of one or more individual encodings, and parameters that apply to
1394	   the group as a whole.  The definition of the <encodingGroup> element
1395	   is the following:

1397	   <!-- ENCODING GROUP TYPE -->
1398	   <xs:complexType name="encodingGroupType">
1399	    <xs:sequence>
1400	     <xs:element name="maxGroupBandwidth" type="xs:integer"/>
1401	     <xs:element name="maxGroupPps" type="xs:integer"
1402	     minOccurs="0"/>
1403	     <xs:element name="encodingIDList" type="encodingIDListType"/>
1404	     <xs:any namespace="##other" processContents="lax" minOccurs="0"
1405	     maxOccurs="unbounded"/>
1406	    </xs:sequence>
1407	    <xs:attribute name="encodingGroupID" type="xs:ID" use="required"/>
1408	    <xs:anyAttribute namespace="##any" processContents="lax"/>
1409	   </xs:complexType>

1411	   In the following, the contained elements are further described.

1413	20.1.  <maxGroupBandwidth>

1415	   <maxGroupBandwidth> is an optional field containing the maximum
1416	   bitrate supported for all the individual encodings included in the
1417	   encoding group.

1419	20.2.  <maxGroupPps>

1421	   <maxGroupPps> is an optional field containing the maximum number of
1422	   pixel per second for all the individual encodings included in the
1423	   encoding group.

1425	   [edt note: Need to be checked]

1427	20.3.  <encodingIDList>

1429	   <maxGroupBandwidth> is the list of the individual encoding grouped
1430	   together.  Each individual encoding is represented through its
1431	   identifier contained within an <encIDREF> element.

1433	   <!-- ENCODING ID LIST TYPE -->
1434	   <xs:complexType name="encodingIDListType">
1435	    <xs:sequence>
1436	     <xs:element name="encIDREF" type="xs:IDREF" maxOccurs="unbounded"/>
1437	    </xs:sequence>
1438	   </xs:complexType>

1440	20.4.  encodingGroupID attribute

1442	   The encodingGroupID attribute contains the identifier of the encoding
1443	   group.

1445	21.  <simultaneousSet>

1447	   <simultaneousSet> represents a simultaneous set, i.e. a list of
1448	   capture of the same type that cab be transmitted at the same time by
1449	   a media provider.  There are different simultaneous transmission sets
1450	   for each media type.

1452	   <!-- SIMULTANEOUS SET TYPE -->
1453	   <xs:complexType name="simultaneousSetType">
1454	    <xs:sequence>
1455	      <xs:element name="captureIDREF" type="xs:IDREF"
1456	      minOccurs="0" maxOccurs="unbounded"/>
1457	      <xs:element name="sceneEntryIDREF" type="xs:IDREF"
1458	      minOccurs="0" maxOccurs="unbounded"/>
1459	    </xs:sequence>
1460	   </xs:complexType>

1462	   [edt note: need to be checked]

1464	21.1.  <captureIDREF>

1466	   <captureIDREF> contains the identifier of the media capture that
1467	   belongs to the simultanous set.

1469	21.2.  <sceneEntryIDREF>

1471	   <captureIDREF> contains the identifier of the scene entry containing
1472	   a group of capture that are able to be sent simultaneously with the
1473	   other capture of the simultaneous set.

1475	22.  <captureEncoding>

1477	   A <captureEncoding> is given from the association of a media capture
1478	   and an individual encoding, to form a capture stream.  It is defined
1479	   as en element of the following type:

1481	   <!-- CAPTURE ENCODING TYPE -->
1482	   <xs:complexType name="captureEncodingType">
1483	    <xs:sequence>
1484	     <xs:element name="mediaCaptureID" type="xs:string"/>
1485	     <xs:element name="encodingID" type="xs:string"/>
1486	    </xs:sequence>
1487	   </xs:complexType>

1489	22.1.  <mediaCaptureID>

1491	   <mediaCaptureID> contains the identifier of the media capture that
1492	   has been encoded to form the capture encoding.

1494	22.2.  <encodingID>

1496	   <encodingID> contains the identifier of the applied individual
1497	   encoding.

1499	23.  <clueInfo>

1501	   The <clueInfo> element has been left within the XML Schema for the
1502	   sake of convenience when representing a prototype of ADVERTISEMENT
1503	   message (see the example section).

1505	   <!-- CLUE INFO ELEMENT -->
1506	   <!-- the <clueInfo> envelope can be seen
1507	        as the ancestor of an <advertisement> envelope -->
1508	   <xs:element name="clueInfo" type="clueInfoType"/>

1510	   <!-- CLUE INFO TYPE -->
1511	   <xs:complexType name="clueInfoType">
1512	     <xs:sequence>
1513	      <xs:element ref="mediaCaptures"/>
1514	      <xs:element ref="encodings"/>
1515	      <xs:element ref="encodingGroups"/>
1516	      <xs:element ref="captureScenes"/>
1517	      <xs:element ref="simultaneousSets"/>
1518	      <xs:any namespace="##other" processContents="lax" minOccurs="0"
1519	      maxOccurs="unbounded"/>
1520	     </xs:sequence>
1521	     <xs:attribute name="clueInfoID" type="xs:ID" use="required"/>
1522	     <xs:anyAttribute namespace="##other" processContents="lax"/>
1523	   </xs:complexType>

1525	24.  Sample XML file

1527	   The following XML document represents a schema compliant example of a
1528	   CLUE telepresence scenario.

1530	   There are 5 video captures:

1532	   VC0:  the video from the left camera

1534	   VC1:  the video from the central camera

1536	   VC2:  the video from the right camera

1538	   VC3:  the overall view of the telepresence room taken from the
1539	      central camera

1541	   VC4:  the video associated with the slide stream

1543	   There are 2 audio captures:

1545	   AC0:  the overall room audio taken from the central camera

1547	   AC1:  the audio associated with the slide stream presentation

1549	   The captures are organized into two capture scenes:

1551	   CS1:  this scene contains captures associated with the participants
1552	      that are in the telepresence room.

1554	   CS2:  this scene contains captures associated with the slide
1555	      presentation, which is a pre-registered presentation played within
1556	      the context of the telepresence session.

1558	   Within the capture scene CS1, there are three scene entries
1559	   available:

1561	   CS1_SE1:  this entry contains the partipants' video captures taken
1562	      from the three cameras (VC0, VC1, VC2).

1564	   CS1_SE2:  this entry contains the zoomed-out view of the overall
1565	      telepresence room (VC3)

1567	   CS1_SE3:  this entry contains the overall telepresence room audio
1568	      (AC0)

1570	   On the other hand, capture scene CS2 presents two scene entries:

1572	   CS2_SE1:  this entry contains the presentation audio stream (AC1)

1574	   CS2_SE2:  this entry contains the presentation video stream (VC4)

1576	   There are two encoding groups:

1578	   EG0  This encoding groups involves video encodings ENC0, ENC1, ENC2

1580	   EG1  This encoding groups involves audio encodings ENC3, ENC4

1582	   As to the simultaneous sets, only VC1 and VC3 cannot be transmitted
1583	   simultaneously since they are captured by the same device. i.e. the
1584	   central camera (VC3 is a zoomed-out view while VC1 is a focused view
1585	   of the front participants).  The simultaneous sets would then be the
1586	   following:

1588	   SS1  made by VC0, VC1, VC2, VC4, AC0, AC1

1590	   SS2  made by VC0, VC3, VC2, VC4, AC0, AC1

1592	 <?xml version="1.0" encoding="UTF-8" standalone="yes"?>
1593	 <clueInfo xmlns="urn:ietf:params:xml:ns:clue-info" clueInfoID="prova">
1594	     <mediaCaptures>
1595	         <mediaCapture
1596	         xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
1597	         xsi:type="audioCaptureType" captureID="AC1">
1598	             <capturedMedia>audio</capturedMedia>
1599	             <captureSceneIDREF>CS2</captureSceneIDREF>
1600	             <encGroupIDREF>EG1</encGroupIDREF>
1601	             <nonSpatiallyDefinible>true</nonSpatiallyDefinible>
1602	             <description lang="en">presentation audio</description>
1603	             <content>slide</content>
1604	             <audioChannelFormat>mono</audioChannelFormat>
1605	         </mediaCapture>
1606	         <mediaCapture
1607	         xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
1608	         xsi:type="videoCaptureType" captureID="VC4">
1609	             <capturedMedia>video</capturedMedia>
1610	             <captureSceneIDREF>CS2</captureSceneIDREF>
1611	             <encGroupIDREF>EG0</encGroupIDREF>
1612	             <nonSpatiallyDefinible>true</nonSpatiallyDefinible>
1613	             <description lang="en">presentation video</description>
1614	             <content>slides</content>
1615	         </mediaCapture>
1616	         <mediaCapture
1617	         xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
1618	         xsi:type="audioCaptureType" captureID="AC0">
1619	             <capturedMedia>audio</capturedMedia>
1620	             <captureSceneIDREF>CS1</captureSceneIDREF>
1621	             <encGroupIDREF>EG1</encGroupIDREF>
1622	             <spatialInformation>
1623	                 <capturePoint>
1624	                     <x>0.5</x>
1625	                     <y>1.0</y>
1626	                     <z>0.5</z>
1627	                     <lineOfCapturePoint>
1628	                         <x>0.5</x>
1629	                         <y>0.0</y>
1630	                         <z>0.5</z>
1631	                     </lineOfCapturePoint>
1632	                 </capturePoint>
1633	             </spatialInformation>
1634	             <description lang="en">
1635	             audio from the central camera mic</description>
1636	             <audioChannelFormat>mono</audioChannelFormat>
1637	             <micPattern>figure8</micPattern>
1638	         </mediaCapture>
1639	         <mediaCapture
1640	         xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
1641	         xsi:type="videoCaptureType" captureID="VC3">
1642	             <capturedMedia>video</capturedMedia>
1643	             <captureSceneIDREF>CS1</captureSceneIDREF>
1644	             <encGroupIDREF>EG0</encGroupIDREF>
1645	             <spatialInformation>
1646	                 <capturePoint>
1647	                     <x>1.5</x>
1648	                     <y>1.0</y>
1649	                     <z>0.5</z>
1650	                     <lineOfCapturePoint>
1651	                         <x>1.5</x>
1652	                         <y>0.0</y>
1653	                         <z>0.5</z>
1654	                     </lineOfCapturePoint>
1655	                 </capturePoint>
1656	                 <captureArea>
1657	                     <bottomLeft>
1658	                         <x>0.0</x>
1659	                         <y>3.0</y>
1660	                         <z>0.0</z>
1661	                     </bottomLeft>
1662	                     <bottomRight>
1663	                         <x>3.0</x>
1664	                         <y>3.0</y>
1665	                         <z>0.0</z>
1666	                     </bottomRight>
1667	                     <topLeft>
1668	                         <x>0.0</x>
1669	                         <y>3.0</y>
1670	                         <z>3.0</z>
1671	                     </topLeft>
1672	                     <topRight>
1673	                         <x>3.0</x>
1674	                         <y>3.0</y>
1675	                         <z>3.0</z>
1676	                     </topRight>
1677	                 </captureArea>
1678	             </spatialInformation>
1679	             <description lang="en">
1680	             zoomed out view of the room</description>
1681	         </mediaCapture>
1682	         <mediaCapture
1683	         xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
1684	         xsi:type="videoCaptureType" captureID="VC2">
1685	             <capturedMedia>video</capturedMedia>
1686	             <captureSceneIDREF>CS1</captureSceneIDREF>
1687	             <encGroupIDREF>EG0</encGroupIDREF>
1688	             <spatialInformation>
1689	                 <capturePoint>
1690	                     <x>2.5</x>
1691	                     <y>1.0</y>
1692	                     <z>0.5</z>
1693	                     <lineOfCapturePoint>
1694	                         <x>2.5</x>
1695	                         <y>0.0</y>
1696	                         <z>0.5</z>
1697	                     </lineOfCapturePoint>
1698	                 </capturePoint>
1699	                 <captureArea>
1700	                     <bottomLeft>
1701	                         <x>2.0</x>
1702	                         <y>3.0</y>
1703	                         <z>0.0</z>
1704	                     </bottomLeft>
1705	                     <bottomRight>
1706	                         <x>3.0</x>
1707	                         <y>3.0</y>
1708	                         <z>0.0</z>
1709	                     </bottomRight>
1710	                     <topLeft>
1711	                         <x>2.0</x>
1712	                         <y>3.0</y>
1713	                         <z>3.0</z>
1714	                     </topLeft>
1715	                     <topRight>
1716	                         <x>3.0</x>
1717	                         <y>3.0</y>
1718	                         <z>3.0</z>
1719	                     </topRight>
1720	                 </captureArea>
1721	             </spatialInformation>
1722	             <description lang="en">right camera video</description>
1723	         </mediaCapture>
1724	         <mediaCapture
1725	         xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
1726	         xsi:type="videoCaptureType" captureID="VC1">
1727	             <capturedMedia>video</capturedMedia>
1728	             <captureSceneIDREF>CS1</captureSceneIDREF>
1729	             <encGroupIDREF>EG0</encGroupIDREF>
1730	             <spatialInformation>
1731	                 <capturePoint>
1732	                     <x>1.5</x>
1733	                     <y>1.0</y>
1734	                     <z>0.5</z>
1735	                     <lineOfCapturePoint>
1736	                         <x>1.5</x>
1737	                         <y>0.0</y>
1738	                         <z>0.5</z>
1739	                     </lineOfCapturePoint>
1740	                 </capturePoint>
1741	                 <captureArea>
1742	                     <bottomLeft>
1743	                         <x>1.0</x>
1744	                         <y>3.0</y>
1745	                         <z>0.0</z>
1746	                     </bottomLeft>
1747	                     <bottomRight>
1748	                         <x>2.0</x>
1749	                         <y>3.0</y>
1750	                         <z>0.0</z>
1751	                     </bottomRight>
1752	                     <topLeft>
1753	                         <x>1.0</x>
1754	                         <y>3.0</y>
1755	                         <z>3.0</z>
1756	                     </topLeft>
1757	                     <topRight>
1758	                         <x>2.0</x>
1759	                         <y>3.0</y>
1760	                         <z>3.0</z>
1761	                     </topRight>
1762	                 </captureArea>
1763	             </spatialInformation>
1764	             <description lang="en">central camera video</description>
1765	         </mediaCapture>
1766	         <mediaCapture
1767	         xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
1768	         xsi:type="videoCaptureType" captureID="VC0">
1769	             <capturedMedia>video</capturedMedia>
1770	             <captureSceneIDREF>CS1</captureSceneIDREF>
1771	             <encGroupIDREF>EG0</encGroupIDREF>
1772	             <spatialInformation>
1773	                 <capturePoint>
1774	                     <x>0.5</x>
1775	                     <y>1.0</y>
1776	                     <z>0.5</z>
1777	                     <lineOfCapturePoint>
1778	                         <x>0.5</x>
1779	                         <y>0.0</y>
1780	                         <z>0.5</z>
1781	                     </lineOfCapturePoint>
1782	                 </capturePoint>
1783	                 <captureArea>
1784	                     <bottomLeft>
1785	                         <x>0.0</x>
1786	                         <y>3.0</y>
1787	                         <z>0.0</z>
1788	                     </bottomLeft>
1789	                     <bottomRight>
1790	                         <x>1.0</x>
1791	                         <y>3.0</y>
1792	                         <z>0.0</z>
1793	                     </bottomRight>
1794	                     <topLeft>
1795	                         <x>0.0</x>
1796	                         <y>3.0</y>
1797	                         <z>3.0</z>
1798	                     </topLeft>
1799	                     <topRight>
1800	                         <x>1.0</x>
1801	                         <y>3.0</y>
1802	                         <z>3.0</z>
1803	                     </topRight>
1804	                 </captureArea>
1805	             </spatialInformation>
1806	             <description lang="en">left camera video</description>
1807	         </mediaCapture>
1808	     </mediaCaptures>
1809	     <encodings>
1810	         <encoding xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
1811	         xsi:type="videoEncodingType" encodingID="ENC0">
1812	             <encodingName>h263</encodingName>
1813	             <maxBandwidth>4000000</maxBandwidth>
1814	             <encodedMedia>video</encodedMedia>
1815	             <maxWidth>1920</maxWidth>
1816	             <maxHeight>1088</maxHeight>
1817	         </encoding>
1818	         <encoding xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
1819	         xsi:type="videoEncodingType" encodingID="ENC1">
1820	             <encodingName>h263</encodingName>
1821	             <maxBandwidth>4000000</maxBandwidth>
1822	             <encodedMedia>video</encodedMedia>
1823	             <maxWidth>1920</maxWidth>
1824	             <maxHeight>1088</maxHeight>
1825	         </encoding>
1826	         <encoding xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
1827	         xsi:type="videoEncodingType" encodingID="ENC2">
1828	             <encodingName>h263</encodingName>
1829	             <maxBandwidth>4000000</maxBandwidth>
1830	             <encodedMedia>video</encodedMedia>
1831	             <maxWidth>1920</maxWidth>
1832	             <maxHeight>1088</maxHeight>
1833	         </encoding>
1834	         <encoding xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
1835	         xsi:type="audioEncodingType" encodingID="ENC3">
1836	             <encodingName>g711</encodingName>
1837	             <maxBandwidth>64000</maxBandwidth>
1838	             <encodedMedia>audio</encodedMedia>
1839	         </encoding>
1840	         <encoding xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
1841	         xsi:type="audioEncodingType" encodingID="ENC4">
1842	             <encodingName>g711</encodingName>
1843	             <maxBandwidth>64000</maxBandwidth>
1844	             <encodedMedia>audio</encodedMedia>
1845	         </encoding>
1846	     </encodings>
1847	     <encodingGroups>
1848	         <encodingGroup encodingGroupID="EG0">
1849	             <maxGroupBandwidth>12000000</maxGroupBandwidth>
1850	             <encodingIDList>
1851	                 <encIDREF>ENC0</encIDREF>
1852	                 <encIDREF>ENC1</encIDREF>
1853	                 <encIDREF>ENC2</encIDREF>
1854	             </encodingIDList>
1855	         </encodingGroup>
1856	         <encodingGroup encodingGroupID="EG1">
1857	             <maxGroupBandwidth>12000000</maxGroupBandwidth>
1858	             <encodingIDList>
1859	                 <encIDREF>ENC3</encIDREF>
1860	                 <encIDREF>ENC4</encIDREF>
1861	             </encodingIDList>
1862	         </encodingGroup>
1863	     </encodingGroups>
1864	     <captureScenes>
1865	         <captureScene scale="unknown" sceneID="CS1">
1866	             <description lang="en">main scene</description>
1867	             <sceneSpace>
1868	                 <bottomLeftFront>
1869	                     <x>0.0</x>
1870	                     <y>3.0</y>
1871	                     <z>0.0</z>
1872	                 </bottomLeftFront>
1873	                 <bottomRightFront>
1874	                     <x>3.0</x>
1875	                     <y>3.0</y>
1876	                     <z>0.0</z>
1877	                 </bottomRightFront>
1878	                 <topLeftFront>
1879	                     <x>0.0</x>
1880	                     <y>3.0</y>
1881	                     <z>2.0</z>
1882	                 </topLeftFront>
1883	                 <topRightFront>
1884	                     <x>3.0</x>
1885	                     <y>3.0</y>
1886	                     <z>2.0</z>
1887	                 </topRightFront>
1888	                 <bottomLeftBack>
1889	                     <x>0.0</x>
1890	                     <y>3.0</y>
1891	                     <z>0.0</z>
1892	                 </bottomLeftBack>
1893	                 <bottomRightBack>
1894	                     <x>3.0</x>
1895	                     <y>3.0</y>
1896	                     <z>0.0</z>
1897	                 </bottomRightBack>
1898	                 <topLeftBack>
1899	                     <x>0.0</x>
1900	                     <y>3.0</y>
1901	                     <z>2.0</z>
1902	                 </topLeftBack>
1903	                 <topRightBack>
1904	                     <x>3.0</x>
1905	                     <y>3.0</y>
1906	                     <z>2.0</z>
1907	                 </topRightBack>
1908	             </sceneSpace>
1909	             <sceneEntries>
1910	                 <sceneEntry mediaType="video" sceneEntryID="SE1">
1911	                     <description lang="en">
1912	                     participants streams</description>
1913	                     <mediaCaptureIDs>
1914	                         <captureIDREF>VC0</captureIDREF>
1915	                         <captureIDREF>VC1</captureIDREF>
1916	                         <captureIDREF>VC2</captureIDREF>
1917	                     </mediaCaptureIDs>
1918	                 </sceneEntry>
1919	                 <sceneEntry mediaType="video" sceneEntryID="SE2">
1920	                     <description lang="en">room stream</description>
1921	                     <mediaCaptureIDs>
1922	                         <captureIDREF>VC3</captureIDREF>
1923	                     </mediaCaptureIDs>
1924	                 </sceneEntry>
1925	                 <sceneEntry mediaType="audio" sceneEntryID="SE3">
1926	                     <description lang="en">room audio</description>
1927	                     <mediaCaptureIDs>
1928	                         <captureIDREF>AC0</captureIDREF>
1929	                     </mediaCaptureIDs>
1930	                 </sceneEntry>
1931	             </sceneEntries>
1932	         </captureScene>
1933	         <captureScene scale="noscale" sceneID="CS2">
1934	             <description lang="en">presentation</description>
1935	             <sceneEntries>
1936	                 <sceneEntry mediaType="video" sceneEntryID="CS2_SE1">
1937	                     <description lang="en">
1938	                     presentation video</description>
1939	                     <mediaCaptureIDs>
1940	                         <captureIDREF>VC4</captureIDREF>
1941	                     </mediaCaptureIDs>
1942	                 </sceneEntry>
1943	                 <sceneEntry mediaType="audio" sceneEntryID="CS2_SE2">
1944	                     <description lang="en">
1945	                     presentation audio</description>
1946	                     <mediaCaptureIDs>
1947	                         <captureIDREF>AC1</captureIDREF>
1948	                     </mediaCaptureIDs>
1949	                 </sceneEntry>
1950	             </sceneEntries>
1951	         </captureScene>
1952	     </captureScenes>
1953	     <simultaneousSets>
1954	         <simultaneousSet setID="SS1">
1955	             <captureIDREF>VC0</captureIDREF>
1956	             <captureIDREF>VC1</captureIDREF>
1957	             <captureIDREF>VC2</captureIDREF>
1958	             <captureIDREF>VC4</captureIDREF>
1959	             <captureIDREF>AC0</captureIDREF>
1960	             <captureIDREF>AC1</captureIDREF>
1961	         </simultaneousSet>
1962	         <simultaneousSet setID="SS2">
1963	             <captureIDREF>VC0</captureIDREF>
1964	             <captureIDREF>VC3</captureIDREF>
1965	             <captureIDREF>VC2</captureIDREF>
1966	             <captureIDREF>VC4</captureIDREF>
1967	             <captureIDREF>AC0</captureIDREF>
1968	             <captureIDREF>AC1</captureIDREF>
1969	         </simultaneousSet>
1970	     </simultaneousSets>
1971	 </clueInfo>

1973	25.  Diff with unofficial -02 version

1975	   Here the link to the unofficial -02 version:
1976	   http://www.grid.unina.it/Didattica/RetiDiCalcolatori
1977	   /inf/draft-presta-clue-data-model-schema-02.html
1978	   <mediaCaptures> moved from <sceneEntry> to <clueInfo> elements.
1979	      <mediaCaptures> have been moved out from the <captureScene> blob
1980	      again.  Media captures should have identifiers that are valid out
1981	      of the local scope of capture scenes, since a consumer should be
1982	      able to require also single captures in the CONFIGURE message.
1983	      This design choice reflects a bottom up approach where captures
1984	      are the basis of the data model.  In each media capture a
1985	      reference to the capture scene containing it is provided.  It
1986	      identifies the space the spatial information of the media capture
1987	      refers to.

1989	   XML document example updated  A new example, compliant with the
1990	      updated schema, has been provided.

1992	   language attribute added to <mediaCapture>  Such optional attribute
1993	      reflects the language used in the capture, if any.  The purpose of
1994	      the element could match the one of the language attribute proposed
1995	      in [I-D.groves-clue-capture-attr].

1997	   <priority> added to <mediaCapture>  The priority element has an
1998	      integer value helping in specifying a media capture relative
1999	      importance with respect to the other captures.  That element could
2000	      correspond to the priority attribute introduced in
2001	      [I-D.groves-clue-capture-attr].

2003	   <embeddedText> added to <videoCapture>  The element, if present,
2004	      indicates text embedded in the video capture.  The language used
2005	      in such embedded textual description is also envisioned within the
2006	      <embeddedText> element itself.  That element could correspond to
2007	      the priority attribute introduced in
2008	      [I-D.groves-clue-capture-attr]

2010	   <relatedTo> added to <mediaCapture>  That optional element contains
2011	      the ID of a capture the capture refers to.  This is for supporting
2012	      cases where there is the translation of a main capture in a
2013	      different language.  Such translation can be marked with a
2014	      <relatedTo> tag to refer to the main capture.  This could be
2015	      interpreted the same manner of the supplementary information
2016	      attribute proposed in [I-D.groves-clue-capture-attr] and further
2017	      discussed in
2018	      http://www.ietf.org/mail-archive/web/clue/current/msg02238.html.

2020	   <dynamic> added to <mediaCapture>  That optional boolean element has
2021	      the same purpose of the dynamic attribute proposed in
2022	      [I-D.groves-clue-capture-attr].  It indicates if the capture
2023	      device originating the capture moves during the telepresence
2024	      session.

2026	   new element definition for <description>  <description> has a new
2027	      attribute, lang, indicating the language used for the text within
2028	      <description>. <description> is used to provide human readable
2029	      information about captures, scene, and scene entries.  The
2030	      definitions of the corresponding XML elements (i.e.,
2031	      <mediaCapture>, <captureScene>, <sceneEntry>) have been updated to
2032	      make them able to contain more than one <description>.  In that
2033	      way, they can be described in different languages.

2035	   text capture added as new type of capture  The element is just a
2036	      place holder, since it is not characterized with any further
2037	      information up to now.

2039	26.  Diff with -02 version

2041	   <sceneSpace> of capture space type  <sceneSpace> (was:<sceneArea>)
2042	      describes a bounding volume for the space of a capture scene as an
2043	      arbitrary hexahedrons with eight points (placeholder solution).

2045	   H26X encoding  to be checked.

2047	   Simultaneous sets  The XML Schema definition of the simultaneous sets
2048	      has changed.  A simultaneous set is defined as a list of L media
2049	      capture identifiers and M capture scene entrie identifiers, where
2050	      L, M can be 0 or unbounded.

2052	   Capture encoding  A new XML Schema type has been added to describe
2053	      capture encodings as the result of the association of a media
2054	      capture, represented by its identifier, with an individual
2055	      encoding, represented by its identifier as well.

2057	   Clue info  The <clueInfo> element has been left within the XML Schema
2058	      for the sake of convenience when representing a prototype of
2059	      ADVERTISEMENT message (see the example section).

2061	   Data model definitions added  For each element of the datamodel a
2062	      brief description has been reported to foster discussion.

2064	27.  Informative References

2066	   [I-D.groves-clue-capture-attr]  Groves, C., Yang, W., and R. Even,
2067	                                   "CLUE media capture description",
2068	                                   draft-groves-clue-capture-attr-01
2069	                                   (work in progress), February 2013.

2071	   [I-D.ietf-clue-framework]       Duckworth, M., Pepperell, A., and S.
2072	                                   Wenger, "Framework for Telepresence
2073	                                   Multi-Streams",
2074	                                   draft-ietf-clue-framework-09 (work in
2075	                                   progress), February 2013.

2077	   [I-D.romanow-clue-data-model]   Romanow, A. and A. Pepperell, "Data
2078	                                   model for the CLUE Framework",
2079	                                   draft-romanow-clue-data-model-01
2080	                                   (work in progress), June 2012.

2082	   [RFC4796]                       Hautakorpi, J. and G. Camarillo, "The
2083	                                   Session Description Protocol (SDP)
2084	                                   Content Attribute", RFC 4796,
2085	                                   February 2007.

2087	Authors' Addresses

2089	   Roberta Presta
2090	   University of Napoli
2091	   Via Claudio 21
2092	   Napoli  80125
2093	   Italy

2095	   EMail: roberta.presta@unina.it

2097	   Simon Pietro Romano
2098	   University of Napoli
2099	   Via Claudio 21
2100	   Napoli  80125
2101	   Italy

2103	   EMail: spromano@unina.it