Языки описания информации презентация

Содержание

1. Языки описания информации
2. XSD (XML Schema Definition Language) XML Schema: альтернатива использованию DTD
3. Стандарт XML Schema XML Schema это альтернатива
4. Спецификации XML Shema и временной график их принятия
5. Язык XML Schema позволяет описывать структуру
6. Введение в XML Schema Язык XML Schema
7. XML Schema является правопреемником (заменителем) языка
8. Почему нужно использовать XML Schema Язык XML
9. XML Schemas поддерживает типы данных One of
10. Рассмотрим пример простого XML-документа (note.xml)
11. Описание схемы на языке XML Schema
12. Связывание XML-документ note.xml с описанием его
13. Здесь все используемые элементы, с помощью
14. Синтаксис языка XSD Корневым элементом описаний на
15. Первый атрибут xmlns:xs="http://www.w3.org/2001/XMLSchema" указывает, что в
16. Встроенные типы языка XML Schema В
17. Примеры встроенных типов xs:string – произвольные строки
18. xs:boolean – значение true или false
19. Использование такого большого количества встроенных типов
20. Описание простых элементов В состав описываемого конкретного
21. Примеры простых XML элементов XML-документ: Refsnes 36
22. Значения по умолчанию и фиксированные значения Простые
23. Описание атрибутов Простые элементы не могут
24. По умолчанию атрибуты считаются не обязательными.
25. Ограничения на содержание элементов Если простому элементу
26. Типы ограничений на элементы
27. Типы ограничений на элементы (2)
28. Пример ограничения Для элемента age (возраст) возможный
29. Пример ограничения (2) C помощью элемента xs:enumeration
30. Пример с типами автомобилей можно описать
31. Ограничения содержания текстового элемента Для ограничения
32. В каждой строке могут быть заданы
33. Кроме задания символов в квадратных скобках
34. Пример 1 разрешенными значениями являются
35. Пример 2 должно задаваться точно 8
36. Пример 3 может задаваться любой
37. Описание составных элементов Составные элементы могут содержать
38. Примеры составных элементов Пример пустого составного элемента
39. Описание пустых элементов Пустые элементы не могут
40. Создание нового составного типа и указание
41. Описание составных элементов, содержащих только дочерние элементы
42. Также можно задать создать новый тип
43. Описание элементов, имеющих простое содержание При описании
44. Описание элемента simpleContent с использованием ограничение
45. Пример описания простого элемента Например, нужно описать
46. Описание составных типов со смешанным содержанием Составные
47. Данный элемент "letter" можно описать следующим
48. Индикаторы Для задания правил использования в
49. Индикатор Индикатор указывает, что дочерние элементы
50. Индикатор Индикатор указывает, что в составном
51. Индикатор Индикатор указывает, что дочерние элементы
52. Индикаторы и Индикатор указывает на максимальное
53. По умолчанию для всех элементов задаются
54. С помощью индикаторов группирования элементов можно
55. После определения группы с именем persongroup,
56. С помощью индикаторов группирования атрибутов можно
57. Элемент Элемент позволяет включать в XML-документ
58. Теперь в элементе person включать, например,
59. Ниже показан XML-документ (называемый "Myfamily.xml"), использующий
60. Данный XML-документ является действительным (valid), так
62. Спасибо за внимание!
66. XML Schemas использует XML-синтаксис Another great strength
67. XML Schemas Secure Data Communication When sending
68. XML Schemas являются расширяемым XML Schemas are
69. Well-Formed is not Enough A well-formed XML
70. XML документы могут ссылаться на DTD или
71. Описание на языке XML Schema The following
72. Ссылки на схему Ссылка на DTD. This
73. Элемент XSD - The element is
74. The following fragment: xmlns:xs=http://www.w3.org/2001/XMLSchema indicates that
75. Ссылка на схему в XML документе This
76. Простые (simple) XSD элементы Простыми XML элементами называются
77. Пример простых элементов Here are some XML
78. Значение по умолчанию и фиксированное значение для
79. Атрибуты XSD All attributes are declared as
80. Default and Fixed Values for Attributes Attributes
81. XSD ограничения/грани Restrictions are used to define acceptable
82. Restrictions on a Set of Values To
83. Restrictions on a Series of Values To
84. Restrictions on a Series of Values (2)
85. Other Restrictions on a Series of Values
86. The next example defines an element
87. The next example defines an element
88. Restrictions on Whitespace Characters To specify how
89. This example also defines an element
90. Restrictions on Length To limit the length
91. Restrictions for Datatypes
92. XSD Complex Elements A complex element is an
93. Описание сложных элементов Look at this complex
94. If you use the method described
95. Пустые элементы An empty complex element cannot
96. However, it is possible to declare
97. Составные элементы, содержащие только подэлементы ("elements-only") An
98. Другой способ описания Or you can give
99. XSD Text-Only Elements A complex text-only element can
100. Here is an example of an
101. Элементы со смешанным содержанием A mixed complex
102. We could also give the complexType
103. Индикаторы XSD We can control HOW elements
104. Order Indicators Order indicators are used to
105. Choice Indicator The indicator specifies that
106. Occurrence Indicators Occurrence indicators are used to
107. Пример XML-документа Myfamily.xml
108. Пример описания схемы family.xsd The XML file
109. Индикаторы группы Group indicators are used to
110. After you have defined a group,
111. Attribute Groups Attribute groups are defined with
112. XSD The Element The element enables us to
113. The XML file below (called "Myfamily.xml"),
114. XSD The Element The element enables us to
115. XSD Element Substitution With XML Schemas, one element
116. A valid XML document (according to
117. Blocking Element Substitution To prevent other elements
118. Using substitutionGroup The type of the
119. XML document Let's have a look at
120. Create an XML Schema Now we want
121. Create an XML Schema Next, we have
122. Then we have to define the
123. We start by defining the "shipto"
124. Now we can define the "item"
125. We have specified this by setting
126. Полное описание схемы (shiporder.xsd)
127. Разделение схемы
128. Использование именованных типов (Named Types)
129. The restriction element indicates that the
130. Типы данных XML Schema
131. XSD String Data Types String data types are used
132. NormalizedString Data Type The normalizedString data type
133. Token Data Type The token data type
134. String Data Types Note that all of
135. Restrictions on String Data Types Restrictions that
136. XSD Date and Time Data Types Date and time
137. Time Data Type The time data type
138. Time Zones To specify a time zone,
139. DateTime Data Type The dateTime data type
140. Time Zones To specify a time zone,
141. Duration Data Type The duration data type
142. Negative Duration To specify a negative duration,
143. Date and Time Data Types …
144. Restrictions on Date Data Types Restrictions that
145. XSD Numeric Data Types - Decimal Data Type Decimal
146. Integer Data Type The integer data type
147. Numeric Data Types Note that all of
148. Restrictions on Numeric Data Types Restrictions that
149. XSD Miscellaneous Data Types Other miscellaneous data types are
150. AnyURI Data Type The anyURI data type
151. Miscellaneous Data Types …
152. Restrictions on Miscellaneous Data Types Restrictions that
153. XSD Elements
154. XSD Elements (2)
155. XSD Elements (3)
156. XSD ограничения/грани для типов данных
157. Синтаксис DTD устарел и недостаточно гибок,
158. В отличие от синтаксиса DTD, синтаксис
159. Пример XML документа переписи 738
160. Описание схемы документа переписи
162. Объявление элементов и атрибутов В первой строке
163. Сложные и простые типы элементов В следующей
164. Ограничение модели содержимого Элементы составных типов могут
165. Типы данных В каждом объявлении элемента
166. Предопределенные типы данных В следующем списке перечислены
167. Грани Грани (facets) представляют собой свойства, используемые
168. Шаблоны Объявление сложного типа Address знакомит нас
169. XML документ Abiteboul Buneman Suciu
170. DTD (Document Type Definition)
171. XML Schema

Слайд 1Языки описания информации
Лекция 3.
Кафедра «ОСУ»
Весенний семестр 2010/2011 уч. года
XML Schema и

XLink

Слайд 2XSD (XML Schema Definition Language) XML Schema: альтернатива использованию DTD

Слайд 3Стандарт XML Schema
XML Schema это альтернатива языку DTD, основанная на синтаксисе

языка XML.
XML 1.0 поддерживает DTD для определения структуры документов.
Язык XML Schema имеет лучшие возможности поддержки для приложений, структуры документов, атрибутов т типов данных.
Будущие версии XML будут основываться на XML Schema для определения типов XML документов.
Стандарт XML Schema включает
XML Schema Structures specifies the XML Schema definition language.
XML Schema Data types specifies extensible data types for XML.

Слайд 4Спецификации XML Shema и временной график их принятия

Слайд 5
Язык XML Schema позволяет описывать структуру XML документа.
Пример XML Schema:

xmlns:xs="http://www.w3.org/2001/XMLSchema">

Слайд 6Введение в XML Schema
Язык XML Schema предназначен для описания структуры XML

документов.
Язык XML Schema также называется XML Schema Definition (XSD).
Целью создания языка XML Schema является описание правильного синтаксиса конкретного языка разметки (также, как и язык DTD).
XML Schema определяет:
элементы, которые могут появляться в документах;
атрибуты, которые могут использоваться с элементами;
как элементы могут вкладываться друг в друга;
порядок следования дочерних элементов;
может ли элемент быть пустым или может ли он включать текст;
типы данных для элементов и атрибутов;
значения по умолчанию и фиксированные значения для элементов и атрибутов.

Слайд 7
XML Schema является правопреемником (заменителем) языка DTD.
Считается, что скоро XML Schema

будет использоваться в большинстве Web приложений в качестве замены для DTDs.
Причинами этого является:
XML Schema является расширяемым для будущих дополнений;
XML Schema более выразительный и более мощный, чем DTD;
XML Schema использует семантику метаязыка XML;
XML Schema поддерживает типы данных;
XML Schema поддерживает пространства име (namespaces).

Слайд 8Почему нужно использовать XML Schema
Язык XML Schemas является более мощным по

сравнению с языком DTD.
XML Schemas Support Data Types.
XML Schemas use XML Syntax.
XML Schemas Secure Data Communication.
XML Schemas are Extensible.
Well-Formed is not Enough.

Слайд 9XML Schemas поддерживает типы данных
One of the greatest strength of XML

Schemas is the support for data types.
With support for data types:
It is easier to describe allowable document content
It is easier to validate the correctness of data
It is easier to work with data from a database
It is easier to define data facets (restrictions on data)
It is easier to define data patterns (data formats)
It is easier to convert data between different data types

Слайд 10Рассмотрим пример простого XML-документа (note.xml)

Луценко
Фролов
Напоминание

Не забудь о собрании лаборатории

в субботу!

Слайд 11Описание схемы на языке XML Schema
На языке XML Schema язык

разметки использованный для составления XML-документа note.xml может быть описан следующим образом (файл note.xsd):

targetNamespace="http://www.w3schools.com"
xmlns="http://www.w3schools.com"
elementFormDefault="qualified">

Слайд 12
Связывание XML-документ note.xml с описанием его синтаксиса на языке XML Schema

можно выполнить следующим образом:

< сообщение xmlns="http://www.w3schools.com"
xmlns:xsl="http://www.w3.org/2001/XMLSchema-instance"
xsl:schemaLocation="http://www.w3schools.com note.xsd">
<для_кого>Луценко
<от_кого>Фролов
<заголовок>Напоминание
<содержание>
Не забудь о собрании лаборатории в субботу!

ЛуценкоФроловНапоминаниеНе забудь о собрании лаборатории в субботу!">

Связывание XML-документ note.xml с описанием его синтаксиса на языке XML Schema можно выполнить следующим образом:<

Слайд 13
Здесь все используемые элементы, с помощью первого атрибута объявлены в пространстве

имен "http://www.w3schools.com".
Во втором атрибуте (xmlns:xsl="http://www.w3.org/2001/XMLSchema-instance") создается префикс для пространства имен «XML Schema Instance», которое используется для связывания с экземплярами документов описания схем на языке XSD.
Из данного пространства имен можно использовать атрибут schemaLocation, которому задается два значения, разделенные пробелом.
первое значение (http://www.w3schools.com) – это используемое пространство имен,
второе значение (note.xsd) – местоположение документа с описанием XML схемы для данного пространства имен.
Например:
xsl:schemaLocation="http://www.w3schools.com note.xsd"

Слайд 14Синтаксис языка XSD
Корневым элементом описаний на языке XML Schema является элемент

:

. . .
. . .

Элемент может включать некоторые атрибуты. Например:
targetNamespace="http://www.w3schools.com"
xmlns="http://www.w3schools.com"
elementFormDefault="qualified">

Слайд 15
Первый атрибут xmlns:xs="http://www.w3.org/2001/XMLSchema" указывает, что в данном документе используется пространство имен

"http://www.w3.org/2001/XMLSchema".
Второй атрибут targetNamespace= "http://www.w3schools.com" указывает на то, что для описываемых в документе новых элементов (сообщение, для_кого, от_кого, заголовок, содержание) задается пространство имен "http://www.w3schools.com".
Третий атрибут xmlns="http://www. w3schools.com" указывает, что пространством имен по умолчанию является "http://www.w3schools.com".
Четвертый атрибут elementFormDefault= "qualified" указывает, что для всех элементов данного XML-документа должно быть указано пространство имен.
Описания всех элементов и атрибутов конкретного языка могут использовать атрибут type.

Слайд 16Встроенные типы языка XML Schema
В языке XML Schema определен набор из

19 встроенных (предопределенных) типов данных, которые могут использоваться при описании элементов и атрибутов.

Слайд 17Примеры встроенных типов
xs:string – произвольные строки текста;
xs:byte, xs:float, xs:long – числовые

форматы:
xs:byte – любое 8-разрядное число со знаком;
xs:long – любое 32-разрядное число со знаком;
xs:float – число с плавающей точкой, например, 15,6 или 5.032E-6.
xs:time, xs:date, xs:timeinstant, xs:timeduration – шаблоны для обозначения времени, даты и длительности;

Слайд 18
xs:boolean – значение true или false (допускаются также числовые эквиваленты: 0

или 1);
xs:binary – шаблон для двоичных чисел, например, 00101110;
xs:language – код языка, например, en-US или RUS;
xs:uri-reference – шаблон для любого URI, например, http://www.donut.org/cruller.xml#ingredients;
xs:ID, xs:IDREF, xs:IDREFS, xs:NMTOKEN, xs:NMTOKENS – типы атрибутов, действующие так же, как соответствующие типы в DTD.

Слайд 19
Использование такого большого количества встроенных типов делает язык XSD очень полезным

для документов, которые связаны со специальными приложениями для обработки данных, например, базами данных и бланками ввода заказов.
При его использовании не требуется писать программу, проверяющую типы данных, т.к. эту работу выполняет сам XML-процессор.

Слайд 20Описание простых элементов
В состав описываемого конкретного языка могут входить простые элементы

и составные элементы.
Простые элементы могут включать только текстовое содержание и не могут включать другие элементы или атрибуты.
Составные элементы могут содержать текст, атрибуты и дочерние элементы.
Для описания простого элемента используется следующий синтаксис:
,
где "имя" это имя элемента, а "тип" – тип данных, которые описываемый элемент может иметь.

Слайд 21Примеры простых XML элементов
XML-документ:
Refsnes
36
1970-03-27
Эти элементы могут быть следующим образом описаны с

помощью языка XSD:

Слайд 22Значения по умолчанию и фиксированные значения
Простые элементы могут иметь значения по

умолчанию или фиксированные значения.
Значение по умолчанию автоматически назначается элементу, когда никакого значения ему не задано.
Например:

Если для простого элемента задано фиксированное значение, то данному элементу не могут быть заданы другие значения.
Например:

Слайд 23Описание атрибутов
Простые элементы не могут иметь атрибуты. Если элемент имеет

атрибуты, то он считается составным типом.
Однако, сами атрибуты описываются, как простые типы, следующим образом:
,
"имя" – это имя атрибуты,
"тип" – тип данных, который может быть задан данному атрибуту.
Например:

Атрибуты, также, как и простые типы могут иметь значения по умолчанию или фиксированные значения.
Например:

Слайд 24
По умолчанию атрибуты считаются не обязательными.
Для задания обязательности описываемых атрибутов

используется атрибут "use".
Например:

Слайд 25Ограничения на содержание элементов
Если простому элементу или атрибуту задан тип, то

он налагает ограничения на их содержание.
Например, если элемент типа xs:date будет содержать строку типа "Hello World", то он не будет считаться верным.
Кроме использования типов на значения простых элементов и атрибутов можно также накладывать дополнительные ограничения, которые еще называются «гранями» (facets).
Ограничения можно задать с помощью элементов xs:restriction, в который вкладываются разные типы ограничений,

Слайд 26Типы ограничений на элементы

Слайд 27Типы ограничений на элементы (2)

Слайд 28Пример ограничения
Для элемента age (возраст) возможный интервал значений от 0 до

120 можно задать следующим образом:

Слайд 29Пример ограничения (2)
C помощью элемента xs:enumeration можно ограничить содержание элемента списком

возможных значений.
Например, с помощью элемента xs:enumeration можно установит условие, что элементу car могут задаваться только значения из списка (Audi, Golf, BMW):

Слайд 30
Пример с типами автомобилей можно описать и по-другому:
вначале создать новый

тип carType,
затем использовать его для описания элемента car:

Новый тип carType может использоваться и для описания и других элементов, т.к. не является частью элемента car.

Слайд 31Ограничения содержания текстового элемента
Для ограничения содержания элемента, имеющего тип xs:string,

можно использовать шаблоны, которые задаются с помощью элемента xs:pattern и атрибута value.
Шаблоны используют язык регулярных выражений и задают наборы правил формирования последовательности допустимых символов.
Они могут задаваться в виде перечня разрешенных строк разделенных знаком "|".
Например: "male | female".

Слайд 32
В каждой строке могут быть заданы конкретные символы или не определенные

символы, описываемые в квадратных скобках […], в которых перечисляются возможные их значения,
например, [xyz] – символ может получать значение x или y или z.
Вместо перечисления значений символа можно задавать интервалы возможных значений: a-z (только маленькие буквы); A-Z (только большие буквы); 0-9 (только цифры).
Интервалы можно комбинировать, например, a-zA-Z – возможна маленькая или большая буква.

Слайд 33
Кроме задания символов в квадратных скобках можно использовать следующие указатели:
\d

– одна цифра;
\s – пробельный символ;
\w – буква или цифра.
С помощью круглых скобок возможные символы можно объединять в группы, после которой можно указать сколько раз данная группа может повториться.
Для этого после закрывающейся скобки группы можно указать символы:
* – ноль или более;
+ – 1 или более;
{n} – точно n раз.

$Кроме задания символов в квадратных скобках можно использовать следующие указатели: \d – одна цифра; \s$

Слайд 34Пример 1

разрешенными значениями являются одна или несколько пар символов,

каждая пара состоит из маленькой и большой буквы
верно: "sToP";
неверно: "Stop" или "STOP".

Слайд 35Пример 2

должно задаваться точно 8 символов, каждый из которых может

быть
маленькой (a-z) или большой буквой (A-Z), или
цифрой от 0 до 9 (0-9).

Слайд 36Пример 3

может задаваться
любой символ алфавита ([A-Z]),
за которым следует

дефис (-) и
три цифры (\d{3}).

$Пример 3 может задаваться любой символ алфавита ([A-Z]), за которым следует дефис (-) и три цифры (\d{3}).$

Слайд 37Описание составных элементов
Составные элементы могут содержать текст, другие элементы и/или атрибуты.

Имеются 4 вида составных элементов:
пустые элементы пустые элементы, которые не могут иметь содержание, но могут иметь атрибуты;
элементы, содержащие другие элементы;
элементы, содержащие только текст;
элементы, которые содержат другие элементы и текст (смешанное содержание).
Каждый из этих видов элементов также может содержать и атрибуты.

Слайд 38Примеры составных элементов
Пример пустого составного элемента "product":

Пример составного

элемента "employee", содержащего только дочерние элементы :

John
Smith

Слайд 39Описание пустых элементов
Пустые элементы не могут иметь содержания, но могут иметь

атрибуты.
Их нужно описывать, как составные элементы, но не включать в них никакие дочерние элементы, а определять только их атрибуты.
Например: элемент, использующий атрибут pid, которому могут задаваться только целые положительные значения (например, ), может описываться на языке XML Schema следующими способами:

Слайд 40
Создание нового составного типа и указание его в описании элемента product:

name="product" type="prodtype"/>

Непосредственное описание элемента без содержания и с одним атрибутом с помощью элементов xs:complexType и xs:restriction:

Слайд 41Описание составных элементов, содержащих только дочерние элементы
Составные элементы, содержащие только дочерние

элементы, как например элемент person:

John
Smith

могут определяться на языке XSD следующим образом:

Элемент xs:sequence указывает, что дочерние элементы (firstname и lastname) должны в XML-документе появляться в заданном порядке.

Слайд 42
Также можно задать создать новый тип составного элемента persontype и использовать

его при описании элемента person:

Слайд 43Описание элементов, имеющих простое содержание
При описании элемента, включающего только простое содержимое

(текст и атрибуты), нужно использовать элемент simpleContent.
В элементе simpleContent можно описать расширение (xs:extension) или ограничение (xs:restriction):

. . .

Слайд 44
Описание элемента simpleContent с использованием ограничение (xs:restriction):

. . .

Слайд 45Пример описания простого элемента
Например, нужно описать элемент "shoesize" (размер обуви), которые

записываются следующим образом:
35
Данный элемент может быть описан, как составной элемент, содержащий только целые значения и использующий атрибут с именем country:

Также можно вначале создать новый составной тип shoetype, а затем использовать его при описании элемента shoesize.

Слайд 46Описание составных типов со смешанным содержанием
Составные типы со смешанным содержанием могут

включать атрибуты, дочерние элементы и текст.
Например, рассмотрим элемент letter, который может содержать, как текст, так и другие элементы:

Уважаемый господин Луценко Александр.
Ваш заказ 1032
Был отправлен 2012-03-13.

Слайд 47
Данный элемент "letter" можно описать следующим образом:

name="orderid" type="xs:positiveInteger"/>

Отметим, чтобы разрешить записывать символьные данные между дочерними элементами, требуется задать атрибуту mixed значение true.

Слайд 48Индикаторы
Для задания правил использования в XML-документе дочерних элементов применяются индикаторы.

Имеется три вида индикаторов:
Индикаторы порядка – используются для определения порядка элементов (all, choice, sequence).
Индикаторы кратности – используется для определения того, сколько раз могут записываться элементы: (maxOccurs, minOccurs).
Индикаторы группирования – используются для задания взаимосвязанных наборов элементов (Group, attributeGroup).

Слайд 49Индикатор
Индикатор указывает, что дочерние элементы могут появляться в любом

порядке и каждый элемент должен быть появится только один раз.
Например:

Отметим, что при использовании индикатора можно
индикатору задавать значение 0 или 1,
индикатору может задаваться только значение 1.

Слайд 50Индикатор
Индикатор указывает, что в составном элементе должен быть задан

один из перечисленных дочерних элементов.
Например:

Слайд 51Индикатор
Индикатор указывает, что дочерние элементы должны записываться в том

порядке, как они описаны.
Например:

Слайд 52Индикаторы и
Индикатор указывает на максимальное количество, а

индикатор указывает на минимальное количество дочерних элементов. Например, для задания ограничения детей от 0 до 10 можно записать

maxOccurs="10" minOccurs="0"/>

Слайд 53
По умолчанию для всех элементов задаются индикаторы maxOccurs=1 и minOccurs=1.
Для

того, что не было ограничения на максимальное количество дочернего элемента нужно задать: maxOccurs="unbounded".

Слайд 54
С помощью индикаторов группирования элементов можно описывать группы элементов, задавать им

имена и использовать их в других описаниях.
При создании группы элементов нужно указать индикатор порядка элементов внутри данной группы (all, choice или sequence).
Например: описание группы элементов с именем persongroup, которые должны появляться в заданном порядке:

Слайд 55
После определения группы с именем persongroup, на нее можно ссылаться в

других описаниях с помощью атрибута ref.
Например, при описании составного элемента:

Слайд 56
С помощью индикаторов группирования атрибутов можно описывать группы атрибутов, задавать им

имена и использовать их в других описаниях.
Например: описание группы атрибутов с именем personattrgroup:

После создания данной группы атрибутов на нее можно ссылаться в других описаниях.
Например:

Слайд 57Элемент
Элемент позволяет включать в XML-документ элементы, которые не

указаны в описании схемы конкретного языка разметки.
Пример, показанный ниже, является фрагментом из схемы конкретного языка разметки, хранящегося в файле family.xsd.
В нем содержится описание элемента person.
С помощью элемента выполняется расширение содержания типа элементов person.
В этом случае в элементе person после элемента lastname могут записываться любые не описанные в схеме элементы:

Слайд 58
Теперь в элементе person включать, например, элемент children, хотя в самой

схеме возможность использования данного элемента не была описана.
Рассмотрим файл, называемый "children.xsd", который содержит описание схему (описание синтаксиса конкретного языка разметки):

targetNamespace="http://www.w3schools.com"
xmlns="http://www.w3schools.com"
elementFormDefault="qualified">

Слайд 59
Ниже показан XML-документ (называемый "Myfamily.xml"), использующий элементы из двух разных схем

"family.xsd" и "children.xsd":

xmlns:xsl="http://www.w3.org/2001/XMLSchema-instance"
xsl:schemaLocation="http://www.microsoft.com family.xsd
http://www.w3schools.com children.xsd">

Hege
Refsnes

Cecilie

Stale
Refsnes

Слайд 60
Данный XML-документ является действительным (valid), так как схема "family.xsd" позволяет расширять

элемент "person" необязательным элементом после элемента "lastname".
Элементы и используются для того, чтобы сделать конкретный язык разметки расширяемым!
Они позволяют документу содержать дополнительные элементы, которые не объявлены в основной XML схеме.

Слайд 61

Слайд 62Спасибо за внимание!

Слайд 63

Слайд 64

Слайд 65

Слайд 66XML Schemas использует XML-синтаксис
Another great strength about XML Schema is that

they are written in XML.
Some benefits of that XML Schemas are written in XML:
You don't have to learn a new language
You can use your XML editor to edit your Schema files
You can use your XML parser to parse your Schema files
You can manipulate your Schema with the XML DOM
You can transform your Schema with XSLT

Слайд 67XML Schemas Secure Data Communication
When sending data from a sender to

a receiver, it is essential that both parts have the same "expectations" about the content.
With XML Schemas, the sender can describe the data in a way that the receiver will understand.
A date like: "03-11-2004" will, in some countries, be interpreted as 3.November and in other countries as 11.March.
However, an XML element with a data type like this:
2004-03-11
ensures a mutual understanding of the content, because the XML data type "date" requires the format "YYYY-MM-DD".

Слайд 68XML Schemas являются расширяемым
XML Schemas are extensible, because they are written

in XML.
With an extensible Schema definition you can:
Reuse your Schema in other Schemas
Create your own data types derived from the standard types
Reference multiple schemas in the same document

Слайд 69Well-Formed is not Enough
A well-formed XML document is a document that

conforms to the XML syntax rules, like:
it must begin with the XML declaration
it must have one unique root element
start-tags must have matching end-tags
elements are case sensitive
all elements must be closed
all elements must be properly nested
all attribute values must be quoted
entities must be used for special characters
Even if documents are well-formed they can still contain errors, and those errors can have serious consequences.
Think of the following situation:
you order 5 gross of laser printers, instead of 5 laser printers (???).
With XML Schemas, most of these errors can be caught by your validating software.

Слайд 70XML документы могут ссылаться на DTD или XML Schema
Look at this

simple XML document called "note.xml":

Tove
Jani
Reminder
Don't forget me this weekend!

DTD файл. The following example is a DTD file called "note.dtd" that defines the elements of the XML document above ("note.xml"):

The first line defines the note element to have four child elements: "to, from, heading, body".
Line 2-5 defines the to, from, heading, body elements to be of type "#PCDATA".

Слайд 71Описание на языке XML Schema
The following example is an XML Schema

file called "note.xsd" that defines the elements of the XML document above ("note.xml"):






The note element is a complex type because it contains other elements. The other elements (to, from, heading, body) are simple types because they do not contain other elements. You will learn more about simple and complex types in the following chapters.

Слайд 72Ссылки на схему
Ссылка на DTD. This XML document has a reference

to a DTD:

Tove
Jani
Reminder
Don't forget me this weekend!

Ссылка на XML Schema. This XML document has a reference to an XML Schema:

xmlns:xsi=http://www.w3.org/2001/XMLSchema-instance
xsi:schemaLocation="http://www.w3schools.com note.xsd">
Tove
Jani
Reminder
Don't forget me this weekend!

Слайд 73Элемент XSD -
The element is the root element of

every XML Schema.
... ...

The element may contain some attributes.
A schema declaration often looks something like this:
targetNamespace=http://www.w3schools.com
xmlns=http://www.w3schools.com
elementFormDefault="qualified">
…
…

Слайд 74
The following fragment: xmlns:xs=http://www.w3.org/2001/XMLSchema indicates that the elements and data types

used in the schema come from the "http://www.w3.org/2001/XMLSchema" namespace.
It also specifies that the elements and data types that come from the "http://www.w3.org/2001/XMLSchema" namespace should be prefixed with xs:
This fragment: targetNamespace="http://www.w3schools.com"indicates that the elements defined by this schema (note, to, from, heading, body.) come from the "http://www.w3schools.com" namespace.
This fragment: xmlns="http://www.w3schools.com"indicates that the default namespace is "http://www.w3schools.com".
This fragment: elementFormDefault="qualified"indicates that any elements used by the XML instance document which were declared in this schema must be namespace qualified.

Слайд 75Ссылка на схему в XML документе
This XML document has a reference

to an XML Schema:
xmlns:xsi=http://www.w3.org/2001/XMLSchema-instance
xsi:schemaLocation="http://www.w3schools.com note.xsd“>
Tove
Jani
Reminder
Don't forget me this weekend!

The following fragment: xmlns="http://www.w3schools.com"specifies the default namespace declaration.
This declaration tells the schema-validator that all the elements used in this XML document are declared in the "http://www.w3schools.com" namespace.
Once you have the XML Schema Instance namespace available: xmlns:xsi=http://www.w3.org/2001/XMLSchema-instance you can use the schemaLocation attribute. This attribute has two values.
The first value is the namespace to use.
The second value is the location of the XML schema to use for that namespace: xsi:schemaLocation="http://www.w3schools.com note.xsd"

Слайд 76Простые (simple) XSD элементы
Простыми XML элементами называются элементы, которые могут содержать только

текст. Они не могут включать другие элементы или атрибуты.
Текст может представлять собой данные разных типов:
типов включенных в определение XML Schema (boolean, string, date, etc.),
пользовательский тип, который он описывает сам.
You can also add restrictions (facets) to a data type in order to limit its content, or you can require the data to match a specific pattern.
The syntax for defining a simple element is:
, где
xxx - name of the element and
yyy - data type of the element.
XML Schema has a lot of built-in data types. The most common types are:
xs:string
xs:decimal
xs:integer
xs:boolean
xs:date
xs:time

Слайд 77Пример простых элементов
Here are some XML elements:
Refsnes
36
1970-03-27
And here are the corresponding

simple element definitions:

Слайд 78Значение по умолчанию и фиксированное значение для простых типов
Simple elements may

have a default value OR a fixed value specified.
A default value is automatically assigned to the element when no other value is specified.
In the following example the default value is "red":

A fixed value is also automatically assigned to the element, and you cannot specify another value.
In the following example the fixed value is "red":

Слайд 79Атрибуты XSD
All attributes are declared as simple types.
Simple elements cannot have

attributes. If an element has attributes, it is considered to be of a complex type. But the attribute itself is always declared as a simple type.
The syntax for defining an attribute is:
, где
xxx is the name of the attribute
yyy specifies the data type of the attribute.
XML Schema has a lot of built-in data types. The most common types are:
xs:string
xs:decimal
xs:integer
xs:boolean
xs:date
xs:time
Пример XML element with an attribute:
Smith
And here is the corresponding attribute definition:

Слайд 80Default and Fixed Values for Attributes
Attributes may have a default value

OR a fixed value specified.
A default value is automatically assigned to the attribute when no other value is specified.
Например, default value is "EN":

A fixed value is also automatically assigned to the attribute, and you cannot specify another value.
Например, fixed value is "EN":

Optional and Required Attributes
Attributes are optional by default. To specify that the attribute is required, use the "use" attribute:

Restrictions on Content.
When an XML element or attribute has a data type defined, it puts restrictions on the element's or attribute's content.
If an XML element is of type "xs:date" and contains a string like "Hello World", the element will not validate.
With XML Schemas, you can also add your own restrictions to your XML elements and attributes. Такие ограничения называются гранями (facets).

Слайд 81XSD ограничения/грани
Restrictions are used to define acceptable values for XML elements or

attributes. Restrictions on XML elements are called facets.
Restrictions on Values
The following example defines an element called "age" with a restriction. The value of age cannot be lower than 0 or greater than 120:

Слайд 82Restrictions on a Set of Values
To limit the content of an

XML element to a set of acceptable values, we would use the enumeration constraint.
The example below defines an element called "car" with a restriction. The only acceptable values are: Audi, Golf, BMW:

The example above could also have been written like this:




Note: In this case the type "carType" can be used by other elements because it is not a part of the "car" element.

Слайд 83Restrictions on a Series of Values
To limit the content of an

XML element to define a series of numbers or letters that can be used, we would use the pattern constraint.
The example below defines an element called "letter" with a restriction. The only acceptable value is ONE of the LOWERCASE letters from a to z:

The next example defines an element called "initials" with a restriction. The only acceptable value is THREE of the UPPERCASE letters from a to z:

Слайд 84Restrictions on a Series of Values (2)
The next example also defines

an element called "initials" with a restriction. The only acceptable value is THREE of the LOWERCASE OR UPPERCASE letters from a to z:

The next example defines an element called "choice" with a restriction. The only acceptable value is ONE of the following letters: x, y, OR z:

Слайд 85Other Restrictions on a Series of Values
The example below defines an

element called "letter" with a restriction. The acceptable value is zero or more occurrences of lowercase letters from a to z:

The next example also defines an element called "letter" with a restriction. The acceptable value is one or more pairs of letters, each pair consisting of a lower case letter followed by an upper case letter. For example, "sToP" will be validated by this pattern, but not "Stop" or "STOP" or "stop":

Слайд 86
The next example defines an element called "gender" with a restriction.

The only acceptable value is male OR female:

The next example defines an element called "password" with a restriction. There must be exactly eight characters in a row and those characters must be lowercase or uppercase letters from a to z, or a number from 0 to 9:

Слайд 87
The next example defines an element called "prodid" with a restriction.

The only acceptable value is FIVE digits in a sequence, and each digit must be in a range from 0 to 9:

Слайд 88Restrictions on Whitespace Characters
To specify how whitespace characters should be handled,

we would use the whiteSpace constraint.
This example defines an element called "address" with a restriction. The whiteSpace constraint is set to "preserve", which means that the XML processor WILL NOT remove any white space characters:

This example also defines an element called "address" with a restriction. The whiteSpace constraint is set to "replace", which means that the XML processor WILL REPLACE all white space characters (line feeds, tabs, spaces, and carriage returns) with spaces:

Слайд 89
This example also defines an element called "address" with a restriction.

The whiteSpace constraint is set to "collapse", which means that the XML processor WILL REMOVE all white space characters (line feeds, tabs, spaces, carriage returns are replaced with spaces, leading and trailing spaces are removed, and multiple spaces are reduced to a single space):

Слайд 90Restrictions on Length
To limit the length of a value in an

element, we would use the length, maxLength, and minLength constraints.
This example defines an element called "password" with a restriction. The value must be exactly eight characters:

This example defines another element called "password" with a restriction. The value must be minimum five characters and maximum eight characters:

Слайд 91Restrictions for Datatypes

Слайд 92XSD Complex Elements
A complex element is an XML element that contains other

elements and/or attributes.
There are four kinds of complex elements:
empty elements
elements that contain only other elements
elements that contain only text
elements that contain both other elements and text
Note: Each of these elements may contain attributes as well!
Examples of Complex Elements
A complex XML element, "product", which is empty:

A complex XML element, "employee", which contains only other elements:

John
Smith

A complex XML element, "food", which contains only text:
Ice cream
A complex XML element, "description", which contains both elements and text:

It happened on 03.03.99 ....

Слайд 93Описание сложных элементов
Look at this complex XML element, "employee", which contains

only other elements:

John
Smith

We can define a complex element in an XML Schema two different ways:
1. The "employee" element can be declared directly by naming the element, like this:

If you use the method described above, only the "employee" element can use the specified complex type. Note that the child elements, "firstname" and "lastname", are surrounded by the indicator. This means that the child elements must appear in the same order as they are declared. You will learn more about indicators in the XSD Indicators chapter.
2. The "employee" element can have a type attribute that refers to the name of the complex type to use:

Слайд 94
If you use the method described above, several elements can refer

to the same complex type, like this:

You can also base a complex element on an existing complex element and add some elements, like this:

Слайд 95Пустые элементы
An empty complex element cannot have contents, only attributes.
Complex Empty

Elements
An empty XML element:

The "product" element above has no content at all. To define a type with no content, we must define a type that allows elements in its content, but we do not actually declare any elements, like this:

In the example above, we define a complex type with a complex content. The complexContent element signals that we intend to restrict or extend the content model of a complex type, and the restriction of integer declares one attribute but does not introduce any element content.

Слайд 96
However, it is possible to declare the "product" element more compactly,

like this:

Or you can give the complexType element a name, and let the "product" element have a type attribute that refers to the name of the complexType (if you use this method, several elements can refer to the same complex type):

Слайд 97Составные элементы, содержащие только подэлементы ("elements-only")
An "elements-only" complex type contains an

element that contains only other elements.
An XML element, "person", that contains only other elements:
John Smith
В схеме элемент "person" может быть описан следующим образом:

Notice the tag. It means that the elements defined ("firstname" and "lastname") must appear in that order inside a "person" element.

Слайд 98Другой способ описания
Or you can give the complexType element a name,

and let the "person" element have a type attribute that refers to the name of the complexType (if you use this method, several elements can refer to the same complex type):

Слайд 99XSD Text-Only Elements
A complex text-only element can contain text and attributes.
Complex Text-Only

Elements.
This type contains only simple content (text and attributes), therefore we add a simpleContent element around the content.
When using simple content, you must define an extension OR a restriction within the simpleContent element, like this:
.... .... или .... ....

Tip: Use the extension/restriction element to expand or to limit the base simple type for the element.

Слайд 100
Here is an example of an XML element, "shoesize", that contains

text-only:
35The following example declares a complexType, "shoesize". The content is defined as an integer value, and the "shoesize" element also contains an attribute named "country":
We could also give the complexType element a name, and let the "shoesize" element have a type attribute that refers to the name of the complexType (if you use this method, several elements can refer to the same complex type):

Слайд 101Элементы со смешанным содержанием
A mixed complex type element can contain attributes,

elements, and text.
Complex Types with Mixed Content
An XML element, "letter", that contains both text and other elements:
Dear Mr.John Smith. Your order 1032 will be shipped on 2001-07-13. The following schema declares the "letter" element:

Note: To enable character data to appear between the child-elements of "letter", the mixed attribute must be set to "true". The tag means that the elements defined (name, orderid and shipdate) must appear in that order inside a "letter" element.

Слайд 102
We could also give the complexType element a name, and let

the "letter" element have a type attribute that refers to the name of the complexType (if you use this method, several elements can refer to the same complex type):

Слайд 103Индикаторы XSD
We can control HOW elements are to be used in

documents with indicators.
Indicators
There are seven indicators:
Order indicators:
All
Choice
Sequence
Occurrence indicators:
maxOccurs
minOccurs
Group indicators:
Group name
attributeGroup name

Слайд 104Order Indicators
Order indicators are used to define the order of the

elements.
All Indicator
The indicator specifies that the child elements can appear in any order, and that each child element must occur only once:

Note: When using the indicator you can set the indicator to 0 or 1 and the indicator can only be set to 1 (the and are described later).

Слайд 105
Choice Indicator
The indicator specifies that either one child element or

another can occur:
Sequence Indicator
The indicator specifies that the child elements must appear in a specific order:

Слайд 106Occurrence Indicators
Occurrence indicators are used to define how often an element

can occur.
Note: For all "Order" and "Group" indicators (any, all, choice, sequence, group name, and group reference) the default value for maxOccurs and minOccurs is 1.
maxOccurs Indicator
The indicator specifies the maximum number of times an element can occur:

The example above indicates that the "child_name" element can occur a minimum of one time (the default value for minOccurs is 1) and a maximum of ten times in the "person" element.
minOccurs Indicator
The indicator specifies the minimum number of times an element can occur:
The example above indicates that the "child_name" element can occur a minimum of zero times and a maximum of ten times in the "person" element.
Tip: To allow an element to appear an unlimited number of times, use the maxOccurs="unbounded" statement:

Слайд 107Пример XML-документа Myfamily.xml
Hege Refsnes Cecilie Tove Refsnes

Hege Stale Jim Borge Stale Refsnes

Слайд 108Пример описания схемы family.xsd
The XML file above contains a root element

named "persons".
Inside this root element we have defined three "person" elements.
Each "person" element must contain a "full_name" element and it can contain up to five "child_name" elements.

Слайд 109Индикаторы группы
Group indicators are used to define related sets of elements.
Element

Groups
Element groups are defined with the group declaration, like this:
...
You must define an all, choice, or sequence element inside the group declaration.
The following example defines a group named "persongroup", that defines a group of elements that must occur in an exact sequence:

Слайд 110
After you have defined a group, you can reference it in

another definition, like this:

Слайд 111Attribute Groups
Attribute groups are defined with the attributeGroup declaration, like this:

name="groupname"> ... The following example defines an attribute group named "personattrgroup":
After you have defined an attribute group, you can reference it in another definition, like this:

Слайд 112XSD The Element
The element enables us to extend the XML document

with elements not specified by the schema!
The following example is a fragment from an XML schema called "family.xsd". It shows a declaration for the "person" element. By using the element we can extend (after ) the content of "person" with any element:

Now we want to extend the "person" element with a "children" element. In this case we can do so, even if the author of the schema above never declared any "children" element.
Look at this schema file, called "children.xsd":

Слайд 113
The XML file below (called "Myfamily.xml"), uses components from two different

schemas; "family.xsd" and "children.xsd":
Hege Refsnes Cecilie Stale Refsnes
The XML file above is valid because the schema "family.xsd" allows us to extend the "person" element with an optional element after the "lastname" element.
The and elements are used to make EXTENSIBLE documents! They allow documents to contain additional elements that are not declared in the main XML schema.

Слайд 114XSD The Element
The element enables us to extend the XML document

with attributes not specified by the schema!
The following example is a fragment from an XML schema called "family.xsd". It shows a declaration for the "person" element.
By using the element we can add any number of attributes to the "person" element:

Now we want to extend the "person" element with a "gender" attribute. In this case we can do so, even if the author of the schema above never declared any "gender" attribute.
Look at this schema file, called "attribute.xsd":

Слайд 115XSD Element Substitution
With XML Schemas, one element can substitute another element.
Let's say

that we have users from two different countries: England and Norway. We would like the ability to let the user choose whether he or she would like to use the Norwegian element names or the English element names in the XML document.
To solve this problem, we could define a substitutionGroup in the XML schema. First, we declare a head element and then we declare the other elements which state that they are substitutable for the head element.
In the example above, the "name" element is the head element and the "navn" element is substitutable for "name".
Look at this fragment of an XML schema:

Слайд 116
A valid XML document (according to the schema above) could look

like this:
John Smith or like this:
John Smith

Слайд 117Blocking Element Substitution
To prevent other elements from substituting with a specified

element, use the block attribute:
Look at this fragment of an XML schema:
A valid XML document (according to the schema above) looks like this:
John Smith BUT THIS IS NO LONGER VALID:
John Smith

Слайд 118
Using substitutionGroup
The type of the substitutable elements must be the same

as, or derived from, the type of the head element. If the type of the substitutable element is the same as the type of the head element you will not have to specify the type of the substitutable element.
Note that all elements in the substitutionGroup (the head element and the substitutable elements) must be declared as global elements, otherwise it will not work!
What are Global Elements?
Global elements are elements that are immediate children of the "schema" element! Local elements are elements nested within other elements.

Слайд 119XML document
Let's have a look at this XML document called "shiporder.xml":

version="1.0" encoding="ISO-8859-1"?> John Smith Ola Nordmann

Langgt 23

4000 Stavanger Norway Empire Burlesque Special Edition 1 10.90 Hide your heart 1 9.90
The XML document above consists of a root element, "shiporder", that contains a required attribute called "orderid".
The "shiporder" element contains three different child elements: "orderperson", "shipto" and "item".
The "item" element appears twice, and it contains a "title", an optional "note" element, a "quantity", and a "price" element.
The line above: xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" tells the XML parser that this document should be validated against a schema.
The line: xsi:noNamespaceSchemaLocation="shiporder.xsd" specifies WHERE the schema resides (here it is in the same folder as "shiporder.xml").

Слайд 120Create an XML Schema
Now we want to create a schema for

the XML document above.
We start by opening a new file that we will call "shiporder.xsd".
To create the schema we could simply follow the structure in the XML document and define each element as we find it.
We will start with the standard XML declaration followed by the xs:schema element that defines a schema:
...
In the schema above we use the standard namespace (xs), and the URI associated with this namespace is the Schema language definition, which has the standard value of http://www.w3.org/2001/XMLSchema.

Слайд 121Create an XML Schema
Next, we have to define the "shiporder" element.

This element has an attribute and it contains other elements, therefore we consider it as a complex type.
The child elements of the "shiporder" element is surrounded by a xs:sequence element that defines an ordered sequence of sub elements:
...

Слайд 122
Then we have to define the "orderperson" element as a simple

type (because it does not contain any attributes or other elements).
The type (xs:string) is prefixed with the namespace prefix associated with XML Schema that indicates a predefined schema data type:

Next, we have to define two elements that are of the complex type: "shipto" and "item".

Слайд 123
We start by defining the "shipto" element:

name="name" type="xs:string"/>
With schemas we can define the number of possible occurrences for an element with the maxOccurs and minOccurs attributes.
maxOccurs specifies the maximum number of occurrences for an element and minOccurs specifies the minimum number of occurrences for an element.
The default value for both maxOccurs and minOccurs is 1!

Слайд 124
Now we can define the "item" element.
This element can appear

multiple times inside a "shiporder" element.
This is specified by setting the maxOccurs attribute of the "item" element to "unbounded" which means that there can be as many occurrences of the "item" element as the author wishes.
Notice that the "note" element is optional.

Слайд 125
We have specified this by setting the minOccurs attribute to zero:

name="item" maxOccurs="unbounded">
We can now declare the attribute of the "shiporder" element.
Since this is a required attribute we specify use="required".
Note: The attribute declarations must always come last:

Слайд 126Полное описание схемы (shiporder.xsd)

Слайд 127Разделение схемы

name="orderperson" type="xs:string"/>

The previous design method is very simple, but can be difficult to read and maintain when documents are complex.
The next design method is based on defining all elements and attributes first, and then referring to them using the ref attribute.
Here is the new design of the schema file ("shiporder.xsd"):

Слайд 128Использование именованных типов (Named Types)

base="xs:string"/>

The third design method defines classes or types, that enables us to reuse element definitions.
This is done by naming the simpleTypes and complexTypes elements, and then point to them through the type attribute of the element.
Here is the third design of the schema file ("shiporder.xsd"):

Слайд 129
The restriction element indicates that the datatype is derived from a

W3C XML Schema namespace datatype. So, the following fragment means that the value of the element or attribute must be a string value:
The restriction element is more often used to apply restrictions to elements. Look at the following lines from the schema above:
This indicates that the value of the element or attribute must be a string, it must be exactly six characters in a row, and those characters must be a number from 0 to 9.

Слайд 130Типы данных XML Schema

Слайд 131XSD String Data Types
String data types are used for values that contains character

strings.
The string data type can contain characters, line feeds, carriage returns, and tab characters.
The following is an example of a string declaration in a schema:

An element in your document might look like this:
John Smith
Or it might look like this:
John Smith
Note: The XML processor will not modify the value if you use the string data type.

Слайд 132NormalizedString Data Type
The normalizedString data type is derived from the String

data type.
The normalizedString data type also contains characters, but the XML processor will remove line feeds, carriage returns, and tab characters.
The following is an example of a normalizedString declaration in a schema:

An element in your document might look like this:
John Smith
Or it might look like this:
John Smith
Note: In the example above the XML processor will replace the tabs with spaces.

Слайд 133Token Data Type
The token data type is also derived from the

String data type.
The token data type also contains characters, but the XML processor will remove line feeds, carriage returns, tabs, leading and trailing spaces, and multiple spaces.
The following is an example of a token declaration in a schema:

An element in your document might look like this:
John Smith
Or it might look like this:
John Smith
Note: In the example above the XML processor will remove the tabs.

Слайд 134String Data Types
Note that all of the data types below derive

from the String data type (except for string itself)!

Слайд 135Restrictions on String Data Types
Restrictions that can be used with String

data types:
enumeration
length
maxLength
minLength
pattern (NMTOKENS, IDREFS, and ENTITIES cannot use this constraint)
whiteSpace

Слайд 136XSD Date and Time Data Types
Date and time data types are used for

values that contain date and time.
The date data type is used to specify a date.
The date is specified in the following form "YYYY-MM-DD" where:
YYYY indicates the year
MM indicates the month
DD indicates the day
Note: All components are required!
The following is an example of a date declaration in a schema:

An element in your document might look like this:
2002-09-24Time Zones
To specify a time zone, you can either enter a date in UTC time by adding a "Z" behind the date - like this:
2002-09-24Z
or you can specify an offset from the UTC time by adding a positive or negative time behind the date - like this:
2002-09-24-06:00
Or
2002-09-24+06:00

Слайд 137Time Data Type
The time data type is used to specify a

time.
The time is specified in the following form "hh:mm:ss" where:
hh indicates the hour
mm indicates the minute
ss indicates the second
Note: All components are required!
The following is an example of a time declaration in a schema:

An element in your document might look like this:
09:00:00
Or it might look like this:
09:30:10.5

Слайд 138Time Zones
To specify a time zone, you can either enter a

time in UTC time by adding a "Z" behind the time - like this:
09:30:10Z
or you can specify an offset from the UTC time by adding a positive or negative time behind the time - like this:
09:30:10-06:00 or
09:30:10+06:00

Слайд 139DateTime Data Type
The dateTime data type is used to specify a

date and a time.
The dateTime is specified in the following form "YYYY-MM-DDThh:mm:ss" where:
YYYY indicates the year
MM indicates the month
DD indicates the day
T indicates the start of the required time section
hh indicates the hour
mm indicates the minute
ss indicates the second
Note: All components are required!
The following is an example of a dateTime declaration in a schema:

An element in your document might look like this:
2002-05-30T09:00:00
Or it might look like this:
2002-05-30T09:30:10.5

Слайд 140Time Zones
To specify a time zone, you can either enter a

dateTime in UTC time by adding a "Z" behind the time - like this:
2002-05-30T09:30:10Z
or you can specify an offset from the UTC time by adding a positive or negative time behind the time - like this:
2002-05-30T09:30:10-06:00 or
2002-05-30T09:30:10+06:00

Слайд 141Duration Data Type
The duration data type is used to specify a

time interval.
The time interval is specified in the following form "PnYnMnDTnHnMnS" where:
P indicates the period (required)
nY indicates the number of years
nM indicates the number of months
nD indicates the number of days
T indicates the start of a time section (required if you are going to specify hours, minutes, or seconds)
nH indicates the number of hours
nM indicates the number of minutes
nS indicates the number of seconds
The following is an example of a duration declaration in a schema:

An element in your document might look like this (period of five years):
P5Y
Or it might look like this (period of five years, two months, and 10 days):
P5Y2M10D
Or it might look like this (period of five years, two months, 10 days, and 15 hours):
P5Y2M10DT15H
Or it might look like this (period of 15 hours):
PT15H

Слайд 142Negative Duration
To specify a negative duration, enter a minus sign before

the P:
-P10DThe example above indicates a period of minus 10 days.

Слайд 143Date and Time Data Types
…

Слайд 144Restrictions on Date Data Types
Restrictions that can be used with Date

data types:
enumeration
maxExclusive
maxInclusive
minExclusive
minInclusive
pattern
whiteSpace

Слайд 145XSD Numeric Data Types - Decimal Data Type
Decimal data types are used for

numeric values.
The decimal data type is used to specify a numeric value.
The following is an example of a decimal declaration in a schema:

An element in your document might look like this:
999.50
Or it might look like this:
+999.5450
Or it might look like this:
-999.5230
Or it might look like this:
0
Or it might look like this:
14
Note: The maximum number of decimal digits you can specify is 18.

Слайд 146Integer Data Type
The integer data type is used to specify a

numeric value without a fractional component.
The following is an example of an integer declaration in a schema:

An element in your document might look like this:
999
Or it might look like this:
+999
Or it might look like this:
-999
Или it might look like this:
0

Слайд 147Numeric Data Types
Note that all of the data types below derive

from the Decimal data type (except for decimal itself)!

Слайд 148Restrictions on Numeric Data Types
Restrictions that can be used with Numeric

data types:
enumeration
fractionDigits
maxExclusive
maxInclusive
minExclusive
minInclusive
pattern
totalDigits
whiteSpace

$Restrictions on Numeric Data TypesRestrictions that can be used with Numeric data types:enumerationfractionDigitsmaxExclusivemaxInclusiveminExclusiveminInclusivepatterntotalDigitswhiteSpace$

Слайд 149XSD Miscellaneous Data Types
Other miscellaneous data types are boolean, base64Binary, hexBinary, float, double,

anyURI, QName, and NOTATION.
Boolean Data Type
The boolean data type is used to specify a true or false value.
The following is an example of a boolean declaration in a schema:

An element in your document might look like this:
999
Note: Legal values for boolean are true, false, 1 (which indicates true), and 0 (which indicates false).
Binary Data Types
Binary data types are used to express binary-formatted data. We have two binary data types:
base64Binary (Base64-encoded binary data)
hexBinary (hexadecimal-encoded binary data)
The following is an example of a hexBinary declaration in a schema:

Слайд 150AnyURI Data Type
The anyURI data type is used to specify a

URI.
The following is an example of an anyURI declaration in a schema:

An element in your document might look like this:

Note: If a URI has spaces, replace them with %20.

Слайд 151Miscellaneous Data Types
…

Слайд 152Restrictions on Miscellaneous Data Types
Restrictions that can be used with the

other data types:
enumeration (a Boolean data type cannot use this constraint)
length (a Boolean data type cannot use this constraint)
maxLength (a Boolean data type cannot use this constraint)
minLength (a Boolean data type cannot use this constraint)
pattern
whiteSpace

Слайд 153XSD Elements

Слайд 154XSD Elements (2)

Слайд 155XSD Elements (3)

Слайд 156XSD ограничения/грани для типов данных

Слайд 157
Синтаксис DTD устарел и недостаточно гибок, из-за чего для некоторых задач

DTD недостаточно выразительны.
В документах применяется один синтаксис, а в DTD – другой. Модели содержимого и объявления списков атрибутов тяжело читать и понимать, невозможно задать шаблонов данных в элементах и атрибутах.
В связи с этим в 2001 году был разработан стандарт XML Schema (XSchema).
Вторая версия XSchema был опубликован в 2004 (http://www.w3.org/TR/xmlschema11-1/).

Слайд 158
В отличие от синтаксиса DTD, синтаксис XML Schema является корректным (well-formed)

XML, что позволяет использовать при ее редактировании излюбленные инструменты XML.
Она также предоставляет значительно больший контроль над типами данных и шаблонами, образуя более привлекательный язык для соблюдения строгих требований ввода данных.
Пример описания схемы бланк для переписи населения.
Данная схема должна способствовать поддержке организации данных путем контроля типов данных в случае, если что-то будет введено в неверное поле.
Вот как может выглядеть в XML экземпляр документа.

Слайд 159Пример XML документа переписи

738

510
Yellowbrick Road
Munchkinville
Negbo

Floyd
Fleegle
61

Phylis
Fleegle
52

Filbert
Fleegle
22

Слайд 160Описание схемы документа переписи

Census form for the Republic of Oz
Department

of Paperwork, Emerald City

Слайд 161

type="xsd:string"/>

Слайд 162Объявление элементов и атрибутов
В первой строке документ идентифицируется как схема и

связывается с пространством имен XML Schema.
Для удобства в дальнейшем обсуждении мы будем отбрасывать префикс пространства имен xsd:.
Следующая структура, , служит местом для документирования назначения схемы и других деталей.
Следующим в нашем примере идет первое объявление типа элемента.
Атрибут name присваивает родовой идентификатор, а атрибут type устанавливает тип элемента.
Есть два типа элементов: простые и сложные.
Объявление простого (simple) элемента не содержит атрибутов или элементов в содержимом.
Поскольку данный конкретный элемент является корневым, он должен иметь иной тип – сложный.
В данном случае сложному типу в действительности дается имя, CensusType, которое нам позднее пригодится для его описания.
Хотя имена не являются обязательными, все же лучше использовать их для вашего же собственного блага.

Слайд 163Сложные и простые типы элементов
В следующей части схемы CensusType определяется как

элемент .
В нем содержатся объявления еще трех элементов и объявление атрибута.
Здесь не только объявляются три элемента: (счетчик),

(адрес) и (жильцы), но и определяется модель содержимого для CensusType.
Поэтому элемент должен содержать все три элемента в указанном порядке и может иметь необязательный атрибут date.
Это значительно отличается от стиля DTD, в котором модель содержимого состоит из строки внутри объявления элемента, а атрибуты объявляются отдельно в объявлении списка атрибутов.
Ограничения модели содержимого
Если последовательность отдельных элементов не предоставляет достаточной информации, в XML Schema есть другие возможности.
Атрибуты minOccurs и maxOccurs устанавливают (минимальное и максимальное) число появлений чеголибо в документе. minOccurs="0" отменяет значение по умолчанию 1 и делает элемент необязательным. maxOccurs="*" снимает верхнюю границу, и элемент может появляться любое число раз.

Слайд 164Ограничение модели содержимого
Элементы составных типов могут иметь разные модели содержимого.
Модели

содержимого элементов, которые могут включать дочерние узлы, могут быть одного из трех видов:
,
или
.
При определении нового составного типа путем наложения ограничения на другой тип необходимо гарантировать, что модели содержимого обоих типов удовлетворяют ряду условий.
Эти условия необходимы для гарантии того, что любое корректное содержимое производного типа является корректным по отношению и к базовому типу.

Слайд 165Типы данных
В каждом объявлении элемента и атрибута есть атрибут type, как

мы видели в объявлении первого элемента.
Некоторые типы являются предопределенными в XML Schema, например, string и decimal.
Тип string представляет обычные символьные данные, как тип CDATA в языке DTD.
Тип decimal представляет собой число.
Ниже мы объявляем элемент как имеющий тип positive-integer и ограничиваем сверху его величину значением 200.

В DTD это сделать нельзя, в нем нет способа наложить на символьные данные шаблон ограничений, а в XML Schema таких способов несколько.

Слайд 166Предопределенные типы данных
В следующем списке перечислены еще некоторые предопределенные типы:
byte, float,

long - Числовые форматы. Формат byte является любым 8-разрядным числом со знаком, а long - любым 32-разрядным числом со знаком. Тип float представляет число с плавающей точкой, например, 5.032E-6. Другие числовые величины представляют скорее абстракции, а не числа, например, INF (бесконечность), –INF (отрицательная бесконечность) и NaN (not a number – не число, категория, определенная IEEE для операций с плавающей точкой).
time, date, timeinstant, timeduration - Шаблоны для обозначения времени, даты и длительности.
boolean - Значение true или false. Допускаются также числовые эквиваленты: 0 или 1.
binary - Шаблон для двоичных чисел, например, 00101110.
language - Код языка, скажем, en-US.
uri-reference - Шаблон для любого URI, например, http://|www.donut.org/cruller.xml#ingredients.
ID, IDREF, IDREFS, NMTOKEN, NMTOKENS - Типы атрибутов, действующие так же, как соответствующие типы в DTD.

Слайд 167Грани
Грани (facets) представляют собой свойства, используемые для задания типов данных, устанавливая

пределы и ограничения на величину данных.
Например, элементу , тип данных которого positive-integer, было разрешено максимальное значение 200, называемое гранью max-inclusive.
В XSchema есть еще 13 граней, в том числе precision, scale, encoding, pattern, enumeration, max-length (точность, масштаб, кодировка, шаблон, перечисление, максимальная длина) и другие.

Слайд 168Шаблоны
Объявление сложного типа Address знакомит нас с другим типом ограничения по

шаблону.
Он имеет атрибут postalcode с типом PCode, который определен посредством объявления .
Если среди предопределенных типов нет нужного шаблона, с помощью этого элемента можно создать свой собственный.
В рассматриваемой схеме PCode определен строкой шаблона [A-Z]-\d{3}, которую нужно читать следующим образом: «любой символ алфавита, за которым следует дефис и три цифры».

Слайд 169XML документ

Abiteboul
Buneman
Suciu
Data on the Web
A fine book.

Buneman
XML in

Scotland
The best ever!

Слайд 170DTD (Document Type Definition)

YEAR CDATA #OPTIONAL>

Слайд 171XML Schema

name="AUTHOR" type="xsd:string" minOccurs="1«
maxOccurs="unbounded"/>

Слайд 172

name="NONEMPTY-INTEGER-LIST">

Скачать презентацию

Языки описания информации презентация

Содержание

Слайд 1Языки описания информацииЛекция 3.Кафедра «ОСУ»Весенний семестр 2010/2011 уч. годаXML Schema и

Слайд 2XSD (XML Schema Definition Language) XML Schema: альтернатива использованию DTD

Слайд 3Стандарт XML SchemaXML Schema это альтернатива языку DTD, основанная на синтаксисе

Слайд 4Спецификации XML Shema и временной график их принятия

Слайд 5Язык XML Schema позволяет описывать структуру XML документа.Пример XML Schema:

Слайд 6Введение в XML SchemaЯзык XML Schema предназначен для описания структуры XML

Слайд 7XML Schema является правопреемником (заменителем) языка DTD.Считается, что скоро XML Schema

Слайд 8Почему нужно использовать XML SchemaЯзык XML Schemas является более мощным по

Слайд 9XML Schemas поддерживает типы данныхOne of the greatest strength of XML

Слайд 10Рассмотрим пример простого XML-документа (note.xml) ЛуценкоФроловНапоминаниеНе забудь о собрании лаборатории

Слайд 11Описание схемы на языке XML Schema На языке XML Schema язык

Слайд 12Связывание XML-документ note.xml с описанием его синтаксиса на языке XML Schema

Слайд 13Здесь все используемые элементы, с помощью первого атрибута объявлены в пространстве

Слайд 14Синтаксис языка XSDКорневым элементом описаний на языке XML Schema является элемент

Слайд 15Первый атрибут xmlns:xs="http://www.w3.org/2001/XMLSchema" указывает, что в данном документе используется пространство имен

Слайд 16Встроенные типы языка XML SchemaВ языке XML Schema определен набор из

Слайд 17Примеры встроенных типовxs:string – произвольные строки текста;xs:byte, xs:float, xs:long – числовые

Слайд 18xs:boolean – значение true или false (допускаются также числовые эквиваленты: 0

Слайд 19Использование такого большого количества встроенных типов делает язык XSD очень полезным

Слайд 20Описание простых элементовВ состав описываемого конкретного языка могут входить простые элементы

Слайд 21Примеры простых XML элементовXML-документ:Refsnes361970-03-27Эти элементы могут быть следующим образом описаны с

Слайд 22Значения по умолчанию и фиксированные значенияПростые элементы могут иметь значения по

Слайд 23Описание атрибутов Простые элементы не могут иметь атрибуты. Если элемент имеет

Слайд 24По умолчанию атрибуты считаются не обязательными. Для задания обязательности описываемых атрибутов

Слайд 25Ограничения на содержание элементовЕсли простому элементу или атрибуту задан тип, то

Слайд 26Типы ограничений на элементы

Слайд 27Типы ограничений на элементы (2)

Слайд 28Пример ограниченияДля элемента age (возраст) возможный интервал значений от 0 до

Слайд 29Пример ограничения (2)C помощью элемента xs:enumeration можно ограничить содержание элемента списком

Слайд 30Пример с типами автомобилей можно описать и по-другому: вначале создать новый

Слайд 31Ограничения содержания текстового элемента Для ограничения содержания элемента, имеющего тип xs:string,

Слайд 32В каждой строке могут быть заданы конкретные символы или не определенные

Слайд 33Кроме задания символов в квадратных скобках можно использовать следующие указатели: \d

Слайд 34Пример 1 разрешенными значениями являются одна или несколько пар символов,

Слайд 35Пример 2 должно задаваться точно 8 символов, каждый из которых может

Слайд 36Пример 3 может задаваться любой символ алфавита ([A-Z]), за которым следует

Слайд 37Описание составных элементовСоставные элементы могут содержать текст, другие элементы и/или атрибуты.

Слайд 38Примеры составных элементовПример пустого составного элемента "product": Пример составного

Слайд 39Описание пустых элементовПустые элементы не могут иметь содержания, но могут иметь

Слайд 40Создание нового составного типа и указание его в описании элемента product:

Слайд 41Описание составных элементов, содержащих только дочерние элементыСоставные элементы, содержащие только дочерние

Слайд 42Также можно задать создать новый тип составного элемента persontype и использовать

Слайд 43Описание элементов, имеющих простое содержаниеПри описании элемента, включающего только простое содержимое

Слайд 44Описание элемента simpleContent с использованием ограничение (xs:restriction):. . .

Слайд 45Пример описания простого элементаНапример, нужно описать элемент "shoesize" (размер обуви), которые

Слайд 46Описание составных типов со смешанным содержаниемСоставные типы со смешанным содержанием могут

Слайд 47Данный элемент "letter" можно описать следующим образом:

Слайд 48Индикаторы Для задания правил использования в XML-документе дочерних элементов применяются индикаторы.

Слайд 49Индикатор Индикатор указывает, что дочерние элементы могут появляться в любом

Слайд 50Индикатор Индикатор указывает, что в составном элементе должен быть задан

Слайд 51Индикатор Индикатор указывает, что дочерние элементы должны записываться в том

Слайд 52Индикаторы и Индикатор указывает на максимальное количество, а

Слайд 53По умолчанию для всех элементов задаются индикаторы maxOccurs=1 и minOccurs=1. Для

Слайд 54С помощью индикаторов группирования элементов можно описывать группы элементов, задавать им

Слайд 55После определения группы с именем persongroup, на нее можно ссылаться в

Слайд 56С помощью индикаторов группирования атрибутов можно описывать группы атрибутов, задавать им

Слайд 57Элемент Элемент позволяет включать в XML-документ элементы, которые не

Слайд 58Теперь в элементе person включать, например, элемент children, хотя в самой

Слайд 59Ниже показан XML-документ (называемый "Myfamily.xml"), использующий элементы из двух разных схем

Слайд 60Данный XML-документ является действительным (valid), так как схема "family.xsd" позволяет расширять