Data Definition Language

1 Introduction

This document is the specification of the Data Definition Language. Programs of this language are sequences of Unicode code points and describe structured data for the purpose of for exchanging that data between entities (humans and machines alike). The programs describes data in terms of typed value and this specificiation describes two aspects; language provides scalar types (boolean type, number type, string type, and void type) as well as aggregate types (map values and list values). This specification describes the translation from Unicode code points to typed values.

2 Translation

A program of the DDL language is translated into values. This translation happens in three phases: The lexical translation translates Unicode code points to words. The syntactical translation filters the resulting sequence of words and then translates these words into a sentence. The semantical translation translates sentences into values t.

3 Lexical Translation

The lexical translation translates a sequence of Unicode code points provided as input a sequence of words.

The lexical translation of the Data Definition Language is based on the Common Lexical Translations (see https://michaelheilmann.com/specifications/common-lexical-translations for more information).

The lexical grammar consists of

a set of non-terminals \(\textit{DDL.Lexical.NonTerminals}\) and the set of terminals \(\textit{DDL.Lexical.Terminals}\) which are disjoint
a set of production rules \(\textit{DDL.Lexical.ProductionRules}\), which are layed down in this section, and
a starting symbol \(\text{DDL.Lexical.Words}\) which is element of \(\textit{DDL.Lexical.NonTerminals}\).

The two rules, defined in terms of the Common Lexical translations, are defined as follows:

\[\begin{array}{ll} \text{DDL.Lexical.Words} &: \text{DDL.Lexical.Word}^*\\ \text{DDL.Lexical.Word} &:\;\text{Lexical.Boolean}\\ &|\;\text{Lexical.Number}\\ &|\;\text{Lexical.String}\\ &|\;\text{Lexical.Void}\\ &|\;\text{Lexical.Name}\\ &|\;\text{Lexical.LeftCurlyBracket}\\ &|\;\text{Lexical.RightCurlyBracket}\\ &|\;\text{Lexical.LeftSquareBracket}\\ &|\;\text{Lexical.RightSquareBracket}\\ &|\;\text{Lexical.Comma}\\ &|\;\text{Lexical.Whitespace}\\ &|\;\text{Lexical.Newline}\\ &|\;\text{Lexical.Comment}\\ \end{array}\]

The lexical translation translates a sequence of Unicode code points into words. This resulting sequence of words is then consumed by the syntactical translation.

4 Syntactical Translation

The syntactical translation translates a sequence of words provided by the lexical translation into a sentence.

The syntactical grammar consists of

a set of non-terminals \(\textit{DDL.Syntactical.NonTerminals}\) and the set of terminals \(\textit{DDL.Syntactical.Terminals}\) which are disjoint
a set of production rules \(\textit{DDL.Syntactical.ProductionRules}\), which are layed down in this section, and
a starting symbol \(\textit{DDL.Syntactical.Sentence}\)> which is element of \(\textit{DDL.Syntactical.NonTerminals}\).

Important:The following words are removed from the sequence of words before its translation into a sentence:

\(\text{DDL.Lexical.Whitespace}\),
\(\text{DDL.Lexical.Newline}\), and
\(\text{DDL.Lexical.Comment}\)

\[\begin{aligned} &\text{DDL.Syntactical.Sentence} : \text{DDL.Syntactical.Value} \end{aligned}\]

4.1 DDL.Syntactical.Value

The sentence \(\text{DDL.Syntactical.Value}\) is defined by

\[\begin{aligned} &\text{DDL.Syntactical.Value} : \text{DDL.Syntactical.Map}\\ &\text{DDL.Syntactical.Value} : \text{DDL.Syntactical.List}\\ &\text{DDL.Syntactical.Value} : \text{DDL.Syntactical.String}\\ &\text{DDL.Syntactical.Value} : \text{DDL.Syntactical.Number}\\ &\text{DDL.Syntactical.Value} : \text{DDL.Syntactical.Boolean}\\ &\text{DDL.Syntactical.Value} : \text{DDL.Syntactical.Void} \end{aligned}\]

4.2 DDL.Syntactical.String

The sentence \(\text{DDL.Syntactical.String}\) is defined by

\[\begin{aligned} &\text{DDL.Syntactical.String} : \text{Lexical.String} \end{aligned}\]

4.2 DDL.Syntactical.Number

The sentence \(\text{DDL.Syntactical.Number}\) is

\[\begin{aligned} &\text{DDL.Syntactical.Number} : \text{Lexical.Number} \end{aligned}\]

4.3 DDL.Syntactical.Boolean

The sentence \(\text{DDL.Syntactical.Boolean}\) is

\[\begin{aligned} &\text{DDL.Syntactical.Boolean} : \text{Lexical.Boolean} \end{aligned}\]

4.4 DDL.Syntactical.Void

The sentence \(\text{DDL.Syntactical.Void}\) is

\[\begin{aligned} &\text{DDL.Syntactical.Void} : \text{Lexical.Void} \end{aligned}\]

4.5 DDL.Syntactical.Map

The sentence \(\text{DDL.Syntactical.Map}\) is

\[\begin{aligned} &\text{DDL.Syntactical.Map} :\text{Lexical.LeftCurlyBracket}\;\text{DDL.Syntactical.MapBody}\;\text{Lexical.RightCurlyBracket}\\ \\ &\text{DDL.Syntactical.MapBody} : \text{DDL.Syntactical.MapBodyElement}\;\text{DDL.Syntactical.MapBodyRest}\\ &\text{DDL.Syntactical.MapBody} : \epsilon\\ \\ &\text{DDL.Syntactical.MapBodyRest} : \text{Lexical.Comma}\;\text{DDL.Syntactical.MapBodyElement}\;\text{DDL.Syntactical.MapBodyRest}\\ &\text{DDL.Syntactical.MapBodyRest} : \text{Lexical.Comma}\\ &\text{DDL.Syntactical.MapBodyRest} : \epsilon\\ &\text{DDL.Syntactical.MapBodyElement} : \text{Lexical.Name}\;\text{Lexical.Colon}\;\text{DDL.Syntactical.Value} \end{aligned}\]

4.6 DDL.Syntactical.List

The sentence \(\text{DDL.Syntactical.List}\) is

\[\begin{aligned} &\text{DDL.Syntactical.List} : \text{Lexical.LeftSquareBracket}\; \text{DDL.Syntactical.ListBody}\; \text{Lexical.RightSquareBracket}\\ \\ &\text{DDL.Syntactical.ListBody} : \text{DDL.Syntactical.ListBodyElement}\; \text{DDL.Syntactical.ListBodyRest}\\ &\text{DDL.Syntactical.ListBody} : \epsilon\\ \\ &\text{DDL.Syntactical.ListBodyRest} : \text{Lexical.Comma}\; \text{DDL.Syntactical.ListBodyElement}\; \text{DDL.Syntactical.ListBodyRest}\\ &\text{DDL.Syntactical.ListBodyRest} : \text{Lexical.Comma}\\ &\text{DDL.Syntactical.ListBodyRest} : \epsilon\\ \\ &\text{DDL.Syntactical.ListBodyElement} : \text{DDL.Syntactical.Value} \end{aligned}\]

The syntatical translation translates a sequence of words into one sentence. This resulting sntence is then consumed by the semantical translation.

5 Semantical Translation

The semantical translation a sentence provided by the syntactical translation into a typed value. The Data Definition Language knows six basic types \(\textit{List}\) and \(\textit{Map}\), which are the so called aggregate types, and \(\textit{Boolean}\), \(\textit{Number}\), \(\textit{String}\), and \(\textit{Void}\), which are the so called scalar types.

The type \(\textit{Value}\) is defined as the union of all the types above. \[\begin{aligned} \textit{Value} =&\;\textit{List}\\ \cup&\;\textit{Map}\\ \cup&\;\textit{Boolean}\\ \cup&\;\textit{Number}\\ \cup&\;\textit{String}\\ \cup&\;\textit{Void} \end{aligned}\]

The translation of a sentence into values is described by syntax-directed translations (see Aho, Seti, Ullman: Compilers, Principles, Techniques, and Tools; 1st; pp. 305 for more information).

At the end of a translation, the input syntactic form \(x\) has a variable \(x.\text{value}\) which is either a value of type \textit{Value}. Furthermore, for each syntactic form, we define two attributes: \(\text{value}\) is the computed value of the syntactic form. \(\textit{codePoints}\) is the sequence of code points associated with the syntactic form.

4.1 Scalar Types

This section defines \(\sigma\) for the translation of scalar types.

4.1.1 Boolean Type

The type \(\textit{Boolean}\) type has two values \(\textit{true}\) and \(\textit{false}\) which are expressed in the language by the words \(\texttt{DDL.Lexical.true}\) and \(\texttt{DDL.Lexical.false}\), respectively (as defined in the syntactical grammar).