RDF 1.2 Turtle

The Resource Description Framework (RDF) is a general-purpose language for representing information in the Web.

This document defines a textual syntax for RDF called Turtle that allows an RDF graph to be completely written in a compact and natural text form, with abbreviations for common usage patterns and datatypes. Turtle provides levels of compatibility with the N-Triples [[RDF12-N-TRIPLES]] format as well as the triple pattern syntax of [[[SPARQL12-QUERY]]] W3C Recommendation.

RDF 1.2 Turtle introduces triple terms as a fourth kind of RDF term which can be used as the object of another triple, making it possible to make statements about other statements. Triple terms are typically not used explicitly as the reifiedTriple construction is generally preferred. RDF 1.2 Turtle also adds support for directional language-tagged strings.

In addition, RDF 1.2 Turtle introduces an annotation syntax which allows triple terms to also be asserted.

Turtle Language

A Turtle document allows writing down an RDF graph in a compact textual form. An RDF graph is made up of triples consisting of a subject, predicate and object.

Comments may follow a # that is not part of another lexical token, and continue to the end of the line.

Simple Triples

The simplest triple statement is a sequence of (subject, predicate, and object) terms, forming an RDF triple, terminated by a period (.). White space (spaces, tabs, LFs, and/or CRs) may surround terms, except where significant as noted in the grammar.

Predicate Lists

Often the same subject will be referenced by a number of predicates. The predicateObjectList production matches a series of predicates and objects, separated by ;, following a subject. This expresses a series of RDF Triples with that subject and each predicate and object allocated to one triple. Thus, the ; is used to repeat the subject of triples that vary only in predicate and object RDF terms.

These two examples are equivalent ways of writing the triples about Spiderman.

Object Lists

As with predicates, the same subject and predicate often repeat with different objects. The objectList production matches a series of objects separated by , following a predicate. This expresses a series of RDF Triples with the same subject and predicate and each object allocated to one triple. Thus, the , is used to repeat the subject and predicate of triples that only differ in the object RDF term.

These two examples are equivalent ways of writing Spiderman's name in two languages.

There are four RDF terms defined in RDF Concepts: IRIs (Internationalized Resource Identifiers), literals, blank nodes, and triple terms; Turtle provides a number of ways of writing each.

Version Announcement

The Turtle language has evolved since its origin, and RDF 1.2 adds new syntax. RDF 1.2 Turtle introduces the VERSION and @version directives along with an optional `version` Media Type parameter. When serializing Turtle with new features such as initial text direction or triple terms, authors can announce the use of the new syntax forms using these directives.

Alternatively, the old-style directive can be used (note that there is a trailing "."):

When providing content over HTTP, servers can announce the version using the optional `version` Media Type parameter:

When using RDF 1.2-specific features, such as initial text direction, the specific RDF version should be announced using either VERSION or @version directives early in the document. This allows parsers that do not support these features to detect the presense of such features early, and potentially inform the user, giving them an opportunity to stop the job or otherwise act on the fact that some amount of the input data will not be processed as desired.

Multiple version directives may appear in a Turtle document. The directive applies to the part of the document following the directive, until another directive is encountered or the end of the document is reached.

In the absence of a current version directive, any version specified as part of the Media Type is considered. In the absence of any version information, a sensible default is `"1.1"`, to maximize compatibility with the previous version of Turtle [[TURTLE]]. In any case, the version announcement is merely a hint; parsers are not required to reject features that are outside the announced version (but could signal them with a warning).

Version labels are defined in [[[RDF12-CONCEPTS]]] section 2.1 Version Labels. Processors may treat unrecognized labels as an error or as a warning.

While `"1.1"` is an acceptable version label, its use in a `VERSION` or `@version` directive is discouraged, as it would needlessly cause Turtle 1.1 parsers to fail.

IRIs

IRIs may be written as resolved IRIs, Relative IRI references, or prefixed names. Relative and resolved IRIs are preceded by <, followed by >, and may contain numeric escape sequences (described below). For example, <http://example.org/#green-goblin>.

Relative IRI references like <#green-goblin> are resolved relative to the current base IRI. A new base IRI can be defined using the @base or BASE directive. Specifics of this operation are defined in .

The token a in the predicate position of a Turtle triple represents the IRI http://www.w3.org/1999/02/22-rdf-syntax-ns#type .

A prefixed name is a prefix label and a local part, separated by a :. A prefixed name is turned into an IRI by concatenating the IRI associated with the prefix and the local part. The @prefix or PREFIX directive associates a prefix label with an IRI. Subsequent @prefix or PREFIX directives may re-map the same prefix label.

The Turtle language originally permitted only the syntax including the @ character for writing prefix and base directives. The case-insensitive PREFIX, BASE, and VERSION forms were added to align Turtle's syntax with that of SPARQL. The original directives @prefix, @base, or @version may also be used. Using PREFIX, BASE, and VERSION may make it easier to copy declarations to SPARQL queries.

To write http://www.perceive.net/schemas/relationship/enemyOf using a prefixed name:

Define a prefix label for the vocabulary IRI http://www.perceive.net/schemas/relationship/ as somePrefix
Then write somePrefix:enemyOf which is equivalent to writing <http://www.perceive.net/schemas/relationship/enemyOf>

This can be written using either the SPARQL-style syntax for prefix declarations:

or original Turtle syntax for prefix declarations:

Prefixed names are a superset of XML QNames. They differ in that the local part of prefixed names may include:

leading digits, e.g., leg:3032571 or isbn13:9780136019701
non leading colons, e.g., og:video:height
reserved character escape sequences, e.g., wgs:lat\-long

The following Turtle document contains examples of all the different ways of writing IRIs in Turtle.

While the @prefix and @base directives require a trailing . after the IRI, the equivalent PREFIX and BASE do not have a trailing . after the IRI part of the directive. The PREFIX and BASE are case-insensitive and can be written as prefix and base or as any blend of mixed case.

RDF Literals

Literals are used to identify values such as strings, numbers, dates.

Quoted Literals

Quoted Literals (Grammar production RDFLiteral) have a lexical form followed by a language tag (possibly including an initial text direction), a datatype IRI, or neither.

The representation of the lexical form consists of an initial delimiter (e.g., ", ', """, or '''); a sequence of permitted characters, numeric escape sequences, and/or string escape sequences; and a final delimiter matching the initial delimiter. The corresponding RDF lexical form is the characters between the delimiters, after processing any escape sequences.

If present, the language tag is preceded by an @ and may be followed by an initial text direction separated from the language tag by --.

If there is no language tag, there may be a datatype IRI, preceded by ^^. A datatype IRI in Turtle may be written using either a resolved IRI, a relative IRI reference, or a prefixed name.

If there is no datatype IRI and no language tag, the datatype is xsd:string.

A \ may not appear in any quoted literal except as part of an escape sequence. Other restrictions depend on the delimiter:

Literals delimited by ' may not contain unescaped ', LF, nor CR characters.
Literals delimited by " may not contain unescaped ", LF, nor CR characters.
Literals delimited by ''' may not contain such a sequence.
Literals delimited by """ may not contain such a sequence.

Numbers

Numbers can be written like other literals with lexical form and datatype (e.g., "-5.0"^^xsd:decimal). Turtle has a shorthand syntax for writing integer values, arbitrary precision decimal values, and double precision floating point values.

Data Type	Abbreviated	Lexical	Description
xsd:integer	`-5`	`"-5"^^xsd:integer`	Integer values may be written as an optional sign and a series of digits. Integers match the regular expression "`[+-]?[0-9]+`".
xsd:decimal	`-5.0`	`"-5.0"^^xsd:decimal`	Arbitrary-precision decimals may be written as an optional sign, zero or more digits, a decimal point, and one or more digits. Decimals match the regular expression "`[+-]?[0-9]*\.[0-9]+`".
xsd:double	`4.2E9`	`"4.2E9"^^xsd:double`	Double-precision floating point values may be written as an optionally signed mantissa with an optional decimal point, the letter `e` or the letter `E`, and an optionally signed integer exponent. The exponent matches the regular expression "`[+-]?[0-9]+`" and the mantissa matches one of these regular expressions: "`[+-]?[0-9]+\.[0-9]+`", "`[+-]?\.[0-9]+`", or "`[+-]?[0-9]`".

Booleans

Boolean values may be written as either true or false (case-sensitive) and represent RDF literals with the datatype xsd:boolean [[XMLSCHEMA11-2]].

RDF Blank Nodes

RDF blank nodes in Turtle are expressed as _: followed by a blank node identifier which is a series of characters. The characters in the identifier are built upon PN_CHARS_BASE, liberalized as follows:

The character _ and the digit characters 0–9 may appear anywhere in a blank node identifier.
The character . may appear anywhere except the first or last character.
The characters -, ·, ‿, ⁀, and the combining diacritical marks (U+0300 to U+036F) are permitted anywhere except the first character.

A fresh RDF blank node is allocated for each unique blank node identifier in a document. Repeated use of the same blank node identifier identifies the same blank node.

Nesting of Blank Nodes without Blank Node Identifiers

In Turtle, fresh RDF blank nodes are also allocated when matching the production blankNodePropertyList and the terminal ANON. Both of these may appear in the subject or object position of a triple (see the Turtle Grammar). That subject or object is a fresh RDF blank node. This blank node also serves as the subject of the triples produced by matching the predicateObjectList production embedded in a blankNodePropertyList. The generation of these triples is described in Predicate Lists. Blank nodes are also allocated for collections described below.

The Turtle grammar allows blankNodePropertyLists to be nested. In this case, each inner [ establishes a new subject blank node which reverts to the outer node at the ], and serves as the current subject for predicate object lists.

The use of predicateObjectList within a blankNodePropertyList is a common idiom for representing a series of properties of a node.

Abbreviated:	Corresponding simple triples:

Collections

RDF provides a Collection [[RDF12-SEMANTICS]] structure for lists of RDF nodes. The Turtle syntax for Collections is a possibly empty list of RDF terms enclosed by (). This collection represents an rdf:first/rdf:rest list structure with the sequence of objects of the rdf:first statements being the order of the terms enclosed by ().

The (…) syntax MUST appear in the subject or object position of a triple (see the Turtle Grammar). The blank node at the head of the list is the subject or object of the containing triple.

Triple Terms

A triple term may be the object of an RDF triple.

A triple term is represented as a tripleTerm with ttSubject, predicate, and ttObject, all preceded by <<(, and all followed by )>>. Note that triple terms may be nested.

Reifying Triples

In RDF, triple terms are rarely used directly, as they are generally restricted to be used only as the object of a triple using the `rdf:reifies` predicate. Such a triple is called a reifying triple. Turtle provides a shorthand notation for writing reifying triples using the reifiedTriple production.

A reifiedTriple is syntactic sugar representing a reifying triple, which defines a specific relationship between an identifier (reifier) and a triple term. The identifier becomes a way to indirectly refer to a triple term, which may or may not be asserted within the graph corresponding to this input document.

Reification in RDF 1.2 is a concept distinct from the Reification vocabulary originally defined in RDF Semantics. While both terms describe a representation of an RDF triple using components, RDF 1.2 uses the term to identify a triple term using the `rdf:reifies` predicate.

A reifying triple is represented using the reifiedTriple production starting with <<, followed by a rtSubject, a predicate, and a rtObject, followed by an optional reifier, composed of a ~ followed by an optional iri production or BlankNode production, and ending with >>. For example, `<< :subject :predicate :object ~ :IRIREF >>`. If no reifiers are present, or a reifier is not immediately followed by an iri or BlankNode, a fresh RDF blank node is allocated, as with `<< :subject :predicate :object >>`, or `<< :subject :predicate :object ~ >>`.

reifiedTriples may be nested, like
`<< :subject1 :predicate1 << :subject2 :predicate2 :object2 >> ~:IRIREF1 >>` or
`<< :subject4 :predicate4 << :subject3 :predicate3 :object3 ~:IRIREF3 >> >>`.

If a reifiedTriple is not identified by an IRI or blank node, a fresh RDF blank node is allocated and used to identify this relationship.

For instance, the syntactic sugar of the above example is equivalent to:

Note the difference in syntax between the syntactic sugar of reifiedTriple (i.e., `<< [...] >>`) and the regular tripleTerm (i.e., `<<( [...] )>>`).

After declaring a prefix so that IRIs can be abbreviated, the first triple in this example asserts that `employee38` has a `familyName` of "Smith". Note that this graph does not assert that `employee38` has a `jobTitle` of "Assistant Designer"; it says that `employee22` has made that claim using a reifiedTriple. In other words, the triple "`employee38` has a `jobTitle` of 'Assistant Designer'" is not a member of the graph, itself, as "employee38 has a `familyName` of 'Smith'" is above; rather, it is known as a reifying triple.

A reifiedTriple is syntactic sugar relating a reifier to a tripleTerm using the rdf:reifies predicate.

Annotation Syntax

Turtle also defines an annotation syntax to both reify and assert a triple, which provides a convenient shortcut. An annotation can be used to simultaneously assert a triple, via an explicit or implicit identifier, and have that triple be the subject or object of further triples. If explicitly identified, the same reifier can then be used as the subject or object of additional triples and/or triple terms. As with a reifiedTriple, the annotation syntax allows the definition of one or more reifiers as either IRIs or blank nodes, each preceded by a tilde (~), which precedes the annotation block. If not followed by an annotation block, a reifier is treated like a reifiedTriple without annotations. If an annotation block is not immediately preceded by a reifier, an RDF blank node is allocated to serve as the reifier of the triple term.

The annotation syntax is a syntactic shortcut in Turtle, and the RDF Abstract Syntax [[RDF11-CONCEPTS]] does not distinguish how the triples were written.

is the same set of triples as:

and the graph contains three triples. The reifier is identified by `:t`.

Fully expanding to use triple terms, instead of reifiers, results in the following:

An annotation can include any number of reifiers.

Examples

This example is a Turtle translation of example 7 in [[[RDF12-XML]]] (example1.ttl):

An example of an RDF collection of two literals.

which is short for (example2.ttl):

An example of two identical triples containing literal objects containing newlines, written in plain and long literal forms. The line breaks in this example are LF. (example3.ttl):

As indicated by the grammar, a collection can be either a subject or an object. This subject or object will be the novel blank node for the first object, if the collection has one or more objects, or rdf:nil if the collection is empty.

For example,

is syntactic sugar for (noting that the blank nodes b0, b1, and b2 do not occur anywhere else in the RDF graph):

RDF collections can be nested and can involve other syntactic forms:

is syntactic sugar for:

Turtle compared to SPARQL

The [[[SPARQL12-QUERY]]] (SPARQL) [[SPARQL12-QUERY]] uses a Turtle style syntax for its TriplesBlock production. This production differs from the Turtle language in that:

SPARQL permits RDF Literals as the subject of Triple Patterns.
SPARQL permits variables (?name or $name) in any part of the triple of the form.
Turtle allows prefix, base, and version declarations anywhere outside of a triple. In SPARQL, they are only allowed in the Prologue (at the start of the SPARQL query).
SPARQL uses case insensitive keywords, except for a. Turtle's @prefix and @base declarations are case sensitive, the SPARQL derived PREFIX and BASE are case insensitive.
true and false are case insensitive in SPARQL and case sensitive in Turtle. TrUe is not a valid boolean value in Turtle.

For further information see the Syntax for IRIs and SPARQL Grammar sections of the SPARQL query document [[SPARQL12-QUERY]].

Turtle Grammar

A Turtle document is an RDF string encoded in UTF-8 [[!RFC3629]]. Only Unicode scalar values, in the ranges U+0000 to U+D7FF and U+E000 to U+10FFFF, are allowed. This excludes surrogate code points, range U+D800 to U+DFFF.

White Space

White space (production WS) is used to separate two terminals which would otherwise be (mis-)recognized as one terminal. Rule names below in capitals indicate where white space is significant; these form a possible choice of terminals for constructing a Turtle parser.

White space is significant in the production String.

Comments

Comments in Turtle start with a # outside an IRIREF, STRING_LITERAL_SINGLE_QUOTE, STRING_LITERAL_QUOTE, STRING_LITERAL_LONG_SINGLE_QUOTE, or STRING_LITERAL_LONG_QUOTE, and continue to the end of line (marked by LF, or CR), or end of file if there is no end of line after the comment marker. Comments are treated as white space.

IRI References

Relative IRI references are resolved with base IRIs as per [[[RFC3986]]] [[RFC3986]] using only the basic algorithm in section 5.2. Neither Syntax-Based Normalization nor Scheme-Based Normalization (described in sections 6.2.2 and 6.2.3 of RFC3986) are performed. Characters additionally allowed in IRI references are treated in the same way that unreserved characters are treated in URI references, per section 6.5 of [[[RFC3987]]] [[RFC3987]].

The @base or BASE directive defines the Base IRI used to resolve relative IRI references per [[RFC3986]] section 5.1.1, "Base URI Embedded in Content". Section 5.1.2, "Base URI from the Encapsulating Entity" defines how the In-Scope Base IRI may come from an encapsulating document, such as a SOAP envelope with an `xml:base` directive or a MIME multipart document with a `Content-Location` header. The "Retrieval URI" identified in 5.1.3, Base "URI from the Retrieval URI", is the URL from which a particular Turtle document was retrieved. If none of the above specifies the Base URI, the default Base URI (section 5.1.4, "Default Base URI") is used. Each @base or BASE directive sets a new In-Scope Base URI, relative to the previous one.

Escape Sequences

There are three forms of escapes used in turtle documents:

A numeric escape sequence represents the value of a Unicode code point.

A numeric escape sequence MUST NOT produce a code point value in the range U+D800 to U+DFFF, which is the range for Unicode surrogates.

Escape sequence	Unicode code point
`\u` `hex` `hex` `hex` `hex`	A Unicode code point in the ranges `U+0000` to `U+D7FF` and `U+E000` to `U+D7FF`, corresponding to the value encoded by the four hexadecimal digits interpreted from most significant to least significant digit.
`\U` `hex` `hex` `hex` `hex` `hex` `hex` `hex` `hex`	A Unicode code point in the ranges `U+0000` to `U+D7FF` and `U+E000` to `U+10FFFF`, corresponding to the value encoded by the eight hexadecimal digits interpreted from most significant to least significant digit.

where hex is a hexadecimal character

HEX ::= [0-9] | [A-F] | [a-f]

A string escape sequence represents a character traditionally escaped in string literals:

Escape sequence	Unicode code point
`\t`	`U+0009`
`\b`	`U+0008`
`\n`	`U+000A`
`\r`	`U+000D`
`\f`	`U+000C`
`\"`	`U+0022`
`\'`	`U+0027`
`\\`	`U+005C`

A reserved character escape sequence consists of a \ followed by one of these characters ~.-!$&'()*+,;=/?#@%_, and represents the character to the right of the \.

Context where each kind of escape sequence can be used
	numeric escapes	string escapes	reserved character escapes
IRIs, used as RDF terms or as in `@prefix`, `PREFIX`, `@base`, or `BASE` declarations	yes	no	no
local names	no	no	yes
Strings	yes	yes	no

%-encoded sequences are in the character range for IRIs and are explicitly allowed in local names. These appear as a % followed by two hex characters and represent that same sequence of three characters. These sequences are not decoded during processing. A term written as <http://a.example/%66oo-bar> in Turtle designates the IRI http://a.example/%66oo-bar and not IRI http://a.example/foo-bar. A term written as ex:%66oo-bar with a prefix PREFIX ex: <http://a.example/> also designates the IRI http://a.example/%66oo-bar.

Grammar

The EBNF used here is defined in XML 1.0 [[!EBNF-NOTATION]].

Notes:

Keywords in single quotes ('@base', '@prefix', '@version', 'a', 'true', and 'false') are case-sensitive. Keywords in quotation marks ("BASE", "PREFIX"), and "VERSION") are case-insensitive.
Escape sequences UCHAR and ECHAR are case sensitive.
When tokenizing the input and choosing grammar rules, the longest match is chosen.
The Turtle grammar is LL(1) and LALR(1) when the rules with uppercased names are used as terminals.
The entry point into the grammar is turtleDoc.
In signed numbers, no white space is allowed between the sign and the number.
The strings '@prefix', '@base', and '@version' match the pattern for LANG_DIR, though neither prefix, base, nor version are registered language subtags. This specification does not define whether a quoted literal followed by either of these tokens (e.g., "A"@base) is in the Turtle language.

A text version of this grammar is available here.

Selected Terminal Literal Strings

This document uses some specific terminal literal strings [[EBNF-NOTATION]]. To clarify the Unicode code points used for these terminal literal strings, the following table describes specific characters and sequences used throughout this document.

Code	Glyph	Description
`U+0009`	`HT`	Horizontal tab
`U+000A`	`LF`	Line feed
`U+000D`	`CR`	Carriage return
`U+0022`	`"`	Quotation mark
`U+0023`	`#`	Number sign
`U+0025`	`%`	Percent sign
`U+0027`	`'`	Apostrophe
`U+0028`	`(`	Left parenthesis
`U+0029`	`)`	Right parenthesis
`U+002C`	`,`	Comma
`U+002D`	`-`	Hyphen
`U+002E`	`.`	Full stop
`U+0030`	`0`	Digit zero
`U+0039`	`9`	Digit nine
`U+003B`	`:`	Colon
`U+003B`	`;`	Semicolon
`U+003C`	`<`	Less-than sign
`U+003E`	`>`	Greater-than sign
`U+0040`	`@`	At sign
`U+0045`	`E`	Latin capital letter E
`U+005B`	`[`	Left square bracket
`U+005C`	`\`	Backslash
`U+005D`	`[`	Right square bracket
`U+005F`	`_`	Underscore
`U+0061`	`a`	Latin small letter A
`U+0065`	`e`	Latin small letter E
`U+007C`	`\|`	vertical line
`U+007E`	`~`	Tilde
`U+00B7`	`·`	Middle dot
`U+203F`	`‿`	Undertie
`U+2040`	`⁀`	Character tie

Other short terminal literal strings are composed of specific sequences of Unicode characters:

space: U+0020
""": three concatenated quotation mark characters, each having the code point U+0022
''': three concatenated apostrophes characters, each having the code point U+0027
<<: two concatenated less-than sign characters, each having the code point U+003C
>>: two concatenated greater-than sign characters, each having the code point U+003E
<<(: two concatenated less-than sign characters, each having the code point U+003C, followed by a left parenthesis character, having the code point U+0028
)>>: a left parenthesis character, having the code point U+0029 followed by two concatenated greater-than sign characters, each having the code point U+003E
^^: two concatenated circumflex accent characters, each having the code point U+005E
{|: { (left curly bracket, code point U+007B) followed by | (vertical line, code point U+007C)
|}: | (vertical line, code point U+007C) followed by } (right curly bracket, code point U+007D)
_:: _ followed by :
--: two concatenated - characters

Parsing

The RDF 1.2 Concepts and Abstract Syntax specification [[!RDF12-CONCEPTS]] defines four types of RDF terms: IRIs, literals, blank nodes, and triple terms. Literals are composed of a lexical form and an optional language tag [[!BCP47]] – possibly including an initial text direction – or datatype IRI. An extra type, prefix, is used during parsing to map string identifiers to namespace IRIs. This section maps a string conforming to the grammar in to a set of triples by mapping strings matching productions and lexical tokens to RDF terms or their components (e.g., language tags, lexical forms of literals). Grammar productions change the parser state and emit triples.

Parser State

Parsing Turtle requires a state of nine items:

IRI |baseURI| — When the base production is reached, the second rule argument, IRIREF, is the base URI used for relative IRI resolution.
Map[prefix -> IRI] |namespaces| — The second and third rule arguments (PNAME_NS and IRIREF) in the prefixID production assign a namespace name (IRIREF) for the prefix (PNAME_NS). Outside of a prefixID or sparqlPrefix production, any PNAME_NS is substituted with the namespace from the current state of the namespaces map. Note that the prefix may be an empty string, per the PNAME_NS production: PN_PREFIX? ':'.
Map[string -> blank node] |bnodeLabels| — A mapping from a string to a blank node.
RDF Term |curSubject| — The |curSubject| is bound to the subject, rtSubject, ttSubject, blankNodePropertyList, collection,and annotationBlock productions.
RDF Term |curPredicate| — The |curPredicate| is bound to the verb production. If token matched was a, |curPredicate| is bound to the IRI http://www.w3.org/1999/02/22-rdf-syntax-ns#type.
RDF Term |curObject| — The |curObject| is bound to the object, rtObject, and ttObject productions.
RDF Term |curReifier| — The |curReifier| is bound to the reifier and annotationBlock productions.
Triple term |curTripleTerm| — The |curTripleTerm| is set in the Annotations constructor.
`xsd:string` |curVersion| – The RDF version used for parsing the document into Triples. If specified as part of a Media Type, the default value for |curVersion| is taken from the `version` parameter. Acceptable values for |curVersion| are defined in 2.1 Version Labels in [[RDF12-CONCEPTS]]. The version announcement is only a hint; this specification does not mandate any parser behavior based on |curVersion|, but a parser MAY signal an error or a warning when it encounters a feature that does not match the value of |curVersion|, or an unaccaptable value for |curVersion|.

Term Constructors can create a stack of these values indicated by using language such as "records the |curSubject| and |curPredicate|."

RDF Term Constructors

This table maps productions and lexical tokens to RDF terms or components of RDF terms listed in :

production	type	procedure
IRIREF	IRI	The characters between `<` and `>` are taken, with the numeric escape sequences unescaped, to form the IRI. Relative IRI reference resolution is performed per Section 6.3. The resulting IRI MUST comply with the syntactic restrictions of generic IRI syntax, and SHOULD conform to section 3.3 of [[RFC3986]] and comply with any narrower restrictions imposed by the corresponding IRI scheme specification.
PNAME_NS	prefix	When used in a `prefixID` or `sparqlPrefix` production, the `prefix` is the potentially empty RDF string matching the first argument of the rule is a key into the namespaces map into which the expanded second argument is stored for future lookup.
PNAME_NS	IRI	When used in a `PrefixedName` production; the namespaces map MUST have a corresponding `namespace`, which forms the RDF string of the IRI. The resulting IRI MUST comply with the syntactic restrictions of generic IRI syntax, and SHOULD comply with any narrower restrictions imposed by the corresponding IRI scheme specification.
PNAME_LN	IRI	A potentially empty prefix is identified by the first sequence, `PNAME_NS`. The namespaces map MUST have a corresponding `namespace`. The RDF string of the IRI is formed by unescaping the reserved characters in the second argument, `PN_LOCAL`, and concatenating this onto the `namespace`. The resulting IRI MUST comply with the syntactic restrictions of generic IRI syntax, and SHOULD comply with any narrower restrictions imposed by the corresponding IRI scheme specification.
VersionSpecifier	literal.	The \|curVersion\| is taken from a literal using the matched RDF string lexical form and `xsd:string` datatype.
STRING_LITERAL_SINGLE_QUOTE	lexical form	The characters between the outermost `'`s are taken, after numeric and string escape sequences are replaced with the characters that they represent, to form the RDF string of a lexical form.
STRING_LITERAL_QUOTE	lexical form	The characters between the outermost `"`s are taken, with numeric and string escape sequences unescaped, to form the RDF string of a lexical form.
STRING_LITERAL_LONG_SINGLE_QUOTE	lexical form	The characters between the outermost `'''`s are taken, with numeric and string escape sequences unescaped, to form the RDF string of a lexical form.
STRING_LITERAL_LONG_QUOTE	lexical form	The characters between the outermost `"""`s are taken, after numeric and string escape sequences are are replaced with the characters that they represent, to form the RDF string of a lexical form.
LANG_DIR	language tag	The characters following the `@` form the language tag and optionally the initial text direction, if the matched characters include `--`. The language tag MUST be well-formed according to section 2.2.9 of [[!BCP47]]. If present, the initial text direction MUST be either `ltr` or `rtl`.
RDFLiteral	literal	The literal has a lexical form of the first rule argument, `String`. If the `'^^' iri` rule is matched, the datatype IRI is derived from the `iri`, and the literal has no language tag. If the `LANG_DIR` rule is matched, the language tag and initial text direction are taken from `LANG_DIR`. If there is no initial text direction, the datatype is `rdf:langString`. If there is a initial text direction, the datatype is `rdf:dirLangString`. If neither matched, the datatype is `xsd:string`, and the literal has no language tag.
INTEGER	literal	The literal has a lexical form of the input string, and a datatype of `xsd:integer`.
DECIMAL	literal	The literal has a lexical form of the input string, and a datatype of `xsd:decimal`.
DOUBLE	literal	The literal has a lexical form of the input string, and a datatype of `xsd:double`.
BooleanLiteral	literal	The literal has a lexical form of the `true` or `false`, depending on which matched the input, and a datatype of `xsd:boolean`.
BLANK_NODE_LABEL	blank node	The string matching the second argument, `PN_LOCAL`, is a key in bnodeLabels. If there is no corresponding blank node in the map, one is allocated.
ANON	blank node	A blank node is generated.
blankNodePropertyList	blank node	A blank node is generated. Note the rules for `blankNodePropertyList` in the next section.
collection	blank node	For non-empty lists, a blank node is generated. Note the rules for `collection` in the next section.
collection	IRI	For empty lists, the resulting IRI is `rdf:nil`. Note the rules for `collection` in the next section.
reifier	IRI \| blank node	The \|curReifier\| is taken from term, which is taken from the matched `iri` production or `BlankNode` production, if any. If no such production is matched, term is taken from a fresh RDF blank node.
tripleTerm	triple term	The triple term is composed of the terms constructed from the `ttSubject`, `predicate`, and `ttObject` productions.
reifiedTriple	IRI \| blank node	The term is taken from the matched `reifier`, if any, or from a fresh RDF blank node.
annotationBlock	IRI \| blank node	The term is taken from a previously matched reifier, if any, or from a fresh RDF blank node.

As processors which detect errors on input can result in graphs which contain fewer triples than are described in the input (including no triples whatsoever), consumers should consider information of any errors signaled when using the output triples, which may be incomplete and/or include ill-typed or ill-formed terms.

RDF Triples Constructors

Reifiers

Reified Triples

Annotations

Annotation Blocks

Property Lists

Collections

Parsing Example

The following informative example shows the semantic actions performed when parsing this Turtle document with an LALR(1) parser:

Map the prefix ericFoaf to the IRI http://www.w3.org/People/Eric/ericP-foaf.rdf#.
Map the empty prefix to the IRI http://xmlns.com/foaf/0.1/.
Assign |curSubject| the IRI http://www.w3.org/People/Eric/ericP-foaf.rdf#ericP.
Assign |curPredicate| the IRI http://xmlns.com/foaf/0.1/givenName.
Emit an RDF triple: <...rdf#ericP> <.../givenName> "Eric" .
Assign |curPredicate| the IRI http://xmlns.com/foaf/0.1/knows.
Emit an RDF triple: <...rdf#ericP> <.../knows> <...who/dan-brickley> .
Emit an RDF triple: <...rdf#ericP> <.../knows> _:1 .
Save |curSubject| and reassign to the blank node _:1.
Save |curPredicate|.
Assign |curPredicate| the IRI http://xmlns.com/foaf/0.1/mbox.
Emit an RDF triple: _:1 <.../mbox> <mailto:timbl@w3.org> .
Restore |curSubject| and |curPredicate| to their saved values (<...rdf#ericP>, <.../knows>).
Emit an RDF triple: <...rdf#ericP> <.../knows> <http://getopenid.com/amyvdh> .

Introduction

Turtle Language

Simple Triples

Predicate Lists

Object Lists

Version Announcement

IRIs

RDF Literals

Quoted Literals

Numbers

Booleans

RDF Blank Nodes

Nesting of Blank Nodes without Blank Node Identifiers

Collections

Triple Terms

Reifying Triples

Annotation Syntax

Examples

Turtle compared to SPARQL

Media Type and Content Encoding

Turtle Grammar

White Space

Comments

IRI References

Escape Sequences

Grammar

Selected Terminal Literal Strings

Parsing

Parser State

RDF Term Constructors

RDF Triples Constructors

Reifiers

Reified Triples

Annotations

Annotation Blocks

Property Lists

Collections

Parsing Example

Embedding Turtle in HTML documents

XHTML

Parsing Turtle in HTML

Privacy Considerations

Security Considerations

Internet Media Type, File Extension and Macintosh File Type

Acknowledgments

Acknowledgments for RDF 1.1

Acknowledgments for RDF 1.2

Changes between RDF 1.1 and RDF 1.2