Type
Description:
The orthographical representation of a word as found in the corpus; this data is case sensitive, i.e. there is a distinction between name and Name. Whitespace is replaced by underscores (Santiago_de_Chile).
Data type:
- String
- case-sensitive
- eq, regex
- no null value
Available in tables:
Also, in the ngrams tables, you can use this filter on any of the components:In the neighbors tables, this filter is the key to obtaining the orthographical neighbors of a certain type:
See also:
In the downcased tables, there is a column and corresponding filter that contains a downcased representation of a type:
Contents
Current version
- 0.3
- New tables: all measures in case-insensitive variant.