DEREKO Corpus Sampler

(250 corpus graphs)

1. General information

Name: DEREKO Corpus Sampler
ID: DEREKOSampler
Format: DEREKO format, Version 1.0
Author: DEREKO project
Date: January 2002
Description: A sampler of 250 sentences from the DEREKO Corpus. With kind permission of the DEREKO project.

2. Corpus details

Features (T): word, pos
Features (NT): cat
Labelled edges: no
Crossing edges: no
Secondary edges: no

3. Statistical information

Number of corpus graphs: 250
Number of tokens: 3595
Average number of tokens: 14.4
Number of inner nodes: 3426
Number of edges: 6771

4. Feature documentation

Feature values: pos

Feature values: cat