Name: | Sampler of the Chinese Treebank |
ID: | CHINESETreebankSampler |
Format: | extended bracketing format |
Author: | University of Pennsylvania |
Description: | A sampler of the Chinese Treebank comprising 105 corpus graphs. With kind permission of LDC. |
Features (T): | word, pos |
Features (NT): | cat |
Labelled edges: | yes |
Crossing edges: | no |
Secondary edges: | yes |
Number of corpus graphs: | 105 |
Number of tokens: | 3146 |
Average number of tokens: | 30.0 |
Number of inner nodes: | 3150 |
Number of edges: | 6191 |