Universal Dadegan Reader
This module includes classes and functions for reading the PerUDT corpus.
PerUDT contains a significant number of labeled sentences with syntactic and morphological information.
UniversalDadeganReader
¶
Bases: DadeganReader
This class includes functions for reading the PerUDT corpus.
Parameters:
| Name | Type | Description | Default |
|---|---|---|---|
conllu_file
|
str
|
Path to the CoNLL-U corpus file. |
required |
__init__(conllu_file)
¶
Initializes the UniversalDadeganReader.
Parameters:
| Name | Type | Description | Default |
|---|---|---|---|
conllu_file
|
str
|
Path to the CoNLL-U corpus file. |
required |
conllu2conll(conllu_path)
¶
Converts a CoNLL-U file to the old CoNLL format.
Parameters:
| Name | Type | Description | Default |
|---|---|---|---|
conllu_path
|
str
|
Path to the CoNLL-U file. |
required |
Returns:
| Type | Description |
|---|---|
str
|
The content of the file converted to CoNLL format as a string. |