Skip to content

Universal Dadegan Reader

This module includes classes and functions for reading the PerUDT corpus.

PerUDT contains a significant number of labeled sentences with syntactic and morphological information.

UniversalDadeganReader

Bases: DadeganReader

This class includes functions for reading the PerUDT corpus.

Parameters:

Name Type Description Default
conllu_file str

Path to the CoNLL-U corpus file.

required

__init__(conllu_file)

Initializes the UniversalDadeganReader.

Parameters:

Name Type Description Default
conllu_file str

Path to the CoNLL-U corpus file.

required

conllu2conll(conllu_path)

Converts a CoNLL-U file to the old CoNLL format.

Parameters:

Name Type Description Default
conllu_path str

Path to the CoNLL-U file.

required

Returns:

Type Description
str

The content of the file converted to CoNLL format as a string.