yaset.tools¶
Submodules¶
yaset.tools.conll¶
-
yaset.tools.conll.
check_bioul_labels
(input_file: str = None)¶
-
yaset.tools.conll.
check_labels
(input_file: str = None, label_type: str = None)¶
-
yaset.tools.conll.
convert_labels
(input_file: str = None, output_file: str = None, input_label_type: str = None, output_label_type: str = None)¶ Convert NER tagging schemes
- Args:
- input_file (str): input CoNLL filepath output_file (str): output CoNLL filepath input_label_type (str): source NER tagging scheme output_label_type (str): target NER tagging scheme
- Returns:
- None
-
yaset.tools.conll.
convert_sequence
(input_sequence: list = None, input_label_type: str = None, output_label_type: str = None)¶
-
yaset.tools.conll.
convert_spaces_to_tabulations
(input_file: str = None, output_file: str = None) → None¶ Convert a CoNLL file with spaces as column separators into a CoNLL file with tabulations as column separators
- Args:
- input_file (str): input CoNLL filepath output_file (str): output CoNLL filepath
- Returns:
- None
-
yaset.tools.conll.
extract_entities_iob1
(input_labels: list = None)¶ Extract entity offsets for a CoNLL file encoded in conll 2003
- Args:
- input_labels (list): source labels
- Returns:
- list: entity offsets
-
yaset.tools.conll.
extract_sent_entities
(sentence_buffer: list = None)¶
-
yaset.tools.conll.
extract_tag_cat
(label)¶ Separate tag from category
- Args:
- label (str): NER label to split
- Returns:
- (str, str): tag, category
-
yaset.tools.conll.
load_sentences
(input_file: str = None, debug: bool = False)¶
-
yaset.tools.conll.
split_tag
(tag: str = None)¶