A wrapper over a list of parallelsentences. It offers convenience
functions for features and properties that apply to the entire set of
parallelsentences altogether
|
__eq__(self,
other)
@todo comparison doesn't really work |
source code
|
|
|
__init__(self,
content=[ ] ,
attributes_list=[ ] ,
annotations=[ ] )
x.__init__(...) initializes x; see help(type(x)) for signature |
source code
|
|
|
__iter__(self)
A DataSet iterates over its basic wrapped object, ParallelSentence |
source code
|
|
|
|
|
add_attribute_vector(self,
att_vector,
target=' tgt ' ,
item=0) |
source code
|
|
|
|
|
|
|
compare(self,
other_dataset,
start=0,
to=None)
Compares this dataset to another, by displaying parallel sentences in
pairs |
source code
|
|
|
confirm_attributes(self,
desired_attributes=[ ] ,
meta_attributes=[ ] )
Convenience function that checks whether the user-requested
attributes (possibly via the config file) exist in the current
dataset's list. |
source code
|
|
|
ensure_judgment_ids(self)
Processes one by one the contained parallel sentences and ensures
that there are judgment ids otherwise adds an incremental value |
source code
|
|
|
|
|
|
|
|
|
get_discrete_attribute_values(self,
discrete_attribute_names) |
source code
|
|
|
|
|
|
|
|
|
|
dict(String, list(sentence.parallelsentence.ParallelSentence))
|
|
dict
|
|
|
|
|
|
|
|
|
|
|
|
|
import_target_attributes_onsystem(self,
dataset,
target_attribute_names,
keep_attributes_general=[ ] ,
keep_attributes_source=[ ] ,
keep_attributes_target=[ ] ) |
source code
|
|
|
merge_dataset(self,
dataset_for_merging_with,
attribute_replacements={ } ,
merging_attributes=[ ' id ' ] ,
merge_strict=False,
**kwargs)
It takes a dataset which contains the same parallelsentences, but
with different attributes. |
source code
|
|
|
merge_dataset_symmetrical(self,
dataset_for_merging_with,
attribute_replacements={ } ,
confirm_attribute='
' )
Merge the current dataset in place with another symmetrical dataset
of the same size and the same original content, but possibly with
different attributes per parallel sentence |
source code
|
|
|
merge_references_symmetrical(self,
dataset_for_merging_with) |
source code
|
|
|
modify_singlesource_strings(self,
strings=[ ] ) |
source code
|
|
|
|
|
remove_ties(self)
Modifies the current dataset by removing ranking ties |
source code
|
|
|
select_attribute_names(self,
expressions=[ ] ) |
source code
|
|
|
|
|
write_singlesource_strings_file(self,
filename=None) |
source code
|
|
Inherited from object :
__delattr__ ,
__format__ ,
__getattribute__ ,
__hash__ ,
__new__ ,
__reduce__ ,
__reduce_ex__ ,
__repr__ ,
__setattr__ ,
__sizeof__ ,
__str__ ,
__subclasshook__
|