Package featuregenerator :: Package parser :: Package berkeley :: Module berkeleyclient :: Class BerkeleyFeatureGenerator
[hide private]
[frames] | no frames]

Class BerkeleyFeatureGenerator

source code

                                   object --+        
                                            |        
            featuregenerator.FeatureGenerator --+    
                                                |    
languagefeaturegenerator.LanguageFeatureGenerator --+
                                                    |
                                                   BerkeleyFeatureGenerator
Known Subclasses:

Instance Methods [hide private]
 
__init__(self, *args)
In order to initialize a language-specific feature generator, the language needs to be instantiatied as a class variable
source code
 
parse(self, string) source code
list(sentence.parallelsentence.ParallelSentence)
add_features_batch(self, parallelsentences)
Abstract method to be overriden by the particular subclassed feature generator.
source code
 
get_features_simplesentence(self, simplesentence, parallelsentence)
Abstract method to be overriden by the particular subclassed feature generator.
source code
 
get_features_string(self, sent_string) source code
 
prepare_sentence(self, simplesentence) source code

Inherited from languagefeaturegenerator.LanguageFeatureGenerator: add_features_batch_xml, add_features_dataset, get_features_src, get_features_tgt

Inherited from featuregenerator.FeatureGenerator: add_features_parallelsentence, add_features_simplesentence, add_features_src, add_features_tgt, get_annotation_name, get_features_parallelsentence, process_dataset

Inherited from object: __delattr__, __format__, __getattribute__, __hash__, __new__, __reduce__, __reduce_ex__, __repr__, __setattr__, __sizeof__, __str__, __subclasshook__

Instance Variables [hide private]

Inherited from languagefeaturegenerator.LanguageFeatureGenerator: lang

Properties [hide private]

Inherited from object: __class__

Method Details [hide private]

__init__(self, *args)
(Constructor)

source code 

In order to initialize a language-specific feature generator, the language needs to be instantiatied as a class variable

Parameters:
  • lang - the language code of the language that the feature generator is capable of
Overrides: object.__init__
(inherited documentation)

add_features_batch(self, parallelsentences)

source code 

Abstract method to be overriden by the particular subclassed feature generator. It allows the generation of features over many parallelsentences. It is a flexible solution when feature generation doesn't take place item to item (see SAX parsing) but a whole list of parallel sentences needs to be implemented at once. In this case, feature generator may optimize better when the whole dataset is given.

Parameters:
  • parallelsentences - The parallel sentences to be be augmented
Returns: list(sentence.parallelsentence.ParallelSentence)
The given list of ParallelSentence which are now augmented with features generated from the current featuregenerator
Overrides: featuregenerator.FeatureGenerator.add_features_batch
(inherited documentation)

get_features_simplesentence(self, simplesentence, parallelsentence)

source code 

Abstract method to be overriden by the particular subclassed feature generator. It receives a simple sentence of any type and returns a list of features. It should be overriden by a feature generator that doesn't differentiate between source and target features

Overrides: featuregenerator.FeatureGenerator.get_features_simplesentence
(inherited documentation)

get_features_string(self, sent_string)

source code 
Overrides: featuregenerator.FeatureGenerator.get_features_string