neuralmonkey.processors.speech module¶
-
neuralmonkey.processors.speech.
SpeechFeaturesPreprocessor
(feature_type: str = 'mfcc', delta_order: int = 0, delta_window: int = 2, **kwargs) → Callable¶ Calculate speech features.
First, the given type of features (e.g. MFCC) is computed using a window of length winlen and step winstep; for additional keyword arguments (specific to each feature type), see http://python-speech-features.readthedocs.io/. Then, delta features up to delta_order are added.
By default, 13 MFCCs per frame are computed. To add delta and delta-delta features (resulting in 39 coefficients per frame), set delta_order=2.
Parameters: - feature_type – mfcc, fbank, logfbank or ssc (default is mfcc)
- delta_order – maximum order of the delta features (default is 0)
- delta_window – window size for delta features (default is 2)
- **kwargs – keyword arguments for the appropriate function from python_speech_features
Returns: A numpy array of shape [num_frames, num_features].