Class WhitespaceTokenizer
Defined in File text.h
Inheritance Relationships
Base Type
public mindspore::dataset::TensorTransform
(Class TensorTransform)
Class Documentation
-
class WhitespaceTokenizer : public mindspore::dataset::TensorTransform
Tokenize a scalar tensor of UTF-8 string on ICU4C defined whitespaces.
Public Functions
-
explicit WhitespaceTokenizer(bool with_offsets = false)
Constructor.
- Parameters
with_offsets – [in] whether to output offsets of tokens (default=false).
-
~WhitespaceTokenizer() = default
Destructor.
-
explicit WhitespaceTokenizer(bool with_offsets = false)