Skip to content


Multibyte Text Normalizer#

This transformer converts the characters in all multibyte strings to the same case. Multibyte strings contain characters such as accents (é, è, à), emojis (😀, 😉) or characters of non roman alphabets such as Chinese and Cyrillic.


⚠️ We recommend you install the mbstring extension for best performance.

Interfaces: Transformer

Data Type Compatibility: Categorical


# Name Default Type Description
1 uppercase false bool Should the text be converted to uppercase?


use Rubix\ML\Transformers\MultibyteTextNormalizer;

$transformer = new MultibyteTextNormalizer(false);

Additional Methods#

This transformer does not have any additional methods.