Visible characters28

Best match for what users see as one character, including combined emoji and accents.

Unicode code points32

Counts Unicode scalar values. Emoji modifiers and combining marks may count separately.

UTF-16 units34

JavaScript string length. Some emoji and rare characters count as two units.

Locale-aware words7

Uses locale-aware word segmentation when supported instead of only splitting by spaces.

UTF-8 bytes82

Counts how many bytes the text needs when encoded as UTF-8.

Lines2

Counts text lines separated by line breaks.

Locale-aware counting

Word segmentation uses the current locale (en) when Intl.Segmenter is available, which is important for languages that do not always separate words with spaces.