Boundaries
- Breaks Demonstrates different boundaries within text.
- Enter the sample text.
- Pick the kind of boundaries, or hit Test.
- Regex Shows transformation of (Java) Regex pattern to support Unicode.
- Enter the regex pattern
- Change the sample text if desired.
- Click Show Modified Regex Pattern
You'll then see the modified pattern.
It will often be much larger, but any reasonable Regex engine will compile character classes reasonably.
Below that, you'll see a sample of how the expression works, using it to find substrings of the sample text and underline them.
|
- Unicode Property Demo window
- Enter a character code in the right side, and hit Show. You'll see the properties
for that character (where they have non-default values).
- If you click on any property (like
Age), you'll see
a list of all the properties and their values in the Unicode Property List window
- If you click on any property value in either of these two windows, like
4.0.0.0 for Age, you'll see the characters with that property in the UnicodeSets
Demo
window
- UnicodeSet Demo window
- You can put in arbitrary UnicodeSets, allowing boolean combinations
of any of the property+value combinations in the Unicode Property List window
- If you click on Compare at the top, you can compare any two UnicodeSets.
|
Transforms
- Transform Demonstrates different boundaries within text.
- Enter the Transform Rules
- Enter Sample Text
- Hit Show Transform
- Examples:
The rules can either be IDs (simple or compound)
or general rules. To see a list of all the IDs, see
ID List.
The sample can either be a piece of text or a UnicodeSet. In the latter case, only characters that are affected by the
transform are shown. They are listed alphabetically by the result of the transform, with multiple entries shown in a UnicodeSet.
|