Commit Graph

3 Commits

Author SHA1 Message Date
dff0f68b39 feat(strings): add complete character mappings from golden test data
Adds missing character mappings to ensure behavioral equivalence with
original Utf8ToAsciiConverter implementation. Creates extended-mappings.json
with 1,213 additional characters covering punctuation, symbols, extended
Latin, Greek, and other Unicode blocks.

Also fixes 8 Cyrillic character mappings to match original behavior.

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-Authored-By: Claude <noreply@anthropic.com>
2025-12-13 00:31:26 +00:00
e7ac544a2f fix(strings): correct Cyrillic hard/soft sign mappings to match original behavior
The Cyrillic hard and soft signs (Ъ, ъ, Ь, ь) were incorrectly mapped to empty strings in cyrillic.json.
This fix restores the correct mappings from the original Utf8ToAsciiConverter implementation:
- Ъ (hard sign uppercase) → " (double quote)
- ъ (hard sign lowercase) → " (double quote)
- Ь (soft sign uppercase) → ' (single quote)
- ь (soft sign lowercase) → ' (single quote)

These mappings now match the golden-mappings.json reference file extracted from the original implementation.

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-Authored-By: Claude <noreply@anthropic.com>
2025-12-12 23:42:58 +00:00
486aa6be81 feat(strings): add character mapping JSON files and golden test data
- Extract 1,308 character mappings from original Utf8ToAsciiConverter.cs switch statement
- Create golden-mappings.json test data file with complete mappings for regression testing
- Create ligatures.json (14 mappings: Æ, Œ, IJ, ß, ff, fi, fl, ffi, ffl, st ligatures)
- Create special-latin.json (14 mappings: Ð, Đ, Ħ, Ł, Ŀ, Ø, Þ, Ŧ and lowercase variants)
- Create cyrillic.json (66 mappings: Russian Cyrillic alphabet transliteration)
- Update Umbraco.Core.csproj to embed JSON files as resources
- Verified embedded resources in compiled DLL

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-Authored-By: Claude <noreply@anthropic.com>
2025-12-12 23:38:33 +00:00