
scripts.modify_improve.multiformat.CharacterRepertoireManipulator.taskScript Maven / Gradle / Ivy
The newest version!
]>
Character Repertoire Manipulator
Perform unicode character-to-replacementstring substitution using external table(s) and optionally non-spacing mark removal.
Input file
Select input file
Output directory
Select output directory
Substitution table(s)
One or several substitution tables
Exclude
Name of a characterset for whose characters the substitution should not be attempted.
Fallback to non-spacing mark removal
Fallback to non-spacing mark removal (disaccentuation) if a replacement text was not found in substitution table(s)
Fallback to Latin
Fallback to a transliteration to Latin if a replacement text was not found in substitution table(s)
Fallback to UCD names
Fallback to names in the UCD table if a replacement text was not found in substitution table(s)
&CharsetSwitcherScriptParamsStatic;
input
${input}
output
${output}/pipeline__temp/
excludeFromSubstitution
${excludeCharset}
substitutionTables
${substitutionTables}
fallbackToNonSpacingMarkRemovalTransliteration
${nonSpacingMark}
fallbackToLatinTransliteration
${latin}
fallbackToUCD
${ucd}
performCharacterSubstitution
true
outputEncoding
utf-8
input
${output}/pipeline__temp/$filename{input}
output
${output}
encoding
${charsetSwitcherEncoding}
breaks
${charsetSwitcherLineBreaks}
delete
${output}/pipeline__temp/
© 2015 - 2025 Weber Informatics LLC | Privacy Policy