All Downloads are FREE. Search and download functionalities are using the official Maven repository.

com.cloudhopper.commons.charset.Charset Maven / Gradle / Ivy

Go to download

Utility classes for converting between charsets (mostly "mobile" in nature) such as Unicode to GSM and vice versa

There is a newer version: 3.0.2
Show newest version
package com.cloudhopper.commons.charset;

/*
 * #%L
 * ch-commons-charset
 * %%
 * Copyright (C) 2012 Cloudhopper by Twitter
 * %%
 * Licensed under the Apache License, Version 2.0 (the "License");
 * you may not use this file except in compliance with the License.
 * You may obtain a copy of the License at
 * 
 *      http://www.apache.org/licenses/LICENSE-2.0
 * 
 * Unless required by applicable law or agreed to in writing, software
 * distributed under the License is distributed on an "AS IS" BASIS,
 * WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
 * See the License for the specific language governing permissions and
 * limitations under the License.
 * #L%
 */

/**
 * Interface for any charset.
 * 
 * @author joelauer (twitter: @jjlauer or http://twitter.com/jjlauer)
 */
public interface Charset {

    public int estimateEncodeByteLength(CharSequence str0);

    /**
     * Encode the Java string into a byte array.
     * @param str0 The Java string to convert into a byte array
     * @return A new byte array
     */
    public byte[] encode(CharSequence str0);

    public int estimateDecodeCharLength(byte[] bytes);

    /**
     * Decode the byte array to a Java string that is appended to the buffer.
     * Implementations of this method will not change any of the byte values
     * contained in the byte array.
     * @param bytes The array of bytes to decode
     * @param buffer The String buffer to append chars to
     */
    public void decode(final byte[] bytes, StringBuilder buffer);

    /**
     * Decode the byte array and return a new Java string.
     * Implementations of this method will not change any of the byte values
     * contained in the byte array.
     * @param bytes The array of bytes to decode
     * @return A new String with characters decoded from the byte array in
     *      the given charset.
     */
    public String decode(final byte[] bytes);
    
    /**
     * Normalize the characters of the source string to characters that can be
     * represented by this charset. Any characters in the input String that
     * cannot be represented by this charset are replaced with a '?' (question
     * mark character).

* The default implementation of this method is partially inefficient by * first encoding the input String to a byte array representing this charset * followed by decoding the byte array back into a Java String. During this * double conversion, any characters in the original Java String that don't * exist in this charset are replaced with '?' (question mark characters) * and then decoded back into a new Java String.

* Some charsets may choose to override this default behavior to achieve a * more efficient implementation. * * @param str0 The source string to normalize * @return The normalized string */ public String normalize(CharSequence str0); }




© 2015 - 2024 Weber Informatics LLC | Privacy Policy