All Downloads are FREE. Search and download functionalities are using the official Maven repository.

edu.stanford.nlp.trees.treebank.Mapper Maven / Gradle / Ivy

Go to download

Stanford Parser processes raw text in English, Chinese, German, Arabic, and French, and extracts constituency parse trees.

There is a newer version: 3.9.2
Show newest version
package edu.stanford.nlp.trees.treebank;

import java.io.File;

/**
 * Generic interface for mapping one string to another given some contextual evidence.
 *
 * @author Spence Green
 *
 */
public interface Mapper {

  /**
   * Perform initialization prior to the first call to map.
   *
   * @param path A filename for data on disk used during mapping
   * @param options Variable length array of strings for options. Option format may
   * vary for the particular class instance.
   */
  public void setup(File path, String... options);

  /**
   * Maps from one string representation to another.
   *
   * @param parent element's context (e.g., the parent node in a parse tree)
   * @param element The string to be transformed.
   * @return The transformed string
   */
  public String map(String parent, String element);

  /**
   * Indicates whether child can be converted to another encoding. In the ATB, for example,
   * if a punctuation character is labeled with the "PUNC" POS tag, then that character should not
   * be converted from Buckwalter to UTF-8.
   *
   * @param parent element's context (e.g., the parent node in a parse tree)
   * @param child The string to be transformed.
   * @return True if the string encoding can be changed. False otherwise.
   */
  public boolean canChangeEncoding(String parent, String child);
}




© 2015 - 2024 Weber Informatics LLC | Privacy Policy