All Downloads are FREE. Search and download functionalities are using the official Maven repository.

au.id.jericho.lib.html.OutputSegment Maven / Gradle / Ivy

Go to download

Jericho HTML Parser is a java library allowing analysis and manipulation of parts of an HTML document, including server-side tags, while reproducing verbatim any unrecognised or invalid HTML.

There is a newer version: 3.4
Show newest version
// Jericho HTML Parser - Java based library for analysing and manipulating HTML
// Version 2.4
// Copyright (C) 2007 Martin Jericho
// http://jerichohtml.sourceforge.net/
//
// This library is free software; you can redistribute it and/or
// modify it under the terms of either one of the following licences:
//
// 1. The Eclipse Public License (EPL) version 1.0,
// included in this distribution in the file licence-epl-1.0.html
// or available at http://www.eclipse.org/legal/epl-v10.html
//
// 2. The GNU Lesser General Public License (LGPL) version 2.1 or later,
// included in this distribution in the file licence-lgpl-2.1.txt
// or available at http://www.gnu.org/licenses/lgpl.txt
//
// This library is distributed on an "AS IS" basis,
// WITHOUT WARRANTY OF ANY KIND, either express or implied.
// See the individual licence texts for more details.

package au.id.jericho.lib.html;

import java.io.*;
import java.util.*;

/**
 * Defines the interface for an output segment, which is used in an {@link OutputDocument} to
 * replace segments of the source document with other text.
 * 

* All text in the OutputDocument between the character positions defined by {@link #getBegin()} and {@link #getEnd()} * is replaced by the content of this output segment. * If the begin and end character positions are the same, the content is simply * inserted at this position without replacing any text. * * @see OutputDocument#register(OutputSegment) */ public interface OutputSegment extends CharStreamSource { /** * The comparator used to sort output segments in the {@link OutputDocument} before output. *

* The following rules are applied in order compare two output segments: *

    *
  1. The output segment that {@linkplain #getBegin() begins} earlier in the document comes first. *
  2. If both output segments begin at the same position, the one that has zero-length comes first. *
  3. If both output segments are zero-length, neither is guaranteed to come before the other. *
  4. If neither segment is zero-length, the result is undefined as the segments are overlapping. * Note that this condition is detected at a later stage, so this comparator returns normally without throwing a * {@link OverlappingOutputSegmentsException}. *
*

* Note: this comparator has a natural ordering that may be inconsistent with the equals * method of classes implementing this interface. * This means that the comparator may treat two output segments as equal where calling the * equals(Object) method with the same two output segments returns false. */ public static final Comparator COMPARATOR=new OutputSegmentComparator(); /** * Returns the character position in the {@linkplain OutputDocument#getSourceText() source text of the output document} where this segment begins. * @return the character position in the {@linkplain OutputDocument#getSourceText() source text of the output document} where this segment begins. */ public int getBegin(); /** * Returns the character position in the {@linkplain OutputDocument#getSourceText() source text of the output document} where this segment ends. * @return the character position in the {@linkplain OutputDocument#getSourceText() source text of the output document} where this segment ends. */ public int getEnd(); /** * Writes the content of this output segment to the specified Writer. * @param writer the destination java.io.Writer for the output. * @throws IOException if an I/O exception occurs. */ public void writeTo(Writer writer) throws IOException; /** * Returns the content of this output segment as a String. *

* Note that before version 2.0 this returned a representation of this object useful for debugging purposes, * which can now be obtained via the {@link #getDebugInfo() getDebugInfo()} method. * * @return the content of this output segment as a String, guaranteed not null. * @see #writeTo(Writer) */ public String toString(); /** * Returns a string representation of this object useful for debugging purposes. * @return a string representation of this object useful for debugging purposes. */ public String getDebugInfo(); }





© 2015 - 2024 Weber Informatics LLC | Privacy Policy