
com.wcohen.ss.package.html Maven / Gradle / Ivy
Show all versions of com.wcohen.secondstring Show documentation
This package contains a bunch of approximate string comparators, plus code for
performing controlled experiments with this.
A {@link com.wcohen.ss.api.StringDistance} is the basic class
for computing distances. The score() function of this class outputs a
distance measure between its two arguments. The other methods are
there for efficiency, so that preprocessing steps (like tokenization)
can be amortized over multiple comparisons with the same string.
The way that preprocessing steps are saved is by creating a {@link
com.wcohen.ss.api.StringWrapper} object which contains the
preprocessed string, plus whatever else needs to be cached. To do
this, extend default implementation of StringWrapper.
Almost everything in this package implements StringDistance. The only
(public) exceptions are StringWrapper; PrintfFormat, pilfered from Sun
to make the explanations easier; {@link
com.wcohen.ss.CharMatchScore}, which is a character-based
distance metric; and {@link com.wcohen.ss.MemoMatrix}, a
utility for defining edit-distance-based methods.