All Downloads are FREE. Search and download functionalities are using the official Maven repository.

smile.stat.distribution.AbstractDistribution Maven / Gradle / Ivy

/*******************************************************************************
 * Copyright (c) 2010-2020 Haifeng Li. All rights reserved.
 *
 * Smile is free software: you can redistribute it and/or modify
 * it under the terms of the GNU Lesser General Public License as
 * published by the Free Software Foundation, either version 3 of
 * the License, or (at your option) any later version.
 *
 * Smile is distributed in the hope that it will be useful,
 * but WITHOUT ANY WARRANTY; without even the implied warranty of
 * MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.  See the
 * GNU Lesser General Public License for more details.
 *
 * You should have received a copy of the GNU Lesser General Public License
 * along with Smile.  If not, see .
 ******************************************************************************/

package smile.stat.distribution;

import smile.math.MathEx;

/**
 * The base class of univariate distributions. Both rejection
 * and inverse transform sampling methods are implemented to provide some
 * general approaches to generate random samples based on probability density
 * function or quantile function. Besides, a quantile function is also
 * provided based on bisection searching. Likelihood and log likelihood
 * functions are also implemented here.
 *
 * @author Haifeng Li
 */
public abstract class AbstractDistribution implements Distribution {
    /**
     * Use the rejection technique to draw a sample from the given
     * distribution. WARNING: this simulation technique can
     * take a very long time. Rejection sampling is also commonly
     * called the acceptance-rejection method or "accept-reject algorithm".
     * It generates sampling values from an arbitrary probability distribution
     * function f(x) by using an instrumental distribution g(x), under the
     * only restriction that f(x) < M g(x) where
     * M > 1 is an appropriate bound on
     * f(x) / g(x).
     * 

* Rejection sampling is usually used in cases where the form of * f(x) makes sampling difficult. Instead of sampling * directly from the distribution f(x), we use an envelope * distribution M g(x) where sampling is easier. These * samples from M g(x) are probabilistically * accepted or rejected. *

* This method relates to the general field of Monte Carlo techniques, * including Markov chain Monte Carlo algorithms that also use a proxy * distribution to achieve simulation from the target distribution * f(x). It forms the basis for algorithms such as * the Metropolis algorithm. */ protected double rejection(double pmax, double xmin, double xmax) { double x; double y; do { x = xmin + MathEx.random() * (xmax - xmin); y = MathEx.random() * pmax; } while (p(x) < y); return x; } /** * Use inverse transform sampling (also known as the inverse probability * integral transform or inverse transformation method or Smirnov transform) * to draw a sample from the given distribution. This is a method for * generating sample numbers at random from any probability distribution * given its cumulative distribution function (cdf). Subject to the * restriction that the distribution is continuous, this method is * generally applicable (and can be computationally efficient if the * cdf can be analytically inverted), but may be too computationally * expensive in practice for some probability distributions. The * Box-Muller transform is an example of an algorithm which is * less general but more computationally efficient. It is often the * case that, even for simple distributions, the inverse transform * sampling method can be improved on, given substantial research * effort, e.g. the ziggurat algorithm and rejection sampling. */ protected double inverseTransformSampling() { double u = MathEx.random(); return quantile(u); } /** * Inversion of CDF by bisection numeric root finding of "cdf(x) = p" * for continuous distribution. */ protected double quantile(double p, double xmin, double xmax, double eps) { if (eps <= 0.0) { throw new IllegalArgumentException("Invalid epsilon: " + eps); } while (Math.abs(xmax - xmin) > eps) { double xmed = (xmax + xmin) / 2; if (cdf(xmed) > p) { xmax = xmed; } else { xmin = xmed; } } return xmin; } /** * Inversion of CDF by bisection numeric root finding of "cdf(x) = p" * for continuous distribution. The default epsilon is 1E-6. */ protected double quantile(double p, double xmin, double xmax) { return quantile(p, xmin, xmax, 1.0E-6); } }





© 2015 - 2025 Weber Informatics LLC | Privacy Policy