All Downloads are FREE. Search and download functionalities are using the official Maven repository.

org.apache.hadoop.fs.statistics.package-info Maven / Gradle / Ivy

The newest version!
/*
 * Licensed to the Apache Software Foundation (ASF) under one
 * or more contributor license agreements.  See the NOTICE file
 * distributed with this work for additional information
 * regarding copyright ownership.  The ASF licenses this file
 * to you under the Apache License, Version 2.0 (the
 * "License"); you may not use this file except in compliance
 * with the License.  You may obtain a copy of the License at
 *
 *     http://www.apache.org/licenses/LICENSE-2.0
 *
 * Unless required by applicable law or agreed to in writing, software
 * distributed under the License is distributed on an "AS IS" BASIS,
 * WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
 * See the License for the specific language governing permissions and
 * limitations under the License.
 */

/**
 * This package contains support for statistic collection and reporting.
 * This is the public API; implementation classes are to be kept elsewhere.
 * 

* This package defines two interfaces: *

* {@link org.apache.hadoop.fs.statistics.IOStatisticsSource}: * a source of statistic data, which can be retrieved * through a call to * {@link org.apache.hadoop.fs.statistics.IOStatisticsSource#getIOStatistics()} . *

* {@link org.apache.hadoop.fs.statistics.IOStatistics} the statistics retrieved * from a statistics source. *

* The retrieved statistics may be an immutable snapshot -in which case to get * updated statistics another call to * {@link org.apache.hadoop.fs.statistics.IOStatisticsSource#getIOStatistics()} * must be made. Or they may be dynamic -in which case every time a specific * statistic is retrieved, the latest version is returned. Callers should assume * that if a statistics instance is dynamic, there is no atomicity when querying * multiple statistics. If the statistics source was a closeable object (e.g. a * stream), the statistics MUST remain valid after the stream is closed. *

* Use pattern: *

* An application probes an object (filesystem, stream etc) to see if it * implements {@code IOStatisticsSource}, and, if it is, * calls {@code getIOStatistics()} to get its statistics. * If this is non-null, the client has statistics on the current * state of the statistics. *

* The expectation is that a statistics source is dynamic: when a value is * looked up the most recent values are returned. * When iterating through the set, the values of the iterator SHOULD * be frozen at the time the iterator was requested. *

* These statistics can be used to: log operations, profile applications, * and make assertions about the state of the output. *

* The names of statistics are a matter of choice of the specific source. * However, {@link org.apache.hadoop.fs.statistics.StoreStatisticNames} * contains a * set of names recommended for object store operations. * {@link org.apache.hadoop.fs.statistics.StreamStatisticNames} declares * recommended names for statistics provided for * input and output streams. *

* Utility classes: *

    *
  • * {@link org.apache.hadoop.fs.statistics.IOStatisticsSupport}. * General support, including the ability to take a serializable * snapshot of the current state of an IOStatistics instance. *
  • *
  • * {@link org.apache.hadoop.fs.statistics.IOStatisticsLogging}. * Methods for robust/on-demand string conversion, designed * for use in logging statements and {@code toString()} implementations. *
  • *
  • * {@link org.apache.hadoop.fs.statistics.IOStatisticsSnapshot}. * A static snaphot of statistics which can be marshalled via * java serialization or as JSON via jackson. It supports * aggregation, so can be used to generate aggregate statistics. *
  • *
* *

* Implementors notes: *

    *
  1. * IOStatistics keys SHOULD be standard names where possible. *
  2. *
  3. * An IOStatistics instance MUST be unique to that specific instance of * {@link org.apache.hadoop.fs.statistics.IOStatisticsSource}. * (i.e. not shared the way StorageStatistics are) *
  4. *
  5. * MUST return the same values irrespective of which thread the statistics are * retrieved or its keys evaluated. *
  6. *
  7. * MUST NOT remove keys once a statistic instance has been created. *
  8. *
  9. * MUST NOT add keys once a statistic instance has been created. *
  10. *
  11. * MUST NOT block for long periods of time while blocking operations * (reads, writes) are taking place in the source. * That is: minimal synchronization points (AtomicLongs etc.) may be * used to share values, but retrieval of statistics should * be fast and return values even while slow/blocking remote IO is underway. *
  12. *
  13. * MUST support value enumeration and retrieval after the source has been * closed. *
  14. *
  15. * SHOULD NOT have back-references to potentially expensive objects * (filesystem instances etc.) *
  16. *
  17. * SHOULD provide statistics which can be added to generate aggregate * statistics. *
  18. *
*/ @InterfaceAudience.Public @InterfaceStability.Unstable package org.apache.hadoop.fs.statistics; import org.apache.hadoop.classification.InterfaceAudience; import org.apache.hadoop.classification.InterfaceStability;




© 2015 - 2025 Weber Informatics LLC | Privacy Policy