All Downloads are FREE. Search and download functionalities are using the official Maven repository.

ai.platon.pulsar.skeleton.crawl.PageEvents.kt Maven / Gradle / Ivy

The newest version!
package ai.platon.pulsar.skeleton.crawl

import ai.platon.pulsar.skeleton.crawl.event.*

/**
 * Event handlers during the crawl phase of the webpage lifecycle.
 * */
interface CrawlEventHandlers {

    /**
     * Fire when the url is about to be loaded.
     * */
    val onWillLoad: UrlAwareEventHandler

    /**
     * Fire to load the url.
     * */
    @Deprecated("No such event handler required", level = DeprecationLevel.WARNING)
    val onLoad: UrlAwareEventHandler

    /**
     * Fire when the url is loaded.
     * */
    val onLoaded: UrlAwareWebPageEventHandler

    /**
     * Chain the other crawl event handler to the tail of this one.
     * */
    fun chain(other: CrawlEventHandlers): CrawlEventHandlers
}

/**
 * Event handlers during the loading phase of the webpage lifecycle.
 * */
interface LoadEventHandlers {

    /**
     * Fire when the url is about to be normalized.
     * The event handlers normalize the url, for example, remove the fragment part of the url.
     * */
    val onNormalize: UrlFilterEventHandler

    /**
     * Fire when the url is about to be loaded.
     * */
    val onWillLoad: UrlEventHandler

    /**
     * Fire when the url is about to be fetched.
     * */
    val onWillFetch: WebPageEventHandler

    /**
     * Fire when the url is fetched.
     * */
    val onFetched: WebPageEventHandler

    /**
     * Fire when the webpage is about to be parsed.
     * */
    val onWillParse: WebPageEventHandler

    /**
     * Fire when the html document is about to be parsed.
     * */
    val onWillParseHTMLDocument: WebPageEventHandler

    /**
     * Fire when the html document is parsed.
     * */
    val onHTMLDocumentParsed: HTMLDocumentEventHandler

    /**
     * Fire when the webpage is parsed.
     * */
    val onParsed: WebPageEventHandler

    /**
     * Fire when the webpage is loaded.
     * */
    val onLoaded: WebPageEventHandler

    /**
     * Chain the other load event handler to the tail of this one.
     * */
    fun chain(other: LoadEventHandlers): LoadEventHandlers
}

/**
 * Event handlers during the browsing phase of the webpage lifecycle.
 * */
interface BrowseEventHandlers {
    /**
     * Fire when the browser is about to be launched.
     * */
    val onWillLaunchBrowser: WebPageEventHandler
    /**
     * Fire when the browser is launched.
     * */
    val onBrowserLaunched: WebPageWebDriverEventHandler

    /**
     * Fire when the url is about to be fetched.
     * */
    val onWillFetch: WebPageWebDriverEventHandler
    /**
     * Fire when the url is fetched.
     * */
    val onFetched: WebPageWebDriverEventHandler

    /**
     * Fire when the url is about to be navigated.
     * */
    val onWillNavigate: WebPageWebDriverEventHandler
    /**
     * Fire when the url is navigated, just like we clicked the `Go` button on the browser's navigation bar.
     * */
    val onNavigated: WebPageWebDriverEventHandler

    /**
     * Fire when the interaction with the webpage is about to begin.
     * */
    val onWillInteract: WebPageWebDriverEventHandler
    /**
     * Fire when the interactions with the webpage have been completed.
     * This event is fired after the interactions are completed, such as clicking a button, filling a form, and so on.
     *
     * This event is fired after the completion of the following actions:
     *
     * 1. Checking the document state
     * 2. Completing webpage scrolling
     * 3. Computing webpage features
     *
     * The event is fired before the following actions:
     * 1. Stopping the browser tab
     * */
    val onDidInteract: WebPageWebDriverEventHandler

    /**
     * Fire when the document state is about to be checked.
     * */
    val onWillCheckDocumentState: WebPageWebDriverEventHandler

    /**
     * Fire when the document is actually ready. The actuallyReady state is determined using an algorithm that is executed
     * within the browser.
     *
     * This actuallyReady state differs from the standard Document.readyState, which describes the loading state of the
     * document. When Document.readyState changes, a readystatechange event fires on the document object.
     *
     * @see [https://developer.mozilla.org/en-US/docs/Web/API/Document/readyState]
     * */
    val onDocumentActuallyReady: WebPageWebDriverEventHandler

    /**
     * Fire when we are about to perform scrolling on the page.
     * */
    val onWillScroll: WebPageWebDriverEventHandler
    /**
     * Fire when we have performed scrolling on the page.
     * */
    val onDidScroll: WebPageWebDriverEventHandler

    /**
     * Fire when we have performed scrolling on the page, at which point the document is considered not to change
     * unless other interactive actions occur. It is a good time to perform custom actions.
     *
     * Custom actions are defined by the user using code snippets that are written for a specific purpose, such as
     * clicking a button, filling a form, and so on.
     *
     * The event is fired after the completion of the following actions:
     * onDocumentActuallyReady, onWillScroll, onDidScroll
     *
     * The event is fired before the following actions:
     * onWillComputeFeature, onFeatureComputed, onDidInteract, onWillStopTab, onTabStopped
     * */
    val onDocumentSteady: WebPageWebDriverEventHandler

    /**
     * Fire when the webpage features are about to be computed.
     *
     * */
    val onWillComputeFeature: WebPageWebDriverEventHandler
    /**
     * Fire when the webpage features have been computed.
     * */
    val onFeatureComputed: WebPageWebDriverEventHandler

    /**
     * Fire when the browser tab is about to be stopped.
     *
     * This event is fired after the completion of the following actions:
     * 1. Checking the document state
     * 2. Completing webpage scrolling
     * 3. Computing webpage features
     * 4. Interacting with the webpage
     * */
    val onWillStopTab: WebPageWebDriverEventHandler
    /**
     * Fire when the browser tab is stopped.
     * */
    val onTabStopped: WebPageWebDriverEventHandler

    /**
     * Chain the other browse event handler to the tail of this one.
     * */
    fun chain(other: BrowseEventHandlers): BrowseEventHandlers
}

/**
 * The `PageEventHandlers` class specifies all event handlers that are triggered at various stages of a webpage’s lifecycle.
 *
 * The events are fall into three groups:
 *
 * 1. [LoadEventHandlers] triggers in loading stage.
 * 2. [BrowseEventHandlers] triggers in browsing stage.
 * 3. [CrawlEventHandlers] triggers in crawl stage, which is before and after loading the page.
 * */
interface PageEventHandlers {
    /**
     * Event handlers during the loading stage.
     * */
    var loadEventHandlers: LoadEventHandlers
    /**
     * Event handlers during the browsing stage.
     * */
    var browseEventHandlers: BrowseEventHandlers
    /**
     * Event handlers during the crawl stage.
     * */
    var crawlEventHandlers: CrawlEventHandlers
    /**
     * Chain the other page event handlers to the tail of this one.
     * */
    fun chain(other: PageEventHandlers): PageEventHandlers

    @Deprecated("Use loadEventHandlers instead", ReplaceWith("loadEventHandlers"))
    var loadEvent: LoadEvent
    @Deprecated("Use browseEventHandlers instead", ReplaceWith("browseEventHandlers"))
    var browseEvent: BrowseEvent
    @Deprecated("Use crawlEventHandlers instead", ReplaceWith("crawlEventHandlers"))
    var crawlEvent: CrawlEvent
}

@Deprecated("Use LoadEventHandlers instead", ReplaceWith("LoadEventHandlers"))
typealias LoadEvent = LoadEventHandlers

@Deprecated("Use BrowseEventHandlers instead", ReplaceWith("BrowseEventHandlers"))
typealias BrowseEvent = BrowseEventHandlers

@Deprecated("Use CrawlEventHandlers instead", ReplaceWith("CrawlEventHandlers"))
typealias CrawlEvent = CrawlEventHandlers

@Deprecated("Use PageEventHandlers instead", ReplaceWith("PageEventHandlers"))
typealias PageEvent = PageEventHandlers




© 2015 - 2025 Weber Informatics LLC | Privacy Policy