pyspark.sql.streaming.StatefulProcessor.handleInitialState#

StatefulProcessor.handleInitialState(key, initialState, timerValues)[source]#

Optional to implement. Will act as no-op if not defined or no initial state input.: Function that will be invoked only in the first batch for users to process initial states.

Parameters

keyAny: grouping key.
initialState: :class:`pandas.DataFrame`: One dataframe in the initial state associated with the key.
timerValues: TimerValues: Timer value for the current batch that process the input rows. Users can get the processing or event time timestamp from TimerValues.