Skip to content

Question in the blog of structured streaming part 11 #3

@bithw1

Description

@bithw1

Hi @phatak-dev
First, thank you very much for the great effort to write the structured streaming series.
I got a question: in http://blog.madhukaraphatak.com/introduction-to-spark-structured-streaming-part-11/

Running the Example
Enter the below records in socket console. These are records for AAPL with time stamps.

First Event
The first records is for time Wed, 27 Apr 2016 11:34:22 GMT.

1461756862000,"aapl",500.0
Spark outputs below results which indicates start of window


Batch: 0

+---------------------------------------------+----------+
|window |sum(value)|
+---------------------------------------------+----------+
|[2016-04-27 17:04:20.0,2016-04-27 17:04:30.0]|500.0 |
+---------------------------------------------+----------+

The first record is 2016-04-27 11:34:22, why in batch 0 the window interval is
2016-04-27 17:04:20.0,2016-04-27 17:04:30.0, I thought it should be around 2016-04-27 11:34:22,

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions