Similar to the 1.0 version, it would be very useful to support an external progress st

Support pluggable progress store about azure-event-hubs-spark HOT 5 CLOSED

azure commented on September 19, 2024

Support pluggable progress store

from azure-event-hubs-spark.

Comments (5)

CodingCat commented on September 19, 2024

no...in direct dstream progress dir is very critical to the progress of the whole streaming pipeline...we will not open that to the user at least in the recent future

from azure-event-hubs-spark.

slyons commented on September 19, 2024

It's a generic ProgressTracker, in what way is having a file system necessary for the dstream approach?

from azure-event-hubs-spark.

CodingCat commented on September 19, 2024

because ProgressTracker is mostly about how to translate seq number to offset in the next batch which is very critical to the whole pipeline...we will not open it to the user to avoid unnecessary troubles

Similarly, Structured Streaming in Spark doesn't open offsetLog to the user and enforce it to be based on HDFS

so that's it

from azure-event-hubs-spark.

slyons commented on September 19, 2024

ProgressTracker seems to only manage saving the progress state back to storage and loading it back up again. It's up to the EH client to determine what the next sequence number is. The tracker is obviously important as things are blocked until the latest numbers are committed, but that doesn't mean it has to be tied to the file system.

from azure-event-hubs-spark.

CodingCat commented on September 19, 2024

ok, let me end the discussion,

you can implement whatever you want as the storage place for progress files. However, that does not mean we will adopt it in this project

from azure-event-hubs-spark.

Support pluggable progress store about azure-event-hubs-spark HOT 5 CLOSED

Comments (5)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent