Comments (3)
I would also find this useful, and it matches a pattern that Yelp engineers have written about: https://engineeringblog.yelp.com/2016/10/redshift-connector.html
I'm not aware that Kafka Connect provides any built-in support for this, and it would likely be tricky to implement while maintaining the delivery guarantees of the connector.
Another approach would be to enable S3 events on your bucket and have them sent to an SNS topic. You could then write an application that subscribes to SNS or you could use an SNS source connector to get the event data onto a Kafka topic.
from kafka-connect-storage-cloud.
Although the workaround is definitely clear, and using Kafka as the only dependency for a connector is is very often meaningful and a good pattern (no new dependency on an external system is introduced, since Kafka is already a dependency for a connector, source or sink), I don't think this workaround is justified yet, in the context of this S3 connector.
Keeping this ticket open, in case you want to discuss this more, but currently I don't find the proposal very appealing.
from kafka-connect-storage-cloud.
This workaround doesn’t work if you are using a non amazon s3 compatible storage system without SNS like functionality. It would be very handy to have some sort of finalizer hook to build off of.
from kafka-connect-storage-cloud.
Related Issues (20)
- Convert to Parquet S3 Sink problems HOT 5
- Wrong Last Modified Time of S3 Object HOT 1
- Add option to keep tombstone events?
- Handling Close call in ICR mode
- java.lang.IllegalStateException
- Where is S3 source connector? HOT 1
- Unable to build v10.5.7 tag locally, missing io.confluent:common:[7.6.0, 7.6.1) HOT 1
- Direct Memory Buffer Leak? HOT 1
- AWS MSK S3 Sink Connector to Deserialize AVRO without Schema Registry HOT 2
- commit timeout question HOT 1
- Failed to put when S3 object lock feature is enabled
- Request for Feature: Confirming Completion and Handling Old Data in S3 Kafka Connector
- io.confluent.connect.s3.format.parquet.ParquetFormat and ParquetRecordWriterProvider writes value in record field of type map and enum as bytes in Parquet result
- Null values being replaced with default HOT 5
- failed to write parquet with error for Fixed Binary size mismatch
- Add ZSTD compression support for JSON output
- Getting SchemaProjectorException after schema evolution with enum
- S3 Kafka Sink connector can't change default path and filename
- S3 Kafka Sink: Tombstone encoded partition doesn't support timebased partioning
- S3SinkConnector error: The bucket is in this region
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from kafka-connect-storage-cloud.