Comments (11)
Can you share your configuration?
from spline.
Something like this in spline.properties, run via spark-submit --class za.co.absa.spline.sample.batch.SampleJob2 spline-sample-0.3.1.jar
# Spline lineage config
spline.persistence.factory=za.co.absa.spline.persistence.api.composition.ParallelCompositeFactory
spline.persistence.composition.factories=za.co.absa.spline.persistence.atlas.AtlasPersistenceFactory,za.co.absa.spline.persistence.mongo.MongoPersistenceFactory
# MongoDB config for UI
spline.mongodb.url=mongodb://bar.domain:27017
spline.mongodb.name=spline
# Atlas Kafka config
atlas.kafka.bootstrap.servers=foo.domain:9027
# Combinations tested with Spline SampleJob2
# HDFS + Atlas + Mongo = fail
#spline.persistence.composition.factories=za.co.absa.spline.persistence.hdfs.HdfsPersistenceFactory,za.co.absa.spline.persistence.atlas.AtlasPersistenceFactory,za.co.absa.spline.persistence.mongo.MongoPersistenceFactory
# Atlas + Mongo = fail
#spline.persistence.composition.factories=za.co.absa.spline.persistence.atlas.AtlasPersistenceFactory,za.co.absa.spline.persistence.mongo.MongoPersistenceFactory
# HDFS + Atlas = works
#spline.persistence.composition.factories=za.co.absa.spline.persistence.hdfs.HdfsPersistenceFactory,za.co.absa.spline.persistence.atlas.AtlasPersistenceFactory
# HDFS + Mongo = works
#spline.persistence.composition.factories=za.co.absa.spline.persistence.hdfs.HdfsPersistenceFactory,za.co.absa.spline.persistence.mongo.MongoPersistenceFactory
from spline.
Does this occur also when using Atlas only?
from spline.
hm I can see mutable ArrayBuffer in the stack that could be the problem.
from spline.
I reproduced the exception. I will have a quick look now.
from spline.
This is caused by simultaneously linking to other lineages and saving to Atlas as well. I will implement fix storing non-linked lineages only to Atlas.
from spline.
Just confirming that it doesn't occur when using Atlas only. Thanks for checking it out.
from spline.
The fix commit is available on release branch 0.3 here: 179283e
I dont want to release new release just yet due to significant overhead of it.
Can you build it from the source using the snapshot version at the moment?
from spline.
Let me know if u have issues with above solution.
from spline.
Sorry, got busy. I'll have a look this weekend.
from spline.
Yep, that works for me.
from spline.
Related Issues (20)
- How do I analyze the column of wirte from the plan detail, which column of the read comes from?
- Admin: print a backup reminder before database migration
- Admin: add `--dry-run` option
- Fix override of JaCoCo argLine settings in pom file
- Security layer HOT 2
- Spline support for expand operation
- Improve code-coverage & add GH check action HOT 1
- Multi-arch docker builds
- Add incoming REST payload size into the persistent object extra info
- Delete older or unusable execution events HOT 4
- How different execution events are related between them HOT 2
- `TxInfo` property name discrepancy, and missing index.
- Kafka :: message failure handling HOT 2
- spline kafka trust packages HOT 3
- Spline Admin - replication factor set to 1 HOT 1
- The class 'ExecutionPlan_1.2' is not in the trusted packages | Old Changes in "develop" branch not available under latest release tags. HOT 1
- Add date to default log message
- 400 Arango error on inserting to executionPlan HOT 1
- Useless AQL error logs
- Databricks- Support of Delta live tables
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from spline.