Comments (8)
Maybe this is what you need: #8609, but you are right, it looks like the table.properties has been updated frequently after the upgrade, can you add some log in HoodieTableConfig
and let's see why the update is triggered.
from hudi.
Are these failing jobs uses separate compaction with Spark, or do they have concurernt write with Spark writers?
from hudi.
Spark enables the MDT while Flink does not, maybe that's the reason why the table properties are updated frequently.
from hudi.
Flink for writing.
from hudi.
enables
How can we configure it to avoid this issue?
from hudi.
I mean did you have both Spark and Flink job writing into the same table, if it is, you might need to disable the MDT on Spark writer.
from hudi.
I mean did you have both Spark and Flink job writing into the same table, if it is, you might need to disable the MDT on Spark writer.
Oh. We don't have that scenario. Each table only has one fink task responsible for writing; there is no situation where multiple sinks correspond to one table.
from hudi.
@danny0405 :I found that this situation tends to occur after Session tasks report errors.
from hudi.
Related Issues (20)
- [SUPPORT] For example, when two writers write to non overlapping files, both writes are allowed to succeed. However, when the writes from different writers overlap (touch the same set of files), only one of them will succeed. Please note that this feature is currently experimental and requires external lock providers to acquire locks briefly at critical sections during the write. More on lock providers below. HOT 1
- [SUPPORT] in https://hudi.apache.org/docs/concurrency_control its written ``` Please note that this feature is currently experimental and requires external lock providers to acquire locks briefly at critical sections during the write. More on lock providers below.``` is this general for OCC or is it experimental for internal lock providers ??
- [SUPPORT] OCC experimental ? HOT 2
- [SUPPORT] Compile Hudi 0.15 with Spark 3.5 and Scala 2.13 HOT 1
- [SUPPORT] No easy way to append classpath in hudi hive sync HOT 1
- [SUPPORT] Hudi Sync tool is dependant on Hadoop 2.10.2 and Hadoop AWS 2.10.2. Need upgrade to newer versions like 3.3.4 HOT 2
- [SUPPORT] run_sync_tool.sh hudi-sync needs to upgraded to avoid AWS SDK V1 warning message
- [SUPPORT] When using AWS Hadoop 3.3.4 libraries, Hudi Sync will give java.lang.ClassNotFoundException: org.apache.hadoop.fs.statistics.IOStatisticsSource HOT 1
- [SUPPORT] Hudi sync requires hadoop and hive installed. Very heavy weight HOT 2
- [SUPPORT] COW+hiveStylePartitioning+glob.paths on Spark: reads incomplete values of partition column HOT 2
- flinksql writes to hudi and then synchronizes hive HOT 4
- [SUPPORT]Schema evolution setting affects Spark's 'describe table' output HOT 2
- flinksql uses hudi to write to hdfs and synchronize to hive HOT 15
- [SUPPORT] "Failed to read schema/check compatibility" on Hudi upgrade from 0.12.2 to Hudi 0.14.1 HOT 5
- [SUPPORT] Specified partition compaction HOT 1
- [SUPPORT] Huge Performance Issue With BLOOM Index On A 1.6 Billion COW Table HOT 8
- [SUPPORT] Table or view not found after create table success HOT 1
- [SUPPORT] hudi-cli.sh. Error creating bean with name 'exportCommand' defined in URL [jar:file:/opt/hudi/hudi-cli/target/hudi-cli-0.15.0.jar!/org/apache/hudi/cli/commands/ExportCommand.class] HOT 2
- [SUPPORT] spark task execute too long and can not finish when ObjectSizeCalculator.getObjectSize HOT 9
- [SUPPORT] Hudi CLI doesn't respect ENDPOINT, AWS_ENDPOINT or AWS_S3_ENDPOINT HOT 6
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from hudi.