Comments (9)
While trying to do insert operation after bulk-insert, ran into above error.
Not sure what to do here ?
from hudi.
@xushiyan @ad1happy2go @bhasudha
Could you please help me here thank you.
from hudi.
Did you try to use the drop column
statement?
from hudi.
@SamarthRaval hoodie.datasource.write.reconcile.schema should ideally handle that,. can you try removing hoodie.write.set.null.for.missing.columns.
from hudi.
@SamarthRaval Let's try to reproduce with sample dataset if possible.
from hudi.
@SamarthRaval hoodie.datasource.write.reconcile.schema should ideally handle that,. can you try removing hoodie.write.set.null.for.missing.columns.
yes actually it should handle it, even if I have few columns missing from writeSchema.
Problem is for other customer it does work with same configurations, with no problem at all.
from hudi.
@SamarthRaval hoodie.datasource.write.reconcile.schema should ideally handle that,. can you try removing hoodie.write.set.null.for.missing.columns.
I tried to follow this
from hudi.
Did you try to use the
drop column
statement?
No I am not dropping any column but when checked closing there are some columns which are missing, but shouldn't it automatically take care of it as per hoodie.datasource.write.reconcile.schema
from hudi.
@SamarthRaval Let's try to reproduce with sample dataset if possible.
Hello @ad1happy2go @danny0405
I was able to reproduce this in research, and was able to get exact same error.
In research with in-between column is missing, it throws above error.
My understanding was with reconcile.schema enabled, it will just populated null for missing column, but seems this not the case.
Any idea with this ?
from hudi.
Related Issues (20)
- [SUPPORT]Performance degrade for migrating from Hudi 0.7 to Hudi 0.14 HOT 6
- [SUPPORT] Pulsar connection error for Hoodie Streamer HOT 1
- [SUPPORT] Datadog Metrics reporter fails with null pointer exception using hudi 0.14.0
- HUDI 0.14.1 and AWS GLUE 4.0 issues with schema evolution HOT 2
- [logical delete data] How to use flink-cdc to logical delete the hudi data HOT 1
- [SUPPORT] Flink bucket index partitioner may cause data skew HOT 6
- [SUPPORT] Failed to parse HoodieCommitMetadata HOT 1
- [SUPPORT] NPE when using PySpark with release-0.15.0 HOT 4
- org.apache.hudi.exception.HoodieException: org.apache.avro.AvroTypeException: Cannot encode decimal with precision 14 as max precision 13 HOT 6
- [SUPPORT] Failed to upsert for commit time xxxx ,HUDI 0.14.1 & Glue 4.0 HOT 4
- [SUPPORT] - Partial update of the MOR table after compaction with Hudi Streamer HOT 7
- [SUPPORT] Spark-Hudi: Unable to perform Hard delete using Pyspark on HUDI table from AWS Glue HOT 7
- [SUPPORT] Issue with RECORD_INDEX Initialization Falling Back to GLOBAL_SIMPLE HOT 1
- duplicated records when use insert overwrite HOT 4
- [SUPPORT] CVE problems in latest 0.14.1
- [SUPPORT] using spark's observe feature on dataframes saved by hudi is stuck HOT 3
- Corrupted parquet file in hudi partition | Deletion of partition in Hudi HOT 6
- [SUPPORT] Multi Writer Jobs with OCC (U1 and U2) with Async Cleaner
- [SUPPORT] how to migrate exist bloom index table to bucket table HOT 2
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from hudi.