Antalya 26.3: Fix condition for using parquet metadata cache#1751
Antalya 26.3: Fix condition for using parquet metadata cache#1751zvonand wants to merge 5 commits into
Conversation
…next commit) --- Original cherry-pick message follows: Merge pull request #1631 from Altinity/arthurpassos-patch-11 Fix condition for using parquet metadata cache # Conflicts: # src/Storages/ObjectStorage/StorageObjectStorageSource.cpp # tests/integration/test_storage_iceberg_with_spark/test_read_constant_columns_optimization.py
98cefd0 to
304b298
Compare
|
AI audit note: This review comment was generated by AI (Claude Opus 4.5). Audit update for PR #1751Antalya 26.3: Fix condition for using parquet metadata cache Confirmed defectsHigh: Case comparison mismatch in parquet metadata cache guard
Coverage summary
|
PR #1751 CI TriagePR: #1751 - Antalya 26.3: Fix condition for using parquet metadata cache Summary
PR ChangesThe PR modifies // Before:
object_info->getFileFormat().value_or(configuration->getFormat()) == "Parquet"
// After:
Poco::toLower(object_info->getFileFormat().value_or(...)) == "Parquet"The fix addresses the issue where Apache Iceberg manifest files return file format in uppercase ( New Fails in PR (Potentially Caused by PR Changes)1.
|
|
this is a forward-port of an existing PR, thus not paying attention to the defects found. the defects shall be addressed by the original PR author in a separate PR |
The original version is incorrect, the issue is regarding the port. Original changes have |
Can you please fix the PR then? |
done |
Changelog category (leave one):
Changelog entry (a user-readable short description of the changes that goes to CHANGELOG.md):
Fix apache iceberg queries not hitting the parquet metadata cache (#1631 by @arthurpassos).
CI/CD Options
Exclude tests:
Regression jobs to run:
Cherry-picked from #1631.
Apache Iceberg queries were not htiting the parquet metadata cache because
object_info->getFileFormat()resolves toIcebergDataObjectInfo::getFileFormat, which gets its return value fromIcebergObjectSerializableInfo. This field is filled with the value from Apache Iceberg manifest file, and it is upper case by default, which then fails clickhouse check for parquet metadata cache usage.Documentation entry for user-facing changes
...