Impala 4.3.0 Change Log
Release Notes - IMPALA - Version Impala 4.3.0
New Feature
- [IMPALA-1766] - Misc. statistical functions
- [IMPALA-6433] - Add read support for PageHeaderV2 to the parquet scanner
- [IMPALA-9551] - Allow embedding complex types into other complex types
- [IMPALA-10262] - Linux Packaging Support
- [IMPALA-10851] - Codegen for structs
- [IMPALA-11013] - Support migrating external tables to Iceberg tables
- [IMPALA-11158] - Add support for Iceberg tables with AVRO data files
- [IMPALA-11339] - Implement LOAD DATA INPATH for Iceberg tables
- [IMPALA-11470] - Support Caching Codegen Functions
- [IMPALA-11482] - Implement Iceberg table rollback feature
- [IMPALA-11565] - Support IF NOT EXISTS in alter table add columns for kudu table
- [IMPALA-11629] - Support Huawei OBS (Object Storage Service) FileSystem
- [IMPALA-11728] - Set fallback database for functions
- [IMPALA-11745] - Add Hive's ESRI geospatial functions as builtins
- [IMPALA-11809] - Support non unique primary key for Kudu
- [IMPALA-11877] - Add support for DELETE statements for Iceberg tables
- [IMPALA-11922] - Add Startup Flag to Select TLS Certificate Verification on JWKS URL
- [IMPALA-11961] - Add support for Ubuntu 22.04
- [IMPALA-11970] - Add query timeline display to Impala WebUI
- [IMPALA-12053] - Expose event-processor error message in catalogd WebUI
Epic
Improvement
- [IMPALA-3880] - Add list of all tables queried to runtime profile
- [IMPALA-4052] - CREATE TABLE LIKE for Kudu tables
- [IMPALA-6665] - Tag CatalogOp logs with query IDs
- [IMPALA-7003] - Support erasure-coding in impala
- [IMPALA-7969] - Always admit trivial queries immediately
- [IMPALA-8731] - Balance queries between executor groups
- [IMPALA-10173] - Allow implicit casts between numeric and string types when inserting into table
- [IMPALA-10856] - Show connections in the web UI
- [IMPALA-11375] - impala-shell: log important message for rpc requests/response
- [IMPALA-11476] - Support for Ozone with Erasure Coding
- [IMPALA-11477] - Codegen Heapify in SortedRunMerger
- [IMPALA-11534] - Skip reloading file metadata for some ALTER_TABLE events
- [IMPALA-11535] - Skip events happen before manual REFRESH
- [IMPALA-11563] - Optimized /etc/sysconfig/clock to find the time zone
- [IMPALA-11603] - Investigate using cloudflare's zlib library
- [IMPALA-11604] - Planner changes for CPU usage
- [IMPALA-11617] - Pool service should be made aware of cpu-usage limit for each executor group set
- [IMPALA-11619] - Improve Iceberg V2 reads with a custom Iceberg Position Delete operator
- [IMPALA-11624] - Update Impyla dependency to 0.18.0
- [IMPALA-11626] - Handle COMMIT_COMPACTION_EVENT from HMS
- [IMPALA-11658] - Implement Iceberg manifest caching configuration for Impala
- [IMPALA-11662] - Improve "refresh iceberg_tbl_on_oss;" performance
- [IMPALA-11694] - Use C++ atomics for the implementation of gutil/atomicops.h
- [IMPALA-11713] - Switch to C++17
- [IMPALA-11717] - Use rapidjson for printing collections
- [IMPALA-11730] - Add support for spilling to Ozone
- [IMPALA-11758] - Databases named "iceberg" confuses the parser, throws ParseException
- [IMPALA-11784] - Don't call Iceberg's planFiles redundantly during table load
- [IMPALA-11785] - impala-shell should produce a warning if Thrift's fastbinary package isn't working
- [IMPALA-11808] - Improve catalogD cache performance in a multi service environment for a refresh/invalidate query
- [IMPALA-11822] - Optimize the Refresh/Invalidate event processing by skipping unnecessary events
- [IMPALA-11823] - Add more items to Impala web UI queries page
- [IMPALA-11850] - impala-shell: add tracing headers to hs2-http requests
- [IMPALA-11859] - Metric tracking encrypted bytes read
- [IMPALA-11865] - Set thread names for Java thread pools
- [IMPALA-11869] - Use to_string() in PrintValue for printing Thrift types
- [IMPALA-11880] - JWT Token based auth support for Impala Shell
- [IMPALA-11886] - Data cache should support asynchronous writes
- [IMPALA-11891] - Update executor group min size without restarting coordinators
- [IMPALA-11895] - Need accessor methods for third party extension
- [IMPALA-11904] - Data cache should support dumping metadata for reloading
- [IMPALA-11916] - base::IsAarch64 should be inlined
- [IMPALA-11935] - Generate core dumps if ASAN/TSAN/UBSAN built tests crash
- [IMPALA-11941] - Support Java 17 in Impala
- [IMPALA-11966] - Enable cache_ozone_file_handles by default
- [IMPALA-11992] - Support setting query options in JDBC's connection URL
- [IMPALA-12005] - Explain executor group set selection criteria in query profile
- [IMPALA-12014] - Output a warning message on failed KeepAlive RPC for a Kudu scanner
- [IMPALA-12024] - For CTAS, include time to create the table in the query profile timeline
- [IMPALA-12025] - Hide misleading TotalTime and InactiveTotalTime
- [IMPALA-12029] - Query can be under parallelized in multi executor group set setup
- [IMPALA-12031] - Add security-related HTTP headers
- [IMPALA-12036] - Web UI incorrectly shows root.default resource pool for all queries in /queries page
- [IMPALA-12059] - Make Thrift compilation stop spewing warnings
- [IMPALA-12064] - Remove unused toolchain components versions (boost, thrift)
- [IMPALA-12065] - Use pigz when compression the tarballs for native-toolchain
- [IMPALA-12069] - Remove BUILD_HISTORICAL mode from native-toolchain
- [IMPALA-12070] - Don't build bison and autotools in the native-toolchain
- [IMPALA-12071] - Update ccache archive or disable downloading ccache for toolchain
- [IMPALA-12085] - The description in cluster-membership-mgr.h should be updated
- [IMPALA-12091] - Control scan fragment parallelism by its processing cost
- [IMPALA-12096] - Impala Web UI list process start time and version of all backends
- [IMPALA-12111] - Speed up DATE to STRING conversion
- [IMPALA-12116] - Bump GBN to get HMS thift API change #hive-25205
- [IMPALA-12120] - Set appropriate output writer parallelism when using new processing cost planner
- [IMPALA-12121] - Add non empty check for UDF location in getLastModifiedTime method
- [IMPALA-12142] - Default fetch_size of 10240 is suboptimal
- [IMPALA-12150] - Use protocol version to isolate cluster components
- [IMPALA-12155] - Support Impala CatalogD HA
- [IMPALA-12179] - Switch from detection via lsb_release to /etc/os-release
- [IMPALA-12188] - Various build / utility scripts source bin/impala-config.sh without redirecting output
- [IMPALA-12198] - Create RANGER_LOG_DIR if it doesn't exist for kill-ranger-server.sh
- [IMPALA-12200] - Cap stats NDV from SetOperationStmt.createMetadata
- [IMPALA-12202] - Switch native-toolchain from lsb_release to /etc/os-release
- [IMPALA-12203] - Use python = python 3 for modern toolchain docker images
- [IMPALA-12207] - Add support for Redhat 9
- [IMPALA-12210] - In external frontend mode, catalogd.INFO shows get_table_req() forwarded to HMS and an exception
- [IMPALA-12212] - Upgrade Maven to 3.9 to enable parallel dependency downloads
- [IMPALA-12218] - ORC dependency via the Ranger plugin can conflict with Impala's own ORC dependency
- [IMPALA-12231] - Bump GBN to get HMS thift API change HIVE-27319
- [IMPALA-12246] - record client network address in the same line when log open or close session
- [IMPALA-12284] - Use maven's batch mode when building jamm
- [IMPALA-12286] - Make leader election robust for Impala CatalogD HA
- [IMPALA-12287] - For dataload, use "INSERT OVERWRITE TABLE" syntax rather than "INSERT OVERWRITE"
- [IMPALA-12288] - Avoiding building all the backend tests when running a packaging build
- [IMPALA-12290] - test_cache_metrics is flaky due to tight thresholds
- [IMPALA-12304] - Fix the sequence number issue for update_catalogd RPC when startestore is restarted
- [IMPALA-12314] - Revisit pre-compiled IR at O1
- [IMPALA-12321] - Fix the race condition for updating active catalogd when CatalogD HA is enabled
- [IMPALA-12326] - Impala daemons should only subscribe to statestore once rpc services are ready
- [IMPALA-12357] - Skip scheduling runtime filter from PK-FK join with full build scan
- [IMPALA-12360] - Switch to using official jamm 0.4.0
- [IMPALA-12372] - Only use -Wno-deprecated / -Wno-deprecated-declaration for OpenSSL3
- [IMPALA-12387] - PartialUpdates is misleading for LOCAL filter
- [IMPALA-12408] - Planner spends lot of time in HdfsPartition.getLocation()
- [IMPALA-12420] - Bump Kudu to 1.17.0 for Impala 4.3
- [IMPALA-12424] - Allow third party extensibility for JniFrontend
- [IMPALA-12430] - Optimize sending rows within the same process
Sub-task
- [IMPALA-9487] - SHOW and DESCRIBE statements should display EC policies
- [IMPALA-10798] - Prototype a simple JSON File reader
- [IMPALA-10893] - Use old schema during time travel
- [IMPALA-11121] - Support loading testdata in JSON format
- [IMPALA-11516] - Create better cardinality estimates for Iceberg V2 tables with deletes
- [IMPALA-11660] - Add support for AVRO-only Iceberg tables
- [IMPALA-11693] - Switch allow_erasure_coded_files default to true
- [IMPALA-11701] - Skip pushing down Iceberg predicates to Impala scanner if not needed
- [IMPALA-11708] - Add support for mixed Iceberg tables with AVRO file format
- [IMPALA-11802] - Optimize count(*) queries for Iceberg V2 tables
- [IMPALA-11908] - Iceberg Metadata querying parser change
- [IMPALA-11950] - Iceberg Metadata querying planner change
- [IMPALA-11951] - Add a script that compiles the python code with Python 2 and Python 3
- [IMPALA-11952] - Fix all syntax errors for Python 3
- [IMPALA-11956] - Update toolchain python 3 to 3.7.16
- [IMPALA-11957] - Implement Regression functions : regr_slope(), regr_intercept() and regr_r2()
- [IMPALA-11959] - Add impala-python3 virtualenv
- [IMPALA-11973] - Add absolute_import and division __future__ imports to all python code
- [IMPALA-11974] - Fix range/xrange/map/filter/ifilter/zip/izip issues regarding iterators being lazy
- [IMPALA-11975] - Fix dictionary methods that became lazy
- [IMPALA-11976] - Fix use of deprecated functions / packages in Python3 (long, maxint, file, etc)
- [IMPALA-11977] - Fix broken imports on Python 3 and Python 3 object model differences
- [IMPALA-11996] - Iceberg Metadata querying executor change
- [IMPALA-12019] - Support ORDER BY for collections of fixed length types in select list
- [IMPALA-12263] - Update CMake Avro module with C++ lib
- [IMPALA-12407] - Create a simple test table with equality deletes
Bug
- [IMPALA-5392] - Stack depth for threads printed in the Catalog UI under JVM Threads is not deep enough
- [IMPALA-5851] - Estimate number of rows for sum_init_zero scans should be number of files not table cardinality
- [IMPALA-6876] - Entries in CatalogUsageMonitor are not cleared after invalidation
- [IMPALA-8054] - Implicit cast fails with {const INT} BETWEEN FLOAT and INT
- [IMPALA-10086] - SqlCastException when comparing char with varchar
- [IMPALA-10186] - Write invalid parquet PageLocations which table sort by some columns
- [IMPALA-10399] - test_show_grant_hive_privilege() fails in the exhaustive tests if being executed after test_grant_revoke_with_role()
- [IMPALA-10569] - Impala should determine Iceberg data file format from Iceberg metadata
- [IMPALA-10829] - Memory leak while inserting kudu with rand()
- [IMPALA-10971] - Data cache metrics count error when LIRS is enabled
- [IMPALA-10986] - Specific privilege should be required to execute a UDF in Impala
- [IMPALA-11081] - Partition key scan optimization may return incorrect results when partition file have more than one block
- [IMPALA-11195] - Disable SSL session renegotiation
- [IMPALA-11223] - ASM files from different fragments conflict when using asm_module_dir
- [IMPALA-11260] - Catalog cache item sizes of CatalogdMetaProvider are underestimated on Java9+
- [IMPALA-11282] - Toolchain centos 8 docker image no longer builds
- [IMPALA-11284] - INSERT query with concat operator fails with 'Function not set in thrift node' error
- [IMPALA-11285] - TestAsyncLoadData.test_async_load timeout on S3
- [IMPALA-11390] - Describe formatted statement on materialized view should show the view definition
- [IMPALA-11479] - Create unit tests for IcebergUtil
- [IMPALA-11509] - Dropping files of Iceberg during table loading may cause Impalad to stuck in infinite loop
- [IMPALA-11536] - Invalid push down predicates in outer join simplification
- [IMPALA-11549] - Incorrent return type mismatch in Hive generic UDFs
- [IMPALA-11572] - TestHdfsScannerSkew.test_mt_dop_skew_lpt is flaky
- [IMPALA-11588] - lastProcessedEvent is not updated in MetastoreEventsProcessor.processEvents()
- [IMPALA-11596] - Document EXECUTE ROLLBACK functionality
- [IMPALA-11620] - Enable setting 'write.format.default'
- [IMPALA-11633] - Create table as select (CTAS) tests occasionally timeout for S3, Ozone
- [IMPALA-11638] - Iceberg time travel and column masking/row filtering does not work together
- [IMPALA-11696] - Incorrect warnings when creating text/sequence table with row format delimiters
- [IMPALA-11715] - test_resource_limits_kudu fails
- [IMPALA-11736] - LOAD DATA statement with Ozone data can not load data from different bucket
- [IMPALA-11738] - Data loading failed at load-functional-query-exhaustive-hive-generated-orc-def-block.sql
- [IMPALA-11744] - Table mask view should preserve the original column order in Hive
- [IMPALA-11751] - Crash in processing partition columns of Avro table with MT_DOP>1
- [IMPALA-11753] - CatalogD OOMkilled due to natively allocated memory
- [IMPALA-11755] - Impala-shell ldap_password_cmd fails on Python 3.8
- [IMPALA-11766] - In HdfsScanNode.buildBinaryStatsPredicate, an Expr needs to be cloned
- [IMPALA-11767] - Hudi tests fail on Ozone with INVALID_VOLUME_NAME org.apache.hadoop.ozone.om.exceptions.OMException: Bucket or Volume name cannot start with a period or dash
- [IMPALA-11768] - EventProcessor should ignore failures on non-existing dbs/tables
- [IMPALA-11777] - Bump CDP_BUILD_NUMBER to get HIVE-24498
- [IMPALA-11779] - Codegen crash due to null slots
- [IMPALA-11780] - Wrong FILE__POSITION values for multi row group Parquet files when page filtering is used
- [IMPALA-11787] - Cardinality estimate for UNION in Iceberg position-delete plans can double the actual table cardinality
- [IMPALA-11788] - IMPALA-10986 seems to break test_show_grant_hive_privilege
- [IMPALA-11794] - TestLoadDataExternal.test_load is flaky
- [IMPALA-11795] - Failed to load table metadata from catalog due to EXCEPTION:Unsupported type encountered in setLowAndHighValue()
- [IMPALA-11798] - CREATE Iceberg table with external.table.purge=false is ignored
- [IMPALA-11801] - Catalogd failed to start when compiled with Apache Hive3
- [IMPALA-11803] - impalad hit DCHECK when running union on empty table with MT_DOP>1
- [IMPALA-11806] - TestIcebergTable.test_load failed
- [IMPALA-11807] - TestIcebergTable.test_avro_file_format and TestIcebergTable.test_mixed_file_format failed
- [IMPALA-11811] - Avoid storing unregistered predicate objects in a Map
- [IMPALA-11812] - Catalogd OOM due to lots of HMS FieldSchema instances
- [IMPALA-11818] - Flaky TestDdlStatements.test_describe_materialized_view
- [IMPALA-11826] - Avoid calling planFiles() on Iceberg V2 tables when there are no predicates
- [IMPALA-11833] - Test iceberg files have wrong length in them
- [IMPALA-11834] - Iceberg LOAD DATA hdfsDelete can crash JVM
- [IMPALA-11842] - Improve memory estimation for streaming aggregate operator
- [IMPALA-11843] - IndexOutOfBoundsException in analytic limit pushdown
- [IMPALA-11845] - Select STAR with table ref is not resolved correctly on column/row masked views
- [IMPALA-11846] - impala-shell packaging is broken with python 3.8 and setuptools 66.0.0
- [IMPALA-11854] - ImpalaStringWritable's underlying array can't be changed in UDFs
- [IMPALA-11855] - Upgrade jetty to 9.4.47+ due to CVE-2022-2047, CVE-2022-2048
- [IMPALA-11856] - Loglevel endpoints don't follow API best practices
- [IMPALA-11857] - Join build fragments not displaying correctly in graphical plan
- [IMPALA-11860] - Query failure due to large buffer allocation provides insufficient context
- [IMPALA-11862] - Document that the default value of ssl_cipher_list is not empty
- [IMPALA-11864] - LOAD DATA should not try to load hidden files for Iceberg tables
- [IMPALA-11873] - run-mini-dfs failed by "OZONE_ERASURECODE_POLICY: unbound variable"
- [IMPALA-11883] - erasure-coded-bytes-read metric doesn't work for Ozone
- [IMPALA-11892] - impala-shell tarball with Python 2 now requires setuptools
- [IMPALA-11900] - Test table iceberg_partitioned_orc has wrong metadata
- [IMPALA-11901] - COPY TESTCASE is broken in LocalCatalog mode
- [IMPALA-11903] - Ozone tests failing due to mismatch in ERASURECODE_POLICY
- [IMPALA-11911] - Incorrect handling of NULL arguments in Hive GenericUDFs
- [IMPALA-11914] - EXPLAIN_LEVEL=VERBOSE is broken by join build fragment (MT_DOP>0)
- [IMPALA-11920] - Spill to HDFS/Ozone can't address by service name
- [IMPALA-11924] - Bloom filter size is unaffected by column NDV
- [IMPALA-11931] - enhance ExecutorBlacklist::IsBlacklisted to avoid null pointer exception
- [IMPALA-11939] - Catalogd can't process hms event once creating table contains extra whitespace
- [IMPALA-11942] - Consider restricting --trusted_domain=localhost to 127.0.0.1
- [IMPALA-11945] - JwtHttpTest.testJwtAuthWithUntrustedJwksHttpsUrl failed in core tests
- [IMPALA-11953] - num_trues and num_falses in TIntermediateColumnStats should be optional
- [IMPALA-11954] - Partition an Iceberg table on a string col with '/' char gives incorrect results
- [IMPALA-11960] - Incorrect expression rewrites of date and timestamp conjuncts
- [IMPALA-11964] - Make sure Impala returns error for Iceberg tables with equality deletes
- [IMPALA-11965] - TestCodegenCache fails in exhaustive runs
- [IMPALA-11998] - The iterator provided by ImpalaServer::GetQueryRecord() may become invalid
- [IMPALA-11999] - TestIcebergTable.test_execute_rollback fails in Ozone build
- [IMPALA-12002] - COMMIT_COMPACTION_EVENT are not correctly skipped if the db not found
- [IMPALA-12006] - Outer/inner join cardinality highly overestimated
- [IMPALA-12008] - End Time in profile is set before DML finishes
- [IMPALA-12011] - Shell command of webserver_private_key_password_cmd doesn't unset JAVA_TOOL_OPTIONS variable
- [IMPALA-12013] - Included pkg_resources fails in Python 2 when file in HOME directory contains special character
- [IMPALA-12017] - Skip memory and cpu limit check if REQUEST_POOL is set
- [IMPALA-12023] - Consider skipping resource checking on last executor group set.
- [IMPALA-12032] - PROCESSING_COST_MIN_THREADS fail to cap minimum parallelism
- [IMPALA-12033] - Impalad crashes when --dump_exec_request_path is used
- [IMPALA-12034] - TestIcebergTable.test_drop_corrupt_table failed in Ozone build
- [IMPALA-12035] - Impala accepts very big numbers but fails to store them correctly
- [IMPALA-12041] - Trivial query fail with "No mapping found for request" error message
- [IMPALA-12042] - Invalid casts in set operations calculation
- [IMPALA-12043] - Large catalog info triggers "TTransportException: MaxMessageSize reached"
- [IMPALA-12044] - Update Spring Framework to 5.3.26
- [IMPALA-12045] - generate_junitxml.py fails when the content contains ANSI escape sequences
- [IMPALA-12049] - test_drop_corrupt_table is flaky, fails with "No failure encountered for query"
- [IMPALA-12050] - Exclude kryo-shaded from the Java build
- [IMPALA-12056] - Child queries could get scheduled on improper executor group sets
- [IMPALA-12058] - Impala create table as select throws exception for Kudu tables with non unique primary key
- [IMPALA-12060] - statestore should only allow a single catalog instance to be part of cluster
- [IMPALA-12063] - Upgrade to a version of zlib with fix for CVE-2022-37434
- [IMPALA-12073] - If topic_update_tbl_max_wait_time_ms is set to 1 in the catalog, the metadata gathering thread will not work
- [IMPALA-12079] - NoneType object is not iterable due to uncaught exceptions
- [IMPALA-12082] - INVALIDATE METADATA on new table fail if db is not loaded
- [IMPALA-12084] - INVALIDATE METADATA under unloaded db hit DCHECK in LocalCatalog mode
- [IMPALA-12086] - Fix BitUtil::CountLeadingZeros for zero
- [IMPALA-12094] - Impala Shell fails to identify "Query id ... not found" messages and return concise message
- [IMPALA-12097] - AnalysisException for a COUNT(*) query on Iceberg table
- [IMPALA-12099] - Catalogd NPE when refreshing a partitioned table
- [IMPALA-12100] - native-toolchain's gflags build should use REGISTER_INSTALL_PREFIX=Off
- [IMPALA-12102] - The JniUtil::GetJniExceptionMsg() method has the risk of memory leakage when handling JNI exceptions
- [IMPALA-12106] - Union fragment without scan node can be overparallelized by backend scheduler by 1
- [IMPALA-12110] - Impala returns parsing error when creating Kudu table in CTAS without specifying primary key
- [IMPALA-12113] - Elapsed time incorrect in query timeline
- [IMPALA-12114] - SSL Thrift connections disconnect if idle more than ~150 seconds
- [IMPALA-12117] - Intermittent pip error when building shell's venv
- [IMPALA-12119] - compilation fails on arm64
- [IMPALA-12123] - SIGSEGV in ScanRange::ReadSubRanges() when using HDFS caching
- [IMPALA-12128] - Bump ORC C++ version
- [IMPALA-12129] - Query timeline not working for running query
- [IMPALA-12131] - Execute update catalog request failed cased by Exception:java.util.NoSuchElementException
- [IMPALA-12133] - Fix query timeline's display height for compact query plans
- [IMPALA-12148] - Create table as select (CTAS) tests time out
- [IMPALA-12153] - Parquet STRUCT reader doesn't fill position slots
- [IMPALA-12154] - LIRS cache implementation is missing some locations to trim the recency list
- [IMPALA-12158] - "pgrep impalad" does not work anymore
- [IMPALA-12163] - Shell RPC Duration Does Not Match Start/End Time
- [IMPALA-12164] - The query fails with "IllegalStateException: Illegal reference to non-materialized slot: tid=x sid=x"
- [IMPALA-12167] - TestJvmMemTracker.test_jvm_mem_tracking crashing impalad during startup
- [IMPALA-12170] - TestWebPage.test_webserver_hide_logs_link fails due to new /events link added in catalogd
- [IMPALA-12174] - Work around Kudu repo.spring.io build issue for native-toolchain
- [IMPALA-12177] - The exception in CatalogOpExecutor.unsetPartitionsColStats() was not thrown
- [IMPALA-12183] - Maintain cardinality clamping across multi-phase aggregation
- [IMPALA-12184] - Java UDF increment on an empty string is inconsistent
- [IMPALA-12185] - bootstrap_system.sh fails to install snappy-devel on Redhat 8.4
- [IMPALA-12189] - updateCatalog not releasing the catalog lock if createTblTransaction() throws exceptions
- [IMPALA-12192] - Scan fragment parallelism is limited by its ScanNode cost.
- [IMPALA-12193] - DataCacheTest.SetReadOnly fails for TSAN builds
- [IMPALA-12194] - DataCacheTest.SetReadOnly intermittently fails on debug builds
- [IMPALA-12197] - Time Travel Queries on Iceberg tables fail with "ERROR: IllegalArgumentException: null"
- [IMPALA-12199] - Precommit builds break on Maven 3.5.4 download location change
- [IMPALA-12204] - Redundant codegen info of HashJoinBuilder inside a subplan
- [IMPALA-12211] - OpenSSL 3 doesn't provide FIPS_mode()
- [IMPALA-12213] - Not a self-event because service id of this catalog does not match with one in event
- [IMPALA-12217] - cgroup memory limit detection doesn't work for cgroups v2
- [IMPALA-12220] - impala-shell tarball's sasl dependency does not work on Redhat 9 / Ubuntu 22
- [IMPALA-12223] - Coordinator crash in serializing huge profile
- [IMPALA-12224] - Interactive shell tests should detect connection failures rather than hanging
- [IMPALA-12225] - LLVM 5.0.1's ASAN cannot work with glibc >=2.34
- [IMPALA-12228] - Extend Java DebugAction to simulate the failure of an iceberg transaction.
- [IMPALA-12233] - Partitioned hash join with a limit can hang when using mt_dop>0
- [IMPALA-12235] - test_multiple_coordinator() failed because _start_impala_cluster() returned non-zero exit status
- [IMPALA-12237] - Add information about the table type in the lineage log
- [IMPALA-12240] - On Redhat 9, Impala fails to build the impala-python virtualenv
- [IMPALA-12242] - bin/cmake_aux/create_py3_virtualenv.sh doesn't successfully fall back to impala-virtualenv
- [IMPALA-12244] - CGroupInfo.ErrorHandling test failed in impala-asf-master-core-asan
- [IMPALA-12245] - TestWebPage::test_query_progress is flaky
- [IMPALA-12247] - Add unimplemented methods to ModifyStmt
- [IMPALA-12249] - progress of queries on WebUI has unexpected word wrap
- [IMPALA-12252] - Support distcc server on Ubuntu 20.04
- [IMPALA-12256] - Stale DROP_PARTITION events might not be skipped correctly
- [IMPALA-12257] - createInsertEvents failed by NullPointerException: Invalid partition name
- [IMPALA-12270] - Deserialization failure, because of 'InternalException: couldn't execute a batch of exprs'
- [IMPALA-12271] - Deserialization failure, because of 'InternalException: couldn't execute a batch of exprs'
- [IMPALA-12272] - Deserialization failure, because of 'InternalException: couldn't execute a batch of exprs'
- [IMPALA-12273] - Deserialization failure, because of 'InternalException: couldn't execute a batch of exprs'
- [IMPALA-12274] - Memory leak because of the local reference created by `NewObject` in class Catalog was not released
- [IMPALA-12275] - Cannot read Hive tables created with DeflateCodec
- [IMPALA-12281] - Child queries mistakenly unset REQUEST_POOL if running query through impala-shell
- [IMPALA-12294] - Cookie handling broken In Impala Shell with python 3
- [IMPALA-12295] - Statestore crashed when restarting catalogd
- [IMPALA-12305] - Impala server hanging when processing DDL if CatalogD HA is enabled
- [IMPALA-12307] - test_75_percent_availability fails on object stores
- [IMPALA-12319] - NullPointerException in UnnestExpr.isBoundByTupleIds()
- [IMPALA-12323] - DDL hang with SYNC_DDL=1 when CatalogD HA enabled
- [IMPALA-12327] - Iceberg V2 operator wrong results in PARTITIONED mode
- [IMPALA-12331] - Installing Maven 3.9.2 at the start of a precommit build fails sometimes
- [IMPALA-12332] - Undo IMPALA-8615 as the corresponding configs have been removed
- [IMPALA-12340] - custom_cluster.test_catalogd_ha.TestCatalogdHA.test_two_catalogd_with_force_active fails in exhaustive tests
- [IMPALA-12346] - ARM core job fails with link error due to Boost locale's ICU dependency
- [IMPALA-12359] - Add missing package-info file used by HiveVersionInfo
- [IMPALA-12366] - If Thrift messages are between 1GB and 2GB, the max message size will trigger
- [IMPALA-12383] - Aggregation with num_nodes=1 and limit returns too many rows
- [IMPALA-12384] - Restore NullLiteral's uncheckedCastTo function signature
- [IMPALA-12386] - NullExpr substitution failure with unsafe casts enabled
- [IMPALA-12393] - DictEncoder uses inconsistent hash function for TimestampValue
- [IMPALA-12395] - Planner overestimates scan cardinality for queries using count star optimization
- [IMPALA-12403] - Kerberos authentication fails when connecting with a proxy user that passes LDAP user and group filters but does not delegate another user
- [IMPALA-12409] - Don't allow EXTERNAL Iceberg tables to point another Iceberg table in Hive catalog
- [IMPALA-12413] - Make Iceberg tables created by Trino compatible with Impala
- [IMPALA-12416] - test_skipping_older_events and some other catlog tests failing
- [IMPALA-12417] - Query timeline not working when enable asynchronous codegen
- [IMPALA-12425] - Upgrade json-smart to 2.4.10 due to CVE-2023-1370
Test
- [IMPALA-11253] - Add test coverage on JDK11
- [IMPALA-11829] - Flaky TestCorruptTableStats.test_corrupt_stats
- [IMPALA-12028] - test_execute_rollback fails if not run with US west coast timezone
- [IMPALA-12400] - Test expected executors used for planning when no executor groups are healthy
Task
- [IMPALA-9627] - Update Impala utility Python scripts to be Python3 compatible
- [IMPALA-10860] - Allow setting separate mem_limit for coordinators
- [IMPALA-11060] - Add option in create-load-data.sh to run setup-ranger only
- [IMPALA-11273] - Get rid of deprecated API in Java 11
- [IMPALA-11478] - Cleanup JniCatalog
- [IMPALA-11584] - Run some HDFS minicluster-only tests with Ozone
- [IMPALA-11821] - Enable new avro/mixed iceberg tests for Ozone
- [IMPALA-11827] - do not cache admission control service's IP address in coordinator
- [IMPALA-11852] - Upgrade postgresql to 42.5.1 due to CVE-2022-41946
- [IMPALA-11913] - Upgrade datatables js library
- [IMPALA-11944] - Add SLES 15 support
- [IMPALA-11955] - Support building without Python 2
- [IMPALA-11989] - ImpalaRuntimeException if Kudu and Impala use different HMS
- [IMPALA-11990] - Output on Ozone test run failure without snapshot is confusing
- [IMPALA-11997] - impala-shell: base64.encodestring has been removed in python3.9
- [IMPALA-12037] - Update chart.js
- [IMPALA-12052] - Update tests for new Ozone EC chunk size format
- [IMPALA-12077] - Prepare for Avro 1.9+
- [IMPALA-12080] - Test test_recover_many_partitions is very slow on S3, Ozone
- [IMPALA-12081] - Build Java 11 Docker containers
- [IMPALA-12214] - Upgrade Spring Framework to 5.3.27
- [IMPALA-12227] - Ozone: configure datanode client port in minicluster
- [IMPALA-12283] - Remove HIVE_HOME/lib/py from the PYTHONPATH
- [IMPALA-12411] - TSAN ThreadSanitizer: data race during expr-test teardown
- [IMPALA-12414] - Add scripts to run Trino in the dev environment
- [IMPALA-12432] - Keep LdapKerberosImpalaShellTest* compatible with older guava versions
- [IMPALA-12434] - Isolate shell/pkg_resources.py to keep it off of the PYTHONPATH
- [IMPALA-12441] - Simplify local toolchain development
Documentation
- [IMPALA-11867] - Add Ozone to docs
- [IMPALA-11906] - Impala Doc: Support non unique primary key for Kudu table
- [IMPALA-11940] - Document manifest caching settings for Iceberg tables
- [IMPALA-11985] - Impala Doc: Support Kudu's multi-rows transaction
- [IMPALA-12010] - Impala Doc: (Addendum) Support non unique primary key for Kudu table
- [IMPALA-12335] - Document Iceberg DELETE