Update dependency io.openlineage:openlineage-java to v1.44.1 by renovate[bot] · Pull Request #3002 · MarquezProject/marquez

renovate · 2025-01-01T00:29:00Z

ℹ️ Note

This PR body was truncated due to platform limits.

This PR contains the following updates:

Package	Change	Age	Confidence
io.openlineage:openlineage-java	`1.23.0` → `1.44.1`

Release Notes

OpenLineage/OpenLineage (io.openlineage:openlineage-java)

`v1.44.1`

Compare Source

Fixed

dbt: Attach ExtractionErrorRunFacet on metadata extraction failures #4349 @harels
Attach ExtractionErrorRunFacet to run events when @handle_keyerror-decorated extraction methods fail, making previously invisible extraction errors visible to downstream consumers instead of silently emitting incomplete events.
dbt: Fix KeyError handling in _get_model_node #4348 @harels
Fix exception type mismatch in _get_model_node() by raising KeyError instead of RuntimeError, allowing the @handle_keyerror decorator to catch it and return None gracefully when a node_id is not found in the manifest.
dbt: Use .get() for optional project version retrieval #4345 @zagoodman
Fix crash when version key is absent from dbt_project.yml, which became optional in dbt 1.5, by using .get() instead of direct key access.

`v1.44.0`

Compare Source

Added

Client: Add JWT authentication support #4313 @jakub-moravec
Add JWT authenticator for Java and Python clients, enabling token-based authentication without requiring a custom authenticator implementation.
Spark: Extract input symlinks from DataSourceRDD #4283 @kchledowski
Enable extraction of input dataset symlinks from DataSourceRDD, providing richer lineage information for RDD-based Iceberg operations.

Changed

Spark: Disable column-level lineage for LogicalRDD plans #4329 @kchledowski
Disable column-level lineage extraction for LogicalRDD plans to prevent incorrect lineage caused by lost schema and transformation context.
Spark: Disable input schema extraction for LogicalRDD and add schema extraction for Iceberg DataSourceRDD #4331 @kchledowski
Disable unreliable input schema extraction from LogicalRDD and instead extract schemas from Iceberg table metadata when reading via DataSourceRDD.

Fixed

dbt: Align schema definition for dbt facets #4285 @LegendPawel-Marut
Align schema definitions for dbt-run-run-facet and dbt-version-run-facet to fix validation inconsistencies.
dbt: Better error handling for finding the directory of profiles.yml #4320 @ah12068
Handle missing profiles_dir key in run_results.json gracefully, falling back to default profile directory resolution.
dbt: Fix --target-path CLI flag being ignored #4298 @gaurav-atlan
Fix the --target-path CLI argument not being parsed and passed to artifact processors, causing the default target path to always be used.
Flink: Use Flink 2.x-only class for version detection #4312 @mobuchowski
Fix false Flink 2.x detection when modern V2-based connectors are used with Flink 1.x by using JobStatusChangedListenerFactory for version detection.
Java: Send Content-Encoding header for compressed requests #4282 @Lukas-Riedel
Send the Content-Encoding header when request body compression is enabled in the Java client, consistent with the Python client behavior.
Spark: Early return in DatabricksEventFilter for non-Databricks platforms #4315 @mobuchowski
Add fast environment detection check to skip Databricks-specific event filtering on non-Databricks platforms, reducing overhead.
Spark: Handle null Glue ARN in IcebergHandler #4311 @mobuchowski
Fix NullPointerException when processing Iceberg datasets with AWS Glue catalog by safely handling null Glue ARN values.
Spark: Handle ResolvedIdentifier in DropTableVisitor #4316 @mobuchowski
Fix ClassCastException on DROP TABLE commands in Databricks Runtime 14.2+ by handling ResolvedIdentifier alongside ResolvedTable.
Python: Remove unneeded build runtime dependency #4344 @mobuchowski
Remove the build package from runtime dependencies as it is only needed at build time and is already handled by the build system configuration.
Spark: Resolve parent job name in ParentRunFacet for AWS Glue #4340 @tstrilka
Fix incorrect parent job name in ParentRunFacet for child events (SQL_JOB, RDD_JOB) on AWS Glue, where the raw spark.app.name was used instead of the resolved application name from platform-specific name resolvers.
Spark: Skip Iceberg RDD integration tests on Java 8 with Spark 3.5 @mobuchowski
Skip testAppendWithRDDTransformations and testAppendWithRDDProcessing on Java 8 + Spark 3.5 where the Iceberg vendor module is not compiled due to Iceberg 1.7 requiring Java 11+.

`v1.43.0`

Compare Source

Added

dbt: Extract test severity from dbt tests #4258 @mobuchowski
Add support for extracting and reporting severity information from dbt tests in OpenLineage events.
Spec: Add severity field to DataQualityAssertionsDatasetFacet #4257 @mobuchowski
Extend the DataQualityAssertionsDatasetFacet with a severity field to indicate the importance level of data quality assertions.
Spark: Add support for DataSourceRDD with Iceberg #4263 @kchledowski
Enable lineage extraction from DataSourceRDD when reading Iceberg data sources, supporting mixed RDD and DataFrame operations common in AWS Glue environments.

Fixed

dbt: Fix send-events command argument parsing #4262 @mobuchowski
Fix IndexError when running 'dbt-ol send-events' command without additional arguments by properly checking args length.
Spark: Fix BigQuery column-level lineage namespace #4264 @mobuchowski
Fix incorrect "namespace" value in BigQuery column-level lineage to match the format used by regular BigQuery dataset collection.
Spark: Fix isDeltaPlan() to handle multiple Spark extensions #4268 @mobuchowski
Fix Delta detection failing when multiple Spark extensions are configured (comma-separated), enabling proper event filtering in environments like Azure Fabric or when using Gluten.
Spark: Fix META-INF/services files of relocated packages #4243 @kchledowski
Fix service provider interface conflicts caused by package relocation not updating META-INF/services configuration files.

`v1.42.1`: OpenLineage 1.42.1

Compare Source

Added

DataZone transport: Add cross-region support #4218 @RohithKayathi
Enable posting lineage events to DataZone domains in different regions from where data transformation jobs run.
Spark: Add config for disabling RDD event emitting #4118 @kchledowski
Add new configuration option spark.openlineage.filter.rddEventsDisabled to selectively disable OpenLineage event emission for RDD operations while keeping SQL-based operations enabled.
Spark: Add schema and CLL facets for Snowflake writes #4124 @kchledowski
Add schema and column-level lineage support for Snowflake datasets when using the Spark-Snowflake connector.
Spark: Add spark.openlineage.applicationRunId override #4215 @wslulciuc
Add support to override the application runID via the property spark.openlineage.applicationRunId.
Spec: Add ExecutionParametersRunFacet #4182 @jakub-moravec
Add a new facet to capture input parameters supplied to a job at the time of execution, enabling reproducibility, debugging, and richer lineage context.

Changed

Java: Update GCP Lineage transport version and fix dependency shading #3768 @tnazarew
Update GCP Lineage transport to use new version of the producer library with fixed dependency shading.
Python: Show what transport failed to create #4220 @mobuchowski
Improve error messages to indicate which transport failed to create.
Spark: Prevent classloader issue by gating log behind additional flag #4207 @mobuchowski
Fix classloader conflicts with BigQuery connector by gating DEBUG toJSON() logging behind an additional flag and logging exceptions.

Fixed

Python: Fix .with_additional_properties() annotation #4197 @dolfinus
Fix type annotation for .with_additional_properties() method to correctly accept keyword arguments.
Spark: Fix BigQuery symlinks with ".db" suffix #4192 @kchledowski
Fix BigQuery symlink namespace incorrectly having ".db" suffix in RUNNING and COMPLETE events by avoiding mutation of the Identifier object.
Spark: Fix Glue Data Catalog detection in YARN cluster mode #4229 @lawofcycles
Add fallback mechanism to retrieve AWS region from EC2 Instance Metadata Service when environment variables are unavailable in YARN cluster mode.
Spark: Fix missing inputs and CLL for AWS DynamicFrame #4222 @kchledowski
Fix missing inputs and column-level lineage when writing from AWS DynamicFrame by treating NewHadoopRDD as file-like.
Spark: Remove path pattern in ColumnLineageFacet as well #4228 @RohithKayathi
Apply spark.openlineage.dataset.removePath.pattern to input field names in ColumnLineageFacet, and fix hashCode/equals methods to include additionalProperties.

Removed

Airflow: Remove Airflow integration from OpenLineage repository #4212 @kacpermuda
The deprecated Airflow integration has been removed from the OpenLineage repository.

`v1.42.0`

Compare Source

Added

DataZone transport: Add cross-region support #4218 @RohithKayathi
Enable posting lineage events to DataZone domains in different regions from where data transformation jobs run.
Spark: Add config for disabling RDD event emitting #4118 @kchledowski
Add new configuration option spark.openlineage.filter.rddEventsDisabled to selectively disable OpenLineage event emission for RDD operations while keeping SQL-based operations enabled.
Spark: Add schema and CLL facets for Snowflake writes #4124 @kchledowski
Add schema and column-level lineage support for Snowflake datasets when using the Spark-Snowflake connector.
Spark: Add spark.openlineage.applicationRunId override #4215 @wslulciuc
Add support to override the application runID via the property spark.openlineage.applicationRunId.
Spec: Add ExecutionParametersRunFacet #4182 @jakub-moravec
Add a new facet to capture input parameters supplied to a job at the time of execution, enabling reproducibility, debugging, and richer lineage context.

Changed

Java: Update GCP Lineage transport version and fix dependency shading #3768 @tnazarew
Update GCP Lineage transport to use new version of the producer library with fixed dependency shading.
Python: Show what transport failed to create #4220 @mobuchowski
Improve error messages to indicate which transport failed to create.
Spark: Prevent classloader issue by gating log behind additional flag #4207 @mobuchowski
Fix classloader conflicts with BigQuery connector by gating DEBUG toJSON() logging behind an additional flag and logging exceptions.

Fixed

Python: Fix .with_additional_properties() annotation #4197 @dolfinus
Fix type annotation for .with_additional_properties() method to correctly accept keyword arguments.
Spark: Fix BigQuery symlinks with ".db" suffix #4192 @kchledowski
Fix BigQuery symlink namespace incorrectly having ".db" suffix in RUNNING and COMPLETE events by avoiding mutation of the Identifier object.
Spark: Fix Glue Data Catalog detection in YARN cluster mode #4229 @lawofcycles
Add fallback mechanism to retrieve AWS region from EC2 Instance Metadata Service when environment variables are unavailable in YARN cluster mode.
Spark: Fix missing inputs and CLL for AWS DynamicFrame #4222 @kchledowski
Fix missing inputs and column-level lineage when writing from AWS DynamicFrame by treating NewHadoopRDD as file-like.
Spark: Remove path pattern in ColumnLineageFacet as well #4228 @RohithKayathi
Apply spark.openlineage.dataset.removePath.pattern to input field names in ColumnLineageFacet, and fix hashCode/equals methods to include additionalProperties.

Removed

Airflow: Remove Airflow integration from OpenLineage repository #4212 @kacpermuda
The deprecated Airflow integration has been removed from the OpenLineage repository.

`v1.41.0`

Compare Source

Added

Spec: Add arbitrary extra info to JobDependency in JobDependenciesRunFacet #4189 @kacpermuda
Add support for arbitrary extra information in JobDependency within JobDependenciesRunFacet.
Python: Add debug mode to file transport #4185 @kacpermuda
Add debug mode support to file transport for better troubleshooting.
dbt: Add dbt model meta.owner to OpenLineage events #4160 @harels
Add support for capturing dbt model owner information from meta.owner in OpenLineage events.
dbt: Add DbtNodeJobFacet #4151 @mobuchowski
Add DbtNodeJobFacet to provide additional dbt node information in job facets.
Spark: Add default name to hive catalog facet #4161 @tnazarew
Add default name support to Hive catalog facet in Spark integration.
Spark: Fetch input statistics for single input RDD #4134 @pawel-big-lebowski
Add support for fetching input statistics for single input RDD jobs.

Changed

SQL: Migrate from sqlparser fork to upstream 0.59 #4153 @kchledowski
Migrate SQL parser from fork to upstream version 0.59 for better maintenance and compatibility.
Spark: Less aggressive normalization in UUID case #4178 @mobuchowski
Reduce aggressiveness of UUID normalization in Spark integration.

Fixed

Python: Small log change #4186 @kacpermuda
Improve logging output in Python client.
Spark: Ensure relation size in bytes is sane #4165 @dolfinus
Fix relation size calculation to ensure values are within reasonable bounds.
Spec: Add missing job facet schema #4154 @mobuchowski
Add missing job facet schema to specification.

`v1.40.1`

Compare Source

Fixed

Python: re-add missing version variables in top of releaseable modules #4135 @mobuchowski
Fixes breaking change in version 1.40.0.

`v1.40.0`

Compare Source

Added

Spec: standardize batch API endpoint #4109 @jakub-moravec
Add a standardized batch API endpoint to OpenLineage specification for handling multiple events in a single request.
Spec: Add ordinal position to SchemaDatasetFacet #4116 @mobuchowski
Add ordinal_position field to track the position of fields in schema (1-indexed).
Spec: Add JobDependenciesRunFacet #4112 @kacpermuda
Introduce JobDependenciesRunFacet to track dependencies between jobs.
Spec: Add support for temporary datasets #4103 @jakub-moravec
Add support for temporary datasets to enable job-to-job lineage tracking.
Spark: Add fallback for BigQuery project ID configuration #4075 @luke-hoffman1
Add fallback configuration for BigQuery project ID in Metastore integration.
Spark: Add COALESCE transformation support #4123 @kacpermuda
Include examples in Python generated classes for better documentation.
Java: Add support for jTDS JDBC URL format #4077 @dolfinus
Add support for parsing jTDS JDBC URL format in Java client.
Hive: Add ParentRunFacet #4066 @tnazarew
Add ParentRunFacet to Hive integration for tracking parent-child run relationships.
Hive: Add LOAD and IMPORT handling #4097 @tnazarew
Add support for tracking LOAD and IMPORT operations in Hive.
Hive: Add EXPORT handling #4085 @tnazarew
Add support for tracking EXPORT operations in Hive.
Hive: Add START event emission #4079 @tnazarew
Add START event emission support to Hive integration.

Fixed

Spark: Fix dataset facet builders for inputs #4121 @usamakunwar
Fix Spark dataset facet builders for input datasets.
Spark: Fix job name trimming #4114 @kchledowski
Fix job name trimming logic in Spark integration.
Spark: Fix putAll on immutable maps #4113 @pawel-big-lebowski
Fix putAll operation failing on immutable maps.
Spark: Fix RDD job handling #4108 @pawel-big-lebowski
Fix multiple issues with RDD job handling in Spark.
Spark: Fix JDBC dbtable parsing #4102 @kchledowski
Fix JDBC dbtable parsing to support any FROM clauses.
Spark: Fix Databricks setup #4083 @pawel-big-lebowski
Fix Spark connector configuration for Databricks environments.
Spark: Catch NoClassDefFoundError #4099 @mobuchowski
Catch NoClassDefFoundError when buggy implementations exist on classpath.
Spark: Fix Snowflake identifier parsing #4104 @mobuchowski
Fix Snowflake identifier parsing to handle quoted identifiers correctly.
Spark: Fix Snowflake account name handling #4105 @mobuchowski
Strip quotes from Snowflake account names for proper handling.
Spec: Fix facet property names #4092 @fm100
Fix facet property names from snake_case to camelCase for consistency.
Python: Fix facet generator after UV migration #4111 @kacpermuda
Fix Python client facet generator after moving to UV build system.
Python: Fix retry config merge #4093 @antonlin1
Fix retry configuration default merge with user-defined config in HTTP transports.
Java: Fix CVE in commons-lang3 #4084 @mandalbalmukund
Upgrade commons-lang3 version to fix CVE security vulnerability.
Hive: Generate same runId for START and STOP events #4126 @dolfinus
Ensure START and STOP events share the same runId in Hive integration.

`v1.39.0`

Compare Source

Added

Spark: Normalize dataset names with configurable trimmers #3996 @pawel-big-lebowski
Add configurable dataset name normalization with support for date patterns, key-value pairs, and S3 location detection to enable proper dataset subsetting.
Spark: Add missing facets in inputs for Databricks Unity Catalog #4057 @kchledowski
Add missing input symlink facets for Databricks Unity Catalog tables.

Changed

Spark: Refactor tests for dependency collector #4058 @kchledowski
Refactor column-level lineage dependency collector tests for better organization and maintainability.

Fixed

Spec: Fix typo in iceberg commit report facet spec file #4069 @fm100
Fix typo in IcebergCommitReportOutputDatasetFacet property name.
Spark: Fix dataset trimming for CLL inputs #4061 @pawel-big-lebowski
Fix dataset name trimming for column-level lineage inputs.
Python: Remove numpy import #4062 @kacpermuda
Remove unnecessary numpy import from Python client.

Removed

Dagster: Remove Dagster integration #3844 @kacpermuda
Remove Dagster integration from the repository.

`v1.38.0`

Compare Source

Added

Spec: Add subset dataset facets to spec #4008 @pawel-big-lebowski
Add subset dataset facets to OpenLineage specification for representing dataset relationships.
Spec: Add DatasetQualityMetricsDatasetFacet #3978 @heron--
Allow attaching dataset quality information outside of InputDatasetFacet.
Spark: Add support for microbatch source write #4018 @tnazarew
Add support for Spark structured streaming microbatch source write operations.
Spark: Add catalog properties to catalog facet #4016 @ddebowczyk92
Add catalog properties support to Spark integration for better catalog metadata tracking.
Spark: Add GCP project ID and location to BigQuery Metastore catalog properties #4039 @ddebowczyk92
Enhance BigQuery integration with GCP project ID and location in catalog properties.
Spark: Add support for COALESCE transformation #3972 @kchledowski
Add support for tracking COALESCE transformations in Spark jobs.
Spark: Add catalog facet when using vanilla Hive tables #3982 @ddebowczyk92
Add catalog facet support for vanilla Hive table operations.
Spark: Make output statistics available within complete event #4013 @pawel-big-lebowski
Output statistics now available in complete events for better observability.
Spark: Add output stats for RDD jobs #3977 @pawel-big-lebowski
Add output statistics tracking for Spark RDD-based jobs.
Java: Add equals and hashcode methods into generated classes #4050 @pawel-big-lebowski
Improve generated model classes with proper equals and hashcode implementations.
dbt: Capture dbt tags #4022 @mobuchowski
Add support for capturing dbt tags in OpenLineage events.
dbt: Add dbt Cloud account ID to DbtRunRunFacet #4017 @mobuchowski
Add dbt Cloud account ID tracking to dbt run facets.
dbt: Update DbtRunRunFacet to add more useful information #3987 @mobuchowski
Enhance DbtRunRunFacet with additional metadata for better observability.
Python: Add GCP Lineage transport #4006 @ddebowczyk92
Add native Google Cloud Platform Lineage transport for Python client.
Python: Add fsspec support for FileTransport #3983 @JDarDagran
Add fsspec filesystem support to FileTransport for broader filesystem compatibility.
Python: Add default tags with OL client version #3980 @kacpermuda
Automatically add OpenLineage client version as default tag in events.
Airflow: Add GCP Composer facets #3986 @gabrysiaolsz
Add GCP Cloud Composer environment metadata facets to Airflow integration.

Changed

dbt: Use alias when naming datasets #4055 @mobuchowski
Use dbt model aliases when generating dataset names for more accurate lineage.
Spark: Serialize event to JSON for logging #4029 @EugeneYushin
Serialize OpenLineage events to JSON format for improved debug logging.
Spark: Respect overridden appName in EventEmitter #4030 @EugeneYushin
Properly respect user-overridden application names in event emission.
Spark: Refactor CLL ExpressionDependencyCollector #4003 @kchledowski
Refactor column-level lineage expression dependency collector for better maintainability.
Spark: Improve logging in IcebergInputStatisticsInputDatasetFacetBuilder #3994 @JDarDagran
Enhance logging for Iceberg input statistics collection.
Spark: Limit external getFileStatus calls when dealing with lots of S3 objects #3985 @pawel-big-lebowski
Optimize S3 operations by limiting external getFileStatus calls for large object sets.
Java/Spark/Hive: Move TransformationInfo to Java client to reuse across integrations #3964 @kchledowski
Refactor TransformationInfo into shared Java client for cross-integration reuse.
Python: Improve logging in AsyncHttpTransport #4026 @dolfinus
Enhance logging capabilities in asynchronous HTTP transport.
Python: Allow type aliases #4000 @JDarDagran
Support Python type aliases in client code generation.
Python: Fix classes generation for almost identical classes #3997 @JDarDagran
Improve code generation to properly handle nearly identical class definitions.
Python: Raise errors if custom token provider cannot be loaded #4014 @dolfinus
Fail fast with clear errors when custom token providers fail to load.
Python: Don't silence import errors in DefaultTransportFactory #4015 @dolfinus
Improve error visibility by not silencing import errors in transport factory.
Python: Import from facet_v2 and event_v2 instead of generated modules #3968 @kacpermuda
Update import paths to use versioned facet and event modules.
Java: Refactor ExecutorService management in OpenLineageClientUtils #4012 @JDarDagran
Improve thread pool management in Java client utilities.
CI: Replace pre-commit with prek across CI and documentation #3965 @JDarDagran
Migrate from pre-commit to prek for pre-commit hook management.

Fixed

Spark: Fix false Hive Glue detection #4053 @jsjasonseba
Fix incorrect Glue catalog detection due to always attempting ARN resolution.
Spark: Fix CLL on hiveless runtimes #4052 @kchledowski
Fix column-level lineage failures on Spark runtimes without spark-hive package.
Spark: Fix missing inputs and CLL on some table creation commands #4031 @kchledowski
Fix missing input datasets and column-level lineage for CreateDataSourceTableAsSelect and CreateHiveTableAsSelect commands.
Spark: Rely on BQ bucket info inside BigQueryIntermediateJobFilter #4044 @EugeneYushin
Fix BigQuery intermediate job filtering by using bucket configuration.
Spark: Fix for TypeNotPresentException/RefreshTableCommand errors in Spark 3.0.2 #4002 @MaciejGajewski
Add additional exception handling for TypeNotPresentException in Spark 3.0.2.
Python: Fix license field in pyproject.toml when using build module #4034 @JDarDagran
Correct license field specification in Python package metadata.
Python: Accept both apikey and api_key in token provider #4045 @kacpermuda
Support both naming conventions for API key configuration parameter.
Java: Fix empty sources jar generation #4037 @EugeneYushin
Fix build issue causing empty sources JAR files to be generated.

`v1.37.0`

Compare Source

Added

Python: Add Datadog transport with configurable async routing #3950 @mobuchowski
Add Datadog transport with intelligent routing between sync/async transports based on configurable rules. Supports wildcard matching and provides seamless integration with Datadog's observability platform.
Spark: Implement support for WriteDelta, WriteIcebergDelta logical plan nodes #3860 @orthoxerox
Add support for WriteDelta and WriteIcebergDelta logical plan nodes in Spark integration.
dbt: Add option to override dbt job name #3933 @mobuchowski
Add configuration option to override dbt job names in OpenLineage events.
Java: Add Jackson Blackbird module for JSON performance optimization #3923 @kyungryun
Improve JSON serialization performance with Jackson Blackbird module.

Changed

Spark: Remove Spark 2 support #3904 @pawel-big-lebowski
Drop support for Spark 2.x versions. Minimum supported version is now Spark 3.x.
Python: Change gzip compression level in HTTP transport #3956 @dolfinus
Optimize HTTP transport performance by adjusting gzip compression level.
Spark: Add support for Spark 4 in streaming tests #3925 @SalvadorRomo
Extend streaming integration tests to support Spark 4.0.

Fixed

Spark: Improve performance of column level lineage #3946 @pawel-big-lebowski
Limit memory consumption, provide limits for the amount of dependencies processed (1M) and input fields returned in the facet (100K). Turns on dataset lineage by default.
Spark: Add schema size limit for column level lineage processing #3949 @ddebowczyk92
Add limits to prevent performance issues with large schemas in column-level lineage processing.
Spark: Fix context factory for Spark 4 #3934 @pawel-big-lebowski
Fix context factory implementation for Spark 4.0 compatibility.
Spark: Fix LogicalRelation constructor compatibility for Spark 4 #3930 @yunchipang
Fix LogicalRelation constructor to maintain compatibility with Spark 4.0.
Spark: Fix vendors parsing in SparkOpenLineageConfig #3947 @ddebowczyk92
Fix parsing of vendor configurations in Spark OpenLineage configuration.
dbt: Use correct namespace for dbt externalQuery facet #3953 @jroachgolf84
Fix namespace handling in dbt external query facets.
Python: Fix tags configuration #3943 @JDarDagran
Fix configuration handling for user-supplied tags in Python client.

`v1.36.0`

Compare Source

Added

Spark: support Delta 4.0 and cover it with tests on Spark 4.0. #3877 @pawel-big-lebowski
Fix failing tests for Spark 4.0. Make delta integration tests pass with Delta 4.0 on Spark 4.
Spark: Add memory info to debug facet. #3914 @pawel-big-lebowski
Extend DebugFacet with additional information on Spark's driver memory configuration and current memory usage.
Spark: Add new AlterTableCommandDatasetBuilder for Spark 4.0. #3921 @pawel-big-lebowski
Add support for AlterTableCommand dataset building in Spark 4.0.
dbt: Add query IDs for dbt. #3890 @jroachgolf84
Add query ID tracking to dbt integration.
dbt: Add query ID capture in structured logs. #3918 @mobuchowski
Capture query IDs from dbt structured logs for better traceability.
Python: Formalize dataset naming for Python client. #3816 @ddebowczyk92
Formalize dataset naming conventions in Python client implementation.

Changed

Spark: bump minor versions 3.4.3 -> 3.4.4, 3.5.4 -> 3.5.6. #3907 @pawel-big-lebowski
Bump tested Spark versions.
Spark: Close OpenLineageClient in onApplicationEnd. #3851 @dolfinus
Ensure proper cleanup of OpenLineageClient when Spark application ends.
Python: Do not use f-strings with logging module. #3895 @dolfinus
Replace f-string usage in logging calls with proper logging formatting.
Python: Update protobuf version to be compatible with newer libraries. #3899 @Shadi
Update protobuf dependency to maintain compatibility with newer library versions.
Website: Documentation for compatibility tests. #3869 @mobuchowski
Add documentation explaining compatibility testing processes.

Fixed

Spark: make visitors stateless - avoid memory leak. #3902 @pawel-big-lebowski
Merge SqlExecutionRDDVisitor and LogicalRDDVisitor classes to avoid memory leak.
Spark: refactor iceberg handler. #3909 @pawel-big-lebowski
Refactor Iceberg handler implementation for better maintainability.
Spark: retry exception on empty row. #3908 @pawel-big-lebowski
Add retry logic for handling empty row exceptions.
Spark: fix Spark version for databricks test. #3911 @pawel-big-lebowski
Fix Spark version configuration in Databricks test environment.
Flink: Fix connector of type kafka-upsert not identifying kafka topics correctly. #3915 @fetta
Fix kafka-upsert connector to properly identify kafka topics.
Airflow: Fail fast and reduce timeout for airflow tests. #3905 @kacpermuda
Improve test performance by implementing fail-fast behavior and reduced timeouts.
dbt: more telemetry, fix quadratic file reading. #3916 @mobuchowski
Improve telemetry collection and fix performance issues with file reading.
dbt: Fix dbt version. #3894 @mobuchowski
Fix dbt version compatibility issues.
Python: Fix filenames for windows users. #3889 @pawel-big-lebowski
Fix filename handling to work correctly on Windows systems.
Transport: Adjust log level when aliasing default_http transport. #3897 @dolfinus
Adjust logging level for transport aliasing messages.
Build: Improve comments and add some tests. #3901 @kacpermuda
Improve code documentation and add additional test coverage.

`v1.35.0`

Compare Source

Added

Spark: Include spark_applicationDetails facet to all events #3848 @dolfinus
Add spark_applicationDetails facet to all OpenLineage events emitted by the Spark integration
Spark: Support additional facets #3850 @ddebowczyk92
Adds support for additional facets in Spark integration
Spark: disable connector by Spark config parameter #3880 @pawel-big-lebowski
Add spark.openlineage.disabled entry to disable OpenLineage integration through Spark config parameters
Spark: Fine-grained timeout config #3779 @pawel-big-lebowski
Add extra timeout options to emit incomplete OpenLineage events in case of timeout when building facets. See buildDatasetsTimePercentage and facetsBuildingTimePercentage in docs for more details
Python: Asynchronous HTTP transport implementation #3812 @mobuchowski
Adds high-performance asynchronous HTTP transport with event ordering guarantees, configurable concurrency, and comprehensive error handling. Features START-before-completion event ordering, bounded queues, and real-time statistics
dbt: Add DbtRun facet to dbt run events #3764 @dolfinus
Adds DbtRun facet for tracking dbt run information
Python: Add continue_on_success and sorting transport in CompositeTransport #3829 @kacpermuda
Adds configuration options for CompositeTransport to control behavior and ordering
Hive: Add jobType facet #3789 @dolfinus
Adds jobType facet to Hive integration
Hive: Add dialect=hive to SqlJobFacet #3863 @dolfinus
Adds dialect field to SqlJobFacet for Hive integration
Spec: SqlJobFacet now contains dialect #3819 @mobuchowski
Adds dialect field to SqlJobFacet specification
Spec: Formalize job naming #3826 @ddebowczyk92
Formalizes job naming conventions in the specification
Spec: Formalize dataset naming #3775 @ddebowczyk92
Formalizes dataset naming conventions in the specification

Changed

**S

Configuration

📅 Schedule: Branch creation - "every 3 months on the first day of the month" (UTC), Automerge - At any time (no schedule defined).

🚦 Automerge: Disabled by config. Please merge this manually once you are satisfied.

♻ Rebasing: Whenever PR is behind base branch, or you tick the rebase/retry checkbox.

🔕 Ignore: Close this PR and you won't be reminded about this update again.

If you want to rebase/retry this PR, check this box

This PR was generated by Mend Renovate. View the repository job log.

netlify · 2025-01-01T00:29:48Z

❌ Deploy Preview for peppy-sprite-186812 failed.

Name	Link
🔨 Latest commit	`cfc84da`
🔍 Latest deploy log	https://app.netlify.com/projects/peppy-sprite-186812/deploys/68e551cf01b1d20008f3ae52

codecov · 2025-01-20T19:53:19Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 81.18%. Comparing base (a89b89c) to head (42f8ad2).

Additional details and impacted files

@@            Coverage Diff            @@
##               main    #3002   +/-   ##
=========================================
  Coverage     81.18%   81.18%           
  Complexity     1506     1506           
=========================================
  Files           268      268           
  Lines          7356     7356           
  Branches        325      325           
=========================================
  Hits           5972     5972           
  Misses         1226     1226           
  Partials        158      158

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.
📦 JS Bundle Analysis: Save yourself from yourself by tracking and limiting bundle sizes in JS merges.

Signed-off-by: renovate[bot] <29139614+renovate[bot]@users.noreply.github.com>

renovate bot force-pushed the renovate/openlineageversion branch from d415db2 to a641137 Compare January 20, 2025 19:42

renovate bot changed the title ~~Update dependency io.openlineage:openlineage-java to v1.26.0~~ Update dependency io.openlineage:openlineage-java to v1.27.0 Jan 20, 2025

renovate bot force-pushed the renovate/openlineageversion branch 7 times, most recently from 403306b to 96a3b7b Compare January 22, 2025 13:56

renovate bot force-pushed the renovate/openlineageversion branch 3 times, most recently from dc448cf to ebaf22c Compare February 7, 2025 00:42

renovate bot changed the title ~~Update dependency io.openlineage:openlineage-java to v1.27.0~~ Update dependency io.openlineage:openlineage-java to v1.28.0 Feb 7, 2025

renovate bot force-pushed the renovate/openlineageversion branch from ebaf22c to a164571 Compare February 25, 2025 18:55

renovate bot changed the title ~~Update dependency io.openlineage:openlineage-java to v1.28.0~~ Update dependency io.openlineage:openlineage-java to v1.29.0 Feb 25, 2025

renovate bot force-pushed the renovate/openlineageversion branch from a164571 to f11be42 Compare March 17, 2025 12:11

renovate bot changed the title ~~Update dependency io.openlineage:openlineage-java to v1.29.0~~ Update dependency io.openlineage:openlineage-java to v1.30.0 Mar 17, 2025

renovate bot force-pushed the renovate/openlineageversion branch 6 times, most recently from 432f8d4 to 9d7550b Compare March 26, 2025 17:23

renovate bot changed the title ~~Update dependency io.openlineage:openlineage-java to v1.30.0~~ Update dependency io.openlineage:openlineage-java to v1.30.1 Mar 26, 2025

renovate bot force-pushed the renovate/openlineageversion branch 3 times, most recently from 43064c3 to 7c19900 Compare March 27, 2025 07:34

renovate bot force-pushed the renovate/openlineageversion branch from 7c19900 to 828aa83 Compare April 10, 2025 15:36

renovate bot changed the title ~~Update dependency io.openlineage:openlineage-java to v1.32.1~~ Update dependency io.openlineage:openlineage-java to v1.33.0 May 19, 2025

renovate bot force-pushed the renovate/openlineageversion branch from 42f8ad2 to ca5d001 Compare June 18, 2025 20:50

renovate bot changed the title ~~Update dependency io.openlineage:openlineage-java to v1.33.0~~ Update dependency io.openlineage:openlineage-java to v1.34.0 Jun 18, 2025

renovate bot force-pushed the renovate/openlineageversion branch from ca5d001 to dcce969 Compare July 11, 2025 20:06

renovate bot changed the title ~~Update dependency io.openlineage:openlineage-java to v1.34.0~~ Update dependency io.openlineage:openlineage-java to v1.35.0 Jul 11, 2025

renovate bot force-pushed the renovate/openlineageversion branch from dcce969 to d45585c Compare July 22, 2025 20:54

renovate bot changed the title ~~Update dependency io.openlineage:openlineage-java to v1.35.0~~ Update dependency io.openlineage:openlineage-java to v1.36.0 Jul 22, 2025

renovate bot force-pushed the renovate/openlineageversion branch from d45585c to d31b116 Compare August 11, 2025 22:00

renovate bot changed the title ~~Update dependency io.openlineage:openlineage-java to v1.36.0~~ Update dependency io.openlineage:openlineage-java to v1.37.0 Aug 11, 2025

renovate bot force-pushed the renovate/openlineageversion branch from d31b116 to bec3771 Compare October 1, 2025 22:10

renovate bot changed the title ~~Update dependency io.openlineage:openlineage-java to v1.37.0~~ Update dependency io.openlineage:openlineage-java to v1.38.0 Oct 1, 2025

renovate bot force-pushed the renovate/openlineageversion branch from bec3771 to cfc84da Compare October 7, 2025 17:45

renovate bot changed the title ~~Update dependency io.openlineage:openlineage-java to v1.38.0~~ Update dependency io.openlineage:openlineage-java to v1.39.0 Oct 7, 2025

renovate bot force-pushed the renovate/openlineageversion branch from cfc84da to f9f7c18 Compare November 14, 2025 02:13

renovate bot changed the title ~~Update dependency io.openlineage:openlineage-java to v1.39.0~~ Update dependency io.openlineage:openlineage-java to v1.40.0 Nov 14, 2025

renovate bot force-pushed the renovate/openlineageversion branch from f9f7c18 to 3b1b2ea Compare November 14, 2025 12:55

renovate bot changed the title ~~Update dependency io.openlineage:openlineage-java to v1.40.0~~ Update dependency io.openlineage:openlineage-java to v1.40.1 Nov 14, 2025

renovate bot force-pushed the renovate/openlineageversion branch from 3b1b2ea to 00ac754 Compare December 11, 2025 13:36

renovate bot changed the title ~~Update dependency io.openlineage:openlineage-java to v1.40.1~~ Update dependency io.openlineage:openlineage-java to v1.41.0 Dec 11, 2025

renovate bot force-pushed the renovate/openlineageversion branch from 00ac754 to e8d4cc6 Compare January 7, 2026 14:39

renovate bot changed the title ~~Update dependency io.openlineage:openlineage-java to v1.41.0~~ Update dependency io.openlineage:openlineage-java to v1.42.0 Jan 7, 2026

renovate bot force-pushed the renovate/openlineageversion branch from e8d4cc6 to 2d1359e Compare January 8, 2026 23:36

renovate bot changed the title ~~Update dependency io.openlineage:openlineage-java to v1.42.0~~ Update dependency io.openlineage:openlineage-java to v1.42.1 Jan 8, 2026

renovate bot force-pushed the renovate/openlineageversion branch from 2d1359e to cd81291 Compare January 23, 2026 01:38

renovate bot changed the title ~~Update dependency io.openlineage:openlineage-java to v1.42.1~~ Update dependency io.openlineage:openlineage-java to v1.43.0 Jan 23, 2026

renovate bot force-pushed the renovate/openlineageversion branch from cd81291 to 3f63859 Compare February 17, 2026 14:48

renovate bot changed the title ~~Update dependency io.openlineage:openlineage-java to v1.43.0~~ Update dependency io.openlineage:openlineage-java to v1.44.0 Feb 17, 2026

Update dependency io.openlineage:openlineage-java to v1.44.1

96d5e69

Signed-off-by: renovate[bot] <29139614+renovate[bot]@users.noreply.github.com>

renovate bot force-pushed the renovate/openlineageversion branch from 3f63859 to 96d5e69 Compare February 20, 2026 18:12

renovate bot changed the title ~~Update dependency io.openlineage:openlineage-java to v1.44.0~~ Update dependency io.openlineage:openlineage-java to v1.44.1 Feb 20, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update dependency io.openlineage:openlineage-java to v1.44.1#3002

Update dependency io.openlineage:openlineage-java to v1.44.1#3002
renovate[bot] wants to merge 1 commit intomainfrom
renovate/openlineageversion

renovate bot commented Jan 1, 2025 •

edited

Loading

Uh oh!

netlify bot commented Jan 1, 2025 •

edited

Loading

Uh oh!

codecov bot commented Jan 20, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

0 participants

Conversation

renovate bot commented Jan 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Release Notes

Fixed

Added

Changed

Fixed

Added

Fixed

v1.42.1: OpenLineage 1.42.1

Added

Changed

Fixed

Removed

Added

Changed

Fixed

Removed

Added

Changed

Fixed

Fixed

Added

Fixed

Added

Changed

Fixed

Removed

Added

Changed

Fixed

Added

Changed

Fixed

Added

Changed

Fixed

Added

Changed

Configuration

Uh oh!

netlify bot commented Jan 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

❌ Deploy Preview for peppy-sprite-186812 failed.

Uh oh!

codecov bot commented Jan 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

0 participants

renovate bot commented Jan 1, 2025 •

edited

Loading

`v1.42.1`: OpenLineage 1.42.1

netlify bot commented Jan 1, 2025 •

edited

Loading

codecov bot commented Jan 20, 2025 •

edited

Loading