Class DeltaMergeBuilder
- Object
-
- io.delta.tables.DeltaMergeBuilder
-
- All Implemented Interfaces:
org.apache.spark.internal.Logging
,org.apache.spark.sql.delta.util.AnalysisHelper
public class DeltaMergeBuilder extends Object implements org.apache.spark.sql.delta.util.AnalysisHelper, org.apache.spark.internal.Logging
Builder to specify how to merge data from source DataFrame into the target Delta table. You can specify any number ofwhenMatched
andwhenNotMatched
clauses. Here are the constraints on these clauses.-
whenMatched
clauses:- There can be at most one
update
action and onedelete
action inwhenMatched
clauses.- Each
whenMatched
clause can have an optional condition. However, if there are twowhenMatched
clauses, then the first one must have a condition.- When there are two
whenMatched
clauses and there are conditions (or the lack of) such that a row matches both clauses, then the first clause/action is executed. In other words, the order of thewhenMatched
clauses matter.- If none of the
whenMatched
clauses match a source-target row pair that satisfy the merge condition, then the target rows will not be updated or deleted.- If you want to update all the columns of the target Delta table with the corresponding column of the source DataFrame, then you can use the
whenMatched(...).updateAll()
. This is equivalent towhenMatched(...).updateExpr(Map( ("col1", "source.col1"), ("col2", "source.col2"), ...))
-
whenNotMatched
clauses:- This clause can have only an
insert
action, which can have an optional condition.- If the
whenNotMatched
clause is not present or if it is present but the non-matching source row does not satisfy the condition, then the source row is not inserted.- If you want to insert all the columns of the target Delta table with the corresponding column of the source DataFrame, then you can use
whenMatched(...).insertAll()
. This is equivalent towhenMatched(...).insertExpr(Map( ("col1", "source.col1"), ("col2", "source.col2"), ...))
Scala example to update a key-value Delta table with new key-values from a source DataFrame:
deltaTable .as("target") .merge( source.as("source"), "target.key = source.key") .whenMatched .updateExpr(Map( "value" -> "source.value")) .whenNotMatched .insertExpr(Map( "key" -> "source.key", "value" -> "source.value")) .execute()
Java example to update a key-value Delta table with new key-values from a source DataFrame:
deltaTable .as("target") .merge( source.as("source"), "target.key = source.key") .whenMatched .updateExpr( new HashMap<String, String>() {{ put("value", "source.value"); }}) .whenNotMatched .insertExpr( new HashMap<String, String>() {{ put("key", "source.key"); put("value", "source.value"); }}) .execute();
- Since:
- 0.3.0
-
-
Constructor Summary
Constructors Constructor Description DeltaMergeBuilder()
-
Method Summary
All Methods Instance Methods Concrete Methods Modifier and Type Method Description void
execute()
Execute the merge operation based on the built matched and not matched actions.DeltaMergeMatchedActionBuilder
whenMatched()
Build the actions to perform when the merge condition was matched.DeltaMergeMatchedActionBuilder
whenMatched(String condition)
Build the actions to perform when the merge condition was matched and the givencondition
is true.DeltaMergeMatchedActionBuilder
whenMatched(org.apache.spark.sql.Column condition)
Build the actions to perform when the merge condition was matched and the givencondition
is true.DeltaMergeNotMatchedActionBuilder
whenNotMatched()
Build the action to perform when the merge condition was not matched.DeltaMergeNotMatchedActionBuilder
whenNotMatched(String condition)
Build the actions to perform when the merge condition was not matched and the givencondition
is true.DeltaMergeNotMatchedActionBuilder
whenNotMatched(org.apache.spark.sql.Column condition)
Build the actions to perform when the merge condition was not matched and the givencondition
is true.-
Methods inherited from class java.lang.Object
equals, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
-
Methods inherited from interface org.apache.spark.sql.delta.util.AnalysisHelper
improveUnsupportedOpError, resolveReferencesForExpressions, toDataset, tryResolveReferences, tryResolveReferencesForExpressions
-
Methods inherited from interface org.apache.spark.internal.Logging
initializeForcefully, initializeLogIfNecessary, initializeLogIfNecessary, initializeLogIfNecessary$default$2, isTraceEnabled, log, logDebug, logDebug, logError, logError, logInfo, logInfo, logName, logTrace, logTrace, logWarning, logWarning, org$apache$spark$internal$Logging$$log_, org$apache$spark$internal$Logging$$log__$eq
-
-
-
-
Method Detail
-
whenMatched
public DeltaMergeMatchedActionBuilder whenMatched()
Build the actions to perform when the merge condition was matched. This returnsDeltaMergeMatchedActionBuilder
object which can be used to specify how to update or delete the matched target table row with the source row.- Since:
- 0.3.0
-
whenMatched
public DeltaMergeMatchedActionBuilder whenMatched(String condition)
Build the actions to perform when the merge condition was matched and the givencondition
is true. This returnsDeltaMergeMatchedActionBuilder
object which can be used to specify how to update or delete the matched target table row with the source row.- Parameters:
condition
- boolean expression as a SQL formatted string- Since:
- 0.3.0
-
whenMatched
public DeltaMergeMatchedActionBuilder whenMatched(org.apache.spark.sql.Column condition)
Build the actions to perform when the merge condition was matched and the givencondition
is true. This returns aDeltaMergeMatchedActionBuilder
object which can be used to specify how to update or delete the matched target table row with the source row.- Parameters:
condition
- boolean expression as a Column object- Since:
- 0.3.0
-
whenNotMatched
public DeltaMergeNotMatchedActionBuilder whenNotMatched()
Build the action to perform when the merge condition was not matched. This returnsDeltaMergeNotMatchedActionBuilder
object which can be used to specify how to insert the new sourced row into the target table.- Since:
- 0.3.0
-
whenNotMatched
public DeltaMergeNotMatchedActionBuilder whenNotMatched(String condition)
Build the actions to perform when the merge condition was not matched and the givencondition
is true. This returnsDeltaMergeMatchedActionBuilder
object which can be used to specify how to insert the new sourced row into the target table.- Parameters:
condition
- boolean expression as a SQL formatted string- Since:
- 0.3.0
-
whenNotMatched
public DeltaMergeNotMatchedActionBuilder whenNotMatched(org.apache.spark.sql.Column condition)
Build the actions to perform when the merge condition was not matched and the givencondition
is true. This returnsDeltaMergeMatchedActionBuilder
object which can be used to specify how to insert the new sourced row into the target table.- Parameters:
condition
- boolean expression as a Column object- Since:
- 0.3.0
-
execute
public void execute()
Execute the merge operation based on the built matched and not matched actions.- Since:
- 0.3.0
-
-