Pyspark DataFrameWriter jdbc 函数的 ignore 选项是忽略整个事务还是只是有问题的行?

时间：2023-04-04

本文介绍了Pyspark DataFrameWriter jdbc 函数的 ignore 选项是忽略整个事务还是只是有问题的行?的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着跟版网的小编来一起学习吧！

问题描述

Pyspark DataFrameWriter 类有一个 jdbc 函数用于将数据帧写入 sql.这个函数有一个 --ignore 选项，文档说:

The Pyspark DataFrameWriter class has a jdbc function for writing a dataframe to sql. This function has an --ignore option that the documentation says will:

如果数据已经存在，则静默忽略此操作.

Silently ignore this operation if data already exists.

但是它会忽略整个事务，还是只会忽略插入重复的行?如果我将 --ignore 与 --append 标志结合起来会怎样?行为会改变吗?

But will it ignore the entire transaction, or will it only ignore inserting the rows that are duplicates? What if I were to combine --ignore with the --append flag? Would the behavior change?

Pyspark DataFrameWriter jdbc 函数的 ignore 选项是忽略整个事务还是只是有问题的行?

问题描述

推荐答案

相关文章