DataFrame is a two-dimensional data structure with labeled rows and columns. We can always add new columns or rows to a DataFrame.
In this short how-to article, we will learn how to add a new column of a constant value to Pandas and PySpark DataFrames.
Pandas
It is a quite simple operation in Pandas.
df["Bonus"] = 100
PySpark
We can perform this task using the withColumn and lit functions as below:
from pyspark.sql import functions as F
df = df.withColumn("Bonus", F.lit(100))
This question is also being asked as:
- How to add a new column with string constant?