Everything you need for AI Performance in one platform.
We decided that Docs should have prime location.
Fundamentals of ML observability
Metrics, feature importance and more
We’re excited ???? to share that Forbes has named Aporia a Next Billion-Dollar Company. This recognition comes on the heels of our recent $25 million Series A funding and is a huge testament that Aporia’s mission and the need for trust in AI are more relevant than ever. We are very proud to be listed […]
DataFrame is a two-dimensional data structure with labeled rows and columns. We can use the labels (i.e. index) to access a particular cell. Row and column indices can be considered as the address of a cell.
In this short how-to article, we will learn how to set the value of a cell in Pandas and PySpark DataFrames using the cell index.
We can use both the loc and iloc methods for this task. Loc uses labels whereas iloc works with index values.
Consider the DataFrame shown in the above drawing. We want to change the value of the cell in green. Its address in terms of labels is “1-C”. In the case of using indices, the address is “1-2”
# with loc df.loc[1, "C"] = 20 # with iloc df.iloc[1, 2] = 20
The row labels are the same as row indices unless we assign a customized index to the DataFrame.
We can use the collect method to get the value in a particular cell.
# with index df.collect()[1][2] 15 # with labels df.collect()[1]["C"] 15
However, PySpark does not allow assigning a new value to a particular cell.