Aporia has been acquired by Coralogix, instantly bringing AI security and reliability to thousands of enterprises | Read the announcement

Back to Blog
How-To

How to Select Multiple Columns in a DataFrame?

Aporia Team Aporia Team 1 min read Sep 06, 2022

In this short how-to article, we will learn how to select multiple columns in Pandas and PySpark DataFrames.

select multiple columns pandas pyspark dataframe

Pandas

We can select multiple columns by writing them in a list.

cols = ["f2", "f4"]
df[cols]

The iloc method can be used for selecting columns based on their indices. Consider you have a DataFrame with 30 columns and you want to select the first 10. You can perform this task as follows:

# Select the first 10 columns
df.iloc[:,:10]

# Select from the second to fifth
df.iloc[:,2:5]

PySpark

The select function can be used for selecting multiple columns from a PySpark DataFrame.

# first method
df.select("f1", "f2")

# second method
df.select(df.f1, df.f2)

This question was also being asked as:

  • How to choose specific columns in a DataFrame?

People have also asked for:

Rate this article

Average rating 0 / 5. Vote count: 0

No votes so far! Be the first to rate this post.

On this page

Blog
Building a RAG app?

Consider AI Guardrails to get to production faster

Learn more
Table of Contents

Related Articles