How to Build an End-To-End ML Pipeline With Databricks & Aporia
This tutorial will show you how to build a robust end-to-end ML pipeline with Databricks and Aporia. Here’s what you’ll...
An empty DataFrame is one that does not contain any data points (i.e., rows). In this short how-to article, we will learn how to check if a Pandas or PySpark DataFrame is empty.
We can use the empty method which returns True if the DataFrame is empty.
df.empty
True
We can also check the number of rows in a DataFrame using the len function or the shape method. If they return 0, then the DataFrame is empty.
len(df) == 0
True
df.shape[0] == 0
True
We can count the number of rows using the count method and check if it equals to zero.
df.count() == 0
True
Another way of checking if a DataFrame is empty is the isEmpty method.
df.rdd.isEmpty()
True
This tutorial will show you how to build a robust end-to-end ML pipeline with Databricks and Aporia. Here’s what you’ll...
Dictionary is a built-in data structure of Python, which consists of key-value pairs. In this short how-to article, we will...
A row in a DataFrame can be considered as an observation with several features that are represented by columns. We...
DataFrame is a two-dimensional data structure with labeled rows and columns. Row labels are also known as the index of...
DataFrames are great for data cleaning, analysis, and visualization. However, they cannot be used in storing or transferring data. Once...
In this short how-to article, we will learn how to sort the rows of a DataFrame by the value in...
In a column with categorical or distinct values, it is important to know the number of occurrences of each value....
NaN values are also called missing values and simply indicate the data we do not have. We do not like...