Raw data rarely arrives in a format that is ready for analysis. It needs to be loaded, inspected, cleaned, and organized before anything useful can be done with it.
Pandas is the Python library built specifically for this job. It provides powerful, flexible data structures, particularly the DataFrame — that make working with structured, tabular data fast and intuitive.
In AI and data science, Pandas is the standard tool for data handling, and virtually every project begins with it.
Installing and Importing Pandas
Pandas comes pre-installed with Anaconda. To install manually:

Import it using the standard alias pd:

Pandas has two primary data structures:



A DataFrame is the primary structure you will work with. It consists of rows and columns, similar to an Excel spreadsheet.


In practice, data comes from external files. Pandas makes loading data straightforward.

Before working with data, always inspect it first to understand its structure and contents.


Accessing specific rows, columns, or cells is a daily operation in data handling.
Selecting Columns

Selecting Rows — .loc[] and .iloc[]



You can add new columns, update values, or drop unnecessary ones.

Real-world datasets almost always have missing values. Pandas provides simple tools to detect and handle them.
Grouping lets you summarize data by category — commonly used in exploratory data analysis.



We have a sales campaign on our promoted courses and products. You can purchase 1 products at a discounted price up to 15% discount.