How-to: formatting options for floating-point numbers in Pandas

n Pandas, pd.set_option(‘display.float_format’, …) is used to set the formatting options for floating-point numbers when they are displayed in the console or output. It allows you to customize how floating-point numbers are presented, including the number of decimal places, scientific notation, and other formatting details. In this example, the pd.set_option(‘display.float_format’, ‘{:,.2f}’.format) line sets the floating-point…

What is Plotly Library

lotly is a data visualization library that allows users to create interactive and visually appealing plots and dashboards. It supports a wide range of chart types, including scatter plots, line charts, bar charts, pie charts, 3D plots, geographic maps, and more. Plotly is known for its interactive features, allowing users to explore and interact with…

What is Seaborn Library

eaborn is a data visualization library for Python that is built on top of Matplotlib. It provides a high-level interface for creating attractive and informative statistical graphics. Seaborn is particularly well-suited for visualizing complex datasets with multiple variables. Key features of Seaborn include: To use a library in your Python code, you typically need to…

Feature Engineering: Scaling, Normalization, and Standardization

Feature scaling is considered a part of the data processing cycle that cannot be skipped, so that we can achieve stable and fast training of our ML algorithm. eature Scaling is a technique to standardize the independent features present in the data in a fixed range. It is performed during the data pre-processing to handle…

Handling missing data in a dataset

There are many ways to address missing data, each with pros and cons. Let’s take a look at the less complex options: Option 1: Delete data with missing rows. When we have a model that cannot handle missing data, the most prudent thing to do is to remove rows that have information missing. Let’s remove…

Finding missing data in a dataset

Do we have a complete dataset in a real-world scenario? No. We know from history that there is missing information in our data! How can we tell if the data we have available is complete? We could print the entire dataset, but this could involve human error, and it would become impractical with this many…