{"id":35247,"date":"2025-02-18T18:12:41","date_gmt":"2025-02-18T13:12:41","guid":{"rendered":"https:\/\/chartexpo.com\/blog\/?p=35247"},"modified":"2026-01-29T17:17:16","modified_gmt":"2026-01-29T12:17:16","slug":"data-cleansing-techniques","status":"publish","type":"post","link":"https:\/\/chartexpo.com\/blog\/data-cleansing-techniques","title":{"rendered":"Power BI Data Cleansing Techniques: Raw Data to Insights"},"content":{"rendered":"<p>Data cleansing is the practice of identifying and correcting data that is inaccurate. The data is then erased, changed, and replaced with newly developed or relevant data. There are various data cleansing techniques available within Power BI.<\/p>\n<div style=\"text-align: center;\"><a href=\"https:\/\/chartexpo.com\/blog\/wp-content\/uploads\/2025\/02\/data-cleansing-techniques.jpg\"><img decoding=\"async\" class=\"alignnone size-full wp-image-4345\" style=\"max-width: 100%;\" src=\"https:\/\/chartexpo.com\/blog\/wp-content\/uploads\/2025\/02\/data-cleansing-techniques.jpg\" alt=\"Data Cleansing Techniques\" \/><\/a><\/div>\n<div style=\"text-align: center;\"><a href=\"https:\/\/chartexpo.com\/utmAction\/MTYrYmxvZytwYitjZXhwbytQQkk1NjIrU2Fua2V5Kw==\" target=\"_blank\" rel=\"noopener noreferrer nofollow\"><img decoding=\"async\" class=\"alignnone size-full wp-image-4345\" src=\"https:\/\/chartexpo.com\/blog\/wp-content\/uploads\/2023\/04\/CTA-in-power-bi.jpg\" alt=\"\" width=\"205\" height=\"113\" \/><\/a> <a href=\"https:\/\/chartexpo.com\/utmAction\/MTYrYmxvZytncytjZXhwbytDRTU2Mis=\" target=\"_blank\" rel=\"noopener noreferrer nofollow\"><img decoding=\"async\" class=\"alignnone size-full wp-image-4345\" src=\"https:\/\/chartexpo.com\/blog\/wp-content\/uploads\/2023\/04\/CTA-in-google-sheets.jpg\" alt=\"\" width=\"205\" height=\"113\" \/><\/a> <a href=\"https:\/\/chartexpo.com\/TrafficTracker\/MTYrYmxvZytzZStjZXhwbytDRTU2Mis=\" target=\"_blank\" rel=\"noopener noreferrer nofollow\"><img decoding=\"async\" class=\"alignnone size-full wp-image-4345\" src=\"https:\/\/chartexpo.com\/blog\/wp-content\/uploads\/2023\/04\/CTA-in-microsoft-excel.jpg\" alt=\"\" width=\"205\" height=\"113\" \/><\/a><\/div>\n<p>Power BI provides a range of tools that facilitate the process of data cleansing. These tools include the ability to:<\/p>\n<ul>\n<li>eliminate duplicate entries,<\/li>\n<li>filter rows based on specified criteria,<\/li>\n<li>handle missing values,<\/li>\n<li>transform data types,<\/li>\n<li>manipulate text,<\/li>\n<li>standardize data,<\/li>\n<li>format data in a consistent manner,<\/li>\n<li>and merge and split columns as needed.<\/li>\n<\/ul>\n<p>These methodologies enhance precision, uniformity, and preparedness for data analysis and visualization.<\/p>\n<p>In this article, we explore data cleansing techniques. We begin by defining them and discussing why data cleansing is important. We then look at the benefits that data cleansing offers. We&#8217;ll also delve into some of the tools you can use.<\/p>\n<p>We then learn how to perform data cleansing using Power BI. We&#8217;ll use the ChartExpo Sankey diagram as an example.<\/p>\n<h3>Table of Content:<\/h3>\n<ol>\n<li><a href=\"#what-are-data-cleansing-techniques\">What are Data Cleansing Techniques?<\/a><\/li>\n<li><a href=\"#why-are-data-cleansing-techniques-important\">Why are Data Cleansing Techniques Important?<\/a><\/li>\n<li><a href=\"#7-data-cleansing-techniques\">7 Data Cleansing Techniques<\/a>\n<ul>\n<li><a href=\"#removing-duplicate-data\">Removing Duplicate Data<\/a><\/li>\n<li><a href=\"#handling-missing-values\">Handling Missing Values<\/a><\/li>\n<li><a href=\"#standardizing-data-formats\">Standardizing Data Formats<\/a><\/li>\n<li><a href=\"#correcting-Inaccuracies\">Correcting Inaccuracies<\/a><\/li>\n<li><a href=\"#removing-outliers\">Removing Outliers<\/a><\/li>\n<li><a href=\"#validating-data-integrity\">Validating Data Integrity<\/a><\/li>\n<li><a href=\"#converting-data-types\">Converting Data Types<\/a><\/li>\n<\/ul>\n<\/li>\n<li><a href=\"#tools-and-software-used-in-data-cleaning-method\">Tools &amp; Software Used in Data Cleaning Method<\/a><\/li>\n<li><a href=\"#data-cleansing-example\">Data Cleansing Example<\/a><\/li>\n<li><a href=\"#data-cleansing-vs-data-cleaning\">Data Cleansing vs. Data Cleaning<\/a><\/li>\n<li><a href=\"#data-cleansing-steps-in-power-bi\">Data Cleansing Steps in Power BI<\/a><\/li>\n<li><a href=\"#how-to-evaluate-data-cleansing-in-power-bi\">How to Evaluate Data Cleansing in Power BI?<\/a><\/li>\n<li><a href=\"#challenges-in-data-cleansing\">Challenges in Data Cleansing<\/a><\/li>\n<li><a href=\"#benefits-of-data-cleansing-techniques\">Benefits of Data Cleansing Techniques<\/a><\/li>\n<li><a href=\"#tips-for-data-cleansing-process\">Tips For Using Data Cleansing Process<\/a><\/li>\n<li><a href=\"#data-cleansing-techniques-faqs\">FAQs About Data Cleansing Techniques<\/a><\/li>\n<li><a href=\"#wrap-up\">Wrap Up<\/a><\/li>\n<\/ol>\n<p>First&#8230;<\/p>\n<h2 id=\"what-are-data-cleansing-techniques\">What are Data Cleansing Techniques?<\/h2>\n<p><strong>Definition:<\/strong> Data cleansing techniques are methods used to identify and rectify errors, inconsistencies, and inaccuracies within a <a href=\"https:\/\/chartexpo.com\/blog\/power-bi-dataset\" target=\"_blank\" rel=\"noopener noreferrer\">dataset<\/a>.<\/p>\n<p>These techniques are essential for ensuring data accuracy, consistency, and trustworthiness during data analysis.<\/p>\n<p>There are several data cleaning methods. These include:<\/p>\n<ul>\n<li><strong>Handling missing data<\/strong> &#8211; this involves identifying and addressing missing or null values in a dataset.<\/li>\n<li><strong>Removing duplicates <\/strong>&#8211; this process involves detecting and eliminating duplicate records or entries within a dataset.<\/li>\n<li><strong>Standardizing data formats<\/strong>: This method ensures uniform formatting of data elements like dates, names, addresses, or measurement units.<\/li>\n<li><strong>Handling outliers<\/strong>: Outliers are data points that significantly deviate from the rest of the data. Techniques may involve identifying and removing outliers or transforming or retaining them based on the analysis requirements.<\/li>\n<li><strong>Data type conversion<\/strong>: This technique involves converting data from one type to another, for example, text to numeric data. This ensures data consistency and compatibility with analysis tools or processes.<\/li>\n<\/ul>\n<h2 id=\"why-are-data-cleansing-techniques-important\">Why are Data Cleansing Techniques Important?<\/h2>\n<p>Data cleansing techniques are critical to maintaining <a href=\"https:\/\/chartexpo.com\/blog\/data-quality\" target=\"_blank\" rel=\"noopener\">data quality<\/a>, accuracy, and reliability. These factors are indispensable for organizations to derive meaningful insights and make informed decisions.<\/p>\n<ul>\n<li>\n<h3>Accuracy<\/h3>\n<\/li>\n<\/ul>\n<p>The accuracy of data is ensured by having clean data. Incorrect data can result in incorrect conclusions, ineffective decision-making, and squandered resources. Through data cleansing, inaccuracies, duplications, and disparities are recognized and remedied, leading to more dependable insights.<\/p>\n<ul>\n<li>\n<h3>Completeness<\/h3>\n<\/li>\n<\/ul>\n<p>Insufficient data can obstruct analysis and result in prejudiced outcomes. The procedure of data cleansing entails the identification and resolution of missing values or the elimination of deficient records.<\/p>\n<p>This guarantees that the dataset is all-encompassing and appropriate for analysis. Additionally, understanding concepts like <a href=\"https:\/\/chartexpo.com\/blog\/power-bi-cross-filter-direction\" target=\"_blank\" rel=\"noopener noreferrer\">Power BI cross-filter direction<\/a> is crucial, as it influences how data relationships are interpreted, ensuring accurate insights from the cleansed dataset.<\/p>\n<ul>\n<li>\n<h3>Data Integration<\/h3>\n<\/li>\n<\/ul>\n<p><a href=\"https:\/\/chartexpo.com\/blog\/customer-data-integration\" target=\"_blank\" rel=\"noopener\">Data integration<\/a> involves the incorporation of data from various sources, which often results in inconsistencies and discrepancies. Cross-tabulation can be a valuable technique in this process, allowing for a clearer comparison of different datasets. To ensure seamless integration and accurate analysis, it is essential to perform data cleansing to reconcile differences between these datasets.<\/p>\n<ul>\n<li>\n<h3>Enhanced Decision Making<\/h3>\n<\/li>\n<\/ul>\n<p>Accurate data leads to better insights, which help with well-informed <a href=\"https:\/\/chartexpo.com\/blog\/data-driven-decision-making\" target=\"_blank\" rel=\"noopener noreferrer\">decision-making<\/a>. Reliable data enables organizations to see opportunities, trends, and patterns more quickly, which improves their ability to make strategic decisions.<\/p>\n<ul>\n<li>\n<h3>Enhanced Data Consistency<\/h3>\n<\/li>\n<\/ul>\n<p>Consistency is key when it comes to effective decision-making and data analysis. Data cleaning techniques help to standardize formats, spellings, and other variations within a dataset. They ensure everything is consistent and reliable.<\/p>\n<p>With enhanced data consistency, such as that achieved through a <a href=\"https:\/\/chartexpo.com\/blog\/power-bi-income-statement\" target=\"_blank\" rel=\"noopener noreferrer\">Power BI income statement<\/a>, you can trust the insights you gain from your analysis and make better-informed decisions. So don&#8217;t overlook the importance of data cleansing, it&#8217;s an essential step towards success.<\/p>\n<h2 id=\"7-data-cleansing-techniques\">7 Data Cleansing Techniques<\/h2>\n<p data-start=\"37\" data-end=\"184\">Effective data cleansing ensures that data is accurate, consistent, and ready for analysis. Below are some key techniques used in data cleansing:<\/p>\n<h3 id=\"removing-duplicate-data\">1. Removing Duplicate Data<\/h3>\n<p data-start=\"223\" data-end=\"475\">Duplicate data can lead to misleading insights and errors in analysis. <a href=\"https:\/\/chartexpo.com\/blog\/power-bi-remove-duplicates\" target=\"_blank\" rel=\"noopener\">Removing duplicate data<\/a> helps maintain data integrity and prevents inflated results. Using automated tools or Excel functions like &#8220;Remove Duplicates&#8221; ensures a clean dataset.<\/p>\n<h3 id=\"handling-missing-values\">2. Handling Missing Values<\/h3>\n<p data-start=\"514\" data-end=\"765\">Incomplete data can affect analysis accuracy. Handling missing values involves techniques like mean imputation, predictive modeling, or removing incomplete records. Choosing the right approach depends on the data type and its impact on analysis.<\/p>\n<h3 id=\"standardizing-data-formats\">3. Standardizing Data Formats<\/h3>\n<p data-start=\"807\" data-end=\"1081\">Inconsistent formats can create confusion and errors. Standardizing data formats ensures that all data entries follow a uniform structure, such as consistent date formats, unit measurements, and naming conventions. This improves compatibility across different systems.<\/p>\n<h3 id=\"correcting-Inaccuracies\">4. Correcting Inaccuracies<\/h3>\n<p data-start=\"1120\" data-end=\"1393\">Incorrect data entries can distort analysis and lead to poor decisions. Correcting inaccuracies involves validating data against reliable sources, checking for typos, and ensuring all information is up to date. Automated validation tools help streamline this process.<\/p>\n<h3 id=\"removing-outliers\">5. Removing Outliers<\/h3>\n<p data-start=\"1426\" data-end=\"1710\">Extreme values can skew results and reduce the reliability of insights. Removing outliers helps in maintaining data accuracy by identifying unusual values that do not align with expected patterns. Statistical methods like Z-score analysis can help detect and eliminate outliers.<\/p>\n<h3 id=\"validating-data-integrity\">6. Validating Data Integrity<\/h3>\n<p data-start=\"1751\" data-end=\"2006\">Ensuring that data is accurate, complete, and reliable is essential. Validating data integrity includes cross-checking entries, performing audits, and using validation rules. This step prevents inconsistencies and ensures data is ready for analysis.<\/p>\n<h3 id=\"converting-data-types\">7. Converting Data Types<\/h3>\n<p data-start=\"2043\" data-end=\"2270\">Mismatched data types can cause errors in calculations and analysis. Converting data types ensures that numerical values, text entries, and date formats are correctly assigned, making data processing smooth and efficient.<\/p>\n<h2 id=\"tools-and-software-used-in-data-cleaning-method\">Tools &amp; Software Used in the Data Cleaning Method<\/h2>\n<p>Data cleaning is a crucial step in the data preprocessing stage of any data analysis. It involves identifying and correcting (or removing) errors and inconsistencies in data to improve its quality and ensure <a href=\"https:\/\/chartexpo.com\/blog\/data-integrity\" target=\"_blank\" rel=\"noopener\">data integrity<\/a>. Here are some common tools and techniques used in data cleaning, which can later help you create clear visualizations using a <a href=\"https:\/\/chartexpo.com\/charts\/sankey-diagram\" target=\"_blank\" rel=\"noopener\">Sankey diagram generator<\/a>.<\/p>\n<ul>\n<li>\n<h3>Power BI Query Editor<\/h3>\n<\/li>\n<\/ul>\n<p>In Power BI, the cleaning of data is predominantly carried out using the Power Query Editor. The Power Query Editor is an efficient and user-friendly <a href=\"https:\/\/chartexpo.com\/blog\/power-bi-transform-data\" target=\"_blank\" rel=\"noopener\">data transformation tool<\/a>. It is seamlessly integrated into Power BI.<\/p>\n<p>Users can link, clean, and alter data from various sources before importing it into the data model.<\/p>\n<ul>\n<li>\n<h3>OpenRefine<\/h3>\n<\/li>\n<\/ul>\n<p>OpenRefine is an open-source tool that has been designed to effectively clean and transform data. The tool is commonly utilized for data-cleaning purposes.<\/p>\n<p>OpenRefine supports the import of several data formats, so users can conveniently upload their datasets.<\/p>\n<p>Upon importing the data, OpenRefine provides users with features like faceting, filtering, and sorting. These features help users comprehend the structure and quality of the data.<\/p>\n<p>OpenRefine enables users to standardize data formats and values to ensure consistency across the dataset.<\/p>\n<p>The tool provides tools for identifying and removing duplicate records based on user-defined criteria.<\/p>\n<ul>\n<li>\n<h3>TIBCO Clarity<\/h3>\n<\/li>\n<\/ul>\n<p>TIBCO Clarity is a specialized platform designed for interactive data cleaning. The tool provides a user-friendly interface that streamlines data quality improvements, data discovery, and data transformation.<\/p>\n<p>This tool is capable of processing various types of raw data and preparing it for various applications. Additionally, it facilitates deduplication operations and address verification before moving the information to its destination.<\/p>\n<p>The cleansing process configuration can be reused for future raw data.<\/p>\n<ul>\n<li>\n<h3>DemandTools<\/h3>\n<\/li>\n<\/ul>\n<p>DemandTools is an efficient data quality suite that is intended to assist organizations in enhancing their data.<\/p>\n<p>It is compatible with Microsoft Dynamics 365 and Salesforce CRM.<\/p>\n<p>DemandTools has a module called Cleansing Tools that is dedicated to improving data quality by:<\/p>\n<ul>\n<li style=\"list-style-type: none;\">\n<ul>\n<li>Rectifying records and preventing duplicates.<\/li>\n<li>Managing lead conversions without creating duplicate contacts.<\/li>\n<\/ul>\n<\/li>\n<\/ul>\n<p>The deduplication matching algorithm utilized in this module employs advanced techniques to identify more matches.<\/p>\n<p>The Discovery Tools module enables you to validate CRM data by comparing it with external data sources.<\/p>\n<p>The Maintenance Tools module streamlines CRM data management tasks, including loading, reporting, record reassignments, backups, and manipulation.<\/p>\n<ul>\n<li>\n<h3>IBM InfoSphere Information Server<\/h3>\n<\/li>\n<\/ul>\n<p>IBM InfoSphere Information Server is a comprehensive data integration platform. It offers a range of top-notch data-cleaning tools.<\/p>\n<p>This tool allows for various services like standardizing information, validating and classifying data, and deduplicating records.<\/p>\n<p>The platform ensures the cleanliness and quality of your data through continuous monitoring. Moreover, it also offers address cleaning services.<\/p>\n<p>IBM&#8217;s InfoSphere provides real-time integration, digital transformation, governance, data monitoring, and smooth scalability of data.<\/p>\n<h2 id=\"data-cleansing-example\">Data Cleansing Example<\/h2>\n<h3 data-start=\"243\" data-end=\"281\"><strong data-start=\"247\" data-end=\"279\">1. Salesforce Data Cleansing<\/strong><\/h3>\n<p data-start=\"282\" data-end=\"528\"><strong data-start=\"282\" data-end=\"311\">Salesforce data cleansing<\/strong> is essential for maintaining high-quality CRM data. Over time, Salesforce databases accumulate duplicate contacts, outdated leads, and incorrect entries. A company using Salesforce for sales tracking may encounter:<\/p>\n<ul data-start=\"529\" data-end=\"702\">\n<li data-start=\"529\" data-end=\"581\">Duplicate customer records lead to confusion.<\/li>\n<li data-start=\"582\" data-end=\"641\">Incomplete lead information affecting outreach efforts.<\/li>\n<li data-start=\"642\" data-end=\"702\">Outdated email addresses cause communication failures.<\/li>\n<\/ul>\n<h3 data-start=\"924\" data-end=\"955\"><strong data-start=\"928\" data-end=\"953\">2. B2B Data Cleansing<\/strong><\/h3>\n<p data-start=\"956\" data-end=\"1212\"><strong data-start=\"956\" data-end=\"978\">B2B data cleansing<\/strong> focuses on refining business contact databases for better marketing and sales outreach. Companies that rely on email campaigns, lead generation, and account-based marketing need clean and up-to-date B2B data. Common issues include:<\/p>\n<ul data-start=\"1213\" data-end=\"1395\">\n<li data-start=\"1213\" data-end=\"1271\">Inactive business emails lead to high bounce rates.<\/li>\n<li data-start=\"1272\" data-end=\"1336\">Incorrect industry classifications cause targeting errors.<\/li>\n<li data-start=\"1337\" data-end=\"1395\">Outdated company information reducing personalization.<\/li>\n<\/ul>\n<h2 id=\"data-cleansing-vs-data-cleaning\">Data Cleansing vs. Data Cleaning<\/h2>\n<table class=\"static\" style=\"table-layout: fixed; overflow-x: auto; border: 1px; font-size: 17px;\">\n<tbody>\n<tr data-start=\"44\" data-end=\"149\">\n<td width=\"90\"><strong>Aspect<\/strong><\/td>\n<td width=\"194\"><strong>Data Cleaning<\/strong><\/td>\n<td width=\"277\"><strong>Data Cleansing<\/strong><\/td>\n<\/tr>\n<tr data-start=\"243\" data-end=\"403\">\n<td width=\"90\"><strong>Definition<\/strong><\/td>\n<td width=\"194\">Removing errors, duplicates, and inconsistencies from data.<\/td>\n<td width=\"277\">Standardizing, correcting, and enriching data for accuracy and usability.<\/td>\n<\/tr>\n<tr data-start=\"404\" data-end=\"510\">\n<td width=\"90\"><strong>Scope<\/strong><\/td>\n<td width=\"194\">Basic error correction.<\/td>\n<td width=\"277\">More comprehensive, including validation and enrichment.<\/td>\n<\/tr>\n<tr data-start=\"511\" data-end=\"609\">\n<td width=\"90\"><strong>Focus<\/strong><\/td>\n<td width=\"194\">Fixing existing issues.<\/td>\n<td width=\"277\">Improving overall data quality and consistency.<\/td>\n<\/tr>\n<tr data-start=\"610\" data-end=\"762\">\n<td width=\"90\"><strong>Techniques<\/strong><\/td>\n<td width=\"194\">Removing duplicates, fixing typos, and handling missing values.<\/td>\n<td width=\"277\">Standardization, validation, deduplication, and data enhancement.<\/td>\n<\/tr>\n<tr data-start=\"763\" data-end=\"891\">\n<td width=\"90\"><strong>Outcome<\/strong><\/td>\n<td width=\"194\">Cleaned but not necessarily optimized data.<\/td>\n<td width=\"277\">High-quality, structured, and reliable data for analysis.<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<h2 id=\"data-cleansing-steps-in-power-bi\">Data Cleansing Steps in Power BI<\/h2>\n<p data-start=\"54\" data-end=\"205\">Data cleansing in Power BI ensures accuracy, consistency, and reliability in reports and analysis. Follow these steps to clean your data effectively:<\/p>\n<h3 data-start=\"207\" data-end=\"241\">Step 1: Import Your Data<\/h3>\n<p data-start=\"242\" data-end=\"405\">Start by importing data from Excel, databases, or other sources into Power BI. Use Power Query Editor to access the raw data and begin the cleansing process.<\/p>\n<h3 data-start=\"407\" data-end=\"449\">Step 2: Remove Duplicate Records<\/h3>\n<p data-start=\"450\" data-end=\"637\">Duplicate data can cause inconsistencies in reports. In Power Query, select the relevant column, click Remove Duplicates, and ensure your dataset is free from redundant entries.<\/p>\n<h3 data-start=\"639\" data-end=\"678\">Step 3: Handle Missing Values<\/h3>\n<p data-start=\"679\" data-end=\"844\">Missing values can impact data accuracy. Use the Replace Values function to fill in missing entries or the Remove Rows option to eliminate incomplete records.<\/p>\n<h3 data-start=\"846\" data-end=\"888\">Step 4: Standardize Data Formats<\/h3>\n<p data-start=\"889\" data-end=\"1062\">Ensure consistency in data, text, and numerical values. Use Transform Options in Power Query to change text cases, modify date formats, and ensure numeric consistency.<\/p>\n<h3 data-start=\"1064\" data-end=\"1113\">Step 5: Correct Errors and Inaccuracies<\/h3>\n<p data-start=\"1114\" data-end=\"1265\">Check for incorrect spelling, formatting errors, and mismatched values. Use the Find &amp; Replace tool to correct typos and maintain data accuracy.<\/p>\n<h3 data-start=\"1267\" data-end=\"1311\">Step 6: Remove Unnecessary Columns<\/h3>\n<p data-start=\"1312\" data-end=\"1446\">Eliminate columns that are not needed for analysis. Select unwanted columns and click Remove Columns to keep only relevant data.<\/p>\n<h3 data-start=\"1448\" data-end=\"1492\">Step 7: Detect and Remove Outliers<\/h3>\n<p data-start=\"1493\" data-end=\"1644\">Extreme values can skew results. Use Conditional Formatting or statistical functions to identify and remove unusual values that distort insights.<\/p>\n<h3 data-start=\"1646\" data-end=\"1705\">Step 8: Validate and Apply Data Cleansing Changes<\/h3>\n<p data-start=\"1706\" data-end=\"1834\">Review the cleansed data, ensure it meets accuracy standards, and click Close &amp; Apply to finalize the changes in Power BI before creating insights with <a href=\"https:\/\/chartexpo.com\/tools\/power-bi-custom-visuals\" target=\"_blank\" rel=\"noopener\">Power BI charts<\/a>.<\/p>\n<h2 id=\"how-to-evaluate-data-cleansing-in-power-bi\">How to Evaluate Data Cleansing in Power BI?<\/h2>\n<p>In this section, we\u2019ll learn how to clean data using Power BI. We\u2019ll use the <a href=\"https:\/\/chartexpo.com\/blog\/sankey-diagram-in-power-bi\" target=\"_blank\" rel=\"noopener\">Sankey diagram in Power BI<\/a> (also known as the Sankey chart) as an example in Power BI Desktop.<\/p>\n<h3>Stage 1: Logging in to Power BI<\/h3>\n<ul>\n<li>Log in to Power BI.<\/li>\n<li>Enter your email. Click the \u201cSubmit\u201d button.<\/li>\n<\/ul>\n<div style=\"text-align: center;\"><a href=\"https:\/\/chartexpo.com\/blog\/wp-content\/uploads\/2023\/05\/enter-email-to-login-to-power-bi.jpg\"><img decoding=\"async\" class=\"alignnone size full wp image 4345\" src=\"https:\/\/chartexpo.com\/blog\/wp-content\/uploads\/2023\/05\/enter-email-to-login-to-power-bi.jpg\" alt=\"Enter email to login to Power BI\" width=\"650\" \/><\/a><\/div>\n<ul>\n<li>Enter your password and click \u201cSign in.&#8221;<\/li>\n<\/ul>\n<div style=\"text-align: center;\"><a href=\"https:\/\/chartexpo.com\/blog\/wp-content\/uploads\/2023\/05\/enter-password-to-login-to-power-bi.jpg\"><img decoding=\"async\" class=\"alignnone size full wp image 4345\" src=\"https:\/\/chartexpo.com\/blog\/wp-content\/uploads\/2023\/05\/enter-password-to-login-to-power-bi.jpg\" alt=\"Enter Password to login to Power BI\" width=\"363\" \/><\/a><\/div>\n<ul>\n<li>Choose whether to stay signed in.<\/li>\n<\/ul>\n<div style=\"text-align: center;\"><a href=\"https:\/\/chartexpo.com\/blog\/wp-content\/uploads\/2023\/05\/click-on-stay-signed-in.jpg\"><img decoding=\"async\" class=\"alignnone size full wp image 4345\" src=\"https:\/\/chartexpo.com\/blog\/wp-content\/uploads\/2023\/05\/click-on-stay-signed-in.jpg\" alt=\"Click on stay signed in\" width=\"392\" \/><\/a><\/div>\n<div style=\"text-align: center;\"><a href=\"https:\/\/chartexpo.com\/utmAction\/MTYrYmxvZytwYitjZXhwbytQQkk1NjIrU2Fua2V5Kw==\" target=\"_blank\" rel=\"noopener noreferrer nofollow\"><img decoding=\"async\" class=\"alignnone size-full wp-image-4345\" src=\"https:\/\/chartexpo.com\/blog\/wp-content\/uploads\/2023\/04\/CTA-in-power-bi.jpg\" alt=\"\" width=\"205\" height=\"113\" \/><\/a> <a href=\"https:\/\/chartexpo.com\/utmAction\/MTYrYmxvZytncytjZXhwbytDRTU2Mis=\" target=\"_blank\" rel=\"noopener noreferrer nofollow\"><img decoding=\"async\" class=\"alignnone size-full wp-image-4345\" src=\"https:\/\/chartexpo.com\/blog\/wp-content\/uploads\/2023\/04\/CTA-in-google-sheets.jpg\" alt=\"\" width=\"205\" height=\"113\" \/><\/a> <a href=\"https:\/\/chartexpo.com\/TrafficTracker\/MTYrYmxvZytzZStjZXhwbytDRTU2Mis=\" target=\"_blank\" rel=\"noopener noreferrer nofollow\"><img decoding=\"async\" class=\"alignnone size-full wp-image-4345\" src=\"https:\/\/chartexpo.com\/blog\/wp-content\/uploads\/2023\/04\/CTA-in-microsoft-excel.jpg\" alt=\"\" width=\"205\" height=\"113\" \/><\/a><\/div>\n<h3>Stage 2: Cleanse the Data to Use in Your Sankey Diagram<\/h3>\n<ul>\n<li>We&#8217;ll use the following <a href=\"https:\/\/chartexpo.com\/blog\/sample-data-for-power-bi\" target=\"_blank\" rel=\"noopener noreferrer\">sample data<\/a> for this example:<\/li>\n<\/ul>\n<table class=\"static\" style=\"table-layout: fixed; overflow-x: auto; border: 1px; font-size: 17px;\">\n<tbody>\n<tr>\n<td width=\"33\"><strong>Age<\/strong><\/td>\n<td width=\"85\"><strong>Gender<\/strong><\/td>\n<td width=\"85\"><strong>Marital Status<\/strong><\/td>\n<td width=\"85\"><strong>Occupation<\/strong><\/td>\n<td width=\"108\"><strong>Monthly Income<\/strong><\/td>\n<td width=\"154\"><strong>Educational Qualifications<\/strong><\/td>\n<td width=\"73\"><strong>Family size<\/strong><\/td>\n<\/tr>\n<tr>\n<td width=\"33\">20<\/td>\n<td width=\"85\">Female<\/td>\n<td width=\"85\">Single<\/td>\n<td width=\"85\">Student<\/td>\n<td width=\"108\">No Income<\/td>\n<td width=\"154\">Post Graduate<\/td>\n<td width=\"73\">4<\/td>\n<\/tr>\n<tr>\n<td width=\"33\">24<\/td>\n<td width=\"85\">Female<\/td>\n<td width=\"85\">Single<\/td>\n<td width=\"85\">Student<\/td>\n<td width=\"108\">Below Rs.10000<\/td>\n<td width=\"154\">Graduate<\/td>\n<td width=\"73\">3<\/td>\n<\/tr>\n<tr>\n<td width=\"33\">22<\/td>\n<td width=\"85\">Male<\/td>\n<td width=\"85\">Single<\/td>\n<td width=\"85\">Student<\/td>\n<td width=\"108\">Below Rs.10000<\/td>\n<td width=\"154\">Post Graduate<\/td>\n<td width=\"73\">3<\/td>\n<\/tr>\n<tr>\n<td width=\"33\">22<\/td>\n<td width=\"85\">Female<\/td>\n<td width=\"85\">Single<\/td>\n<td width=\"85\">Student<\/td>\n<td width=\"108\">No Income<\/td>\n<td width=\"154\">Graduate<\/td>\n<td width=\"73\">6<\/td>\n<\/tr>\n<tr>\n<td width=\"33\">22<\/td>\n<td width=\"85\">Male<\/td>\n<td width=\"85\">Single<\/td>\n<td width=\"85\">Student<\/td>\n<td width=\"108\">Below Rs.10000<\/td>\n<td width=\"154\">Post Graduate<\/td>\n<td width=\"73\">4<\/td>\n<\/tr>\n<tr>\n<td width=\"33\">27<\/td>\n<td width=\"85\">Female<\/td>\n<td width=\"85\">Married<\/td>\n<td width=\"85\">Employee<\/td>\n<td width=\"108\">More than 50000<\/td>\n<td width=\"154\">Post Graduate<\/td>\n<td width=\"73\">2<\/td>\n<\/tr>\n<tr>\n<td width=\"33\">22<\/td>\n<td width=\"85\">Male<\/td>\n<td width=\"85\">Single<\/td>\n<td width=\"85\">Student<\/td>\n<td width=\"108\">No Income<\/td>\n<td width=\"154\">Graduate<\/td>\n<td width=\"73\">3<\/td>\n<\/tr>\n<tr>\n<td width=\"33\">24<\/td>\n<td width=\"85\">Female<\/td>\n<td width=\"85\">Single<\/td>\n<td width=\"85\">Student<\/td>\n<td width=\"108\">No Income<\/td>\n<td width=\"154\">Post Graduate<\/td>\n<td width=\"73\">3<\/td>\n<\/tr>\n<tr>\n<td width=\"33\">23<\/td>\n<td width=\"85\">Female<\/td>\n<td width=\"85\">Single<\/td>\n<td width=\"85\">Student<\/td>\n<td width=\"108\">No Income<\/td>\n<td width=\"154\">Post Graduate<\/td>\n<td width=\"73\">2<\/td>\n<\/tr>\n<tr>\n<td width=\"33\">23<\/td>\n<td width=\"85\">Female<\/td>\n<td width=\"85\">Single<\/td>\n<td width=\"85\">Student<\/td>\n<td width=\"108\">No Income<\/td>\n<td width=\"154\">Post Graduate<\/td>\n<td width=\"73\">4<\/td>\n<\/tr>\n<tr>\n<td width=\"33\">22<\/td>\n<td width=\"85\">Female<\/td>\n<td width=\"85\">Single<\/td>\n<td width=\"85\">Student<\/td>\n<td width=\"108\">No Income<\/td>\n<td width=\"154\">Post Graduate<\/td>\n<td width=\"73\">5<\/td>\n<\/tr>\n<tr>\n<td width=\"33\">23<\/td>\n<td width=\"85\">Male<\/td>\n<td width=\"85\">Single<\/td>\n<td width=\"85\">Student<\/td>\n<td width=\"108\">Below Rs.10000<\/td>\n<td width=\"154\">Post Graduate<\/td>\n<td width=\"73\">2<\/td>\n<\/tr>\n<tr>\n<td width=\"33\">23<\/td>\n<td width=\"85\">Male<\/td>\n<td width=\"85\">Single<\/td>\n<td width=\"85\">Student<\/td>\n<td width=\"108\">No Income<\/td>\n<td width=\"154\">Post Graduate<\/td>\n<td width=\"73\">5<\/td>\n<\/tr>\n<tr>\n<td width=\"33\">21<\/td>\n<td width=\"85\">Male<\/td>\n<td width=\"85\">Single<\/td>\n<td width=\"85\">Student<\/td>\n<td width=\"108\">No Income<\/td>\n<td width=\"154\">Graduate<\/td>\n<td width=\"73\">4<\/td>\n<\/tr>\n<tr>\n<td width=\"33\">23<\/td>\n<td width=\"85\">Female<\/td>\n<td width=\"85\">Single<\/td>\n<td width=\"85\">Self Employed<\/td>\n<td width=\"108\">10001 to 25000<\/td>\n<td width=\"154\">Post Graduate<\/td>\n<td width=\"73\">5<\/td>\n<\/tr>\n<tr>\n<td width=\"33\">24<\/td>\n<td width=\"85\">Female<\/td>\n<td width=\"85\">Single<\/td>\n<td width=\"85\">Student<\/td>\n<td width=\"108\">No Income<\/td>\n<td width=\"154\">Post Graduate<\/td>\n<td width=\"73\">6<\/td>\n<\/tr>\n<tr>\n<td width=\"33\">28<\/td>\n<td width=\"85\">Female<\/td>\n<td width=\"85\">Single<\/td>\n<td width=\"85\">Employee<\/td>\n<td width=\"108\">25001 to 50000<\/td>\n<td width=\"154\">Post Graduate<\/td>\n<td width=\"73\">2<\/td>\n<\/tr>\n<tr>\n<td width=\"33\">23<\/td>\n<td width=\"85\">Female<\/td>\n<td width=\"85\">Single<\/td>\n<td width=\"85\">Student<\/td>\n<td width=\"108\">No Income<\/td>\n<td width=\"154\">Graduate<\/td>\n<td width=\"73\">3<\/td>\n<\/tr>\n<tr>\n<td width=\"33\">25<\/td>\n<td width=\"85\">Male<\/td>\n<td width=\"85\">Single<\/td>\n<td width=\"85\">Student<\/td>\n<td width=\"108\">No Income<\/td>\n<td width=\"154\">Graduate<\/td>\n<td width=\"73\">4<\/td>\n<\/tr>\n<tr>\n<td width=\"33\">21<\/td>\n<td width=\"85\">Female<\/td>\n<td width=\"85\">Single<\/td>\n<td width=\"85\">Student<\/td>\n<td width=\"108\">Below Rs.10000<\/td>\n<td width=\"154\">Post Graduate<\/td>\n<td width=\"73\">1<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<ul>\n<li>Once you access Power BI&#8217;s dashboard, choose \u201cImport data from Excel.\u201d<\/li>\n<\/ul>\n<div style=\"text-align: center;\"><a href=\"https:\/\/chartexpo.com\/blog\/wp-content\/uploads\/2024\/05\/import-data-from-excel-for-applying-data-cleansing-techniques.jpg\"><img decoding=\"async\" class=\"alignnone size full wp image 4345\" src=\"https:\/\/chartexpo.com\/blog\/wp-content\/uploads\/2024\/05\/import-data-from-excel-for-applying-data-cleansing-techniques.jpg\" alt=\"Import data from Excel For Applying Data Cleansing Techniques\" width=\"650\" \/><\/a><\/div>\n<div style=\"text-align: center;\"><\/div>\n<ul>\n<li>Choose your dataset.<\/li>\n<li>It will be loaded into the Navigator pane.<\/li>\n<\/ul>\n<div style=\"text-align: center;\"><a href=\"https:\/\/chartexpo.com\/blog\/wp-content\/uploads\/2024\/05\/navigator-pane-for-applying-data-cleansing-techniques.jpg\"><img decoding=\"async\" class=\"alignnone size full wp image 4345\" src=\"https:\/\/chartexpo.com\/blog\/wp-content\/uploads\/2024\/05\/navigator-pane-for-applying-data-cleansing-techniques.jpg\" alt=\"Navigator Pane For Applying Data Cleansing Techniques\" width=\"629\" \/><\/a><\/div>\n<ul>\n<li>Choose the Excel sheet containing the data.<\/li>\n<\/ul>\n<div style=\"text-align: center;\"><a href=\"https:\/\/chartexpo.com\/blog\/wp-content\/uploads\/2024\/05\/choose-excel-sheet-for-applying-data-cleansing-techniques.jpg\"><img decoding=\"async\" class=\"alignnone size full wp image 4345\" src=\"https:\/\/chartexpo.com\/blog\/wp-content\/uploads\/2024\/05\/choose-excel-sheet-for-applying-data-cleansing-techniques.jpg\" alt=\"Choose Excel Sheet For Applying Data Cleansing Techniques\" width=\"603\" \/><\/a><\/div>\n<ul>\n<li>As you can see, our data set contains null Values.<\/li>\n<\/ul>\n<div style=\"text-align: center;\"><a href=\"https:\/\/chartexpo.com\/blog\/wp-content\/uploads\/2024\/05\/data-set-with-null-value-for-applying-data-cleansing-techniques.jpg\"><img decoding=\"async\" class=\"alignnone size full wp image 4345\" src=\"https:\/\/chartexpo.com\/blog\/wp-content\/uploads\/2024\/05\/data-set-with-null-value-for-applying-data-cleansing-techniques.jpg\" alt=\"Data Set With Null Value For Applying Data Cleansing Techniques\" width=\"650\" \/><\/a><\/div>\n<ul>\n<li>Select \u201cTransform Data\u201d to remove null columns and rows.<\/li>\n<li>This opens the Power Query Editor tool dashboard.<\/li>\n<\/ul>\n<div style=\"text-align: center;\"><a href=\"https:\/\/chartexpo.com\/blog\/wp-content\/uploads\/2024\/05\/transform-data-for-applying-data-cleansing-techniques.jpg\"><img decoding=\"async\" class=\"alignnone size full wp image 4345\" src=\"https:\/\/chartexpo.com\/blog\/wp-content\/uploads\/2024\/05\/transform-data-for-applying-data-cleansing-techniques.jpg\" alt=\"Transform Data For Applying Data Cleansing Techniques\" width=\"650\" \/><\/a><\/div>\n<ul>\n<li>In the &#8220;Home&#8221; tab, look for the &#8220;Manage Columns&#8221; group. Click on the &#8220;Choose Columns&#8221; icon (it looks like a table with highlighted columns).<\/li>\n<\/ul>\n<div style=\"text-align: center;\"><a href=\"https:\/\/chartexpo.com\/blog\/wp-content\/uploads\/2024\/05\/manage-columns-for-applying-data-cleansing-techniques.jpg\"><img decoding=\"async\" class=\"alignnone size full wp image 4345\" src=\"https:\/\/chartexpo.com\/blog\/wp-content\/uploads\/2024\/05\/manage-columns-for-applying-data-cleansing-techniques.jpg\" alt=\"Manage Columns For Applying Data Cleansing Techniques\" width=\"650\" \/><\/a><\/div>\n<ul>\n<li>The following window opens:<\/li>\n<\/ul>\n<div style=\"text-align: center;\"><a href=\"https:\/\/chartexpo.com\/blog\/wp-content\/uploads\/2024\/05\/new-window-for-applying-data-cleansing-techniques.jpg\"><img decoding=\"async\" class=\"alignnone size full wp image 4345\" src=\"https:\/\/chartexpo.com\/blog\/wp-content\/uploads\/2024\/05\/new-window-for-applying-data-cleansing-techniques.jpg\" alt=\"New Window For Applying Data Cleansing Techniques\" width=\"266\" \/><\/a><\/div>\n<ul>\n<li>You should see all the columns.<\/li>\n<li>Choose the columns to keep and click &#8220;OK.\u201d<\/li>\n<\/ul>\n<div style=\"text-align: center;\"><a href=\"https:\/\/chartexpo.com\/blog\/wp-content\/uploads\/2024\/05\/click-ok-for-applying-data-cleansing-techniques.jpg\"><img decoding=\"async\" class=\"alignnone size full wp image 4345\" src=\"https:\/\/chartexpo.com\/blog\/wp-content\/uploads\/2024\/05\/click-ok-for-applying-data-cleansing-techniques.jpg\" alt=\"Click OK For Applying Data Cleansing Techniques\" width=\"390\" \/><\/a><\/div>\n<ul>\n<li>To delete null rows, in the &#8220;Home&#8221; tab, look for the &#8220;Remove Rows&#8221; group. Click on the &#8220;Remove Rows&#8221; icon (it looks like a table with a row being removed).<\/li>\n<\/ul>\n<div style=\"text-align: center;\"><a href=\"https:\/\/chartexpo.com\/blog\/wp-content\/uploads\/2024\/05\/click-remove-blanks-icon-for-applying-data-cleansing-techniques.jpg\"><img decoding=\"async\" class=\"alignnone size full wp image 4345\" src=\"https:\/\/chartexpo.com\/blog\/wp-content\/uploads\/2024\/05\/click-remove-blanks-icon-for-applying-data-cleansing-techniques.jpg\" alt=\"Click Remove Blanks Icon For Applying Data Cleansing Techniques\" width=\"650\" \/><\/a><\/div>\n<ul>\n<li>Choose \u201cRemove Blank Rows.\u201d<\/li>\n<li>You now have clean data that you can use to create your Sankey diagram.<\/li>\n<li>Choose \u201cApply\u201d to save the changes you&#8217;ve made to your dataset.<\/li>\n<\/ul>\n<div style=\"text-align: center;\"><a href=\"https:\/\/chartexpo.com\/blog\/wp-content\/uploads\/2024\/05\/remove-blank-rows-for-applying-data-cleansing-techniques.jpg\"><img decoding=\"async\" class=\"alignnone size full wp image 4345\" src=\"https:\/\/chartexpo.com\/blog\/wp-content\/uploads\/2024\/05\/remove-blank-rows-for-applying-data-cleansing-techniques.jpg\" alt=\"Remove Blank Rows For Applying Data Cleansing Techniques\" width=\"650\" \/><\/a><\/div>\n<ul>\n<li>Open Power Query Editor by selecting the \u201cTransform data\u201d option on the home tab of Power BI Desktop.<\/li>\n<\/ul>\n<div style=\"text-align: center;\"><a href=\"https:\/\/chartexpo.com\/blog\/wp-content\/uploads\/2024\/05\/transform-data-in-power-bi-desktop-for-applying-data-cleansing-techniques.jpg\"><img decoding=\"async\" class=\"alignnone size full wp image 4345\" src=\"https:\/\/chartexpo.com\/blog\/wp-content\/uploads\/2024\/05\/transform-data-in-power-bi-desktop-for-applying-data-cleansing-techniques.jpg\" alt=\"Transform Data in Power BI Desktop For Applying Data Cleansing Techniques\" width=\"624\" \/><\/a><\/div>\n<p>The data in your selected query is displayed in the middle of the screen. To the left, the queries pane list is available and to the right, a list of your steps is available in the Query Settings pane.<\/p>\n<div style=\"text-align: center;\"><a href=\"https:\/\/chartexpo.com\/blog\/wp-content\/uploads\/2024\/05\/query-settings-pane-for-applying-data-cleansing-techniques.jpg\"><img decoding=\"async\" class=\"alignnone size full wp image 4345\" src=\"https:\/\/chartexpo.com\/blog\/wp-content\/uploads\/2024\/05\/query-settings-pane-for-applying-data-cleansing-techniques.jpg\" alt=\"Query Settings Pane For Applying Data Cleansing Techniques\" width=\"624\" \/><\/a><\/div>\n<h3>Stage 3: Adding the Power BI Sankey Diagram Extension<\/h3>\n<ul>\n<li>To finish creating our Sankey Diagram, we&#8217;ll use an add-in or Power BI visual from AppSource.<\/li>\n<li>Navigate to the Power BI Visualizations panel.<\/li>\n<li>Select the \u201cGet more visuals\u201d option.<\/li>\n<\/ul>\n<div style=\"text-align: center;\"><a href=\"https:\/\/chartexpo.com\/blog\/wp-content\/uploads\/2024\/05\/get-more-visuals-after-applying-data-cleansing-techniques.jpg\"><img decoding=\"async\" class=\"alignnone size full wp image 4345\" src=\"https:\/\/chartexpo.com\/blog\/wp-content\/uploads\/2024\/05\/get-more-visuals-after-applying-data-cleansing-techniques.jpg\" alt=\"Get More Visuals After Applying Data Cleansing Techniques\" width=\"629\" \/><\/a><\/div>\n<ul>\n<li>Enter \u201cSankey Diagram for Power BI by ChartExpo\u201d in the highlighted search box.<\/li>\n<li>You should see the \u201cSankey Diagram for Power BI by ChartExpo\u201d in the following image.<\/li>\n<\/ul>\n<div style=\"text-align: center;\"><a href=\"https:\/\/chartexpo.com\/blog\/wp-content\/uploads\/2024\/05\/sankey-diagram-window-after-applying-data-cleansing-techniques.jpg\"><img decoding=\"async\" class=\"alignnone size full wp image 4345\" src=\"https:\/\/chartexpo.com\/blog\/wp-content\/uploads\/2024\/05\/sankey-diagram-window-after-applying-data-cleansing-techniques.jpg\" alt=\"Sankey Diagram Window After Applying Data Cleansing Techniques\" width=\"650\" \/><\/a><\/div>\n<ul>\n<li>Click the highlighted \u201cAdd\u201d button.<\/li>\n<\/ul>\n<div style=\"text-align: center;\"><a href=\"https:\/\/chartexpo.com\/blog\/wp-content\/uploads\/2024\/05\/click-add-button-after-applying-data-cleansing-techniques.jpg\"><img decoding=\"async\" class=\"alignnone size full wp image 4345\" src=\"https:\/\/chartexpo.com\/blog\/wp-content\/uploads\/2024\/05\/click-add-button-after-applying-data-cleansing-techniques.jpg\" alt=\"Click Add Button After Applying Data Cleansing Techniques\" width=\"624\" \/><\/a><\/div>\n<ul>\n<li>Power BI will add the \u201cSankey Diagram for Power BI by ChartExpo\u201d icon in the visualization panel.<\/li>\n<\/ul>\n<div style=\"text-align: center;\"><a href=\"https:\/\/chartexpo.com\/blog\/wp-content\/uploads\/2024\/05\/sankey-diagram-icon-in-pane-after-applying-data-cleansing-techniques.jpg\"><img decoding=\"async\" class=\"alignnone size full wp image 4345\" src=\"https:\/\/chartexpo.com\/blog\/wp-content\/uploads\/2024\/05\/sankey-diagram-icon-in-pane-after-applying-data-cleansing-techniques.jpg\" alt=\"Sankey Diagram Icon in Pane After Applying Data Cleansing Techniques\" width=\"187\" \/><\/a><\/div>\n<h3>Stage 4: Drawing a Sankey Diagram With ChartExpo&#8217;s Power BI Extension<\/h3>\n<ul>\n<li>Select the \u201cSankey Diagram for Power BI by ChartExpo\u201d icon in the visualization panel.<\/li>\n<li>The following window opens in the report section of your dashboard:<\/li>\n<\/ul>\n<div style=\"text-align: center;\"><a href=\"https:\/\/chartexpo.com\/blog\/wp-content\/uploads\/2024\/05\/select-sankey-diagram-icon-after-applying-data-cleansing-techniques.jpg\"><img decoding=\"async\" class=\"alignnone size full wp image 4345\" src=\"https:\/\/chartexpo.com\/blog\/wp-content\/uploads\/2024\/05\/select-sankey-diagram-icon-after-applying-data-cleansing-techniques.jpg\" alt=\"Select Sankey Diagram Icon After Applying Data Cleansing Techniques\" width=\"624\" \/><\/a><\/div>\n<ul>\n<li>You can resize the visual as needed.<\/li>\n<li>Navigate to the right side of your Power BI dashboard. You should see \u201cFields\u201d next to &#8220;Visualizations.&#8221;<\/li>\n<li>You&#8217;ll select the fields to use in your Sankey chart here.<\/li>\n<\/ul>\n<div style=\"text-align: center;\"><a href=\"https:\/\/chartexpo.com\/blog\/wp-content\/uploads\/2024\/05\/select-fields-after-applying-data-cleansing-techniques.jpg\"><img decoding=\"async\" class=\"alignnone size full wp image 4345\" src=\"https:\/\/chartexpo.com\/blog\/wp-content\/uploads\/2024\/05\/select-fields-after-applying-data-cleansing-techniques.jpg\" alt=\"Select Fields After Applying Data Cleansing Techniques\" width=\"333\" \/><\/a><\/div>\n<ul>\n<li>The ChartExpo visual needs to be selected, though.<\/li>\n<li>Select the fields in the following sequence:\n<ul>\n<li>Age<\/li>\n<li>Educational qualifications<\/li>\n<li>Family size<\/li>\n<li>Gender<\/li>\n<li>Marital Status<\/li>\n<li>Monthly Income<\/li>\n<li>Occupation<\/li>\n<\/ul>\n<\/li>\n<\/ul>\n<ul>\n<li>You&#8217;ll be asked for a ChartExpo license key or email address.<\/li>\n<\/ul>\n<div style=\"text-align: center;\"><a href=\"https:\/\/chartexpo.com\/blog\/wp-content\/uploads\/2024\/05\/license-key-after-applying-data-cleansing-techniques.jpg\"><img decoding=\"async\" class=\"alignnone size full wp image 4345\" src=\"https:\/\/chartexpo.com\/blog\/wp-content\/uploads\/2024\/05\/license-key-after-applying-data-cleansing-techniques.jpg\" alt=\"License Key After Applying Data Cleansing Techniques\" width=\"531\" \/><\/a><\/div>\n<ul>\n<li>Select the ChartExpo visual. You should see three icons below \u201cBuild Visual\u201d in the Visualizations panel.<\/li>\n<\/ul>\n<div style=\"text-align: center;\"><a href=\"https:\/\/chartexpo.com\/blog\/wp-content\/uploads\/2024\/05\/choose-sankey-diagram-icon-after-applying-data-cleansing-techniques.jpg\"><img decoding=\"async\" class=\"alignnone size full wp image 4345\" src=\"https:\/\/chartexpo.com\/blog\/wp-content\/uploads\/2024\/05\/choose-sankey-diagram-icon-after-applying-data-cleansing-techniques.jpg\" alt=\"Choose Sankey Diagram Icon After Applying Data Cleansing Techniques\" width=\"203\" \/><\/a><\/div>\n<ul>\n<li>Select the middle icon, \u201cFormat visual.&#8221;<\/li>\n<li>The visual properties will be populated as shown below.<\/li>\n<\/ul>\n<div style=\"text-align: center;\"><a href=\"https:\/\/chartexpo.com\/blog\/wp-content\/uploads\/2024\/05\/select-format-visuals-after-applying-data-cleansing-techniques.jpg\"><img decoding=\"async\" class=\"alignnone size full wp image 4345\" src=\"https:\/\/chartexpo.com\/blog\/wp-content\/uploads\/2024\/05\/select-format-visuals-after-applying-data-cleansing-techniques.jpg\" alt=\"Select Format Visuals After Applying Data Cleansing Techniques\" width=\"183\" \/><\/a><\/div>\n<ul>\n<li>If you are a new user,\n<ul>\n<li>Type in your email under the section titled \u201cTrial \u201d<\/li>\n<li>This should be the email address that you used to subscribe to the ChartExpo add-in. It is where your ChartExpo license key will be sent.<\/li>\n<li>Ensure that your email address is valid.<\/li>\n<li>Click \u201cEnable Trial.&#8221; You&#8217;ll get a 7-day trial.<\/li>\n<\/ul>\n<\/li>\n<\/ul>\n<div style=\"text-align: center;\"><a href=\"https:\/\/chartexpo.com\/blog\/wp-content\/uploads\/2024\/05\/remove-trial-after-applying-data-cleansing-techniques.jpg\"><img decoding=\"async\" class=\"alignnone size full wp image 4345\" src=\"https:\/\/chartexpo.com\/blog\/wp-content\/uploads\/2024\/05\/remove-trial-after-applying-data-cleansing-techniques.jpg\" alt=\"Remove Trial After Applying Data Cleansing Techniques\" width=\"180\" \/><\/a><\/div>\n<ul>\n<li>You should receive a welcome email from ChartExpo.<\/li>\n<li>The Sankey Diagram you create under the 7-day trial contains the ChartExpo watermark.<\/li>\n<li>If you have obtained a license key:\n<ul>\n<li>Enter your license key in the \u201cChartExpo License Key\u201d textbox in the \u201cLicense Settings\u201d section (see below).<\/li>\n<li>Slide the toggle switch next to \u201cEnable License\u201d to &#8220;On.&#8221;<\/li>\n<\/ul>\n<\/li>\n<\/ul>\n<div style=\"text-align: center;\"><a href=\"https:\/\/chartexpo.com\/blog\/wp-content\/uploads\/2024\/05\/enter-license-key-after-applying-data-cleansing-techniques.jpg\"><img decoding=\"async\" class=\"alignnone size full wp image 4345\" src=\"https:\/\/chartexpo.com\/blog\/wp-content\/uploads\/2024\/05\/enter-license-key-after-applying-data-cleansing-techniques.jpg\" alt=\"Enter License Key After Applying Data Cleansing Techniques\" width=\"197\" \/><\/a><\/div>\n<ul>\n<li>Your Sankey diagram should now be ready (see screenshot). Note that it does not have a watermark.<\/li>\n<\/ul>\n<div style=\"text-align: center;\"><a href=\"https:\/\/chartexpo.com\/blog\/wp-content\/uploads\/2024\/05\/remove-watermark-after-applying-data-cleansing-techniques.jpg\"><img decoding=\"async\" class=\"alignnone size full wp image 4345\" src=\"https:\/\/chartexpo.com\/blog\/wp-content\/uploads\/2024\/05\/remove-watermark-after-applying-data-cleansing-techniques.jpg\" alt=\"Remove Watermark After Applying Data Cleansing Techniques\" width=\"650\" \/><\/a><\/div>\n<ul>\n<li>To add colors, expand the \u201cLevel Colors\u201d properties and select a color.<\/li>\n<li>Do this to change the color of each node.<\/li>\n<li>All changes are automatically saved.<\/li>\n<\/ul>\n<div style=\"text-align: center;\"><a href=\"https:\/\/chartexpo.com\/blog\/wp-content\/uploads\/2024\/05\/add-colors-after-applying-data-cleansing-techniques.jpg\"><img decoding=\"async\" class=\"alignnone size full wp image 4345\" src=\"https:\/\/chartexpo.com\/blog\/wp-content\/uploads\/2024\/05\/add-colors-after-applying-data-cleansing-techniques.jpg\" alt=\"Add Colors After Applying Data Cleansing Techniques\" width=\"181\" \/><\/a><\/div>\n<ul>\n<li>Your final visualization will look like the one below.<\/li>\n<\/ul>\n<div style=\"text-align: center;\"><a href=\"https:\/\/chartexpo.com\/blog\/wp-content\/uploads\/2024\/05\/final-data-cleansing-techniques.jpg\"><img decoding=\"async\" class=\"alignnone size full wp image 4345\" src=\"https:\/\/chartexpo.com\/blog\/wp-content\/uploads\/2024\/05\/final-data-cleansing-techniques.jpg\" alt=\"Final Data Cleansing Techniques\" width=\"650\" \/><\/a><\/div>\n<div style=\"text-align: center;\"><a href=\"https:\/\/chartexpo.com\/utmAction\/MTYrYmxvZytwYitjZXhwbytQQkk1NjIrU2Fua2V5Kw==\" target=\"_blank\" rel=\"noopener noreferrer nofollow\"><img decoding=\"async\" class=\"alignnone size-full wp-image-4345\" src=\"https:\/\/chartexpo.com\/blog\/wp-content\/uploads\/2023\/04\/CTA-in-power-bi.jpg\" alt=\"\" width=\"205\" height=\"113\" \/><\/a> <a href=\"https:\/\/chartexpo.com\/utmAction\/MTYrYmxvZytncytjZXhwbytDRTU2Mis=\" target=\"_blank\" rel=\"noopener noreferrer nofollow\"><img decoding=\"async\" class=\"alignnone size-full wp-image-4345\" src=\"https:\/\/chartexpo.com\/blog\/wp-content\/uploads\/2023\/04\/CTA-in-google-sheets.jpg\" alt=\"\" width=\"205\" height=\"113\" \/><\/a> <a href=\"https:\/\/chartexpo.com\/TrafficTracker\/MTYrYmxvZytzZStjZXhwbytDRTU2Mis=\" target=\"_blank\" rel=\"noopener noreferrer nofollow\"><img decoding=\"async\" class=\"alignnone size-full wp-image-4345\" src=\"https:\/\/chartexpo.com\/blog\/wp-content\/uploads\/2023\/04\/CTA-in-microsoft-excel.jpg\" alt=\"\" width=\"205\" height=\"113\" \/><\/a><\/div>\n<h4>Insights<\/h4>\n<p>The dataset above was gathered from an online based platform for ordering food. It includes different characteristics linked to occupation, family size, feedback, and more.<\/p>\n<p>This dataset is used to examine how demographic and location factors relate to online food ordering habits.<\/p>\n<ul>\n<li>At Level 1, the sales are based on educational qualifications. Postgraduates account for 43.76%, graduates for 45.67%, and Ph.D. holders for 6.40%. School-educated individuals account for 3.52%, and uneducated individuals account for 0.65%.<\/li>\n<li>At Level 2, sales for females account for 42%, while sales for males make up 58%.<\/li>\n<li>At Level 3, the sales data is based on marital status. 65.38% of clients are classified as single, while 31.28% are considered married. The remaining 3.36% of the customers have not disclosed their marital status.<\/li>\n<li>At Level 4, sales are classified based on customer income. The customer segment with no income constitutes 45% of the total. Those earning between 25,001 and 50,000 constitute 19%. Customers with an income between 10,001 and 25,000 make up 12% of the total. Those with an income below 10,000 accounts for 6%. The remaining 18% are customers with incomes exceeding 50,000.<\/li>\n<li>At Level 5, sales are occupation-based. Students accounted for 49%, employees 33%, self-employed individuals 15%, and housewives 3%.<\/li>\n<\/ul>\n<h2 id=\"challenges-in-data-cleansing\">Challenges in Data Cleansing<\/h2>\n<h3 data-start=\"138\" data-end=\"174\"><strong data-start=\"142\" data-end=\"172\">1. <\/strong>Handling Large Datasets<\/h3>\n<p data-start=\"175\" data-end=\"307\">Cleaning massive amounts of data requires significant time and computing power, making the process complex and resource-intensive.<\/p>\n<h3 data-start=\"309\" data-end=\"357\">2. Identifying and Removing Duplicates<\/h3>\n<p data-start=\"358\" data-end=\"494\">Duplicate records often appear in different formats, making it difficult to detect and merge them without losing critical information.<\/p>\n<h3 data-start=\"496\" data-end=\"548\">3. Dealing with Missing or Incomplete Data<\/h3>\n<p data-start=\"549\" data-end=\"674\">Missing values can lead to biased analysis. Deciding whether to remove, fill, or predict missing data is a major challenge.<\/p>\n<h3 data-start=\"676\" data-end=\"729\">4. Ensuring Data Consistency Across Sources<\/h3>\n<p data-start=\"730\" data-end=\"891\">Merging data from multiple sources often results in inconsistencies in formats, naming conventions, and data structures, requiring additional cleaning efforts.<\/p>\n<h3 data-start=\"893\" data-end=\"941\">5. Maintaining Data Accuracy Over Time<\/h3>\n<p data-start=\"942\" data-end=\"1064\">Even after cleansing, data quality can degrade due to outdated information, requiring continuous monitoring and updates.<\/p>\n<h2 id=\"benefits-of-data-cleansing-techniques\">Benefits of Data Cleansing Techniques<\/h2>\n<p>In this section, we delve into the transformative benefits of data cleansing techniques. We explore how they empower organizations to extract actionable insights from their data assets.<\/p>\n<p>Here are some key advantages of implementing data cleaning techniques:<\/p>\n<ul>\n<li>\n<h3>Boosting the Accuracy and Reliability of Your Data<\/h3>\n<\/li>\n<\/ul>\n<p>Power BI provides a variety of tools and techniques to identify and rectify data errors. Whether it&#8217;s incorrect values, misspellings, or typos, these tools can help you get rid of them. This boosts the accuracy and reliability of your data.<\/p>\n<ul>\n<li>\n<h3>Better Data Integration<\/h3>\n<\/li>\n<\/ul>\n<p>Clean data is easier to integrate across different systems and platforms. This facilitates seamless data exchange and interoperability between various applications within an organization.<\/p>\n<ul>\n<li>\n<h3><strong>Aligned Decision Making<\/strong><\/h3>\n<\/li>\n<\/ul>\n<p>Teams can make informed decisions based on a shared understanding of clean and accurate data. This alignment ensures that everyone is working towards common goals and objectives.<\/p>\n<ul>\n<li>\n<h3>Improved Data Security<\/h3>\n<\/li>\n<\/ul>\n<p>Data cleansing often involves identifying and removing redundant or obsolete data. This reduces the risk of data breaches and unauthorized access. By maintaining a clean data environment, organizations can enhance data security and protect sensitive information.<\/p>\n<ul>\n<li>\n<h3>Compliance and Regulatory Requirements<\/h3>\n<\/li>\n<\/ul>\n<p>Many industries have strict compliance and regulatory requirements regarding data accuracy and privacy. Data cleansing helps organizations ensure compliance with these regulations by maintaining accurate and secure data.<\/p>\n<h2 id=\"tips-for-data-cleansing-process\">Tips For Using Data Cleansing Process<\/h2>\n<h3 data-start=\"58\" data-end=\"96\">1. Regularly Audit Your Data<\/h3>\n<p data-start=\"97\" data-end=\"267\">Frequent audits help identify errors, duplicates, and inconsistencies before they impact decision-making. Schedule periodic reviews to maintain clean and reliable data.<\/p>\n<h3 data-start=\"269\" data-end=\"315\">2. Use Automation for Data Cleansing<\/h3>\n<p data-start=\"316\" data-end=\"497\">Leverage AI-driven tools and scripts to streamline the cleansing process. Automated solutions help remove duplicates, standardize formats, and validate missing values efficiently.<\/p>\n<h3 data-start=\"499\" data-end=\"536\">3. Standardize Data Formats<\/h3>\n<p data-start=\"537\" data-end=\"703\">Ensure consistency in data formats, such as dates, currencies, and text capitalization. This minimizes discrepancies and enhances data integration across platforms.<\/p>\n<h3 data-start=\"705\" data-end=\"742\">4. Remove Duplicate Entries<\/h3>\n<p data-start=\"743\" data-end=\"910\">Duplicates distort analysis and confuse. Use built-in deduplication features in tools like Excel, Power BI, or Salesforce to eliminate redundant records.<\/p>\n<h3 data-start=\"912\" data-end=\"946\">5. Validate Data Sources<\/h3>\n<p data-start=\"947\" data-end=\"1114\">Cross-check data from multiple sources before using it for analysis. Implement validation rules to prevent incorrect or incomplete entries from entering your system.<\/p>\n<h2 id=\"data-cleansing-techniques-faqs\">FAQs About Data Cleansing Techniques<\/h2>\n<h3>What are Data Cleansing Examples?<\/h3>\n<p>Data cleansing involves a range of tasks aimed at improving the quality and dependability of datasets.<\/p>\n<p>Examples of data cleansing include:<\/p>\n<ul>\n<li>Elimination of duplicates<\/li>\n<li>Standardization of data formats<\/li>\n<li>Correction of typographical errors and misspellings<\/li>\n<li>Handling of missing values<\/li>\n<li>Validation of data for accuracy and integrity<\/li>\n<li>Identification and removal of outliers to prevent skewed analysis<\/li>\n<li>Elimination of redundant information<\/li>\n<li>Cross-referencing of data with external sources for validation<\/li>\n<li>Cleaning of textual data<\/li>\n<\/ul>\n<h3>What are the Three Points to the Cleansing of Data?<\/h3>\n<p>Data cleansing involves three critical points: accuracy, completeness, and consistency.<\/p>\n<p>Accuracy entails the elimination of errors, inconsistencies, and duplicates. This ensures the data is reliable for decision-making and analysis.<\/p>\n<p>Completeness ensures that a dataset has all the necessary information. This information includes the missing values to provide a comprehensive view of the subject matter.<\/p>\n<p>Consistency focuses on standardizing data formats, resolving variations in entries, and promoting uniformity across different sources. This enables seamless integration and reliable analysis.<\/p>\n<p>Addressing these aspects can improve the quality and reliability of the data. This enables organizations to gain valuable insights and make informed decisions.<\/p>\n<h4 id=\"wrap-up\">Wrap Up<\/h4>\n<p>Data cleansing is an essential process for ensuring data quality, accuracy, and reliability.<\/p>\n<p>Businesses increasingly rely on data-driven insights for critical decisions. The importance of clean and trustworthy data, therefore, cannot be overstated.<\/p>\n<p>This article discusses data cleansing techniques that can be used to transform raw data into valuable assets. These include:<\/p>\n<ul>\n<li>Handling missing data,<\/li>\n<li>Removing duplicates,<\/li>\n<li>Standardizing formats,<\/li>\n<li>and Addressing outliers.<\/li>\n<\/ul>\n<p>Power BI provides the intuitive Power Query Editor to help you streamline the data cleansing process.<\/p>\n<p>After cleaning our data via Power Query, we then used it in a Sankey diagram.<\/p>\n<p>We hope that these data-cleaning techniques will empower you to work with accurate and reliable data. Better still, make decisions based on high-quality data and keep everyone moving in the same direction.<\/p>\n","protected":false},"excerpt":{"rendered":"<p><p>Uncover essential Data Cleansing Techniques to boost accuracy for analysis. Get tips on transforming data types, managing outliers, and standardizing formats.<\/p>\n&nbsp;&nbsp;<a href=\"https:\/\/chartexpo.com\/blog\/data-cleansing-techniques\"><\/a><\/p>","protected":false},"author":1,"featured_media":47517,"comment_status":"closed","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[1017],"tags":[],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v21.5 - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\r\n<title>Power BI Data Cleansing Techniques: Raw Data to Insights -<\/title>\r\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\r\n<link rel=\"canonical\" href=\"https:\/\/chartexpo.com\/blog\/data-cleansing-techniques\" \/>\r\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\r\n<meta name=\"twitter:title\" content=\"Power BI Data Cleansing Techniques: Raw Data to Insights -\" \/>\r\n<meta name=\"twitter:description\" content=\"Uncover essential Data Cleansing Techniques to boost accuracy for analysis. Get tips on transforming data types, managing outliers, and standardizing formats.\" \/>\r\n<meta name=\"twitter:image\" content=\"https:\/\/chartexpo.com\/blog\/wp-content\/uploads\/2025\/02\/feature-ce562-200x200-1.jpg\" \/>\r\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"admin\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"25 minutes\" \/>\r\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Power BI Data Cleansing Techniques: Raw Data to Insights -","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/chartexpo.com\/blog\/data-cleansing-techniques","twitter_card":"summary_large_image","twitter_title":"Power BI Data Cleansing Techniques: Raw Data to Insights -","twitter_description":"Uncover essential Data Cleansing Techniques to boost accuracy for analysis. Get tips on transforming data types, managing outliers, and standardizing formats.","twitter_image":"https:\/\/chartexpo.com\/blog\/wp-content\/uploads\/2025\/02\/feature-ce562-200x200-1.jpg","twitter_misc":{"Written by":"admin","Est. reading time":"25 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"WebPage","@id":"https:\/\/chartexpo.com\/blog\/data-cleansing-techniques","url":"https:\/\/chartexpo.com\/blog\/data-cleansing-techniques","name":"Power BI Data Cleansing Techniques: Raw Data to Insights -","isPartOf":{"@id":"http:\/\/localhost\/blog\/#website"},"datePublished":"2025-02-18T13:12:41+00:00","dateModified":"2026-01-29T12:17:16+00:00","author":{"@id":"http:\/\/localhost\/blog\/#\/schema\/person\/6aceeb7c948a3f66ff6439ce5c24a280"},"breadcrumb":{"@id":"https:\/\/chartexpo.com\/blog\/data-cleansing-techniques#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/chartexpo.com\/blog\/data-cleansing-techniques"]}]},{"@type":"BreadcrumbList","@id":"https:\/\/chartexpo.com\/blog\/data-cleansing-techniques#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"http:\/\/localhost\/blog"},{"@type":"ListItem","position":2,"name":"Power BI Data Cleansing Techniques: Raw Data to Insights"}]},{"@type":"WebSite","@id":"http:\/\/localhost\/blog\/#website","url":"http:\/\/localhost\/blog\/","name":"","description":"","potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"http:\/\/localhost\/blog\/?s={search_term_string}"},"query-input":"required name=search_term_string"}],"inLanguage":"en-US"},{"@type":"Person","@id":"http:\/\/localhost\/blog\/#\/schema\/person\/6aceeb7c948a3f66ff6439ce5c24a280","name":"admin","url":"https:\/\/chartexpo.com\/blog\/author\/admin"}]}},"_links":{"self":[{"href":"https:\/\/chartexpo.com\/blog\/wp-json\/wp\/v2\/posts\/35247"}],"collection":[{"href":"https:\/\/chartexpo.com\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/chartexpo.com\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/chartexpo.com\/blog\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/chartexpo.com\/blog\/wp-json\/wp\/v2\/comments?post=35247"}],"version-history":[{"count":25,"href":"https:\/\/chartexpo.com\/blog\/wp-json\/wp\/v2\/posts\/35247\/revisions"}],"predecessor-version":[{"id":58426,"href":"https:\/\/chartexpo.com\/blog\/wp-json\/wp\/v2\/posts\/35247\/revisions\/58426"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/chartexpo.com\/blog\/wp-json\/wp\/v2\/media\/47517"}],"wp:attachment":[{"href":"https:\/\/chartexpo.com\/blog\/wp-json\/wp\/v2\/media?parent=35247"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/chartexpo.com\/blog\/wp-json\/wp\/v2\/categories?post=35247"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/chartexpo.com\/blog\/wp-json\/wp\/v2\/tags?post=35247"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}