Management

Removing Duplicates in Excel and Google Sheets: 5 Easy Steps

Removing Duplicates in Excel and Google Sheets: 5 Easy Steps

Excel and Google Sheets: A Free Course for Everyone!

Learn More

How to Effectively Find and Process Duplicates in Excel

Before Deleting Duplicate Records Excel's Find and Highlight Duplicates feature is helpful. This tool helps you effectively analyze data and make informed decisions about which records to keep and which to delete. Proper use of this feature improves data quality and facilitates more accurate analysis.

Let's look at the process using a specific example. This will allow us to better understand all the steps and details, as well as illustrate the main points. Using a practical example will make the information more accessible and understandable, which will help you better absorb the material. We will discuss each stage so that you can fully understand the process.

The company has an employee table created from multiple data sources. This may result in duplicate employee names. Data analysis and cleansing are necessary to eliminate duplicate records and ensure that each record in the table is unique. This will help improve personnel management and increase the efficiency of working with employee data.

Source table with a list of employees. Screenshot: Excel / Skillbox Media

To identify duplicate names, use conditional formatting. This feature allows you to easily highlight duplicates in your list, greatly simplifying data analysis. By setting up conditional formatting, you can visually identify duplicate entries, which will help avoid errors and improve the accuracy of your analysis. Proper use of this feature promotes more efficient work with data and improves the quality of your reports.

First, select the range to be checked; in this case, it's the entire column of last names. On the Excel main tab, go to the "Conditional Formatting" menu. In the list that appears, select "Highlight Cells Rules," then click "Duplicate Values." This feature will allow you to quickly identify duplicates in your list of last names, which will help improve data quality and avoid errors in future work.

Selecting a range and accessing the Conditional Formatting menu. Screenshot: Excel / Skillbox Media
Setting up a range for formatting. Screenshot: Excel / Skillbox Media

In the window that appears, you can configure the data formatting parameters. Select the format to highlight duplicate values ​​so Excel highlights all last names that appear more than once. You can also change the fill color to increase their visibility. This will allow you to quickly identify duplicate records and improve your data management in Excel.

Once you have finished setting up the options, click OK to apply the changes.

Checking the formatting settings. Screenshot: Excel / Skillbox Media

After applying the format in Excel, all duplicate last names will be highlighted in red, while unique values ​​will remain unchanged. This allows you to quickly identify records that need to be deleted. Now you can effectively manage your data, improving its quality and eliminating duplicate information.

Result of highlighting duplicate values. Screenshot: Excel / Skillbox Media

To remove formatting, open the "Conditional Formatting" menu, select "Remove Rules," and then click "Remove Rules from Selected Cells." This will clear the cells of all applied styles and return them to their original formatting.

Removing formatting of duplicate values. Screenshot: Excel / Skillbox Media

Highlighting is performed for both duplicate and original values, which complicates the process of manual duplicate removal. It is important to ensure that only unique records remain in the table. To optimize this process, we recommend using the bulk duplicate removal tool, which will be described in the next section. Automation will significantly save time and reduce the likelihood of errors when working with data.

Training at Skillbox is an excellent option for those aspiring to become an Excel expert. The course program covers all key aspects of working with this powerful spreadsheet, from the basics to complex analytical tools. At Skillbox, you will gain the practical skills necessary to effectively use Excel in various fields, including business analysis and financial planning. A convenient training format and access to up-to-date materials will help you quickly master the necessary knowledge and become a sought-after specialist in the job market. Take a step towards a successful career with Excel courses from Skillbox.

  • Course "Excel + Google Sheets: From Beginner to Professional"
  • Course "Excel for Work and Personal Tasks"
  • Free online intensive "Excel Express Course: Mastering Spreadsheets in 3 Days"
  • Free video lessons on Excel and Google Sheets

Effective Ways to Remove Duplicates in Excel

In this article, we will discuss two methods for removing duplicates in Excel: the first is removing duplicate values ​​in one column, and the second is processing multiple columns simultaneously. Mastering these methods will allow you to organize your data more effectively and minimize the risk of confusion in your spreadsheets. Understanding how to properly remove duplicates will help you maintain the cleanliness and accuracy of your Excel data.

To begin removing duplicates in a table, select the data range from which you want to remove duplicate entries. For example, this could be a column of employee names. Then, go to the "Data" tab and select "Remove Duplicates." This process will help you remove unnecessary information from your table and improve its quality. Removing duplicates will simplify data analysis and improve their presentation.

Removing duplicates from one table column. Screenshot: Excel / Skillbox Media

In the menu that opens, select the columns to check for duplicates in Excel. In this case, since you only need to check one column, just click "OK" without changing the settings.

Selecting a column to search for duplicates. Screenshot: Excel / Skillbox Media

Excel will provide information about the number of duplicates detected and removed, as well as the number of remaining unique values. Review this information and click OK.

Information about deleted and remaining values. Screenshot: Excel / Skillbox Media

Only unique values ​​remain in your table, and cells with duplicate last names have been removed. It's important to note that Excel only removed last names that were duplicates, while leaving all unique records. This way, you can easily analyze data without unnecessary duplicates, which greatly simplifies working with the table.

Result of removing duplicates: before → after. Screenshot: Excel / Skillbox Media

In some situations, simply removing duplicates from a single column is not enough. For example, a list of cars may contain duplicate records by make and model, but with differences in color and year. In such cases, it is important to consider all parameters to avoid losing unique information. Effective data management requires deeper analysis and processing to preserve all the significant characteristics of each record and ensure information accuracy.

To remove duplicates in a table, you need to select multiple columns at once. To do this, select any cell in the table, go to the "Data" tab, and click the "Remove Duplicates" option. This process will allow you to effectively cleanse your data of duplicate records, which will improve the quality and accuracy of your information.

Select all the necessary columns in the menu for analysis. This will allow you to more accurately remove duplicates and optimize your data.

When using Excel to work with multiple columns, keep in mind that the program will only delete those rows whose values ​​match in all selected columns. This is important to keep in mind when processing data to avoid losing necessary information.

Removing duplicates in all columns of a table. Screenshot: Excel / Skillbox Media

If the car values ​​match in all respects, including brand, color, year, and price, then Excel will delete the row. Otherwise, if at least one of the parameters differs, both records will remain in the table. This allows you to effectively manage vehicle data and eliminate duplicate entries when analyzing information.

Result of removing duplicates in all columns. Screenshot: Excel / Skillbox Media

Let's change the duplicate search criteria by excluding the color column. This will allow Excel to ignore color characteristics when analyzing the data. This way, we will focus only on the contents of the cells, which will increase the accuracy of the duplicate search and simplify the information processing.

Changing the duplicate search conditions. Screenshot: Excel / Skillbox Media

After making changes to the duplicate search settings, several more records will disappear from the table. This highlights the importance of properly configuring the settings for effective duplicate management. Proper configuration not only optimizes the process but also improves data quality, which in turn facilitates more accurate analysis and decision-making.

Result of deleting duplicates taking into account the new conditions. Screenshot: Excel / Skillbox Media

Efficient Ways to Hide Duplicates in Excel Without Deleting Them

If you're concerned about losing important content when deleting duplicate data in Excel, there's an effective alternative: hiding duplicate values. This method preserves all cells and rows in a worksheet, allowing you to restore their visibility at any time. Hiding duplicates not only helps organize your data but also preserves the integrity of the information for further analysis and work with the table.

To get started, select any cell in the table and go to the "Data" tab. Then, click the "Filter" button to activate the data filtering feature. This will allow you to sort and find the information you need in the table more effectively. Data filtering makes it much easier to work with large amounts of information, allowing you to focus on the most important data.

Turn on table filtering. Screenshot: Excel / Skillbox Media

Select the "Advanced" option. In the dialog box that opens, check that your table range is correctly specified in the "List Range" line. Make sure the "Unique records only" checkbox is checked, then click "OK."

Selecting options for hiding duplicates. Screenshot: Excel / Skillbox Media
Settings for hiding duplicates. Screenshot: Excel / Skillbox Media

Excel has completed the data analysis and successfully removed duplicates, which can be seen by the changed row numbering.

Duplicates are hidden, but not deleted. Screenshot: Excel / Skillbox Media

To reveal hidden rows in your document, hover your cursor over the bold line separating the row numbers, for example, between rows 9 and 12. Hold down the left mouse button and drag the cursor down to reveal the hidden rows one by one. This simple method allows you to quickly and easily reveal the information you need in your text.

To restore all data to its original state, click the "Filter" button in the menu again. This will reset all applied filters and return the data to its original state.

Efficiently Removing Duplicates in Google Sheets

Removing duplicates in Google Sheets is similar to the process in Excel, making it intuitive for users. To begin, select the cell in the table from which you want to remove duplicate data. Then, go to the "Data" menu and select "Remove Duplicates". This will clear the table of redundant information, keeping only unique records. Make sure to check all the necessary columns to achieve the best result. Removing duplicates in Google Sheets is a simple and effective way to improve the quality of your data and simplify information analysis. Open the "Data" menu and select "Data Cleanup," then click "Remove Duplicates." This will allow you to effectively process your data and identify duplicates. Removing duplicate records will help improve the quality of your data and increase its accuracy.

Removing duplicates by selected columns in Google Sheets. Screenshot: Google Sheets / Skillbox Media
Selecting columns to remove duplicates. Screenshot: "Google Sheets" / Skillbox Media

In the window that opens, you will be asked to select columns to analyze for duplicates. After selecting the required columns, click the "Remove Duplicates" button to complete the duplicate removal process. This step will help you clean up your data and improve its quality.

The result of successful removal of duplicates in the table. Screenshot: Google Sheets / Skillbox Media

It's important to emphasize that the Google Sheets algorithm only deletes rows if the values ​​match in all selected columns at the same time. This protects unique data from accidental deletion and allows users to manage their spreadsheets more precisely. Using this algorithm makes the data management process more reliable and predictable.

If your spreadsheet exceeds the limits of Excel or Google Sheets and duplicates need to be removed, this can become a serious problem, especially when collecting data from various sources. It is important to know how to effectively manage redundant data to improve its quality and manageability.

There are several methods that can help you remove duplicates and optimize your spreadsheet. One of the most common ways is to use specialized tools and programs, such as Microsoft Access or SQL databases, which are capable of processing large volumes of information.

You can also use online data cleansing services to quickly and efficiently remove duplicate records. Don't forget about using scripts in programming languages ​​such as Python to automate the duplicate removal process.

It's important to remember that before removing duplicates, it's recommended to back up your data to avoid losing important information. Properly organizing and cleaning your data will help you create a more manageable and efficient spreadsheet.

In such situations, it's a good idea to use the Power Query add-in in Excel. Power Query allows you to extract data from multiple sources, process it, including removing duplicates, and then import it into a format convenient for analysis. One of the main advantages of Power Query is the unlimited data volume, making it an excellent tool for working with large data sets. Using Power Query significantly simplifies data processing and improves the efficiency of your analytical work.

For a deeper understanding of Power Query's functionality, we strongly recommend reading our detailed instructions on how to use it. This will significantly optimize your work with large volumes of data and increase the efficiency of analysis.

Excel and Google Sheets: from beginner to PRO in 30 days

Want to become a PRO in Excel and Google Sheets? Learn how to automate your work and create reports effectively! Read the article.

Find out more