Fuzzy Duplicate Finder is a tool for Microsoft Excel that helps you find and correct similar records. The add-in quickly performs approximate match according to the settings you select and changes all typos into the correct equivalents of your choice.
Open the add-in pane by clicking on the Find Fuzzy Duplicates icon in the Duplicate Remover drop-down list on the Ablebits Data tab, Dedupe group:
This tool is also available in the Search group on the Ablebits Tools tab: in the Find and Replace drop-down list, click Search for Typos:
There are some settings that you need to tweak to get more precise results:
Click the Search for typos button and the add-in will perform approximate matching in Excel. All found items will be grouped by duplicates (nodes) in the tool's tree view:
If you have several correct values in your node that differ in case, you may want to keep their original case or make them all look the same.
Tick the Keep case of the same values in the node option at the top to leave the items that differ only in case unchanged.
So, if you have "Street, street, street, and Streett" in the results and you consider only the value as a misprint, not the case, then select this option and the "street" value will keep its case.
If you want to remove an entry from the list of found duplicates, just select it and press the Delete button on your keyboard. Or, you can select several items by holding the Ctrl key and delete them all at once.
You can remove similar data from your Excel worksheet if necessary. Just click on the value you want to delete in the list of found items. The cell with this value will be automatically highlighted in Excel worksheet, so you can select it in your table and press the Delete button on your keyboard.
To see the found similar records in a separate Excel worksheet, click the Export button on the Fuzzy Duplicate Finder pane.
This option may be helpful if you want to see the changes in the nodes before applying them.
To do a new fuzzy match, click New Search at the top of the Fuzzy Duplicate Finder pane.
Responses
I am new to Fuzzy Lookup. I have a single column list of vendors names and some vendors are listed multiple times in various ways. What is the best setting to find similar names. I am wanting it to find from the list below as possible matches:
A1 Golf Cart Leasing
A-1 Industrial Parts and Supplies Inc
A-1 Key Service Inc
A-1 Golf Cart Leasing
A1 Golf Lessons Inc
A-1 Oil Inc
A-1 Pallet Co of Clarksville Inc
A-1 Rental Inc
A-1 Scale Service Inc
A-1 Welding, Inc.
A3 Taxi
AAA - 1 Lock and Key
AAA Business Systems Inc
AAA Cooper Transportation
AAA Galvanizing - Joliet
AAA Pallet & Lumber Co Inc
AAA Pallets & Lumber Company Inc
AAA Quality Services Inc
A1 Golf Cart Leasing an d A-1 Golf Cart Leasing
AAA Pallet & Lumber Co Inc and AAA Pallets & Lumber Company Inc
Hello Debbie,
Thank you for your comment. I am afraid, we can't offer you an easy solution to fulfill your task. Before running our Fuzzy Duplicate Finder tool, we'd recommend making the list of vendor names more convenient to work with. For example, you can remove the substrings before the names using our Remove Characters tool, then get rid of extra spaces in the cells with the help of the Trim Spaces tool. After that, make sure that each cell contains just one vendor name and try to run Fuzzy Duplicate Finder setting the maximum number of different characters from 7 to 10 and check what results you'll get.
Hope these recommendations will turn to be helpful.
Hello, I have a list with 40K rows. Would this fuzzy duplicates tool be able to handle 40K rows?
Hello Julia,
Thank you for your question. Our add-in doesn't have clearly defined limitations on the number of rows with the data to process. However, it may take some time for the add-in to handle 40K rows and the processing speed also depends on such things as the formatting of your data, other processes running at the moment, available system resources, etc.
You can install the trial version of our Ultimate Suite for Excel that contains Fuzzy Duplicate Finder and check how the tool works with your data amount. The trial is fully functional and allows you to test the add-in during 30 days for free. If you are interested, please download from this page.
Feel free to contact us at support@ablebits.com if you have any other questions or difficulties.
hello
i have an excel spreadsheet of customer names and addreses and i need to be able to quickly identify all duplicates. the problem is the data looks like this:
ABC school district 123 main street Cleveland Ohio
ABC Schools 123 Main St Cleveland OH
will your duplicate catcher fuzzy matcher work for these purposes?
Hello Tanya,
Thank you for contacting us.
For us to be able to help you better, please send us a small sample workbook with your source data and the result you expect to get to support@ablebnits.com. I kindly ask you to shorten the table to 10-20 rows.
We'll look into your task and see if our software can help.
Hi Tanya,
Did this helped you ? If not please let me know if there is a way out because I'm stuck with similar kind of situation.
Ex:
Kotak Life Insurance
Kotak Mahindra Life Insurance Company Limited.
I want to merge or remove this kind of duplicates
Hello Ashim,
Thank you for your comment. I'm sorry to tell but the add-in doesn't have the requested functionality.
What is the ideal setting to find typos in a list of company names?
Hello, Anton,
Thank you for using our product and for your question. To find typos in a list of company names, first count the number of characters the company name contains. This will be the number that you should put into the Min number of characters in a word/cell field. And then think of the number of possible different characters that the correct spelling of the company name and the typo may contain. Enter this number into the Max number of different characters field. If the name of the company consists of several words, check the The cells contain separate words delimited by box and in the drop-down list select Space. That’s it! You’ll get a list of the found entries where you can select typos and get them corrected right in the add-in pane.
Please do not hesitate to contact us if you still have questions or doubts.
Thank you.
Post a comment
Seen by everyone, do not publish license keys and sensitive personal info!