Comparing two large excel sheets

%3CLINGO-SUB%20id%3D%22lingo-sub-2861624%22%20slang%3D%22en-US%22%3EComparing%20two%20large%20excel%20sheets%3C%2FLINGO-SUB%3E%3CLINGO-BODY%20id%3D%22lingo-body-2861624%22%20slang%3D%22en-US%22%3E%3CP%3EI%20have%20two%20large%20excel%20files.%26nbsp%3B%20I%20have%20to%20to%20combine%20the%20two%20files%20to%20see%20who%20is%20missing%20from%20the%20first%20file%20to%20the%20second%20file.%26nbsp%3B%20The%20files%20are%20mostly%20the%20same%20besides%20a%20handful%20of%20differences.%26nbsp%3B%20I%20cannot%20seem%20to%20find%20a%20way%20to%20isolate%20out%20the%20two%20files%20to%20only%20show%20who%20is%20missing%20from%20the%20first%20one%20compared%20to%20the%20second.%3C%2FP%3E%3CP%3E%26nbsp%3B%3C%2FP%3E%3CP%3EI%20can%20get%20the%20system%20to%20Highlight%20the%20duplicates%2C%20which%20is%20a%20step%20in%20the%20right%20direction%20BUT%20I%20dont%20want%20to%20see%20the%20duplicates%20and%20the%20list%20is%20over%2060k%20so%20its%20a%20nitemare%20to%20try%20and%20look%20at%20manually.%3C%2FP%3E%3CP%3E%26nbsp%3B%3C%2FP%3E%3CP%3EPlease%20help!!!%20%3CLI-EMOJI%20id%3D%22lia_slightly-smiling-face%22%20title%3D%22%3Aslightly_smiling_face%3A%22%3E%3C%2FLI-EMOJI%3E%3C%2FP%3E%3C%2FLINGO-BODY%3E%3CLINGO-LABS%20id%3D%22lingo-labs-2861624%22%20slang%3D%22en-US%22%3E%3CLINGO-LABEL%3EExcel%3C%2FLINGO-LABEL%3E%3C%2FLINGO-LABS%3E%3CLINGO-SUB%20id%3D%22lingo-sub-2861922%22%20slang%3D%22en-US%22%3ERe%3A%20Comparing%20two%20large%20excel%20sheets%3C%2FLINGO-SUB%3E%3CLINGO-BODY%20id%3D%22lingo-body-2861922%22%20slang%3D%22en-US%22%3E%3CP%3E%3CA%20href%3D%22https%3A%2F%2Ftechcommunity.microsoft.com%2Ft5%2Fuser%2Fviewprofilepage%2Fuser-id%2F1189910%22%20target%3D%22_blank%22%3E%40Gluksker%3C%2FA%3E%26nbsp%3B%3C%2FP%3E%3CP%3EHey%2C%3CBR%20%2F%3Ethere%20are%20multiple%20ways%20to%20compare%20two%20files.%20But%20it%20would%20require%20some%20additional%20information%20two%20find%20a%20possible%20solution.%3C%2FP%3E%3CP%3EIf%20for%20example%20you%20have%20a%20unique%20identifier%20(e.g.%20an%20ID)%2C%20you%20could%20use%20a%20%3DUNIQUE()%20function%20on%20the%20column%20and%20use%20%3DCOUNTIF()%20to%20count%20the%20occurrences%20for%20each%20ID%20in%20%3CSTRONG%3Ea%20separate%20file%20or%20worksheet%3C%2FSTRONG%3E%20for%20both%20files%20and%20compare%20the%20results.%20Based%20on%20this%20comparison%20you%20at%20least%20would%20know%20which%20rows%20are%20missing.%3CBR%20%2F%3E%3CBR%20%2F%3E%3CBR%20%2F%3E%3C%2FP%3E%3C%2FLINGO-BODY%3E%3CLINGO-SUB%20id%3D%22lingo-sub-2862360%22%20slang%3D%22en-US%22%3ERe%3A%20Comparing%20two%20large%20excel%20sheets%3C%2FLINGO-SUB%3E%3CLINGO-BODY%20id%3D%22lingo-body-2862360%22%20slang%3D%22en-US%22%3ESo%20they%20all%20do%20have%20ID%20numbers.%20It%E2%80%99s%20an%20Active%20roster%20sheet.%20So%20I%20get%20the%20first%20sheet%20and%20shows%20me%20all%20active%2C%20when%20I%20get%20the%20next%20sheet%2C%20shows%20me%20all%20active%20again%20but%20will%20now%20have%20people%20missing%20that%20have%20been%20terminated.%20So%20I%20need%20to%20compare%20the%20two%20to%20find%20who%20has%20disappeared%20from%20the%20original%20sheet%20compared%20to%20the%20second%20sheet%3C%2FLINGO-BODY%3E
New Contributor

I have two large excel files.  I have to to combine the two files to see who is missing from the first file to the second file.  The files are mostly the same besides a handful of differences.  I cannot seem to find a way to isolate out the two files to only show who is missing from the first one compared to the second.

 

I can get the system to Highlight the duplicates, which is a step in the right direction BUT I dont want to see the duplicates and the list is over 60k so its a nitemare to try and look at manually.

 

Please help!!!

4 Replies

@Gluksker 

Hey,
there are multiple ways to compare two files. But it would require some additional information two find a possible solution.

If for example you have a unique identifier (e.g. an ID), you could use a =UNIQUE() function on the column and use =COUNTIF() to count the occurrences for each ID in a separate file or worksheet for both files and compare the results. Based on this comparison you at least would know which rows are missing.


So they all do have ID numbers. It’s an Active roster sheet. So I get the first sheet and shows me all active, when I get the next sheet, shows me all active again but will now have people missing that have been terminated. So I need to compare the two to find who has disappeared from the original sheet compared to the second sheet

@Gluksker Hey again, 

 

in that case you can use a lookup function, e.g.: 

ID - Sheet 1

CheckID - Sheet 2
1=XLOOKUP(A2;C:C;C:C;"Missing")2
223
335
444
556
667
77 
8Missing 
9Missing 
10Missing 

 

But depending on your machine, this can take some time, or could lead the file to crash.

@Gluksker I'd look into Power Query (as long as you are not on a Mac). Ideal to connect to large data sets without loading everything into a workbook and merge these data sets mimicking VLOOKUP, with the possibility to not only find matching records, but also find the ones that don't. Would be helpful if you could share a representative sample of your data sets to demonstrate it.