Identifying Keywords in a Long List of Descriptions Shortcut

%3CLINGO-SUB%20id%3D%22lingo-sub-2991977%22%20slang%3D%22en-US%22%3EIdentifying%20Keywords%20in%20a%20Long%20List%20of%20Descriptions%20Shortcut%3C%2FLINGO-SUB%3E%3CLINGO-BODY%20id%3D%22lingo-body-2991977%22%20slang%3D%22en-US%22%3E%3CP%3EHi%20everyone%2C%20I%20have%20101%2C000%20descriptions.%20I%20am%20searching%20for%20keywords%20within%20these%20descriptions%20to%20narrow%20down%20a%20target%20list.%20I%20have%20147%20keywords%20I%20would%20like%20to%20search%20for.%20Column%20one%20contains%20the%20number%20representing%20the%20description%20and%20column%202%20is%20the%20description.%20I%20am%20trying%20to%20build%20a%20list%2C%20whereby%20searching%20for%20the%20keyword%20e.g.%20dog%2C%20all%20the%20descriptions%20with%20the%20word%20dog%20in%20them%20will%20populate%20a%20list%20with%20the%20corresponding%20number%20that%20represents%20them.%20So%20far%20I%20have%20used%20the%20below%20formula%20which%20has%20worked%3B%20%3DINDEX(the%20column%20with%20the%20list%20of%20numbers%20which%20represent%2Fcorrespond%20to%20the%20subscription%2C%20SMALL(IF(ISNUMBER(SEARCH(%22dog%20(the%20keyword)%22%2C%20the%20descriptions%20column))%2CMATCH(ROW(the%20descriptions%20column)%2CROW(the%20descriptions%20column)))%2CROWS(%24A%241%3AA1))).%20My%20computer%20cannot%20cope%20with%20so%20many%20calculations%20across%20101k%20descriptions.%20It%20stops%20responding.%20Does%20anyone%20have%20an%20alternative%20or%20can%20offer%20some%20advice%2C%20which%20would%20increase%20the%20speed%20of%20calculation%20or%20offer%20a%20solution%20with%20fewer%20steps%3F%3C%2FP%3E%3C%2FLINGO-BODY%3E%3CLINGO-LABS%20id%3D%22lingo-labs-2991977%22%20slang%3D%22en-US%22%3E%3CLINGO-LABEL%3EExcel%3C%2FLINGO-LABEL%3E%3CLINGO-LABEL%3EFormulas%20and%20Functions%3C%2FLINGO-LABEL%3E%3C%2FLINGO-LABS%3E%3CLINGO-SUB%20id%3D%22lingo-sub-2992045%22%20slang%3D%22en-US%22%3ERe%3A%20Identifying%20Keywords%20in%20a%20Long%20List%20of%20Descriptions%20Shortcut%3C%2FLINGO-SUB%3E%3CLINGO-BODY%20id%3D%22lingo-body-2992045%22%20slang%3D%22en-US%22%3E%3CP%3E%3CA%20href%3D%22https%3A%2F%2Ftechcommunity.microsoft.com%2Ft5%2Fuser%2Fviewprofilepage%2Fuser-id%2F1225923%22%20target%3D%22_blank%22%3E%40NathanJB665%3C%2FA%3E%26nbsp%3BPerhaps%20Power%20Query%20is%20more%20suited%20for%20such%20a%20data%20set.%20I%20attached%20a%20small%20demo.%20Have%20you%20used%20PQ%20before%3F%3C%2FP%3E%3CP%3E%26nbsp%3B%3C%2FP%3E%3CP%3ETest%20it%20by%20expanding%20the%20table%20on%20the%20left%20with%20some%20of%20your%20own%20descriptions%20and%20the%20one%20on%20the%20right%20with%20real%20keywords.%20But%20don't%20change%20the%20column%20headers%20or%20table%20names.%20Now%2C%20from%20the%20Data%20ribbon%20select%20Refresh%20All.%20Does%20it%20work%20for%20you%3F%3C%2FP%3E%3C%2FLINGO-BODY%3E
New Contributor

Hi everyone, I have 101,000 descriptions. I am searching for keywords within these descriptions to narrow down a target list. I have 147 keywords I would like to search for. Column one contains the number representing the description and column 2 is the description. I am trying to build a list, whereby searching for the keyword e.g. dog, all the descriptions with the word dog in them will populate a list with the corresponding number that represents them. So far I have used the below formula which has worked; =INDEX(the column with the list of numbers which represent/correspond to the subscription, SMALL(IF(ISNUMBER(SEARCH("dog (the keyword)", the descriptions column)),MATCH(ROW(the descriptions column),ROW(the descriptions column))),ROWS($A$1:A1))). My computer cannot cope with so many calculations across 101k descriptions. It stops responding. Does anyone have an alternative or can offer some advice, which would increase the speed of calculation or offer a solution with fewer steps?

3 Replies

@NathanJB665 Perhaps Power Query is more suited for such a data set. I attached a small demo. Have you used PQ before?

 

Test it by expanding the table on the left with some of your own descriptions and the one on the right with real keywords. But don't change the column headers or table names. Now, from the Data ribbon select Refresh All. Does it work for you?

@Riny_van_Eekelen 

Thank you so much for getting back to me. I am unaware of Power Query, and have never used it before. Would you mind letting me know how you would go about using it??Thanks again

@NathanJB665 It will require a bit of getting used to. The link below would be a good starting point. But  there are many others on-line.

 

https://exceloffthegrid.com/power-query-introduction/