manipulating data in excel

%3CLINGO-SUB%20id%3D%22lingo-sub-2645545%22%20slang%3D%22en-US%22%3Emanipulating%20data%20in%20excel%3C%2FLINGO-SUB%3E%3CLINGO-BODY%20id%3D%22lingo-body-2645545%22%20slang%3D%22en-US%22%3E%3CP%3EHi%2C%26nbsp%3B%3CBR%20%2F%3EI%20have%20a%20csv%20file%20where%20in%20column%20A%20there%20are%20IDs%2C%20and%20in%20column%20B%20there%20are%20associated%20data.%20The%20data%20is%20separated%20with%20a%20semicolon%20(%3B).%20And%20the%20hardest%20bit%20is%20if%20one%20data%20point%20shares%20the%20same%20category%20with%20another%20data%2C%20then%20their%20category%20name%20precedes%20it%20with%20a%20colon%20(%3A)%20between%20them.%20So%20something%20like%20this%3A%3C%2FP%3E%3CP%3Ecolumn%20A%26nbsp%3B%20%26nbsp%3B%20%26nbsp%3B%20%26nbsp%3B%20%26nbsp%3BColumn%20B%3C%2FP%3E%3CP%3EthisisID%26nbsp%3B%20%26nbsp%3B%20%26nbsp%3B%20%26nbsp%3B%20%26nbsp%3B%20%26nbsp%3B%20%26nbsp%3Babc%3Bcat1%3Axyz%3Bcat1%3Ayyy%2Ccat1%3Azzz%3Bcat2%3Addd%2Ccat2%3Aeee%3C%2FP%3E%3CP%3E%26nbsp%3B%3C%2FP%3E%3CP%3EI%20want%20to%20transform%20this%20format%20to%20another%20format%20where%2C%3C%2FP%3E%3CP%3EColumn%20A%20(ID)%26nbsp%3B%20%26nbsp%3B%20%26nbsp%3B%20%26nbsp%3B%20Column%20B%20(abc)%26nbsp%3B%20%26nbsp%3B%20%26nbsp%3B%20%26nbsp%3B%20%26nbsp%3B%20Column%20C%20(cat1)%26nbsp%3B%20%26nbsp%3B%20%26nbsp%3B%20%26nbsp%3B%20Column%20D%20(cat2)%3C%2FP%3E%3CP%3EthisisID%26nbsp%3B%20%26nbsp%3B%20%26nbsp%3B%20%26nbsp%3B%20%26nbsp%3B%20%26nbsp%3B%20%26nbsp%3B%20%26nbsp%3B%20%26nbsp%3B%20%26nbsp%3B%20abc%26nbsp%3B%20%26nbsp%3B%20%26nbsp%3B%20%26nbsp%3B%20%26nbsp%3B%20%26nbsp%3B%20%26nbsp%3B%20%26nbsp%3B%20%26nbsp%3B%20%26nbsp%3B%20%26nbsp%3B%20%26nbsp%3B%20%26nbsp%3B%20%26nbsp%3B%20%26nbsp%3Bxyz%7Cyyy%7Czzz%26nbsp%3B%20%26nbsp%3B%20%26nbsp%3B%20%26nbsp%3B%20%26nbsp%3B%20%26nbsp%3B%20%26nbsp%3B%20%26nbsp%3B%20ddd%7Ceee%3C%2FP%3E%3CP%3E%26nbsp%3B%3C%2FP%3E%3CP%3EIs%20there%20any%20formula%20to%20change%20the%20format%20to%20the%20new%20format%3F%20Maybe%20this%20is%20something%20that%20a%20formula%20cannot%20solve.%20Any%20advise%2Fhelp%20is%20appreciated.%26nbsp%3B%3C%2FP%3E%3CP%3E%26nbsp%3B%3C%2FP%3E%3C%2FLINGO-BODY%3E%3CLINGO-LABS%20id%3D%22lingo-labs-2645545%22%20slang%3D%22en-US%22%3E%3CLINGO-LABEL%3EFormulas%20and%20Functions%3C%2FLINGO-LABEL%3E%3CLINGO-LABEL%3EMacros%20and%20VBA%3C%2FLINGO-LABEL%3E%3C%2FLINGO-LABS%3E%3CLINGO-SUB%20id%3D%22lingo-sub-2650164%22%20slang%3D%22en-US%22%3ERe%3A%20manipulating%20data%20in%20excel%3C%2FLINGO-SUB%3E%3CLINGO-BODY%20id%3D%22lingo-body-2650164%22%20slang%3D%22en-US%22%3EHow%20many%20rows%20of%20data%20do%20you%20have%20now%2C%20and%20what%20is%20the%20anticipated%20number%20of%20rows%20of%20data%20will%20you%20have%20in%20the%20future%20and%20will%20you%20be%20summarizing%20the%20data%20now%20or%20in%20the%20future%20by%20category%20and%20their%20associated%20values%20now%20or%20in%20the%20future%3C%2FLINGO-BODY%3E
New Contributor

Hi, 
I have a csv file where in column A there are IDs, and in column B there are associated data. The data is separated with a semicolon (;). And the hardest bit is if one data point shares the same category with another data, then their category name precedes it with a colon (:) between them. So something like this:

column A         Column B

thisisID             abc;cat1:xyz;cat1:yyy,cat1:zzz;cat2:ddd,cat2:eee

 

I want to transform this format to another format where,

Column A (ID)        Column B (abc)          Column C (cat1)        Column D (cat2)

thisisID                    abc                             xyz|yyy|zzz                ddd|eee

 

Is there any formula to change the format to the new format? Maybe this is something that a formula cannot solve. Any advise/help is appreciated. 

 

3 Replies
How many rows of data do you have now, and what is the anticipated number of rows of data will you have in the future and will you be summarizing the data now or in the future by category and their associated values now or in the future
select comun B and use the text to columns function (using ; as the delimiter), which should separate it out into separate columns. Then, if you still need to change : with | (or anything else), select those columns and use the 'find and replace' function. hope this helps.
.....don't forget to use 'save as' and change document type to an excel format, rather than .csv