Formula challenge: re-shape and cleanup

= LET( Fnλ, HSTACK(FirstWordλ,FinalWordλ,Identityλ,Identityλ), baseTbl, WRAPROWS(TOCOL(alumni,1,1),3), extendedTbl, CHOOSECOLS(baseTbl,{1,1,2,3}), u, SEQUENCE(ROWS(extendedTbl),,1,0), Fnsλ, CHOOSEROWS(Fnλ,u), processedTbl, MAP(Fnsλ,extendedTbl,Applyλ), NoBellλ(processedTbl) )

mtarler
Silver Contributor
Jun 02, 2022
Speaking of how people would do the removal of 'bellman' records I noticed in a couple of solution I thought it could be more efficient if that step was moved up earlier. If I read the code right, it looked like it could be done before the splitting of the first and last name in some cases which means that name splitting step would act on a slightly smaller set.
- Patrick2788
  Silver Contributor
  Jun 03, 2022
  mtarler
  I've put thought into removing the 'bellman' records as early as possible. The way I designed this problem (Placing blank rows intermittently) it makes it very difficult to pull those records until before using TOCOL and WRAPROWS. I've considered using MAP to pull the records but the blank rows are an issue (and using MAP to cleanup before TOCOL/WRAPROWS is probably not very efficient). I had also considered BYROW to identify the rows to be used in the array but again, removing the blank rows first would have to be done. FILTER appears to be the best option at the moment. Maybe the calculations are a bit quicker if the 'bellman' records are pulled earlier. It would certainly have to be done if the record set was much larger and the 'bellman' records had to be removed to allow HSTACK to run.
  My first draft of this exercise actually included arbitrary header information every thousand rows or so (The sheet would've appeared to have been pasted from a poorly designed Word document). I like creating a puzzle and working towards a solution when it seems impossible. I might revisit that first draft.
  - PeterBartholomew1
    Silver Contributor
    Jun 04, 2022
    If you wish to add to the pain, delete Ron Palmer's email address (row 2) and see what happens
- PeterBartholomew1
  Silver Contributor
  Jun 03, 2022
  mtarler
  It has been an interesting exercise; we have learnt something about the limits of HSTACK, but it has also pushed me into trying strategies that were previously only possibilities. I had to modify your named Lambda to apply it at the start of my formula without invalidating the existing version
  NoBellλ = LAMBDA(arr, col, FILTER(arr, CHOOSECOLS(arr, col) <> "Bellman"));
  Overall, I was pleasantly surprised at how easy it was to return to an existing formula and modify it using modern methods. A traditional formula nested 7-deep would not be fun though, to be fair, there would probably have been a whole raft of helper columns to ease the pain.
  For me, the function that I am most likely to reuse is
  Applyλ = LAMBDA(λ, x, λ(x));
  Lambdas don't have to be long to be useful!
PeterBartholomew1
Silver Contributor
Jun 02, 2022
Patrick2788
SergeiBaklan might know better, but I do not think FILTER is especially slow, given that it evaluates the entire criterion for each row. The conditional aggregation functions are faster because they don't evaluate secondary conditions once the first condition fails. If you are only trying to extract one or two matches XLOOKUP would be a preferred option because of its bisection search which requires order O(ln(n)) steps.

Forum Discussion

Formula challenge: re-shape and cleanup

Resources