A generalisation of the MAP lambda helper function

Discussion topic. Your thoughts are welcome. On Saturday I finally bit the bullet and completed a MAPλ Lambda function that generalises the in-built MAP Lambda helper function. As examples, I t...

MAP AmortisationTables KroneckerMatrixProduct.xlsx29 KB

excel

Formulas and Functions

office 365

PeterBartholomew1

Sep 08, 2024

This is a demonstration of the proposed MAP function used to solve a challenge provided by Oz_Sunray and Wyn Hopkins on data cleansing and vote counting. The key element of my solution is to build a table of validated votes for each voter ID and each candidate (i.e. an array of arrays - much the same as any other problem of interest). The formula is modularised using Lambda functions. They are not that pretty, but neither are they impossible to follow.

"Worksheet formula"
= AggregateVotesλ(
      tblVOTES[VoterID],
      tblVOTES[[Vote1]:[Vote3]],
      weight,
      tblAvailable[AVAILABLE]
  )

which calls

/*  "FUNCTION NAME:  AggregateVotesλ"
    "DESCRIPTION:    Aggregates all votes to candidate colors" */
/*  "REVISIONS:      Date            Developer           Description"
    "                08 Sep 2024     Peter Bartholomew   Original Development"
*/
AggregateVotesλ
= LAMBDA(voterID, votesCast, weight, available,
     LET(
        person, TOROW(UNIQUE(voterID)),
        votes,  MAPλ(person, AssignVotesλ(voterID, votesCast, weight, available)),
        count,  HSTACK(available, BYROW(votes, SUM)),
        return, TAKE(SORT(count, 2, -1), 4),
        return
     )
  );

to build and aggregate the table of votes. This, in turn, calls another Lambda function to clean and validate the votes corresponding to a single voterID

/*  "FUNCTION NAME:  AssignVotesλ"
    "DESCRIPTION:   Validates and assigns votes to candidate colors for a single voterID " */
/*  "REVISIONS:      Date            Developer           Description"
    "                08 Sep 2024     Peter Bartholomew   Original Development"
*/
AssignVotesλ
= LAMBDA(voterID, ballots, weight, available,
    LAMBDA(p,
        LET(
            // "In the case of duplicate ballots return first"
            votesCast, XLOOKUP(p, voterID, ballots),
            // "Remove invalid and repeated votes"
            valid,     COUNTIFS(available, votesCast),
            distinct,  EXPAND(UNIQUE(FILTER(votesCast, valid), TRUE), , 3, ""),
            // "Assign votes to candidate colours"
            assigned,  BYROW((distinct = available) * weight, SUM),
            assigned
        )
    )
);

I took the problem from Wyn's recording

https://www.youtube.com/watch?v=FIcxyLqzWcE

Wyn - Peter - Expert Challenge.xlsx5.2 MB

djclements

Bronze Contributor

Sep 13, 2024

PeterBartholomew1 Once again, great job on producing a generalized solution to the nested arrays issue. Hopefully Microsoft will take note of your efforts and come up with a built-in solution in the future.

Your reference to Oz_Sunray's live challenge caught my attention, as I too had come across that video in my YouTube feed. A lambda-free variant to avoid the array of arrays issue for that particular challenge could be:

=LET(
    names, UNIQUE(Votes[Ballot]),
    colors, INDEX(Votes, XMATCH(names, Votes[Ballot]), {2,3,4}),
    names_colors, TOCOL(names & "|" & colors),
    keys, UNIQUE(names_colors),
    weight, XLOOKUP(keys, names_colors, TOCOL(IF({1}, {3,2,1}, colors))),
    TAKE(SORT(HSTACK(Available, MMULT(N(Available = TOROW(TEXTAFTER(keys, "|"))), weight)), 2, -1), 4)
)

Note: the weight array can be adjusted as desired (e.g. {1,1,1} instead of {3,2,1}).

This is one situation where INDEX / XMATCH is preferred over XLOOKUP, as it is perfectly capable of returning multiple columns of data for multiple lookup values, whereas XLOOKUP is not.

While I typically seek out alternative methods to overcome the nested arrays issue, complex scenarios can quickly become convoluted, making it difficult for the average user to follow. Your MAPλ function seems to take care of that nicely. Thank you for sharing!

PeterBartholomew1

Silver Contributor

Sep 14, 2024

djclements

Whilst we are chatting, you mention a 'Lambda free' variant. In general terms I am, at present' travelling in the opposite direction; that is trying to turn every formula into a Lambda. The reasoning goes, a basic formula exposes the functions it uses and, scattered through, the references it makes to data. Conversely, a named Lambda conceals the 'how' but describes the task it is performing and lists its precedents. The 'how' is concealed but it is available by listing the function.

Forum Discussion

A generalisation of the MAP lambda helper function

Resources