Forum Discussion

Elmar Bischof's avatar
Elmar Bischof
Copper Contributor
Aug 04, 2017
Solved

MaxIfIs vs Earliest vs Aggregation to Find Douplicates {Get&Transform}

After having a view looks on variouse forums I found that I was not the first with the same data filtering requirement of my source data.   Case: You have a Dataset where you need to append freque...
  • Elmar Bischof's avatar
    Elmar Bischof
    Aug 08, 2017

    Hi.

     

    After playing arround for it for a couple of days I found a quicker solution than #2, while using the Query Editor.

     

    Hence here #4:

    1. Import your data and duplicate it.
    2. Remove all columuns in one data set up to the Uniqe ID and the "Date Created" of the source file.
    3. Use "GroupBy" within "Transform". Group by the ID, return Max of the Date.
    4. Select both columns, go to "Transform" and use "Merge Columns"
    5. Change the "Load To" to of this Grouped dataset to connection only without loading it to the datamodel.
    6. Go back to the other dataset that you intend to load into the Datamodel and merge the "Date created" and your Unique ID
    7. Make an Inner Join

    I get a refresh time of less then one minute.

    Still not the quick single Table solution without the use of "Join" I have been looking for, though at least it is within the "Query Editor", quick and keeps the file size small.

     

    Ps.: You get "Date Created" at the very first ste of Applied Steps, under "Source"

Resources