Analysis Tools for Data Exercise

%3CLINGO-SUB%20id%3D%22lingo-sub-1435000%22%20slang%3D%22en-US%22%3EAnalysis%20Tools%20for%20Data%20Exercise%3C%2FLINGO-SUB%3E%3CLINGO-BODY%20id%3D%22lingo-body-1435000%22%20slang%3D%22en-US%22%3E%3CDIV%20class%3D%22vk_c%22%3E%3CDIV%3E%3CDIV%20class%3D%22jhH5U%22%3E%3CDIV%20class%3D%22tw-src-ltr%22%3E%3CDIV%20class%3D%22oSioSc%22%3E%3CDIV%3E%3CDIV%20class%3D%22g9WsWb%22%3E%3CDIV%20class%3D%22tw-ta-container%20hide-focus-ring%20tw-nfl%22%3E%3CP%3ESomebody%20can%20help%20me%20with%20this%20exercise%3F%20Attached%20is%20a%20file%20with%2010.000%20ids%20to%20use%20as%20a%20learning%20population%2C%20divided%20into%20two%20classes%3A%20positive%20and%20negative.%20The%20data%20includes%2050%20demographic%20variables%20and%2050%20behavioral%20variables.%20I%20have%20to%20develop%20a%20model%20that%20will%20allow%20me%20to%20better%20identify%20the%20positive%20class%20that%20has%20a%20rare%20frequency.%20The%20file%20also%20contains%204.000%20cases%20to%20be%20used%20as%20an%20evaluation%20population%20with%20the%20same%20set%20of%20variables%2C%20but%20for%20which%20the%20Y%20factor%20to%20be%20forecast%20is%20not%20revealed.%20The%20exercise%20is%20select%20800%20cases%2C%20seeking%20to%20maximize%20the%20selection%20of%20positive%20cases.%3C%2FP%3E%3C%2FDIV%3E%3C%2FDIV%3E%3C%2FDIV%3E%3C%2FDIV%3E%3C%2FDIV%3E%3C%2FDIV%3E%3C%2FDIV%3E%3C%2FDIV%3E%3C%2FLINGO-BODY%3E%3CLINGO-LABS%20id%3D%22lingo-labs-1435000%22%20slang%3D%22en-US%22%3E%3CLINGO-LABEL%3EBI%20%26amp%3B%20Data%20Analysis%3C%2FLINGO-LABEL%3E%3CLINGO-LABEL%3EExcel%3C%2FLINGO-LABEL%3E%3CLINGO-LABEL%3EFormulas%20and%20Functions%3C%2FLINGO-LABEL%3E%3C%2FLINGO-LABS%3E
Highlighted
Regular Visitor

Somebody can help me with this exercise? Attached is a file with 10.000 ids to use as a learning population, divided into two classes: positive and negative. The data includes 50 demographic variables and 50 behavioral variables. I have to develop a model that will allow me to better identify the positive class that has a rare frequency. The file also contains 4.000 cases to be used as an evaluation population with the same set of variables, but for which the Y factor to be forecast is not revealed. The exercise is select 800 cases, seeking to maximize the selection of positive cases.

0 Replies