Categorical variables in data set - how do i clean?

Copper Contributor

Hi all,

I have to clean my dataset for missing values. The feature "Last_New_Job" states the amount of years since last job.

There is some 1.000 missing values which i would like to clean by replace with the median value.

 

The problem is that the dataset has "<4", which makes the dataset a "string" dataset. What solution can i do to make the ">4" a numerical number? Or should i just replace the missing value with mode?

 

image.png

0 Replies