A function to find the “Penultimax”
Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.
Penulti-what? Let me explain: Today I had to iteratively go through each row of a donor history dataset and compare a donor’s maximum yearly donation total to the second highest yearly donation total. In even more concrete terms, for each row I had to compare the maximum value across 5 columns against the next highest number. This seemed to be a rather unique task, and so I had to make an R function to help carry it out.
So, I named the function “penultimax”, to honour the idea that it’s finding the second highest, or second max. It works pretty simply, really just by removing the maximum value from the input vector, and returning the maximum of the new vector, if it’s there at all. Following is the code for it (notice that it draws on an earlier function that I made, called safe.max, that returns an NA when it can’t find a maximum, instead of an error):
penultimax = function(invector) { | |
# If the vector starts off as only having 1 or 0 numbers, return NA | |
if (length(invector) <= 1) { | |
return(NA) | |
} | |
first.max = safe.max(invector) | |
#Once we get the max, take it out of the vector and make newvector | |
newvector = invector[!invector == first.max] | |
#If newvector now has nothing in it, return NA | |
if (length(newvector) == 0) { | |
return(NA) | |
} | |
#Now we get the second highest number in the vector. | |
#So long as it's there, we return that second highest number (the penultimax) | |
#or else we just return NA | |
second.max = safe.max(newvector) | |
if (is.na(first.max) & is.na(second.max)) { | |
return (NA) } | |
else if (!is.na(first.max) & is.na(second.max)) { | |
return (NA) } | |
else if (!is.na(first.max) & !is.na(second.max)) { | |
return (second.max)} | |
} | |
safe.max = function(invector) { | |
na.pct = sum(is.na(invector))/length(invector) | |
if (na.pct == 1) { | |
return(NA) } | |
else { | |
return(max(invector,na.rm=TRUE)) | |
} | |
} |
Did I miss something out there that’s simpler than what I wrote?
data:image/s3,"s3://crabby-images/e2f2a/e2f2ae46928d1c614ea1a787458272b909716b07" alt=""
R-bloggers.com offers daily e-mail updates about R news and tutorials about learning R and many other topics. Click here if you're looking to post or find an R/data-science job.
Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.