…or your money back.
author = "Ben Ogorek"<br>Twitter = "@baogorek"<br>email = paste0(sub("@", "", Twitter), "@gmail.com")<br>
Setup Pretend this is Big Data:
doc1 <- "Stray cats are running all over the place. I see 10 a day!"<br>doc2 <- "Cats are killers. They kill billions of animals a year."<br>doc3 <- "The best food in Columbus, OH is the North Market."<br>doc4 <- "Brand A is the best tasting cat food around. Your cat will love it."<br>doc5 <- "Buy Brand C cat food for your cat. Brand C makes healthy and happy cats."<br>doc6 <- "The Arnold Classic came to town this weekend. It reminds us to be healthy."<br>doc7 <- "I have nothing to say. In summary, I have told you nothing."<br>
and this is the Big File System:
doc.list <- list(doc1, doc2, doc3, doc4, doc5, doc6, doc7)<br>N.docs <- length(doc.list)<br>names(doc.list) <- paste0("doc", c(1:N.docs))<br>
You have an information need that is expressed via the following text query:
query <- "Healthy cat food"<br>
How will you meet your information need amidst all this unstructured text? Jokes aside, we're going ...
[Read more...]