Site icon R-bloggers

Why balloons are better than balls (in urn schemes)

[This article was first published on BioStatMatt » R, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here)
Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.

The below is taken from a work in progress:

The Polya urn is a heuristic associated with Dirichlet process mixtures. We present the scheme in a modified format, using balloons instead of balls, where the probability of drawing a balloon from the urn is proportional to its volume. Balloons are preferred because their volume may be adjusted by fractional amounts, whereas a ball count may be adjusted only in whole amounts.

The Polya urn initially contains n uniquely colored baloons, each filled with the same volume of air. At each draw, a single baloon is selected at random from the collection of balloons within the urn, and its color is recorded. If the recorded color had been observed in previous draws, the baloon is inflated by an amount equal to its original volume. Finally, the balloon is returned to the urn. Hence, a balloon drawn from the Polya urn is more likely be observed in subsequent draws. ‘The rich get richer’ is a fitting mnemonic for the Polya urn scheme.

To leave a comment for the author, please follow the link and comment on their blog: BioStatMatt » R.

R-bloggers.com offers daily e-mail updates about R news and tutorials about learning R and many other topics. Click here if you're looking to post or find an R/data-science job.
Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.