Use nanoparquet instead of readr/CSV

October 8, 2024 | Stephen Turner

 This is reposted from the original at https://blog.stephenturner.us/p/use-nanoparquet-instead-of-readr-csv.Parquet is interoperable between Python and R, fast to read+write, works well with databases, and stores complex data types (e.g., tib...
[Read more...]

PowerQuery Puzzle solved with R

October 8, 2024 | Numbers around us

#223–224PuzzlesAuthor: ExcelBIAll files (xlsx with puzzle and R with solution) for each and every puzzle are available on my Github. Enjoy.Puzzle #223As usual on weekends we are mainly doing table transformations. Sometimes it need simple manouvers and...
[Read more...]

DuckDB vs dplyr vs base R

October 8, 2024 | Stephen Turner

Reposted from https://blog.stephenturner.us/p/duckdb-vs-dplyr-vs-base-r.TL;DR: For a very simple analysis (means by group on 100M rows), duckdb was 125x faster than base R, and 28x faster than readr+dplyr, without having to read data from disk int...
[Read more...]

The Vibe of Flanders: Part 2

October 7, 2024 | Method Matters

This blog post is the second installment in a series detailing analyses of the 2023 De Gemeente-Stadsmonitor (The Municipality and City Monitor) survey, conducted in the region of Flanders in Belgium. You can check out the first post here. In the previous post, we used Principal Components Analysis and data visualization ...
[Read more...]

R Solution for Excel Puzzles

October 7, 2024 | Numbers around us

Puzzles no. 554–558PuzzlesAuthor: ExcelBIAll files (xlsx with puzzle and R with solution) for each and every puzzle are available on my Github. Enjoy.Puzzle #554Another interesting number called after Indian mathematician. We need to find first 50 Kapr...
[Read more...]

Nested unit tests with testthat

October 6, 2024 | R some blog

The testthat package is the most widely used tool for unit testing in R. However, many users may not be aware of the possibility to nest test blocks within each other. In this post, I demonstrate how this underused feature provides a great way to struc... [Read more...]

Dudeney’s Remainder Problem

October 6, 2024 | John Mount

The remainder problem The description of this puzzle really cracks me up (Dudeney, Strand Magazine, January 1924). Health risks aside, how do we find the maximal integer d such that (480608 % d) = (508811 % d) = (723217 % d)? The solution A good puzzle strategy is to try […]
[Read more...]

Is round(0.5) 0 or 1?

October 5, 2024 | jhk0530

Actually, it’s both possible This Article was originally published before on YOZM-IT as Korean Various way of data science  There are many programming languages in the world and software that utilizes them. And those play an important role in “Data science”. For example, if you’re using funnel analysis ...
[Read more...]

Getting started with SQL with R

October 4, 2024 | Albert Rapp

In today’s blog post, I’m showing you how to work with databases from within R. As always, you can find the video version of this blog post on YouTube: Establish a connection Let us start by setting up a database connection. library(odbc) lib... [Read more...]

An Easy Puzzle: The Perplexed Banker

October 4, 2024 | John Mount

Nina Zumel continues with the puzzles. This one is “The Perplexed Banker”. In my opinion, this one captures the essence of the “mathematical” aspect of a puzzle. For a mathematical puzzle one often hopes there is a systematic method that makes the puzzle easy. In this case there is indeed […] [Read more...]
1 16 17 18 19 20 2,158