Site icon
R-bloggers

Smart Extraction: Converting PDF Tables into Usable Data with R workshop

[This article was first published on R-posts.com, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here)
Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.

Join our workshop on  Smart Extraction: Converting PDF Tables into Usable Data with R, which is a part of our workshops for Ukraine series! 


Here’s some more info: 


Title: Smart Extraction: Converting PDF Tables into Usable Data with R


Date: Thursday, May 1st, 18:00 – 20:00 CET (Rome, Berlin, Paris timezone)


Speaker: Flávia E. Rius, PhD, is a data scientist at Mendelics, Latin America’s leading genomics company, and a postdoctoral researcher at the University of São Paulo. With a strong background in molecular biology and bioinformatics, she combines research and applied genomics to advance precision medicine in Brazil. Passionate about sharing knowledge, she also mentors students and professionals in R, data science, and bioinformatics.


Description: In this workshop, we’ll dive into the extraction of tables from PDFs using R, an essential skill for turning static documents into usable data. We’ll explore two approaches: first, using {tabulizer} to extract structured tables, and second, using the ocr() function from {tesseract}, a powerful tool for when text can’t be extracted directly. Our focus will be on academic journal articles, a rich source of data for both research and industry applications. Join me to level up your data wrangling skills and add a valuable asset to your R toolkit!


Minimal registration fee: 20 euro (or 20 USD or 800 UAH)



Please note that the registration confirmation is sent 1 day before the workshop to all registered participants rather than immediately after registration


How can I register?





If you are not personally interested in attending, you can also contribute by sponsoring a participation of a student, who will then be able to participate for free. If you choose to sponsor a student, all proceeds will also go directly to organisations working in Ukraine. You can either sponsor a particular student or you can leave it up to us so that we can allocate the sponsored place to students who have signed up for the waiting list.


How can I sponsor a student?





If you are a university student and cannot afford the registration fee, you can also sign up for the waiting list here. (Note that you are not guaranteed to participate by signing up for the waiting list).



You can also find more information about this workshop series,  a schedule of our future workshops as well as a list of our past workshops which you can get the recordings & materials here.


Looking forward to seeing you during the workshop!










 





Smart Extraction: Converting PDF Tables into Usable Data with R workshop was first posted on April 2, 2025 at 9:20 am.
To leave a comment for the author, please follow the link and comment on their blog: R-posts.com.

R-bloggers.com offers daily e-mail updates about R news and tutorials about learning R and many other topics. Click here if you're looking to post or find an R/data-science job.
Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.
Exit mobile version