The Wild World of Data Repositories
Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.
There is no one-size-fits-all protocol for depositing your research data into a public repository in a way that maximizes its reuse and citation. We’ve assembled a panel that will help you understand the issues and opportunities for developing new tools and documentation.
This 1-hour event, moderated by Kara Woo, includes 5 speakers and 20 minutes for Q & A on:
- Where and how to deposit data (or data + software!)
- Challenges in data deposition for reuse
- How to reuse data
- Where are the gaps?
Speakers will cover nuances of general and domain-specific repositories (Dryad, the DataONE federation of repositories, Arctic Data Center), what curators look for, what to do when you want to deposit both data and software, designing tools to help researchers provide the right metadata to maximize reuse, making data reuse easier with R tools like piggyback, and their vision for new tools and documentation.
All are welcome.
Speakers
Kara Woo
Kara Woo is a Principal Bioinformatics Engineer at Sage Bionetworks where she leads a team of developers building tools and infrastructure for open science. Kara is a member of the rOpenSci Code of Conduct committee. Kara on GitHub, Twitter, Website
Daniella Lowenberg
Daniella Lowenberg is the Product Manager at Dryad and Principal Investigator of Make Data Count within the California Digital Library at University of California. Daniella on GitHub, Twitter
Matt Jones
Matt Jones is Director of Informatics R&D at the National Center for Ecological Analysis and Synthesis (NCEAS), Principal Investigator of the NSF Arctic Data Center, and Director of DataONE at University of California Santa Barbara. Matt on GitHub, Twitter, Website
Carl Boettiger
Carl Boettiger is Assistant Professor in the Department of Environmental Science, Policy and Management at UC Berkeley, a Co-founder and strategic advisor of rOpenSci. Carl on GitHub, Twitter, Website
Karthik Ram
Karthik Ram is a Senior Research Scientist with the Berkeley Institute for Data Science, Project Lead and Co-founder of rOpenSci, Editor for rOpenSci Software Peer Review. He has a PhD in Ecology and Evolution. Karthik on GitHub, Twitter, Website
Resources
- Discussion on GitHub, Read the thread that led to this Community Call
- A Realistic Guide to Making Data Available Alongside Code to Improve Reproducibility, arXiv preprint by Nicholas Tierney, Karthik Ram
- Enforcing public data archiving policies in academic publishing. A study of ecology journals, Sholler D, Ram K, Boettiger C, Katz DS (2019) Big Data & Society.
- The citation advantage of linking publications to research data, Colavizza G, Hrynaszkiewicz I, Staden I, Whitaker K, McGillivray B (2020) PLoS ONE 15(4), e0230416.
Join Us!
- Who Everyone is welcome. No RSVP needed, simply connect and/or dial in at the time of the event.
- When Wednesday, 16 December 2020 10:00 PST (Wednesday, 16 December 2020 18:00 UTC)
- Find your timezone
- Add to Calendar.
-
How
Meeting ID: 922 9890 9939
Passcode: 896415
R-bloggers.com offers daily e-mail updates about R news and tutorials about learning R and many other topics. Click here if you're looking to post or find an R/data-science job.
Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.