Accessing and Manipulating Biological Databases Exercises (Part 1)
Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.
In the exercises below we cover how we can Access and Manipulate Biological Data bases through rentrez & seqinr packages
Install Packages
rentrez
seqinr
Answers to the exercises are available here
If you obtained a different (correct) answer than those listed on the solutions page, please feel free to post your answer as a comment on that page.
Exercise 1
Print all the available data bases which you can access through rentrez package
Exercise 2
Print all the searchable terms in a database
Exercise 3
Display the details of any database of your choice
Exercise 4
Retrieve and print 10 ids of nucleotide sequences from nuccore database about Human.
Exercise 5
Retrieve and print 20 ids of protein sequences from protein database about Human.
- import data into R in several ways while also beeing able to identify a suitable import tool
- use SQL code within R
- And much more
Exercise 6
Create a Fasta File for a particular human protein sequence from the listed ids.
Exercise 7
Create a Fasta File for a particular human nucleotide sequence from the listed ids.
Exercise 8
Open the Nucleotide Fasta file and print the details using seqinr package.
Exercise 9
Open the Protein Fasta file and print the details using seqinr package
Exercise 10
Open the Nucleotide Fasta file and print only sequence from the created Fasta file striping all other information.
R-bloggers.com offers daily e-mail updates about R news and tutorials about learning R and many other topics. Click here if you're looking to post or find an R/data-science job.
Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.