Extending Commodity time series
Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.
I want to follow up with Extending Gold time series post by showing how we can extend Commodity time series. Most Commodity ETFs began trading in 2006, please see the List of Commodity ETFs page. I will use DBC – PowerShares DB Commodity Fund, one on the most liquid Commodity ETFs as my proxy for Commodity time series.
Unfortunately the historical data for the underlying index for the DBC – PowerShares DB Commodity Fund is not publicly available. Instead I will use Thomson Reuters/Jefferies CRB Index to extend DBC – PowerShares DB Commodity Fund time series.
I created a helper function get.CRB() function in data.r at github to download and extract index information. Please note that I used gdata package to read index data and you might need local Perl available to make it work. There is a great tutorial on setting everything up at How to read an excel file (dot xls and dot xlsx) into a data frame with r: gdata and perl post.
Now let’s plot the CRB index and a few Commodity ETFs side by side to visual check the similarity:
############################################################################### # Load Systematic Investor Toolbox (SIT) # http://systematicinvestor.wordpress.com/systematic-investor-toolbox/ ############################################################################### setInternet2(TRUE) con = gzcon(url('http://www.systematicportfolio.com/sit.gz', 'rb')) source(con) close(con) #***************************************************************** # Load historical data #****************************************************************** load.packages('quantmod') CRB = get.CRB() tickers = spl('GSG,DBC') getSymbols(tickers, src = 'yahoo', from = '1970-01-01') #***************************************************************** # Compare different indexes #****************************************************************** out = na.omit(merge(Ad(CRB), Ad(GSG), Ad(DBC))) colnames(out) = spl('CRB,GSG,DBC') temp = out / t(repmat(as.vector(out[1,]),1,nrow(out))) # Plot side by side layout(1:2, heights=c(4,1)) plota(temp, ylim=range(temp)) plota.lines(temp[,1],col=1) plota.lines(temp[,2],col=2) plota.lines(temp[,3],col=3) plota.legend(colnames(temp),1:3) # Plot correlation table temp = compute.cor(temp / mlag(temp)- 1, 'pearson') temp[] = plota.format(100 * temp, 0, '', '%') plot.table(temp)
The CRB index performed similar to Commodity ETFs; its not the exact replica, but will do given the lack of public historical data for Commodity index.
Next let’s extend the DBC’s time series with CRB index and use it for a simple equal weight strategy:
#***************************************************************** # Create simple equal weight back-test #****************************************************************** tickers = spl('GLD,DBC,TLT') data <- new.env() getSymbols(tickers, src = 'yahoo', from = '1980-01-01', env = data, auto.assign = T) for(i in ls(data)) data[[i]] = adjustOHLC(data[[i]], use.Adjusted=T) # extend Gold and Commodity time series data$GLD = extend.GLD(data$GLD) data$DBC = extend.data(data$DBC, get.CRB(), scale=T) bt.prep(data, align='remove.na') #***************************************************************** # Code Strategies #****************************************************************** prices = data$prices n = ncol(prices) # find period ends period.ends = endpoints(prices, 'months') period.ends = period.ends[period.ends > 0] models = list() #***************************************************************** # Equal Weight #****************************************************************** data$weight[] = NA data$weight[period.ends,] = ntop(prices[period.ends,], n) models$equal.weight = bt.run.share(data, clean.signal=F) #***************************************************************** # Create Report #****************************************************************** plotbt.custom.report.part1(models) plotbt.custom.report.part2(models)
By extending DBC’s and GLD’s time series, the back-test now goes back to the July 2002, the TLT’s first trading date.
To view the complete source code for this example, please have a look at the bt.extend.DBC.test() function in bt.test.r at github.
R-bloggers.com offers daily e-mail updates about R news and tutorials about learning R and many other topics. Click here if you're looking to post or find an R/data-science job.
Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.