Rendering Mermaid Diagrams on a Hugo Website Using Quarto

I find a lot of the methods for drawing diagrams and flow charts using code to be a bit of a nightmare. I’m not sure what it is, perhaps I’m just a bit stupid, but they always seem a little more convoluted than I can handle. It’s possible that the design is actually of some value when it comes to drawing diagrams that are much larger and more complicated than I am generally dealing with, but in my case, it always seems to be a lot of work for what little I’m trying to do....

September 17, 2022 · 5 min · 1065 words

Setting up a Hugo Website with Quarto

Like every other nerd that likes R and has used RMarkdown more than one time in their entire life, I am now in the process of transitioning to using Quarto, and I want to let the world know! I wanted to make good use of Quarto, including utilising the really nice job it does with rendering R, Python, Julia, and JavaScript (not that I’m writing a lot of Julia or JavaScript) and outputting it in a HTML document....

September 11, 2022 · 4 min · 852 words

Resources for Learning Bayesian Statistics

As a social scientist turned data scientist, my graduate school training taught me a lot of frequentist statistics that has served me well in my career since. However, there’s one thing that frequentist statistics can’t solve, and that’s my lack of a personality. For that, we’ve got Bayesian statistics, the perfect substitute. I’ve spent the last year or so gradually trying to become more Bayesian in my approach to statistical reasoning and analysis, and while there’s quite a few really good resources for supporting that journey, it can be a little difficult to know where to start....

May 10, 2022 · 8 min · 1603 words

Combining Tidymodels and Caret for Machine Learning in R

The two main approaches to building machine learning models in R are caret and tidymodels. Having tried both, I found that I struggled to pick my favorite. There’s elements of both that made more intuitive sense to me than the other. I think it’s a product of having become very familiar with the tidyverse, particularly dplyr, for data wrangling, but still using a lot of Base R functions for statistical modeling....

May 7, 2022 · 6 min · 1244 words

Make More Packages!

Having learned R in an academic context, I wasn’t exposed to programming in a linear path that taught me the basics of programming theory and application. I learned how to carry out statistical analysis using R, and my interaction with packages was that they would help me get the job done without having to do everything manually. As a result, package development has always intimidated me a little bit. It seemed like the responsibility of much smarter people that know what they’re doing, and that can build packages that can help me figure out what I’m doing too....

February 19, 2022 · 7 min · 1319 words

Making Myself Accountable

One of my goals when setting up this site was to use the blog section reasonably frequently to share some of my work and share anything particularly interesting that I’ve learned in my development as a data scientist. There hasn’t been much sharing going on. I’d like to think that isn’t because I haven’t done or learned anything interesting (…), I just haven’t found the time to share it. I’m the ideas man....

January 17, 2022 · 1 min · 213 words

Recent Football Analytics Articles

As a side-project/hobby, I write about football. Some of my recent work (as of 04/20) can be found below: Past his peak? Assessing goalkeeper performance by age and Manuel Neuer vs Alexander Nübel For Bavarian Football Works, I analyzed goalkeeper peak age, and the performance of Bayern Munich’s great but ageing Manuel Neuer. I found that Neuer continues to be a very good goalkeeper, but that his performances are gradually declining, suggesting that the effects of age are starting to wear on him, and showing that even the greatest athletes ever can’t beat Father Time....

April 14, 2020 · 1 min · 202 words