Analysing Twitter data Oct 24, 2020 Earlier this month, I was the curator of @WeAreRLadies for a week. They have a different person tweeting every seven days; I was in charge in the week commencing 5th October. ...
Making it work from home Sep 26, 2020 It’s been about six months since the UK first went into lockdown. Many of us around the world have drastically changed our lifestyles and ways of doing things. I’ve become much better at working from home and there are lots of things I prefer about it. ...
Using callbacks and logging during training with gensim Aug 24, 2020 How long should you train an LDA model for? This post is less to do with the actual minutes and hours it takes to train a model, which is impacted in several ways, but more do with the number of opportunities the model has during training to learn from the data, and therefore the ultimate quality of the model. ...
Data cleaning and exploration with data.table Jul 19, 2020 In May, I delivered a few training sessions for R beginners and I’m using those materials for some posts here. Following on from my two posts on ggplot2, this is based on my session of data. ...
Stop using iris Jun 24, 2020 The iris dataset is very widely used in the data science community, whether as a training aid, a tool for trying out new skills, or just a well-known set of numbers that can be used as background while demonstrating something in a blog. ...
Changing the look of your ggplot2 objects May 30, 2020 In my last post, I went through the basics of creating a ggplot2 object. Here, I’ll be focusing on how to alter the standard look of ggplot2 and make other changes to make the visualisation right for you. ...
Data visualisation and hawks with ggplot2 May 23, 2020 Conferences and meetups have been important throughout my data science career - especially when I was starting out and learning so much, but still critical as ways of keeping on top of new developments and making new connections. ...
Password invalid when cloning a GitHub repo Apr 12, 2020 I haven’t written a ‘things I learnt’ post for a while, mainly because I’ve been wrapping those into my main posts. However, after going round in circles for ages with this problem, I wanted to share the solution quickly in case it helps anybody else! ...
My favourite RStudio tricks Mar 28, 2020 My first post on this blog was on the 31st March 2019, almost exactly a year ago. My aim was to post at least once a month. Looking back over the year, I’ve posted 20 times (not including this) across topics like web scraping, regular expressions and creating Shiny apps, as well as chiming in with my thoughts and experiences around becoming a data scientist without a traditional STEM academic background. ...
But are degrees actually required for data science? Feb 24, 2020 Looking at job ads for data scientists, half of them specify they want you to have a degree. At least, that was what I found looking at jobs based in London posted on Indeed, which I covered in my most recent post. ...