Tales of Data Science

Conditional Random Fields (CRF): Short Survey

On a picture above you may see a random field.

Currently, many of us are overwhelmed with mighty power of Deep Learning. We start to forget about humble graphical models. CRF is not so trendy as LSTM, but it is robust, reliable and worth noting.

In this post, you will find a short summary about CRF (aka Conditional Random Fields) – what is this thing, what is it for and some interesting facts. Enjoy!

Read More

25,492 total views, no views today

Examples of Data Analysis Reports


Recently, I have discovered some old examples of data analyses, which were carried out for study purposes by my colleagues and me in 2013, during the Data Analysis course on Coursera. These examples are based on the analyses conducted on two datasets – Lending Club company dataset and Samsung smartphones dataset. The examples DO NOT contain advanced approaches to Data Analysis and Data Mining, but they will come in handy to everyone who need to see how a decent data analysis report should look like.

But remember: the following data analysis reports were composed to be read by persons at least acquanted with standard approaches to data analysis and predictive modeling.

Business-driven data analysis for non-technical people (such as managers) should be composed in other way:

  1. with much less or no (if possible) technical details,
  2. thorough yet simple description of what did you do and why did you do so
  3. clear practical recommendations, which can be directly applied to business.

If you still want to continue, you’re welcome! If not – you’re still welcome!

Read More

54,193 total views, 3 views today

Top 10 countries on StackOverflow and GitHub


Here I would like to show you the result of my analysis, which I conducted in late October, 2014 in order to find and outline statistical trends, connected with users from different countries.

How does the amount of users from different countries changes with time? What countries there are most users from? Citizens of what countries commit on GitHub more? These questions I wanted to answer while working on this analysis. Please note, that this analysis is “shallow” enough – I mean that I didn’t analyze StackOverflow rating system thoroughly, just in a nutshell. Please mention this.

The study was conducted on 24 October, 2014, and if you are interested, I may update this study to bring it up to date, just let me know 🙂

You also may read this post in Russian if you like. There are more information about presence of Russian citizens on StackOverflow and GitHub.

Read More

114,265 total views, no views today