by dvorakt | Mar 20, 2018 | data science
Tableau’s LOD expressions are extremely powerful. They allow users to control how Tableau aggregates data. This is particularly important when we join tables with different level of aggregation. For example, combining transaction level data with store level data...
by dvorakt | Feb 14, 2018 | data science
The purpose of this post is to discuss the key elements in developing an interactive web application that displays data with geographic component. I discuss developing an app using Shiny – a powerful R package. I briefly compare that process to building a...
by dvorakt | Jan 24, 2018 | data science
Let’s consider a small data set with 12 observations sorted from lowest to highest: (1,1,4),(4,5,8),(8,9,10),(10,12,13). I grouped the observations into four equal groups so that we can easily spot the quartiles. (I purposefully made the numbers at the border...
by dvorakt | Aug 23, 2017 | data science
I created a project that illustrates the use programming code to perform empirical analysis from beginning to end: from database retrieval, through cleaning, manipulating and analyzing the data, to compiling the write-up and display of the results. It consists of...
by dvorakt | Aug 18, 2017 | data science
As a huge fan of Stata I was super-excited about dynamic markdown documents newly available in the latest Stata 15 release. I played with the feature for the last few days, and can report that I was able to produce a decent looking markdown document using Stata....
by dvorakt | Jul 24, 2017 | data science
I created my first shiny app. It visualizes data from IPEDS and College Scorecard. It is now mostly for my soon college-bound children and their friends to play with, but I see that shiny is very flexible and powerful tool. Let me know what you think. You can fork the...
by dvorakt | Nov 24, 2016 | retirement plan design
The Chronicle of Higher Education published my short piece about the design of retirement plans. It is a non-technical review of the current state of the literature on the design of retirement plans. Sadly, the Chronicle did not include the links to may of the papers...
by dvorakt | Jul 21, 2016 | data science
My Project TIER colleagues and I did a session at the 2016 AEA CTREE conference on transparency and reproducibility in empirical research. My piece of the session argued that teaching programming, i.e. coding, should be part of undergraduate econometrics curriculum....