July 2018

Mastering your day as a programmer

July 3, 2018 | Lorenz Walthert

How do you program? I feel like there is a lot of advice out there, but most of it focuses on the micro level rather than on the macro level. That’s why decided to think about what makes one an efficient programmer on a macro level. Think long-term Spend ... [Read more...]

Mastering your day as a programmer

July 3, 2018 | Lorenz Walthert

How do you program? I feel like there is a lot of advice out there, but most of it focuses on the micro level rather than on the macro level. That’s why decided to think about what makes one an efficient programmer on a macro level. Think long-te... [Read more...]

A brief guide to data visuals in R in 2018

July 3, 2018 | Bluecology blog

Data visuals 2018 Supplementary notes for CJ Brown’s talks on dataviz in 2018 for Griffith University’s honours students and the UQ Winterschool in Bioinformatics. These notes run through some of the principles I present and also show R code for generating data visuals. If you want to take the dataviz ... [Read more...]

Tidily evaluated ggplot2

July 3, 2018 | HighlandR

Test driving the new release of ggplot2 - A new release of ggplot2 Now that tidy evaluation is baked into ggplot2, as of TODAY, let’s take it for a spin: ggplot2 3.0.0 %__% create function %__% test function %__% end
<span>library</span><span>(</span><span>dplyr</span><span>)</span><span>
</span><span>library</span><span>(</span><span>ggplot2</span><span>)</span><span>
</span><span>library</span><span>(</span><span>tidyr</span><span>)</span><span>
</span><span>library</span><span>(</span><span>tibble</span><span>)</span><span>

</span><span>data</span><span> </span><span><-</span><span> </span><span>list</span><span>(</span><span>fdeaths</span><span>,</span><span>mdeaths</span><span>,</span><span>ldeaths</span><span>)</span><span> </span><span>#time series data- needs prep</span><span>

</span><span>names</span><span>(</span><span>data</span><span>)[</span><span>1</span><span>:</span><span>3</span><span>]</span><span> </span><span><-</span><span> </span><span>c</span><span>(</span><span>"fdeaths"</span><span>,</span><span>"mdeaths"</span><span>,</span><span>"ldeaths"</span><span>)</span><span>

</span><span>data</span><span> </span><span><-</span><span> </span><span>as_tibble</span><span>(</span><span>data</span><span>)</span><span>

</span><span>startdate</span><span> </span><span><-</span><span> </span><span>as.Date</span><span>(</span><span>'1974-1-1'</span><span>)</span><span>

</span><span>data</span><span>$</span><span>date</span><span> </span><span><-</span><span> </span><span>seq.Date</span><span>(</span><span>startdate</span><span>,</span><span>by</span><span> </span><span>=</span><span> </span><span>'month'</span><span>,</span><span>length.out</span><span> </span><span>=</span><span> </span><span>72</span><span>)</span><span>


</span><span>newdata</span><span> </span><span><-</span><span> </span><span>tidyr</span><span>::</span><span>gather</span><span>(</span><span>data</span><span>,</span><span> </span><span>key</span><span> </span><span>=</span><span> </span><span>key</span><span>,</span><span> </span><span>value</span><span> </span><span>=</span><span> </span><span>value</span><span>,</span><span>-</span><span>date</span><span>)</span><span>

</span><span>newdata</span><span>$</span><span>value</span><span> </span><span><-</span><span> </span><span>as.numeric</span><span>(</span><span>newdata</span><span>$</span><span>value</span><span>)</span><span>

</span><span># create generic function</span><span>

</span><span>gtest</span><span> </span><span><-</span><span> </span><span>function</span><span>(</span><span>df</span><span>,</span><span>x</span><span>,</span><span>y</span><span>,</span><span> </span><span>group</span><span>)</span><span> </span><span>{</span><span>
  
  
  </span><span>x_quo</span><span> </span><span><-</span><span> </span><span>enquo</span><span>(</span><span>x</span><span>)</span><span>
  
  </span><span>y_quo</span><span> </span><span><-</span><span> </span><span>enquo</span><span>(</span><span>y</span><span>)</span><span>
  
  </span><span>group_quo</span><span> </span><span><-</span><span> </span><span>enquo</span><span>(</span><span>group</span><span>)</span><span>
  
  
  
  </span><span>p</span><span> </span><span><-</span><span> </span><span>ggplot</span><span>(</span><span>df</span><span>,</span><span>aes</span><span>(</span><span>x</span><span> </span><span>=</span><span> </span><span>!!</span><span>x_quo</span><span>,</span><span> </span><span>y</span><span> </span><span>=</span><span> </span><span>!!</span><span>y_quo</span><span>))</span><span> </span><span>+</span><span>   </span><span>#bangin'   </span><span>
    
    </span><span>geom_line</span><span>(</span><span>colour</span><span> </span><span>=</span><span> </span><span>"blue"</span><span>,</span><span> </span><span>group</span><span> </span><span>=</span><span> </span><span>1</span><span>)</span><span> </span><span>+</span><span> 
    
    </span><span>geom_point</span><span>(</span><span>colour</span><span> </span><span>=</span><span> </span><span>"blue"</span><span>)</span><span> </span><span>+</span><span>
    
    </span><span>facet_wrap</span><span>(</span><span>group_quo</span><span>,</span><span> </span><span>ncol</span><span> </span><span>=</span><span> </span><span>3</span><span>)</span><span> </span><span>#look Ma, no need to bang bang here!</span><span>
  
  
  </span><span>p</span><span> </span><span><-</span><span> </span><span>p</span><span> </span><span>+</span><span> </span><span>ggtitle</span><span>(</span><span>label</span><span> </span><span>=</span><span> </span><span>"Easy Tidy Eval in ggplot 3.0.0"</span><span>,</span><span>
                   
                   </span><span>subtitle</span><span> </span><span>=</span><span> </span><span>"ggplot with tidy evaluation & facetting with no strings"</span><span>)</span><span>
  
  </span><span>p</span><span> </span><span><-</span><span> </span><span>p</span><span> </span><span>+</span><span> </span><span>labs</span><span>(</span><span>x</span><span> </span><span>=</span><span> </span><span>NULL</span><span>,</span><span> </span><span>y</span><span> </span><span>=</span><span> </span><span>NULL</span><span>,</span><span> </span><span>caption</span><span> </span><span>=</span><span> </span><span>""</span><span>)</span><span> </span><span>+</span><span>
    </span><span>theme_bw</span><span>()</span><span>
  
  </span><span>p</span><span>
</span><span>}</span><span>
</span>
Let’s test it:
<span>gtest</span><span>(</span><span>newdata</span><span>,</span><span>date</span><span>,</span><span>value</span><span>,</span><span>key</span><span>)</span><span>

</span>
End
[Read more...]

More tidy evaluation with ggplot2

July 3, 2018 | HighlandR

Following up the follow up - After my previous post, my good friend David Henderson asked a good question on Twitter ( it happens :)) To be honest, I hadn’t even noticed there was no tilde (the ‘~’ ) in my call to facet_wrap. So I did a little experiment :
<span>library</span><span>(</span><span>dplyr</span><span>)</span><span>

</span><span>library</span><span>(</span><span>ggplot2</span><span>)</span><span>

</span><span>library</span><span>(</span><span>tidyr</span><span>)</span><span>

</span><span>library</span><span>(</span><span>tibble</span><span>)</span><span>


</span><span>data</span><span> </span><span><-</span><span> </span><span>list</span><span>(</span><span>fdeaths</span><span>,</span><span>mdeaths</span><span>,</span><span>ldeaths</span><span>)</span><span> </span><span>#time series data- needs prep</span><span>

</span><span>names</span><span>(</span><span>data</span><span>)[</span><span>1</span><span>:</span><span>3</span><span>]</span><span> </span><span><-</span><span> </span><span>c</span><span>(</span><span>"fdeaths"</span><span>,</span><span>"mdeaths"</span><span>,</span><span>"ldeaths"</span><span>)</span><span>

</span><span>data</span><span> </span><span><-</span><span> </span><span>as_tibble</span><span>(</span><span>data</span><span>)</span><span>

</span><span>startdate</span><span> </span><span><-</span><span> </span><span>as.Date</span><span>(</span><span>'1974-1-1'</span><span>)</span><span>

</span><span>data</span><span>$</span><span>date</span><span> </span><span><-</span><span> </span><span>seq.Date</span><span>(</span><span>startdate</span><span>,</span><span>by</span><span> </span><span>=</span><span> </span><span>'month'</span><span>,</span><span>length.out</span><span> </span><span>=</span><span> </span><span>72</span><span>)</span><span>


</span><span>newdata</span><span> </span><span><-</span><span> </span><span>tidyr</span><span>::</span><span>gather</span><span>(</span><span>data</span><span>,</span><span> </span><span>key</span><span> </span><span>=</span><span> </span><span>key</span><span>,</span><span> </span><span>value</span><span> </span><span>=</span><span> </span><span>value</span><span>,</span><span>-</span><span>date</span><span>)</span><span>

</span><span>newdata</span><span>$</span><span>value</span><span> </span><span><-</span><span> </span><span>as.numeric</span><span>(</span><span>newdata</span><span>$</span><span>value</span><span>)</span><span>
</span><span># adding in an additional colum to facet by</span><span>
</span><span>newdata</span><span>$</span><span>random</span><span> </span><span><-</span><span> </span><span>stringr</span><span>::</span><span>str_sub</span><span>(</span><span>newdata</span><span>$</span><span>key</span><span>,</span><span>1</span><span>,</span><span>1</span><span>)</span><span>


</span><span>gtest</span><span> </span><span><-</span><span> </span><span>function</span><span>(</span><span>df</span><span>,</span><span>x</span><span>,</span><span>y</span><span>,</span><span>...</span><span>)</span><span> </span><span>{</span><span>
  
  </span><span># check out the ... in the function parameters</span><span>
  
  </span><span>x_quo</span><span> </span><span><-</span><span> </span><span>enquo</span><span>(</span><span>x</span><span>)</span><span>
  
  </span><span>y_quo</span><span> </span><span><-</span><span> </span><span>enquo</span><span>(</span><span>y</span><span>)</span><span>
  
  </span><span>groupvars</span><span> </span><span><-</span><span> </span><span>quos</span><span>(</span><span>...</span><span>)</span><span> </span><span>## this is a new bit too</span><span>
  
  
  
  </span><span>p</span><span> </span><span><-</span><span> </span><span>ggplot</span><span>(</span><span>df</span><span>,</span><span>aes</span><span>(</span><span>x</span><span> </span><span>=</span><span> </span><span>!!</span><span>x_quo</span><span>,</span><span> </span><span>y</span><span> </span><span>=</span><span> </span><span>!!</span><span>y_quo</span><span>))</span><span> </span><span>+</span><span>   </span><span>#bangin'   </span><span>
    
    </span><span>geom_line</span><span>(</span><span>colour</span><span> </span><span>=</span><span> </span><span>"blue"</span><span>,</span><span> </span><span>group</span><span> </span><span>=</span><span> </span><span>1</span><span>)</span><span> </span><span>+</span><span> 
    
    </span><span>geom_point</span><span>(</span><span>colour</span><span> </span><span>=</span><span> </span><span>"blue"</span><span>)</span><span> </span><span>+</span><span>
    
    </span><span>facet_wrap</span><span>(</span><span>groupvars</span><span>,</span><span> </span><span>ncol</span><span> </span><span>=</span><span> </span><span>2</span><span>)</span><span> </span><span>#look Ma, no tilde. Also, 2 cols, not 3</span><span>
  
  
  </span><span>p</span><span> </span><span><-</span><span> </span><span>p</span><span> </span><span>+</span><span> </span><span>ggtitle</span><span>(</span><span>label</span><span> </span><span>=</span><span> </span><span>"Easy Tidy Eval in ggplot 3.0.0"</span><span>,</span><span>
                   
                   </span><span>subtitle</span><span> </span><span>=</span><span> </span><span>"ggplot with tidy evaluation & facetting with no strings"</span><span>)</span><span>
  
  </span><span>p</span><span> </span><span><-</span><span> </span><span>p</span><span> </span><span>+</span><span> </span><span>labs</span><span>(</span><span>x</span><span> </span><span>=</span><span> </span><span>NULL</span><span>,</span><span> </span><span>y</span><span> </span><span>=</span><span> </span><span>NULL</span><span>,</span><span> </span><span>caption</span><span> </span><span>=</span><span> </span><span>""</span><span>)</span><span> </span><span>+</span><span>
    </span><span>theme_bw</span><span>()</span><span>
  
  </span><span>p</span><span>
</span><span>}</span><span>

</span><span>gtest</span><span>(</span><span>newdata</span><span>,</span><span>date</span><span>,</span><span>value</span><span>,</span><span>random</span><span>,</span><span>key</span><span>)</span><span>
</span>
Pretty ...
[Read more...]

anytime 0.3.1

July 3, 2018 | Thinking inside the box

A new minor release of the anytime package is now on CRAN. This is the twelveth release, and the first in a little over a year as the package has stabilized. anytime is a very focused package aiming to do just one thing really well: to convert anything in integer, ... [Read more...]

Marginal Effects for Regression Models in R #rstats #dataviz

July 3, 2018 | Daniel

Regression coefficients are typically presented as tables that are easy to understand. Sometimes, estimates are difficult to interpret. This is especially true for interaction or transformed terms (quadratic or cubic terms, polynomials, splines), in particular for more complex models. In such cases, coefficients are no longer interpretable in a direct ...
[Read more...]

A Comparative Review of the Rattle GUI for R

July 2, 2018 | Bob Muenchen

Introduction Rattle is a popular free and open source Graphical User Interface (GUI) for the R software, one that focuses on beginners looking to point-and-click their way through data mining tasks. Such tasks are also referred to as machine learning or predictive analytics.  … Continue reading →
[Read more...]

An introduction to Tensorflow

July 2, 2018 | R on Coding Club UC3M

Tensorflow has been widely used for many applications in machine learning and deep learning. However, Tensorflow is more than that, it is a general purpose computing library. Based on that, people have created a rich ecosystem for quickly developing models. In this talk, I will show how statisticians can get ...
[Read more...]

A Tour of Timezones (& Troubles) in R

July 2, 2018 | Rstats on pi: predict/infer

In any programming tool, dates, times, and timezones are hard. Deceptively hard. They’ve been shaped by politics and whimsy for hundreds of years: timezones can shift with minimal notice, countries have skipped or repeated certain days, some are offset by weird increments, some observe Daylight Saving Time, leap years, ... [Read more...]

Elo and EloBeta models in snooker

July 2, 2018 | QuestionFlow

Research about adequacy of Elo based models applied to snooker match results. Contains a novel approach (EloBeta) targeted for sport results with variable "best of N" format. Prologue For many years I’ve been following snooker as a sport. It has it all: hypnotic beauty of smart play, elegance of ... [Read more...]

tikzDevice has a new home

July 2, 2018 | Ralf Stubner

Back in February the tikzDevice package became ORPHANED on CRAN. Consequently Kirill Müller and Yihui Xie searched for a new maintainer. When I read about it some time later, we decided that it makes sense for us to step in here. After a brief mail exchange with Yihui Xie ... [Read more...]

seplyr 0.5.8 Now Available on CRAN

July 2, 2018 | John Mount

We are pleased to announce that seplyr version 0.5.8 is now available on CRAN. seplyr is an R package that provides a thin wrapper around elements of the dplyr package and (now with version 0.5.8) the tidyr package. The intent is to give the part time R user the ability to easily ... [Read more...]

Why R 2018 Winners

July 2, 2018 | jumpingrivers

So it’s here… After lots of entries (147 to be precise), we can finally announce the winner of the WhyR 2018 Competition! But first, we have to tell you quickly about how we picked the winner. How we did it So it really wasn’t that hard. We held the questionnaire ... [Read more...]

Why R 2018 Winners

July 2, 2018 | jumpingrivers

So it’s here… After lots of entries (147 to be precise), we can finally announce the winner of the WhyR 2018 Competition! But first, we have to tell you quickly about how we picked the winner. How we did it So it really wasn’t that hard. We held the questionnaire ... [Read more...]

Handling Outliers with R

July 2, 2018 | jameshunterbr

Recently, I attended a presentation where the following graph was shown illustrating the response to stimulation with Thalidomide among a cohort of HIV-1 patients. The biomarker used to measure the response in this case was TNF (tumor necrosis factor) and … Continue reading →
[Read more...]

Freeing PDF Data to Account for the Unaccounted

July 2, 2018 | hrbrmstr

I’ve mentioned @stiles before on the blog but for those new to my blatherings, Matt is a top-notch data journalist with the @latimes and currently stationed in South Korea. I can only imagine how much busier his life has gotten since that fateful, awful November 2016 Tuesday, but I’m ...
[Read more...]

Pushing Ordinary Least Squares to the limit with Xy()

July 2, 2018 | André Bleier

Introduction to Xy() Simulation is mostly about answering particular research questions. Whenever the word simulation appears somewhere in a discussion, everyone knows that this means additional effort. At STATWORX we are using simulations as a first step to proof concepts we are developing. Sometimes such a simulation is simple, in ...
[Read more...]
1 14 15 16 17

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)