Blog Archives

Speeding up R packages’ installation process

There is a time for some things, and a time for all things; a time for great things, and a time for small things — Miguel de Cervantes Building R packages from sources may take a long time, especially if

Tagged with: , , ,
Posted in Blog/R, Blog/R-bloggers

Pull the (character) strings with stringi 0.5-2

A reliable string processing toolkit is a must-have for any data scientist. A new release of the stringi package is available on CRAN (please wait a few days for Windows and OS X binary builds). As for now, about 850

Tagged with:
Posted in Blog/News, Blog/R, Blog/R-bloggers

SimilaR

Introduction Being a teacher can be a very gratifying job. If you teach programming, which is your favorite hobby too, nothing can be better than that. Only thing can spoil your dream: cheating students. As we all know, one can

Posted in Blog/R, Blog/R-bloggers

Using Hadoop Streaming API to perform a word count job in R and C++

by Marek Gagolewski, Maciej Bartoszuk, Anna Cena, and Jan Lasek (Rexamine). Introduction In a recent blog post we explained how we managed to set up a working Hadoop environment on a few CentOS7 machines. To test the installation, let’s play

Tagged with: , , , , ,
Posted in Blog/Hadoop, Blog/R, Blog/R-bloggers

stringi 0.4-1 released – fast, portable, consistent character string processing

A new release of the stringi package is available on CRAN (please wait a few days for Windows and OS X binary builds). # install.packages("stringi") or update.packages() library("stringi") Here’s a list of changes in version 0.4-1. In the current release,

Tagged with: , , ,
Posted in Blog/News, Blog/R, Blog/R-bloggers

Faster, easier, and more reliable character string processing with stringi 0.3-1

A new release of the stringi package is available on CRAN (please wait a few days for Windows and OS X binary builds). # install.packages("stringi") or update.packages() library("stringi") stringi is an R package providing (but definitely not limiting to) equivalents

Tagged with: , , , , , ,
Posted in Blog/R, Blog/R-bloggers

R now will keep children away from drugs

Do you find this plot fancy? If yes, you can find the code at the end of this article BUT if you spend a little time to read it thoroughly, you can learn how to create better ones. We would

Tagged with: , , ,
Posted in Blog/R, Blog/R-bloggers

Playing with GUIs in R with RGtk2

Sometimes when we create some nice functions which we want to show other people who don’t know R we can do two things. We can teach them R what is not easy task which also takes time or we can

Tagged with: , , ,
Posted in Blog/R, Blog/R-bloggers

Text mining in R – Automatic categorization of Wikipedia articles

Text mining is currently a live issue in data analysis. Enoromus text data resourses on the Internet made it an important component of Big Data world. The potential of information hidden in the words is the reason why I find

Tagged with: , , ,
Posted in Blog/R, Blog/R-bloggers

ICU Unicode text transforms in the R package stringi

The ICU (International Components for Unicode) library provides very powerful and flexible ways to apply various Unicode text transforms. These include: Full (language-specific) case mappings, Unicode normalization, Text transliteration (e.g. script-to-script conversion). All of these are available to R programmers/users

Tagged with: , , , ,
Posted in Blog/R, Blog/R-bloggers