Monthly Archives: May 2014

ICU Unicode text transforms in the R package stringi

The ICU (International Components for Unicode) library provides very powerful and flexible ways to apply various Unicode text transforms. These include: Full (language-specific) case mappings, Unicode normalization, Text transliteration (e.g. script-to-script conversion). All of these are available to R programmers/users

Tagged with: , , , ,
Posted in Blog/R, Blog/R-bloggers

Counting the number of words in a LaTeX file with stringi

In my recent post I promised to present the most interesting features of the stringi package in more detail. Here's one of such jolly features. Many LaTeX users may find it very useful. Loading a text file with encoding auto-detection

Tagged with: , , , ,
Posted in Blog/LaTeX, Blog/R, Blog/R-bloggers

(String/text processing)++: stringi 0.2-3 released

A new release of the stringi package is available on CRAN (please wait a few days for Windows and OS X binary builds). stringi is a package providing (but definitely not limiting to) replacements for nearly all the character string

Tagged with: , , , ,
Posted in Blog/R, Blog/R-bloggers