ARCHIVED: What is the Programming with Big Data in R (pbdR) project?

This content has been archived, and is no longer maintained by Indiana University. Information here may no longer be accurate, and links may no longer be available or reliable.

Programming with Big Data in R (pbdR) is an Extreme Science and Engineering Discovery Environment (XSEDE) project that enables high-level distributed data parallelism in R for the analysis of "Big Data" on large distributed systems.

The pbdR project provides several packages featuring simple interfaces to scalable, high-performance libraries (e.g., MPI, ScaLAPACK, and NetCDF4). The packages are intended for use in the Single Program/Multiple Data (SPMD) programming model for batch parallel computing.

Project partners include the Oak Ridge National Laboratory (ORNL), the Oak Ridge Leadership Computing Facility (OLCF), and the National Institute for Computational Sciences (NICS).

For more, see:

If you have questions, comments, or bug reports, email RBigData@gmail.com.

This document was developed with support from National Science Foundation (NSF) grants 1053575 and 1548562. Any opinions, findings, conclusions, or recommendations expressed in this material are those of the author(s) and do not necessarily reflect the views of the NSF.

This is document bcrw in the Knowledge Base.
Last modified on 2018-02-21 14:05:42.