Artigo Acesso aberto Revisado por pares

PivotalR: A Package for Machine Learning on Big Data

2014; Volume: 6; Issue: 1 Linguagem: Inglês

10.32614/rj-2014-006

ISSN

2073-4859

Autores

Hai Qian,

Tópico(s)

Data Mining Algorithms and Applications

Resumo

PivotalR is an R package that provides a front-end to PostgreSQL and all PostgreSQLlike databases such as Pivotal Inc.'s Greenplum Database (GPDB), HAWQ.When running on the products of Pivotal Inc., PivotalR utilizes the full power of parallel computation and distributive storage, and thus gives the normal R user access to big data.PivotalR also provides an R wrapper for MADlib.MADlib is an open-source library for scalable in-database analytics.It provides data-parallel implementations of mathematical, statistical and machine-learning algorithms for structured and unstructured data.Thus PivotalR also enables the user to apply machine learning algorithms on big data.

Referência(s)
Altmetric
PlumX