Gå direkt till innehållet
Parallel R
Parallel R
Spara

Parallel R

Läs i Adobe DRM-kompatibel e-boksläsareDen här e-boken är kopieringsskyddad med Adobe DRM vilket påverkar var du kan läsa den. Läs mer
Its tough to argue with R as a high-quality, cross-platform, open source statistical software productunless youre in the business of crunching Big Data. This concise book introduces you to several strategies for using R to analyze large datasets, including three chapters on using R and Hadoop together. Youll learn the basics of Snow, Multicore, Parallel, Segue, RHIPE, and Hadoop Streaming, including how to find them, how to use them, when they work well, and when they dont.With these packages, you can overcome Rs single-threaded nature by spreading work across multiple CPUs, or offloading work to multiple machines to address Rs memory barrier.Snow: works well in a traditional cluster environmentMulticore: popular for multiprocessor and multicore computersParallel: part of the upcoming R 2.14.0 releaseR+Hadoop: provides low-level access to a popular form of cluster computingRHIPE: uses Hadoops power with Rs language and interactive shellSegue: lets you use Elastic MapReduce as a backend for lapply-style operations
Undertitel
Data Analysis in the Distributed World
ISBN
9781449320331
Språk
Engelska
Utgivningsdatum
2011-10-21
Tillgängliga elektroniska format
  • Epub - Adobe DRM
Läs e-boken här
  • E-boksläsare i mobil/surfplatta
  • Läsplatta
  • Dator