Data Algorithms: Recipes for Scaling Up with Hadoop and Spark by Mahmoud Parsian

Data Algorithms: Recipes for Scaling Up with Hadoop and Spark

Mahmoud Parsian
778 pages
O'Reilly Media, 2015.
Aug 2015
Computers & Internet WSBN
1
Readers
0
Reviews
0
Discussions
0
Quotes
<p>If you are ready to dive into the MapReduce framework for processing large datasets, this practical book takes you step by step through the algorithms and tools you need to build distributed MapReduce applications with Apache Hadoop or Apache Spark. Each chapter provides a recipe for solving a massive computational problem, such as building a recommendation system. You'll learn how to implement the appropriate MapReduce solution with code that you can use in your projects.</p><p>Dr. Mahmoud Parsian covers basic design patterns, optimization techniques, and data mining and machine learning solutions for problems in bioinformatics, genomics, statistics, and social network analysis. This book also includes an overview of MapReduce, Hadoop, and Spark.</p><p>Topics include:</p>Market basket analysis for a large set of transactionsData mining algorithms (K-means, KNN, and Naive Bayes) Using huge genomic data to sequence DNA and RNANaive Bayes theorem and Markov chains for data and market predictionRecommendation algorithms and pairwise document similarityLinear regression, Cox regression, and Pearson correlationAllelic frequency and mining DNASocial network analysis (recommendation systems, counting triangles, sentiment analysis)
Join the conversation

No discussions yet. Join BookLovers to start a discussion about this book!

No reviews yet. Join BookLovers to write the first review!

No quotes shared yet. Join BookLovers to share your favorite quotes!

Earn Points
Your voice matters. Every comment, review, and quote earns you reward points redeemable for Bitcoin.
Comment +5 pts Review +20 pts Quote +7 pts Upvote +1 pt
BookMatch Quiz
Find books similar to this one
About this book
Pages 778
Publisher O'Reilly Media, 2015...
Published 2015
Readers 1