樓主: Nicolle
871 3

Apache Mahout Clustering Designs [分享]

版主

巨擘

0%

還不是VIP/貴賓

-

TA的文庫  其他...

Python Programming

SAS Programming

Must-Read Books

威望
16
論壇币
12292652 个
通用積分
262.2717
學術水平
3071 点
熱心指數
3066 点
信用等級
2862 点
經驗
453053 点
帖子
21118
精華
92
在線時間
8072 小时
注冊时间
2005-4-23
最后登錄
2019-11-15

Nicolle 学生认证  发表于 2016-6-25 10:36:00 |顯示全部樓層
1論壇币

About This Book
  • Use Mahout for clustering datasets and gain useful insights
  • Explore the different clustering algorithms used in day-to-day work
  • A practical guide to create and evaluate your own clustering models using real world data sets
Who This Book Is For

This book is for developers who want to try out clustering on large datasets using Mahout. It will also be useful for those users who don't have background in Mahout, but have knowledge of basic programming and are familiar with basics of machine learning and clustering. It will be helpful if you know about clustering techniques with some other tool.

What You Will Learn
  • Explore clustering algorithms and cluster evaluation techniques
  • Learn different types of clustering and distance measuring techniques
  • Perform clustering on your data using K-Means clustering
  • Discover how canopy clustering is used as pre-process step for K-Means
  • Use the Fuzzy K-Means algorithm in Apache Mahout
  • Implement Streaming K-Means clustering in Mahout
  • Learn Spectral K-Means clustering implementation of Mahout
In Detail

As more and more organizations are discovering the use of big data analytics, interest in platforms that provide storage, computation, and analytic capabilities has increased. Apache Mahout caters to this need and paves the way for the implementation of complex algorithms in the field of machine learning to better analyse your data and get useful insights into it.

Starting with the introduction of clustering algorithms, this book provides an insight into Apache Mahout and different algorithms it uses for clustering data. It provides a general introduction of the algorithms, such as K-Means, Fuzzy K-Means, StreamingKMeans, and how to use Mahout to cluster your data using a particular algorithm. You will study the different types of clustering and learn how to use Apache Mahout with real world data sets to implement and evaluate your clusters.

This book will discuss about cluster improvement and visualization using Mahout APIs and also explore model-based clustering and topic modelling using Dirichlet process. Finally, you will learn how to build and deploy a model for production use.

Style and approach

This book is a hand's-on guide with examples using real-world datasets. Each chapter begins by explaining the algorithm in detail and follows up with showing how to use mahout for that algorithm using example data-sets.



關鍵詞:Clustering Cluster Designs apache Design background different knowledge practical familiar

本帖被以下文庫推荐

蒼茫河漢橫 发表于 2017-2-22 13:30:30 |顯示全部樓層
怎麽看
jxf5245 学生认证  发表于 2017-4-22 01:21:05 |顯示全部樓層
謝謝樓主
蒼茫河漢橫 发表于 2017-5-22 14:41:19 |顯示全部樓層
怎麽看?
您需要登錄后才可以回帖 登錄 | 我要注冊

京ICP备16021002-2号 京B2-20170662号 京公网安备 11010802022788号 論壇法律顾问:王进律师 知識産權保護聲明   免責及隱私聲明

GMT+8, 2019-11-15 11:40