It's been said that data is the new "dirt"—the raw material from which and on which you build the structures of the modern world. And like dirt, data can seem like a limitless, undifferentiated mass. The ability to take raw data, access it, filter it, process it, visualize it, understand it, and communicate it to others is possibly the most essential business problem for the coming decades.
"Machine learning," the process of automating tasks once considered the domain of highly-trained analysts and mathematicians, is the key to efficiently extracting useful information from this sea of raw data. By implementing the core algorithms of statistical data processing, data analysis, and data visualization as reusable computer code, you can scale your capacity for data analysis well beyond the capabilities of individual knowledge workers.
Machine Learning in Action is a unique book that blends the foundational theories of machine learning with the practical realities of building tools for everyday data analysis. In it, you'll use the flexible Python programming language to build programs that implement algorithms for data classification, forecasting, recommendations, and higher-level features like summarization and simplification.
As you work through the numerous examples, you'll explore key topics like classification, numeric prediction, and clustering. Along the way, you'll be introduced to important established algorithms, such as Apriori, through which you identify association patterns in large datasets and Adaboost, a meta-algorithm that can increase the efficiency of many machine learning tasks.
Peter Harrington holds Bachelors and Masters Degrees in Electrical Engineering. He worked for Intel Corporation for seven years in California and China. Peter holds five US patents and his work has been published in three academic journals. He is currently the chief scientist for Zillabyte Inc. Peter spends his free time competing in programming competitions, and building 3D printers.
这本书最大的优点在于有源码实现,很赞,但是理论部分太差了,看了逻辑回归和支持向量机两章,发现好多理论都没讲,就比如逻辑回归中的Cost函数都没说,如果不了解,源码读起来也是一头雾水,所以对于初学者还需要一本理论较强的书,推荐李航博士的统计机器学习方法,刚好配套~
评分1. 这本书的价值是提供了一系列有趣的「实验作业」和「对应的数据」,以及乱七八糟的 Python 代码,迫使读者在同样数据集上自己写一个更好的。 2. 作者的 Python 代码写得真的真的很渣。 3. 作者的 SVM 写错了,不是 Platt 的原始 SMO 算法,里面的 error cache 形同虚设。 ...
评分如果你是机器学习的入门者,如果你想快速看到算法的执行效果,那么这本书适合你。 作者把算法的基本原理讲的很清楚,而且代码是完整可执行的。当然,如果你想了解算法背后的数学原理,还需要花时间去复习一下概率论、高等数学和线性代数。 BTW:读者最好有编程经验,有抽象思维。
评分客观说,完全不能当入门书。 缺少必要的证明过程,有些甚至连公式都没有。 我觉得既然要学习机器学习,光改改代码完全是不够的,起码还得知道各个算法的基本公式和过程,不幸的是,这本书没有。 就比如逻辑斯蒂回归那章,他连损失函数都没提,就开始说梯度法了。问题是梯度法的...
评分对ML主要工具简单介绍 上手快 挺好 FP Tree没看 SVM/CART/AdaBoost/Apriori还需要再看看
评分随便翻翻,当复习Python和相关库了。适合初学者。
评分看这书可以同时入门机器学习,python,mapreduce,作者可以几个方面都讲清楚,真不容易
评分随便翻翻,当复习Python和相关库了。适合初学者。
评分入门好书
本站所有内容均为互联网搜索引擎提供的公开搜索信息,本站不存储任何数据与内容,任何内容与数据均与本站无关,如有需要请联系相关搜索引擎包括但不限于百度,google,bing,sogou 等
© 2025 book.wenda123.org All Rights Reserved. 图书目录大全 版权所有