It's been said that data is the new "dirt"—the raw material from which and on which you build the structures of the modern world. And like dirt, data can seem like a limitless, undifferentiated mass. The ability to take raw data, access it, filter it, process it, visualize it, understand it, and communicate it to others is possibly the most essential business problem for the coming decades.
"Machine learning," the process of automating tasks once considered the domain of highly-trained analysts and mathematicians, is the key to efficiently extracting useful information from this sea of raw data. By implementing the core algorithms of statistical data processing, data analysis, and data visualization as reusable computer code, you can scale your capacity for data analysis well beyond the capabilities of individual knowledge workers.
Machine Learning in Action is a unique book that blends the foundational theories of machine learning with the practical realities of building tools for everyday data analysis. In it, you'll use the flexible Python programming language to build programs that implement algorithms for data classification, forecasting, recommendations, and higher-level features like summarization and simplification.
As you work through the numerous examples, you'll explore key topics like classification, numeric prediction, and clustering. Along the way, you'll be introduced to important established algorithms, such as Apriori, through which you identify association patterns in large datasets and Adaboost, a meta-algorithm that can increase the efficiency of many machine learning tasks.
Peter Harrington holds Bachelors and Masters Degrees in Electrical Engineering. He worked for Intel Corporation for seven years in California and China. Peter holds five US patents and his work has been published in three academic journals. He is currently the chief scientist for Zillabyte Inc. Peter spends his free time competing in programming competitions, and building 3D printers.
1. 这本书的价值是提供了一系列有趣的「实验作业」和「对应的数据」,以及乱七八糟的 Python 代码,迫使读者在同样数据集上自己写一个更好的。 2. 作者的 Python 代码写得真的真的很渣。 3. 作者的 SVM 写错了,不是 Platt 的原始 SMO 算法,里面的 error cache 形同虚设。 ...
评分如果你是机器学习的入门者,如果你想快速看到算法的执行效果,那么这本书适合你。 作者把算法的基本原理讲的很清楚,而且代码是完整可执行的。当然,如果你想了解算法背后的数学原理,还需要花时间去复习一下概率论、高等数学和线性代数。 BTW:读者最好有编程经验,有抽象思维。
评分为什么我会力荐这本书? 也许书中分类器都非常的简单,数学理论都非常的粗浅(为了看明白书中SVM分类器的训练过程,不得不去复习了二次凸优化解法,自己推导被作者略去的中间过程),算法测试也只在轻量级的数据集上完成。 不过,大可不必像其他评论一样对贬低本书。聪明的读...
评分 评分这本书的最大好处是让你能够用最基本的pyton语法,从底层上让你构建代码,实现我们常说的比如邮件过滤,数据分类的应用。很多时候你要写最基本的代码和结构去做这些工作,而不是像kaggle的tutorial或者其他的工程大多数告诉你一个lib库函数去调用,你能看到底层在干什么...
理论条理清楚、举重若轻。可惜程序代码水平稍差。
评分看这书可以同时入门机器学习,python,mapreduce,作者可以几个方面都讲清楚,真不容易
评分看这书可以同时入门机器学习,python,mapreduce,作者可以几个方面都讲清楚,真不容易
评分读它是为了熟悉Python语言;内容是在不敢恭维。
评分是本好书,有些章节还看的不是最明白。值得反复阅读
本站所有内容均为互联网搜索引擎提供的公开搜索信息,本站不存储任何数据与内容,任何内容与数据均与本站无关,如有需要请联系相关搜索引擎包括但不限于百度,google,bing,sogou 等
© 2026 book.wenda123.org All Rights Reserved. 图书目录大全 版权所有