Building Machine Learning Systems with Python - Third Edition

Read it now on the O’Reilly learning platform with a 10-day free trial.

O’Reilly members get unlimited access to books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.

Book description

Get more from your data by creating practical machine learning systems with Python

Key Features

Develop your own Python-based machine learning system
Discover how Python offers multiple algorithms for modern machine learning systems
Explore key Python machine learning libraries to implement in your projects

Book Description

Machine learning allows systems to learn things without being explicitly programmed to do so. Python is one of the most popular languages used to develop machine learning applications, which take advantage of its extensive library support. This third edition of Building Machine Learning Systems with Python addresses recent developments in the field by covering the most-used datasets and libraries to help you build practical machine learning systems.

Using machine learning to gain deeper insights from data is a key skill required by modern application developers and analysts alike. Python, being a dynamic language, allows for fast exploration and experimentation. This book shows you exactly how to find patterns in your raw data. You will start by brushing up on your Python machine learning knowledge and being introduced to libraries. You'll quickly get to grips with serious, real-world projects on datasets, using modeling and creating recommendation systems. With Building Machine Learning Systems with Python, you'll gain the tools and understanding required to build your own systems, all tailored to solve real-world data analysis problems.

By the end of this book, you will be able to build machine learning systems using techniques and methodologies such as classification, sentiment analysis, computer vision, reinforcement learning, and neural networks.

What you will learn

Build a classification system that can be applied to text, images, and sound
Employ Amazon Web Services (AWS) to run analysis on the cloud
Solve problems related to regression using scikit-learn and TensorFlow
Recommend products to users based on their past purchases
Understand different ways to apply deep neural networks on structured data
Address recent developments in the field of computer vision and reinforcement learning

Who this book is for

Building Machine Learning Systems with Python is for data scientists, machine learning developers, and Python developers who want to learn how to build increasingly complex machine learning systems. You will use Python's machine learning capabilities to develop effective solutions. Prior knowledge of Python programming is expected.

Show and hide more Table of contents Product information

Title Page
Copyright and Credits
1. Building Machine Learning Systems with Python Third Edition
1. Why subscribe?
2. PacktPub.com
1. About the authors
2. About the reviewers
3. Packt is searching for authors like you
1. Who this book is for
2. What this book covers
3. To get the most out of this book
  1. Download the example code files
  2. Download the color images
  3. Conventions used
  1. Reviews
  1. Machine learning and Python – a dream team
    1. What the book will teach you – and what it will not
    2. How to best read this book
    3. What to do when you are stuck
    4. Getting started
      1. Introduction to NumPy, SciPy, Matplotlib, and TensorFlow
      2. Installing Python
      3. Chewing data efficiently with NumPy and intelligently with SciPy
      4. Learning NumPy
        
        Indexing
        
        Handling nonexistent values
        
        Comparing the runtime
        
        Asking a question
        
        Getting answers
        
        Reading in the data
        
        Preprocessing and cleaning the data
        
        Choosing the right model and learning algorithm
        
        Before we build our first model
        
        Starting with a simple straight line
        
        Toward more complex models
        
        Stepping back to go forward - another look at our data
        
        Training and testing
        
        Answering our initial question
        
        The Iris dataset
        
        Visualization is a good first step
        
        Classifying with scikit-learn
        
        Building our first classification model
        
        Learning about the seeds dataset
        
        Features and feature engineering
        
        Nearest neighbor classification
        
        Looking at the decision boundaries
        
        Predicting house prices with regression
        
        Multidimensional regression
        
        Cross-validation for regression
        
        Penalized or regularized regression
        
        L1 and L2 penalties
        
        Visualizing the Lasso path
        
        P-greater-than-N scenarios
        
        An example based on text documents
        
        Setting hyperparameters in a principled way
        
        Sketching our roadmap
        
        Learning to classify classy answers
        
        Tuning the instance
        
        Tuning the classifier
        
        Slimming the data down to chewable chunks
        
        Preselecting and processing attributes
        
        Defining what a good answer is
        
        Engineering the features
        
        Training the classifier
        
        Measuring the classifier's performance
        
        Designing more features
        
        Bias, variance and their trade-off
        
        Fixing high bias
        
        Fixing high variance
        
        High or low bias?
        
        A bit of math with a small example
        
        Applying logistic regression to our post-classification problem
        
        Sketching our roadmap
        
        Selecting features
        
        Detecting redundant features using filters
        
        Correlation
        
        Mutual information
        
        Principal component analysis
        
        Sketching PCA
        
        Applying PCA
        
        Measuring the relatedness of posts
        
        How not to do it
        
        How to do it
        
        Converting raw text into a bag of words
        
        Counting words
        
        Normalizing word count vectors
        
        Removing less important words
        
        Stemming
        
        Installing and using NLTK
        
        Extending the vectorizer with NLTK's stemmer
        
        K-means
        
        Getting test data to evaluate our ideas
        
        Clustering posts
        
        Another look at noise
        
        Rating predictions and recommendations
        
        Splitting into training and testing
        
        Normalizing the training data
        
        A neighborhood approach to recommendations
        
        A regression approach to recommendations
        
        Combining multiple methods
        
        Basket analysis
        
        Obtaining useful predictions
        
        Analyzing supermarket shopping baskets
        
        More advanced basket analysis
        
        Using TensorFlow
        
        TensorFlow API
        
        Graphs
        
        Sessions
        
        Useful operations
        
        Training neural networks
        
        Convolutional neural networks
        
        Recurrent neural networks
        
        Sketching our roadmap
        
        Fetching the Twitter data
        
        Introducing the Naïve Bayes classifier
        
        Getting to know the Bayes theorem
        
        Being naïve
        
        Using Naïve Bayes to classify
        
        Accounting for unseen words and other oddities
        
        Accounting for arithmetic underflows
        
        Solving an easy problem first
        
        Using all classes
        
        Tuning the classifier's parameters
        
        Determining the word types
        
        Successfully cheating using SentiWordNet
        
        Our first estimator
        
        Putting everything together
        
        Latent Dirichlet allocation
        
        Building a topic model
        
        Comparing documents by topic
        
        Modeling the whole of Wikipedia
        
        Choosing the number of topics
        
        Sketching our roadmap
        
        Fetching the music data
        
        Converting into WAV format
        
        Decomposing music into sine-wave components
        
        Increasing experimentation agility
        
        Training the classifier
        
        Using a confusion matrix to measure accuracy in multiclass problems
        
        An alternative way to measure classifier performance using receiver-operator characteristics
        
        Introducing image processing
        
        Loading and displaying images
        
        Thresholding
        
        Gaussian blurring
        
        Putting the center in focus
        
        Types of reinforcement learning
        
        Policy and value network
        
        Q-network
        
        A small example
        
        Using Tensorflow for the text game
        
        Playing breakout
        
        Learning about big data
        
        Using jug to break up your pipeline into tasks
        
        An introduction to tasks in jug
        
        Online courses
        
        Books
        
        Blogs
        
        Data sources
        
        Getting competitive
        
        Leave a review - let other readers know what you think
        
        Show and hide more
        Product information
        
        Title: Building Machine Learning Systems with Python - Third Edition
        
        Author(s): Luis Pedro Coelho, Wilhelm Richert, Matthieu Brucher
        
        Release date: July 2018
        
        Publisher(s): Packt Publishing
        
        ISBN: 9781788623223
        
        You might also like
        
        Check it out now on O’Reilly
        
        Dive in for free with a 10-day trial of the O’Reilly learning platform—then explore all the other resources our members count on to build skills and solve problems every day.

Building Machine Learning Systems with Python - Third Edition

Book description

Key Features

Book Description

What you will learn

Who this book is for

Table of contents

Product information

You might also like

Check it out now on O’Reilly