Building Machine Learning Systems with Python - Third Edition

Building Machine Learning Systems with Python - Third Edition

Read it now on the O’Reilly learning platform with a 10-day free trial.

O’Reilly members get unlimited access to books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.

Book description

Get more from your data by creating practical machine learning systems with Python

Key Features

Book Description

Machine learning allows systems to learn things without being explicitly programmed to do so. Python is one of the most popular languages used to develop machine learning applications, which take advantage of its extensive library support. This third edition of Building Machine Learning Systems with Python addresses recent developments in the field by covering the most-used datasets and libraries to help you build practical machine learning systems.

Using machine learning to gain deeper insights from data is a key skill required by modern application developers and analysts alike. Python, being a dynamic language, allows for fast exploration and experimentation. This book shows you exactly how to find patterns in your raw data. You will start by brushing up on your Python machine learning knowledge and being introduced to libraries. You'll quickly get to grips with serious, real-world projects on datasets, using modeling and creating recommendation systems. With Building Machine Learning Systems with Python, you'll gain the tools and understanding required to build your own systems, all tailored to solve real-world data analysis problems.

By the end of this book, you will be able to build machine learning systems using techniques and methodologies such as classification, sentiment analysis, computer vision, reinforcement learning, and neural networks.

What you will learn

Who this book is for

Building Machine Learning Systems with Python is for data scientists, machine learning developers, and Python developers who want to learn how to build increasingly complex machine learning systems. You will use Python's machine learning capabilities to develop effective solutions. Prior knowledge of Python programming is expected.

Show and hide more Table of contents Product information

Table of contents

  1. Title Page
  2. Copyright and Credits
    1. Building Machine Learning Systems with Python Third Edition
    1. Why subscribe?
    2. PacktPub.com
    1. About the authors
    2. About the reviewers
    3. Packt is searching for authors like you
    1. Who this book is for
    2. What this book covers
    3. To get the most out of this book
      1. Download the example code files
      2. Download the color images
      3. Conventions used
      1. Reviews
      1. Machine learning and Python – a dream team
        1. What the book will teach you – and what it will not
        2. How to best read this book
        3. What to do when you are stuck
        4. Getting started
          1. Introduction to NumPy, SciPy, Matplotlib, and TensorFlow
          2. Installing Python
          3. Chewing data efficiently with NumPy and intelligently with SciPy
          4. Learning NumPy
            1. Indexing
            2. Handling nonexistent values
            3. Comparing the runtime
            1. Asking a question
            2. Getting answers
            1. Reading in the data
            2. Preprocessing and cleaning the data
            3. Choosing the right model and learning algorithm
              1. Before we build our first model
              2. Starting with a simple straight line
              3. Toward more complex models
              4. Stepping back to go forward - another look at our data
              5. Training and testing
              6. Answering our initial question
              1. The Iris dataset
                1. Visualization is a good first step
                2. Classifying with scikit-learn
                3. Building our first classification model
                1. Learning about the seeds dataset
                2. Features and feature engineering
                3. Nearest neighbor classification
                4. Looking at the decision boundaries
                1. Predicting house prices with regression
                2. Multidimensional regression
                3. Cross-validation for regression
                  1. Penalized or regularized regression
                  2. L1 and L2 penalties
                  1. Visualizing the Lasso path
                  2. P-greater-than-N scenarios
                  3. An example based on text documents
                  4. Setting hyperparameters in a principled way
                  1. Sketching our roadmap
                  2. Learning to classify classy answers
                    1. Tuning the instance
                    2. Tuning the classifier
                    1. Slimming the data down to chewable chunks
                    2. Preselecting and processing attributes
                    3. Defining what a good answer is
                    1. Engineering the features
                    2. Training the classifier
                    3. Measuring the classifier's performance
                    4. Designing more features
                    1. Bias, variance and their trade-off
                    2. Fixing high bias
                    3. Fixing high variance
                    4. High or low bias?
                    1. A bit of math with a small example
                    2. Applying logistic regression to our post-classification problem
                    1. Sketching our roadmap
                    2. Selecting features
                      1. Detecting redundant features using filters
                        1. Correlation
                        2. Mutual information
                        1. Principal component analysis
                          1. Sketching PCA
                          2. Applying PCA
                          1. Measuring the relatedness of posts
                            1. How not to do it
                            2. How to do it
                            1. Converting raw text into a bag of words
                              1. Counting words
                              2. Normalizing word count vectors
                              3. Removing less important words
                              4. Stemming
                                1. Installing and using NLTK
                                2. Extending the vectorizer with NLTK's stemmer
                                1. K-means
                                2. Getting test data to evaluate our ideas
                                3. Clustering posts
                                1. Another look at noise
                                1. Rating predictions and recommendations
                                2. Splitting into training and testing
                                3. Normalizing the training data
                                4. A neighborhood approach to recommendations
                                5. A regression approach to recommendations
                                6. Combining multiple methods
                                7. Basket analysis
                                  1. Obtaining useful predictions
                                  2. Analyzing supermarket shopping baskets
                                  1. More advanced basket analysis
                                  1. Using TensorFlow
                                    1. TensorFlow API
                                    2. Graphs
                                    3. Sessions
                                    4. Useful operations
                                    1. Training neural networks
                                    2. Convolutional neural networks
                                    3. Recurrent neural networks
                                    1. Sketching our roadmap
                                    2. Fetching the Twitter data
                                    3. Introducing the Naïve Bayes classifier
                                      1. Getting to know the Bayes theorem
                                      2. Being naïve
                                      3. Using Naïve Bayes to classify
                                      4. Accounting for unseen words and other oddities
                                      5. Accounting for arithmetic underflows
                                      1. Solving an easy problem first
                                      2. Using all classes
                                      3. Tuning the classifier's parameters
                                      1. Determining the word types
                                      2. Successfully cheating using SentiWordNet
                                      3. Our first estimator
                                      4. Putting everything together
                                      1. Latent Dirichlet allocation
                                        1. Building a topic model
                                        2. Comparing documents by topic
                                        3. Modeling the whole of Wikipedia
                                        4. Choosing the number of topics
                                        1. Sketching our roadmap
                                        2. Fetching the music data
                                          1. Converting into WAV format
                                          1. Decomposing music into sine-wave components
                                          1. Increasing experimentation agility
                                          2. Training the classifier
                                          3. Using a confusion matrix to measure accuracy in multiclass problems
                                          4. An alternative way to measure classifier performance using receiver-operator characteristics
                                          1. Introducing image processing
                                            1. Loading and displaying images
                                            2. Thresholding
                                            3. Gaussian blurring
                                            4. Putting the center in focus
                                            1. Types of reinforcement learning
                                              1. Policy and value network
                                              2. Q-network
                                              1. A small example
                                              2. Using Tensorflow for the text game
                                              3. Playing breakout
                                              1. Learning about big data
                                                1. Using jug to break up your pipeline into tasks
                                                2. An introduction to tasks in jug
                                                1. Online courses
                                                2. Books
                                                  1. Blogs
                                                  2. Data sources
                                                  3. Getting competitive
                                                  1. Leave a review - let other readers know what you think
                                                  Show and hide more

                                                  Product information

                                                  • Title: Building Machine Learning Systems with Python - Third Edition
                                                  • Author(s): Luis Pedro Coelho, Wilhelm Richert, Matthieu Brucher
                                                  • Release date: July 2018
                                                  • Publisher(s): Packt Publishing
                                                  • ISBN: 9781788623223

                                                  You might also like

                                                  Check it out now on O’Reilly

                                                  Dive in for free with a 10-day trial of the O’Reilly learning platform—then explore all the other resources our members count on to build skills and solve problems every day.