Search results for Pattern Recognition and Machine Learning

Inference in Statistical Modelling and Machine Learning

A Concise Introduction
James Burridge, Nick Tosh
Coming soon
Expected online publication date:

May 2026

Print publication:

31 May 2026
- Book
- Export citation
Statistical modelling and machine learning offer a vast toolbox of inference methods with which to model the world, discover patterns and reach beyond the data to make predictions when the truth is not certain. This concise book provides a clear introduction to those tools and to the core ideas – probabilistic model, likelihood, prior, posterior, overfitting, underfitting, cross-validation – that unify them. A mixture of toy and real examples illustrates diverse applications ranging from biomedical data to treasure hunts, while the accompanying datasets and computational notebooks in R and Python encourage hands-on learning. Instructors can benefit from online lecture slides and exercise solutions. Requiring only first-year university-level knowledge of calculus, probability and linear algebra, the book equips students in statistics, data science and machine learning, as well as those in quantitative applied and social science programmes, with the tools and conceptual foundations to explore more advanced techniques.

Accelerating Deep Neural Networks

Ryoma Sato
Coming soon
Expected online publication date:

May 2026

Print publication:

31 May 2026
- Book
- Export citation
Deep learning models are powerful, but often large, slow, and expensive to run. This book is a practical guide to accelerating and compressing neural networks using proven techniques such as quantization, pruning, distillation, and fast architectures. It explains how and why these methods work, fostering a comprehensive understanding. Written for engineers, researchers, and advanced students, the book combines clear theoretical insights with hands-on PyTorch implementations and numerical results. Readers will learn how to reduce inference time and memory usage, lower deployment costs, and select the right acceleration strategy for their task. Whether you're working with large language models, vision systems, or edge devices, this book gives you the tools and intuition needed to build faster, leaner AI systems, without sacrificing performance. It is perfect for anyone who wants to go beyond intuition and take a principled approach to optimizing AI systems

All of Regression

Isabella Verdinelli, Larry Wasserman
Coming soon
Expected online publication date:

May 2026

Print publication:

30 June 2026
- Book
- Export citation
This comprehensive modern look at regression covers a wide range of topics and relevant contemporary applications, going well beyond the topics covered in most introductory books. With concision and clarity, the authors present linear regression, nonparametric regression, classification, logistic and Poisson regression, high-dimensional regression, quantile regression, conformal prediction and causal inference. There are also brief introductions to neural nets, deep learning, random effects, survival analysis, graphical models and time series. Suitable for advanced undergraduate and beginning graduate students, the book will also serve as a useful reference for researchers and practitioners in data science, machine learning, and artificial intelligence who want to understand modern methods for data analysis.

Markov Decision Processes and Reinforcement Learning

Martin L. Puterman, Timothy C. Y. Chan
Coming soon
Expected online publication date:

April 2026

Print publication:

30 April 2026
- Book
- Export citation
This book offers a comprehensive introduction to Markov decision process and reinforcement learning fundamentals using common mathematical notation and language. Its goal is to provide a solid foundation that enables readers to engage meaningfully with these rapidly evolving fields. Topics covered include finite and infinite horizon models, partially observable models, value function approximation, simulation-based methods, Monte Carlo methods, and Q-learning. Rigorous mathematical concepts and algorithmic developments are supported by numerous worked examples. As an up-to-date successor to Martin L. Puterman's influential 1994 textbook, this volume assumes familiarity with probability, mathematical notation, and proof techniques. It is ideally suited for students, researchers, and professionals in operations research, computer science, engineering, and economics.

Differential Equations and Variational Methods on Graphs

With Applications to Machine Learning and Image Analysis
Yves van Gennip, Jeremy Budd
Coming soon
Expected online publication date:

April 2026

Print publication:

30 April 2026
- Book
- Export citation
The burgeoning field of differential equations on graphs has experienced significant growth in the past decade, propelled by the use of variational methods in imaging and by its applications in machine learning. This text provides a detailed overview of the subject, serving as a reference for researchers and as an introduction for graduate students wishing to get up to speed. The authors look through the lens of variational calculus and differential equations, with a particular focus on graph-Laplacian-based models and the graph Ginzburg-Landau functional. They explore the diverse applications, numerical challenges, and theoretical foundations of these models. A meticulously curated bibliography comprising approximately 800 references helps to contextualise this work within the broader academic landscape. While primarily a review, this text also incorporates some original research, extending or refining existing results and methods.

Bandit Convex Optimisation

Tor Lattimore
Coming soon
Expected online publication date:

February 2026

Print publication:

28 February 2026
- Book
- Export citation
This comprehensive reference brings readers to the frontier of research on bandit convex optimization or zeroth-order convex optimization. The focus is on theoretical aspects, with short, self-contained chapters covering all the necessary tools from convex optimization and online learning, including gradient-based algorithms, interior point methods, cutting plane methods and information-theoretic machinery. The book features a large number of exercises, open problems and pointers to future research directions, making it ideal for students as well as researchers.

10 - Support Vector Machine Classifier
Parteek Bhatia, Thapar University, India
Book:

Machine Learning with Python

Published online:

22 February 2025

Print publication:

31 January 2026, pp 495-532
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

Chapter Objectives
• To understand the working principle of support vector machine (SVM).
• To comprehend the rules for identification of correct hyperplane.
• To understand the concept of support vectors, maximized margin, positive and negative hyperplanes.
• To apply an SVM classifier for a linear and non-linear dataset.
• To understand the process of mapping data points to higher dimensional space.
• To comprehend the working principle of the SVM Kernel.
• To highlight the applications of SVM.
10.1 Support Vector Machines
Support vector machines (SVMs) are supervised machine learning (ML) models used to solve regression and classification problems. However, it is widely used for solving classification problems. The main goal of SVM is to segregate the n-dimensional space into labels or classes by defining a decision boundary or hyperplanes. In this chapter, we shall explore SVM for solving classification problems.
10.1.1 SVM Working Principle
SVM Working Principle | Parteek Bhatia, https://youtu.be/UhzBKrIKPyE
To understand the working principle of the SVM classifier, we will take a standard ML problem where we want a machine to distinguish between a peach and an apple based on their size and color.
Let us suppose the size of the fruit is represented on the X-axis and the color of the fruit is on the Y-axis. The distribution of the dataset of apple and peach is shown in Figure 10.1.
To classify it, we must provide the machine with some sample stock of fruits and label each of the fruits in the stock as an “apple” or “peach”. For example, we have a labeled dataset of some 100 fruits with corresponding labels, i.e., “apple” or “peach”. When this data is fed into a machine, it will analyze these fruits and train itself. Once the training is completed, if some new fruit comes into the stock, the machine will classify whether it is an “apple” or a “peach”.
Most of the traditional ML algorithms would learn by observing the perfect apples and perfect peaches in the stock, i.e., they will train themselves by observing the ideal apples of stock (apples which are very much like apples in terms of their size and color) and the perfect peaches of stock (peaches which are very much like peaches in terms of their size and color). These standard samples are likely to be found in the heart of stock. The heart of the stock is shown in Figure 10.2.

1 - Overview of Machine Learning
Parteek Bhatia, Thapar University, India
Book:

Machine Learning with Python

Published online:

22 February 2025

Print publication:

31 January 2026, pp 1-52
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

Chapter Objectives
• To define machine learning (ML) and discuss its applications.
• To learn the differences between traditional programming and ML.
• To understand the importance of labeled and unlabeled data and its various usage for ML.
• To understand the working principle of supervised, unsupervised, and reinforcement learnings.
• To understand the key terms like data science, data mining, artificial intelligence, and deep learning.
1.1 Introduction
In today’s data-driven world, information flows through the digital landscape like an untapped river of potential. Within this vast data stream lies the key to unlocking a new era of discovery and innovation. Machine learning (ML), a revolutionary field, acts as the gateway to this wealth of opportunities. With its ability to uncover patterns, make predictive insights, and adapt to evolving information, ML has transformed industries, redefined technology, and opened the door to limitless possibilities. This book is your gateway to the fascinating realm of ML—a journey that empowers you to harness the power of data, enabling you to build intelligent systems, make informed decisions, and explore the boundless possibilities of the digital age.
ML has emerged as the dominant approach for solving problems in the modern world, and its wide-ranging applications have made it an integral part of our lives. Right from search engines to social networking sites, everything is powered by ML algorithms. Your favorite search engine uses ML algorithms to get you the appropriate search results. Smart home assistants like Alexa and Siri use ML to serve us better. The influence of ML in our day-to-day activities is so much that we cannot even realize it. Online shopping sites like Amazon, Flipkart, and Myntra use ML to recommend products. Facebook is using ML to display our feed. Netflix and YouTube are using ML to recommend videos based on our interests.
Data is growing exponentially with the Internet and smartphones, and ML has just made this data more usable and meaningful. Social media, entertainment, travel, mining, medicine, bioinformatics, or any field you could name uses ML in some form.
To understand the role of ML in the modern world, let us first discuss the applications of ML.

16 - Artificial Neural Network
Parteek Bhatia, Thapar University, India
Book:

Machine Learning with Python

Published online:

22 February 2025

Print publication:

31 January 2026, pp 821-864
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

Chapter Objectives
• To understand the concept of artificial neural network (ANN).
• To comprehend the working of the human brain as an inspiration for the development of neural network.
• To understand the mapping of human brain neurons to an ANN.
• To understand the working of ANN with case studies.
• To understand the role of weights in building ANN.
• To perform forward and backward propagation to train the neural networks.
• To understand different activation functions like threshold function, sigmoid function, rectifier linear unit function, and hyperbolic tangent function.
• To find the optimized value of weights for minimizing the cost function by using the gradient descent approach and stochastic gradient descent algorithm.
• To understand the concept of the mini-batch method.
16.1 Introduction to Artificial Neural Network
Neural networks and deep learning are the buzzwords in modern-day computer science. And, if you think that these are the latest entrants in this field, you probably have a misconception. Neural networks have been around for quite some time, and they have only started picking up now, putting up a huge positive impact on computer science.
Artificial neural network (ANN) was invented in the 1960s and 1970s. It became a part of common tech talks, and people started thinking that this machine learning (ML) technique would solve all the complex problems that were challenging the researchers during that time. But sooner, the hopes and expectations died off over the next decade.
The decline could not be attributed to some loopholes in neural networks, but the major reason for the decline was the “technology” itself. The technology back then was not up to the right standard to facilitate neural networks as they needed a lot of data for training and huge computation resources for building the model. During that time, both data and computing power were scarce. Hence, the resulting neural network remained only on paper rather than taking centerstage of the machine to solve some real-world problems.
Later on, at the beginning of the 21st century, we saw a lot of improvements in storage techniques resulting in reduced cost per gigabyte of storage. Humanity witnessed a huge rise in big data due to the Internet boom and smartphones.

13 - Implementation of Clustering
Parteek Bhatia, Thapar University, India
Book:

Machine Learning with Python

Published online:

22 February 2025

Print publication:

31 January 2026, pp 699-738
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

Chapter Objectives
• To implement the k-means clustering algorithm in Python.
• To determining the ideal number of clusters by implementing its code.
• To understand how to visualize clusters using plots.
• To create the dendrogram and find the optimal number of clusters for agglomerative hierarchical clustering.
• To compare results of k-means clustering with agglomerative hierarchical clustering.
• To implement clustering through various case studies.
13.1 Implementation of k-means Clustering and Hierarchical Clustering
In the previous chapter, we discussed various clustering algorithms. We learned that clustering algorithms are broadly classified into partitioning methods, hierarchical methods, and density-based methods. The k-means clustering algorithm follows partitioning method; agglomerative and divisive algorithms follow the hierarchical method, while DBSCAN is based on density-based clustering methods.
In this chapter, we will implement each of these algorithms by considering various case studies by following a step-by-step approach. You are advised to perform all these steps on your own on the mentioned databases stated in this chapter.
The k-means algorithm is considered a partitioning method and an unsupervised machine learning (ML) algorithm used to identify clusters of data items in a dataset. It is one of the most prominent ML algorithms, and its implementation in Python is quite straightforward. This chapter will consider three case studies, i.e., customers shopping in the mall dataset, the U.S. arrests dataset, and a popular Iris dataset. We will understand the significance of k-means clustering techniques to implement it in Python through these case studies. Along with the clustering of data items, we will also discuss the ways to find out the optimal number of clusters. To compare the results of the k-means algorithm, we will also implement hierarchical clustering for these problems.
We will kick-start the implementation of the k-means algorithm in Spyder IDE using the following steps.
Step 1: Importing the libraries and the dataset—The dataset for the respective case study would be downloaded, and then the required libraries would be imported.
Step 2: Finding the optimal number of clusters—We will find the optimal number of clusters by the elbow method for the given dataset.
Step 3: Fitting k-means to the dataset—A k-means model will be prepared by training the model over the acquired dataset.
Step 4: Visualizing the clusters—The clusters formed by the k-means model would then be visualized in the form of scatter plots.

List of Figures
Parteek Bhatia, Thapar University, India
Book:

Machine Learning with Python

Published online:

22 February 2025

Print publication:

31 January 2026, pp xxiii-xlvi
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

14 - Association Mining
Parteek Bhatia, Thapar University, India
Book:

Machine Learning with Python

Published online:

22 February 2025

Print publication:

31 January 2026, pp 739-806
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

Chapter Objectives
• To comprehend the concept of association mining and its applications.
• To understand the role of support, confidence, and lift.
• To understand the naive algorithm for finding association mining rules, its limits, and improvements.
• To learn about different ways to store transaction database storage.
• To understand and apply the Apriori algorithm to identify the association mining rules.
14.1 Introduction to Association Rule Mining
Association rule mining is a rule-based technique to discover the relation between the attributes of a dataset. It is used to find the relation between the sales of item X and item Y. It is often called a “market basket” analysis, as shown in Figure 14.1. Here, the market analyst examines the items that consumers often purchase together to find the relation between the sale of item X and item Y.
In other words, when customers visit a store, they may buy a certain type of items together during a shopping trip. For example, as shown in Figure 14.1, a database of customer’s transactions (e.g., shopping baskets) is shown where each transaction consists of a set of items (e.g., products) purchased during a visit, machine learning (ML) engineers can use association mining for finding out a group of items which are frequently purchased together (customers purchasing behavior). This is also referred to as an analysis of customer purchasing behavior. For example, “IF one buys bread, THEN there is a high probability of buying butter with it”, as it is common that people who buy bread often buy butter with it. The store manager can use this information and arrange the items accordingly to increase sales and the overall efficiency of the store.
Let us consider a situation where the store manager feels that there is a lot of rush and customers always complain about the slow working of his store. He is exploring different ways to improve the efficiency of his store. He performed an association analysis and prepared a list of associated items like bread and butter. He may decide to put all these associated items together on the same shelf or near each other so that customers can find them quickly, reducing their shopping time. It will also improve the overall efficiency of the store and the sale of the products. To further improve the shopping experience of his customers, he can create different combos and put sales over these combos.

22 - Genetic Algorithm
Parteek Bhatia, Thapar University, India
Book:

Machine Learning with Python

Published online:

22 February 2025

Print publication:

31 January 2026, pp 1041-1084
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

Chapter Objectives
• To know the inspiration behind the genetic algorithm.
• To understand the concept of natural selection, recombination, and mutation.
• To understand the correlation between nature and genetic algorithm.
• To formulate the mathematic representation of genes and fitness theory.
• To implement natural selection through roulette wheel.
• To implement recombination or crossover.
• To implement the process of mutation.
• To understand the elitism and its implementation.
• To discuss the advantages and disadvantages of genetic algorithms.
22.1 Intuition of Genetic Algorithm
Genetic algorithm (GA) is inspired by nature, and it plays a vital role in the field of machine learning (ML). It selects the best-optimized solution from all available possible solutions or candidates. As nature selects the best possible candidates using the theory of evolution, in the same way, the GA selects the best possible solution from the available solutions.
One of the applications of GAs in ML is to select the global minima from all possible (local) minima by using natural selection. In earlier chapters, we learned that during the training of an artificial neural network, the main goal is to obtain the weights with a minimum cost function value. The gradient descent algorithm is commonly used to find the local minima of the cost function. But, we must find the global minima to reach the optimal weights. A GA can be used to find the global minima out of all available local minima or possible solutions. In this case, the set of possible local minima becomes the population containing possible candidates.
In this chapter, we will discuss inspiration from nature which is the main driving concept in working of GAs and their implementation. To get a good idea about the GA, we will discuss the basics of natural selection by revisiting the theory of evolution in the next section.
22.2 The Inspiration behind Genetic Algorithm
The concepts discussed in this chapter are also available in the form of the free online Udemy Course, Genetic Algorithm for Machine Learning by Parteek Bhatia,
https://www.udemy.com/course/genetic-algorithm-for-machine-learning/
The GA is one of the first and most well-regarded evolutionary algorithms in computer science literature. John Holland, a researcher at the University of Michigan, gave this algorithm in the 1970s, but it became popular in the ‘90s.

3 - Data Pre-processing
Parteek Bhatia, Thapar University, India
Book:

Machine Learning with Python

Published online:

22 February 2025

Print publication:

31 January 2026, pp 125-154
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

Chapter Objectives
• To understand the need for data pre-processing.
• To learn about different phases of data pre-processing like data cleaning, data integration, data transformation, and data reduction.
• To understand the need for feature scaling.
• To comprehend normalization and standardization techniques for feature scaling.
• To understand principal component analysis for feature extraction.
• To pre-process the categorical data for building machine learning models.
3.1 Need for Data Pre-processing
We live in an age where data is considered oil because we need data to train machine learning (ML) algorithms. The most important job for a data analyst is to collect, clean, and analyze the data and build ML models on the cleaned dataset. But often, the raw data that we obtain is noisy. It consists of many discrepancies, inconsistencies, and often missing values. To understand this situation, let us consider an example.
Suppose we have to predict the house price, and for this, we have collected data from a few previous transactions, as shown in Figure 3.1.
In a perfect situation, the captured data should be of this format, as shown in Figure 3.1. Here, we have the size of the house and the number of bedrooms as input features, while the price is the output attribute. We can predict the price of an unknown instance through regression.
But practically, in most situations, the captured data is not of good quality, and usually, we have a dataset, as shown in Figure 3.2.
You can see that this data is messy. There are a lot of unknown or missing values, and if we trained the model on this data, its prediction would be very poor. Also, you can identify the noise and incorrect labels like the second record price is incorrect and will result in poor model training.
We can also consider some more examples like if someone entered –1 in the “salary credited” column in the case of employee dataset. It does not make any sense and will be considered noise. Sometimes, we may have an unrealistic and impossible combination of data; for example, let us consider a record where we have Gender–Male and Pregnant–Yes.

2 - Introduction to Python
Parteek Bhatia, Thapar University, India
Book:

Machine Learning with Python

Published online:

22 February 2025

Print publication:

31 January 2026, pp 53-124
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

Chapter Objectives
• To understand the main features of Python.
• To know about various integrated development environments of Python.
• To implement basic programming constructs using Python.
• To understand the usage of various data types like numbers, list, tuple, strings, set, and dictionary.
• To compare various data types like list, tuple, dictionary, and set.
• To use if and looping statements in Python.
• To define user-defined functions.
Today, Python is known to be one of the most in-demand programming languages. As per the stats of GitHub (a provider of Internet hosting for software development), Python is the second most popular programming language, following JavaScript, as shown in Figure 2.1, and soon it may be on the top of the chart. Python surpassed Java, PHP, and other prominent languages in 2019.
Python is easy and versatile. So it is acclaimed as the major programming language to work on many new-age technologies like machine learning (ML), artificial intelligence, data science, and natural language processing. The creator of Python, Guido van Rossum, in 1991, stated that Python is a high-level programming language, and its core design philosophy is about code readability and syntax, which allows programmers to express concepts in a few lines of code. Interestingly, the name Python is inspired by Guido's favorite television show Monty Python's Flying Circus.
In this chapter, we will discuss various programming constructs of Python so that you can easily implement ML algorithms by using it. Before writing the actual code in Python, let us focus on the features of Python that make it so popular and unique.
2.1 Features of Python
Features offered by Python can be visualized in Figure 2.2. Talking about them profoundly, the main features of Python are as follows:
• Beginner's Language: Python is not only just easy to code and learn, but also fast to grasp, and hence it is a suitable choice for any novice user who wants to learn to program. This is why nowadays this language is introduced to students in schools.
• Interpreted: Unlike other programming languages such as C or C++, Python does not require you to compile programs before executing them. It is an interpreted language, i.e., the code written in Python gets processed in real-time line by line.
• Interactive: The interactive feature of Python enables real-time feedback, allowing programmers to experiment, debug, and make adjustments on the go.

Frontmatter
Parteek Bhatia, Thapar University, India
Book:

Machine Learning with Python

Published online:

22 February 2025

Print publication:

31 January 2026, pp i-iv
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

5 - Simple Linear Regression
Parteek Bhatia, Thapar University, India
Book:

Machine Learning with Python

Published online:

22 February 2025

Print publication:

31 January 2026, pp 187-240
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

Chapter Objectives
• To understand the need for simple linear regression.
• To comprehend the concept of hypothesis and parameters of simple linear regression.
• To understand mathematical modeling of cost function and its minimization.
• To understand the importance and different steps of the gradient descent algorithm.
• To comprehend the mathematical modeling of the gradient descent algorithm.
• To understand the role of learning rate α.
5.1 Introduction to Simple Linear Regression
As discussed in earlier chapters, regression predicts a continuous value or real-valued output. This chapter will discuss how regression works (from a mathematical aspect) to predict the continuous value for the given dataset. Our first learning algorithm is simple linear regression. In this section, we will discuss the fundamental concepts and mathematical modeling of simple linear regression.
We usually have a dependent variable having a continuous value whose value we wish to predict based on one or more independent variables. If we have only one independent or input variable, this situation is known as simple linear regression (also called univariate regression). If we have multiple independent or input variables, it is known as multiple linear regression or multivariate regression.
Linear regression could be used for studying patterns in different real-life scenarios. Consider a research lab where a researcher wants to understand how the stipend is effected by the years of experience, or, in simple words, we wish to predict the stipend based on the years of experience of the researcher. Machine learning (ML) is about learning from past experiences or data. Thus, to predict the researcher's stipend, we have to collect some data about past researchers, specifically their stipend and experience.
In the supervised learning models, we need a dataset called a training set. We will use the dataset as given in Table 5.1 for training the model, and our job will be to build the ML model that learns from this data and hence predicts the stipend of a researcher based on his experience. Here, the stipend will be considered the dependent or output variable because it depends on the researcher's years of experience. Thus, years of experience will be considered an independent or input variable. So, we will use simple linear regression to build the ML model. For proceeding with this problem, we will use a dataset of researchers’ stipends with their corresponding years of experience, as shown in Table 5.1.

17 - Implementation of the ANN
Parteek Bhatia, Thapar University, India
Book:

Machine Learning with Python

Published online:

22 February 2025

Print publication:

31 January 2026, pp 865-888
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

Chapter Objectives
• To understand the process of implementation of the artificial neural network (ANN).
• To understand the role of keras and its different modules in building the ANN.
• To understand the syntax for adding input layer, hidden layers, and output layer to ANN.
• To perform a compilation of the ANN model.
• To fit the ANN model on the training dataset.
• To make predictions with a trained ANN model.
• To evaluate the performance of the ANN classifier by using confusion matrix, precision, and recall.
17.1 Building Artificial Neural Network for Cancer Detection
Machine learning (ML) can play a crucial role in cancer detection. In this chapter, we will build a neural network for cancer detection by using a breast cancer dataset.
You can download this dataset by using the following link.
https://www.kaggle.com/datasets/uciml/breast-cancer-wisconsin-data
The acquired data contains the records of cancer patients in the United States. These records were created by Dr William H. Wolberg and others at the University of Wisconsin, USA. The whole data has 32 columns along with 569 rows. The prominent attributes are the radius, texture, perimeter, smoothness, concavity, symmetry, area, compactness, concave points, and the fractal dimension of the tumor. A snapshot of the dataset is depicted in Figure 17.1.
The dataset has a diagnosis column used as an output variable, while the remaining variables will be used as input data. The class attribute diagnosis has two classes, i.e., malignant identified as M and benign identified as B. Thus, it will be a binary classifier.
The code and dataset used in this chapter are also available at the following link.
https://github.com/bhatiaparteek/ml_with_python/tree/main/Chapter_17_ANN
To build ANN over this dataset, the whole procedure can be divided into three sub-parts below.
i. Loading the dataset and performing pre-processing of data
ii. Building the artificial neural network (ANN)
iii. Making predictions and performing the validations
Let us perform all these operations by following a step-by-step approach.
17.2 Loading the Dataset and Pre-processing
In this step, we will perform tasks of loading the dataset and pre-processing.
17.2.1 Step 1: Importing the Libraries
To perform this task, we need to import two libraries, i.e., Pandas and NumPy, as shown in code snippet 1. NumPy facilitates mathematical operations, while Pandas specializes in loading and extracting datasets.

15 - Implementation of Association Mining
Parteek Bhatia, Thapar University, India
Book:

Machine Learning with Python

Published online:

22 February 2025

Print publication:

31 January 2026, pp 807-820
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

Chapter Objectives
• To implement the Apriori algorithm for transaction dataset.
• To prepare the dataset in the form of a transactions list for its processing.
• To learn parameter tuning of the Apriori algorithm.
• To understand and analyze the results produced by the model.
15.1 Building Association Mining Model
In this chapter, we will implement the Apriori algorithm in Python to solve a business problem. Association rule mining is one of the most popular machine learning applications, which is often used by supermarket chains and retail outlets to find the relation between the sales of item X and item Y. It is often called a “market basket” analysis. Discovering associations among attributes can lead to fact-based marketing strategies for store floor plans, special discounts, coupon offerings, product clustering, and catalog design to identify items that need to be put in combo packs.
Let us solve the problem of one such retail store by implementing the Apriori algorithm in Python.
Problem statement: Consider a supermarket store selling 16 products. To better understand the association between the sales of different items, the store manager decides to perform a market-basket analysis through the Apriori algorithm. The goal is to find association mining rules to improve the store's sales. The list of 16 items in-store is shown in Table 15.1, and the manager is analyzing 25 transactions of the sale of these items as given in Table 15.2. The number of items and transactions in a real application will be larger. In Table 15.2, each row in the table represents one transaction, i.e., the items bought by one customer.
In this dataset, the manager is interested in finding association rules that should have a minimum of 25% support and 70% confidence.
The implementation of association mining can be broken down into multiple steps. These steps are described below:
Step 1: Importing libraries and loading the dataset—We must import the required libraries for model building into the environment, and then we must load the required dataset.
Step 2: Making transactions—Apriori takes an input as a set of transactions in the desired structure; thus, we need to prepare a set of transactions containing the combination of items.
Step 3: Building the model—An Apriori model will be trained on the lists of transactions, and rules will be generated based on some key parameters (support and confidence).

4 - Implementing Data Pre-processing in Python
Parteek Bhatia, Thapar University, India
Book:

Machine Learning with Python

Published online:

22 February 2025

Print publication:

31 January 2026, pp 155-186
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

Chapter Objectives
• To understand the need for importing libraries like NumPy, Pandas, Matplotlib, Scikit–Learn.
• To learn the steps to import dataset.
• To understand the process for handling missing values.
• To discuss the steps for handling categorical data.
• To understand the need and process of splitting the dataset into training and testing datasets.
• To discuss the steps to perform feature scaling by using normalization and standardization.
Machine learning (ML) algorithms work on cleaned data. Usually, the data we collect for building ML models suffers from noise, missing values, inconsistent data types, and different data scales. This makes pre-processing of data a very important phase in preparing the data for building ML models. Pre-processing is when we apply transformations over the data before feeding it to the ML algorithm. In short, data pre-processing symbolizes a set of procedures applied to the data to make it fit for ML algorithms. It generally involves the following steps:
Step 1—Importing libraries: It involves importing the necessary libraries that are required to carry out the subsequent data manipulation and cleaning tasks.
Step 2—Loading the dataset: The dataset that needs to be pre-processed must be loaded.
Step 3—Handling the missing values: Dataset often contains missing or null values; these values need to be handled appropriately.
Step 4—Handling the categorical data: In the data pre-processing phase, it is crucial to address categorical attributes that often contain multiple categories. Handling categorical data becomes an important step to ensure proper treatment and transformation of these attributes.
Step 5—Splitting the dataset into training and testing datasets: Training and testing is the most important part of ML; thus, we need to split the dataset into training and testing subsets before building the ML models.
Step 6—Feature scaling: In datasets, the range of data often varies, or data is often of different scales. Thus, feature scaling needs to be done to ensure uniformity in results.
It is important to note that it is not necessary to apply all of these steps to pre-process the data. However, based on the nature of the dataset, some of these steps may be skipped for building the model. In the coming sections, we will discuss the importance or need of these steps and discuss how to perform these steps in Python.

Pattern Recognition and Machine Learning

Refine search

Refine search

Actions for selected content:

2327 results in Pattern Recognition and Machine Learning

Inference in Statistical Modelling and Machine Learning

Accelerating Deep Neural Networks

All of Regression

Markov Decision Processes and Reinforcement Learning

Differential Equations and Variational Methods on Graphs

Bandit Convex Optimisation

10 - Support Vector Machine Classifier

Summary

1 - Overview of Machine Learning

Summary

16 - Artificial Neural Network

Summary

13 - Implementation of Clustering

Summary

List of Figures

14 - Association Mining

Summary

22 - Genetic Algorithm

Summary

3 - Data Pre-processing

Summary

2 - Introduction to Python

Summary

Frontmatter

5 - Simple Linear Regression

Summary

17 - Implementation of the ANN

Summary

15 - Implementation of Association Mining

Summary

4 - Implementing Data Pre-processing in Python

Summary

Pattern Recognition and Machine Learning

Refine search

Refine search

Actions for selected content:

Save Search

2327 results in Pattern Recognition and Machine Learning

Inference in Statistical Modelling and Machine Learning

Accelerating Deep Neural Networks

All of Regression

Markov Decision Processes and Reinforcement Learning

Differential Equations and Variational Methods on Graphs

Bandit Convex Optimisation

Summary

Summary

Summary

Summary

Summary

Summary

Summary

Summary

Summary

Summary

Summary

Summary