Search results for Computer Science

13 - Conjugate Gradient Method
Ali H. Sayed, École Polytechnique Fédérale de Lausanne
Book:

Inference and Learning from Data

Published online:

17 February 2023

Print publication:

22 December 2022, pp 441-470
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

KLAUS-Tr: Knowledge & learning-based unit focused arithmetic word problem solver for transfer cases
Suresh Kumar, P. Sreenivasa Kumar
Journal:

Natural Language Engineering / Volume 30 / Issue 1 / January 2024

Published online by Cambridge University Press:

22 December 2022, pp. 96-131
- Article
- - You have access
  - Open access
- PDF
- HTML
- Export citation
Solving the Arithmetic Word Problems (AWPs) using AI techniques has attracted much attention in recent years. We feel that the current AWP solvers are under-utilizing the relevant domain knowledge. We present a knowledge- and learning-based system that effectively solves AWPs of a specific type—those that involve transfer of objects from one agent to another (Transfer Cases (TC)). We represent the knowledge relevant to these problems as TC Ontology. The sentences in TC-AWPs contain information of essentially four types: before-transfer, transfer, after-transfer, and query. Our system (KLAUS-Tr) uses statistical classifier to recognize the types of sentences. The sentence types guide the information extraction process used to identify the agents, quantities, units, types of objects, and the direction of transfer from the AWP text. The extracted information is represented as an RDF graph that utilizes the TC Ontology terminology. To solve the given AWP, we utilize semantic web rule language (SWRL) rules that capture the knowledge about how object transfer affects the RDF graph of the AWP. Using the TC ontology, we also analyze if the given problem is consistent or otherwise. The different ways in which TC-AWPs can be inconsistent are encoded as SWRL rules. Thus, KLAUS-Tr can identify if the given AWP is invalid and accordingly notify the user. Since the existing datasets do not have inconsistent AWPs, we create AWPs of this type and augment the datasets. We have implemented KLAUS-Tr and tested it on TC-type AWPs drawn from the All-Arith and other datasets. We find that TC-AWPs constitute about 40% of the AWPs in a typical dataset like All-Arith. Our system achieves an impressive accuracy of 92%, thus improving the state-of-the-art significantly. We plan to extend the system to handle AWPs that contain multiple transfers of objects and also offer explanations of the solutions.

35 - Particle Filters
Ali H. Sayed, École Polytechnique Fédérale de Lausanne
Book:

Inference and Learning from Data

Published online:

17 March 2023

Print publication:

22 December 2022, pp 1380-1404
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

We develop a sequential version of the importance sampling technique from Chapter 33 in order to respond to streaming data, thus leading to a sequential Monte Carlo solution. The algorithm will lead to the important class of particle filters. This chapter presents the basic data model and the main construction that enables recursive inference. Many of the inference and learning methods in subsequent chapters will possess a recursive structure, which is a fundamental property to enable them to continually learn in response to the arrival of sequential data measurements. Particle filters are particularly well suited for scenarios involving nonlinear models and non-Gaussian signals, and they have found applications in a wide range of areas where these two features (nonlinearity and non-Gaussianity) are prevalent, including in guidance and control, robot localization, visual tracking of objects, and finance.

Dedication
Ali H. Sayed, École Polytechnique Fédérale de Lausanne
Book:

Inference and Learning from Data

Published online:

24 February 2023

Print publication:

22 December 2022, pp v-vi
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

55 - Naïve Bayes Classifier
Ali H. Sayed, École Polytechnique Fédérale de Lausanne
Book:

Inference and Learning from Data

Published online:

24 February 2023

Print publication:

22 December 2022, pp 2341-2356
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

The optimal Bayes classifier (52.8) requires knowledge of the conditional probability distribution $ℙ (r = r | h = h)$ , which is generally unavailable. In this and the next few chapters, we describe data‐based generative methods that approximate the joint probability distribution $f_{r, h} (r, h)$ , or its components $ℙ (r = r)$ and $f_{h | r} (h | r)$ , directly from the data.

64 - Generalization Theory
Ali H. Sayed, École Polytechnique Fédérale de Lausanne
Book:

Inference and Learning from Data

Published online:

24 February 2023

Print publication:

22 December 2022, pp 2650-2714
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

We described several data-based methods for inference and learning in the previous chapters. These methods operate directly on the data to arrive at classification or inference decisions. One key challenge these methods face is that the available training data need not provide sufficient representation for the sample space.

2 - Vector Differentiation
Ali H. Sayed, École Polytechnique Fédérale de Lausanne
Book:

Inference and Learning from Data

Published online:

17 February 2023

Print publication:

22 December 2022, pp 59-67
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

5 - Exponential Distributions
Ali H. Sayed, École Polytechnique Fédérale de Lausanne
Book:

Inference and Learning from Data

Published online:

17 February 2023

Print publication:

22 December 2022, pp 167-195
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

Frontmatter
Ali H. Sayed, École Polytechnique Fédérale de Lausanne
Book:

Inference and Learning from Data

Published online:

24 February 2023

Print publication:

22 December 2022, pp i-iv
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

66 - Deep Belief Networks
Ali H. Sayed, École Polytechnique Fédérale de Lausanne
Book:

Inference and Learning from Data

Published online:

24 February 2023

Print publication:

22 December 2022, pp 2797-2837
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

We indicated in the concluding remarks of the previous chapter that feedforward neural networks have powerful modeling capabilities, as reflected by the universal approximation theorem. In one of its versions, the theorem asserts that networks with a single hidden layer are rich enough to model almost any arbitrary function.

1 - Matrix Theory
Ali H. Sayed, École Polytechnique Fédérale de Lausanne
Book:

Inference and Learning from Data

Published online:

17 February 2023

Print publication:

22 December 2022, pp 1-58
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

20 - Convergence Analysis II: Stochastic Subgradient Algorithms
Ali H. Sayed, École Polytechnique Fédérale de Lausanne
Book:

Inference and Learning from Data

Published online:

17 February 2023

Print publication:

22 December 2022, pp 730-755
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

52 - Nearest-Neighbor Rule
Ali H. Sayed, École Polytechnique Fédérale de Lausanne
Book:

Inference and Learning from Data

Published online:

24 February 2023

Print publication:

22 December 2022, pp 2260-2289
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

We encountered one instance of Bayesian inference in Chapter 50, based on the quadratic loss in the context of mean-square-error (MSE) estimation. We explained there that the optimal solution for inferring a hidden zero-mean random variable $x$ from observations of another zero-mean random variable $y$ is given by the conditional estimator, $E (x | y)$ , whose computation requires knowledge of the conditional distribution, $f_{x | y} (x | y)$ .

72 - Meta Learning
Ali H. Sayed, École Polytechnique Fédérale de Lausanne
Book:

Inference and Learning from Data

Published online:

24 February 2023

Print publication:

22 December 2022, pp 3099-3148
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

In supervised methods, learning is attained by training on a sufficient amount of labeled data in order to deliver reliable levels of classification. However, there are important situations in practice where data is scarce because it is either difficult or expensive to collect. This scenario leads to few-shot learning, where it is desired to train a classifier by using only a few training samples for each class.

65 - Feedforward Neural Networks
Ali H. Sayed, École Polytechnique Fédérale de Lausanne
Book:

Inference and Learning from Data

Published online:

24 February 2023

Print publication:

22 December 2022, pp 2715-2796
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

We illustrated in Example 63.2 one limitation of linear separation surfaces by considering the XOR mapping (63.11). The example showed that certain feature spaces are not linearly separable and cannot be resolved by the perceptron algorithm. The result in the example was used to motivate one powerful approach to nonlinear separation surfaces by means of kernel methods.

63 - Kernel Methods
Ali H. Sayed, École Polytechnique Fédérale de Lausanne
Book:

Inference and Learning from Data

Published online:

24 February 2023

Print publication:

22 December 2022, pp 2587-2649
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

In the immediate past chapters we developed several techniques for the design of linear classifiers, such as logistic regression, perceptron, and support vector machines (SVM). These algorithms are suitable for data that are linearly separable; otherwise, their performance degrades significantly. In this chapter we explain how the methods can be adjusted to determine nonlinear separation surfaces.

49 - Policy Gradient Methods
- By Ali H. Sayed
Ali H. Sayed, École Polytechnique Fédérale de Lausanne
Book:

Inference and Learning from Data

Published online:

17 March 2023

Print publication:

22 December 2022, pp 2047-2120
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

In most multistage decision problems, we are interested in determining the optimal strategy, $π^{⋆} (a | s)$ (i.e., the optimal actions to follow in the state–action space). Most of the algorithms described in the previous chapters focused on evaluating the state and state–action value functions, $υ^{π} (s)$ and $q^{π} (s, a)$ , for a given policy $π (a | s)$ . More is needed to learn the optimal policy.

17 - Adaptive Gradient Methods
Ali H. Sayed, École Polytechnique Fédérale de Lausanne
Book:

Inference and Learning from Data

Published online:

17 February 2023

Print publication:

22 December 2022, pp 599-641
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

46 - Temporal Difference Learning
Ali H. Sayed, École Polytechnique Fédérale de Lausanne
Book:

Inference and Learning from Data

Published online:

17 March 2023

Print publication:

22 December 2022, pp 1917-1970
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

We derived in the previous two chapters procedures for assessing the performance of strategies used by agents interacting with a Markov decision process (MDP), including obtaining optimal policies. Among other methods, we discussed the policy evaluation algorithm (44.116) and the value and policy iterations (45.23) and (45.43), respectively.

58 - Dictionary Learning
Ali H. Sayed, École Polytechnique Fédérale de Lausanne
Book:

Inference and Learning from Data

Published online:

24 February 2023

Print publication:

22 December 2022, pp 2424-2456
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

Principal component analysis (PCA) is a formidable tool for dimensionality reduction. Given feature vectors ${h_{n}}$ in $M$ ‐dimensional space, PCA replaces them by lower‐dimensional vectors ${h_{n}^{'}}$ of size $M^{'} ≪ M$ each.

Computer Science

Refine search

Refine search

Actions for selected content:

48274 results in Computer Science

13 - Conjugate Gradient Method

KLAUS-Tr: Knowledge & learning-based unit focused arithmetic word problem solver for transfer cases

35 - Particle Filters

Summary

Dedication

55 - Naïve Bayes Classifier

Summary

64 - Generalization Theory

Summary

2 - Vector Differentiation

5 - Exponential Distributions

Frontmatter

66 - Deep Belief Networks

Summary

1 - Matrix Theory

20 - Convergence Analysis II: Stochastic Subgradient Algorithms

52 - Nearest-Neighbor Rule

Summary

72 - Meta Learning

Summary

65 - Feedforward Neural Networks

Summary

63 - Kernel Methods

Summary

49 - Policy Gradient Methods

Summary

17 - Adaptive Gradient Methods

46 - Temporal Difference Learning

Summary

58 - Dictionary Learning

Summary

Computer Science

Refine search

Refine search

Actions for selected content:

Save Search

48274 results in Computer Science

Summary

Summary

Summary

Summary

Summary

Summary

Summary

Summary

Summary

Summary

Summary