Search results for Computer Science

16 - Neural Architectures for Natural Language Processing Applications
Mihai Surdeanu, University of Arizona, Marco Antonio Valenzuela-Escárcega, University of Arizona
Book:

Deep Learning for Natural Language Processing

Published online:

01 February 2024

Print publication:

08 February 2024, pp 246-271
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

In this chapter, we describe several common applications (including the ones we touched on before) and multiple possible neural approaches for each. We focus on simple neural approaches that work well and should be familiar to anybody beginning research in natural language processing or interested in deploying robust strategies in industry. In particular, we describe the implementation of the following applications: text classification, part-of-speech tagging, named entity recognition, syntactic dependency parsing, relation extraction, question answering, and machine translation.

6 - Best Practices in Deep Learning
Mihai Surdeanu, University of Arizona, Marco Antonio Valenzuela-Escárcega, University of Arizona
Book:

Deep Learning for Natural Language Processing

Published online:

01 February 2024

Print publication:

08 February 2024, pp 87-106
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

The previous chapter introduced feed-forward neural networks and demonstrated that, theoretically, implementing the training procedure for an arbitrary feed-forward neural network is relatively simple. Unfortunately, neural networks trained this way will suffer from several problems such as stability of the training process – that is, slow convergence due to parameters jumping around a good minimum – and overfitting. In this chapter, we will describe several practical solutions that mitigate these problems. In particular, we discuss minibatching, multiple optimization algorithms, other activation and cost functions, regularization, dropout, temporal averaging, and parameter initialization and normalization.

8 - Small-World Phenomena in Preferential Attachment Models
from Part III - Small-World Properties of Random Graphs
Remco van der Hofstad, Technische Universiteit Eindhoven, The Netherlands
Book:

Random Graphs and Complex Networks

Published online:

08 February 2024

Print publication:

08 February 2024, pp 326-380
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

In this chapter we investigate graph distances in preferential attachment models. We focus on typical distances as well as the diameter of preferential attachment models. We again rely on path-counting techniques, as well as local limit results. Since the local limit is a rather involved quantity, some parts of our analysis are considerably harder than those in Chapters 6 and 7.

3 - Logistic Regression
Mihai Surdeanu, University of Arizona, Marco Antonio Valenzuela-Escárcega, University of Arizona
Book:

Deep Learning for Natural Language Processing

Published online:

01 February 2024

Print publication:

08 February 2024, pp 30-48
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

As mentioned in the previous chapter, the perceptron does not perform smooth updates during training, which may slow down learning, or cause it to miss good solutions entirely in real-world situations. In this chapter, we will discuss logistic regression, a machine learning algorithm that elegantly addresses this problem. We also extend the vanilla logistic regression, which was designed for binary classification, to handle multiclass classification. Through logistic regression, we introduce the concept of cost function (i.e., the function we aim to minimize during training), and gradient descent, the algorithm that implements this minimization procedure.

7 - Small-World Phenomena in Configuration Models
from Part III - Small-World Properties of Random Graphs
Remco van der Hofstad, Technische Universiteit Eindhoven, The Netherlands
Book:

Random Graphs and Complex Networks

Published online:

08 February 2024

Print publication:

08 February 2024, pp 289-325
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

In this chapter we investigate the distance structure of the configuration model by investigating its typical distances and its diameter. We adapt the path-counting techniques in Chapter 6 to the configuration model, and obtain typical distances from the “giant is almost local” proof. To understand the ultra-small distances for infinite-variance degree configuration models, we investigate the generation growth of infinite-mean branching processes. The relation to branching processes informally leads to the power-iteration technique that allows one to deduce typical distance results in random graphs in a relatively straightforward way.

Bibliography
Gregory Dudek, McGill University, Montréal, Michael Jenkin, York University, Toronto
Book:

Computational Principles of Mobile Robotics

Published online:

19 March 2024

Print publication:

08 February 2024, pp 387-420
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

4 - Non-Visual Sensors and Algorithms
from Part one - Locomotion and Perception
Gregory Dudek, McGill University, Montréal, Michael Jenkin, York University, Toronto
Book:

Computational Principles of Mobile Robotics

Published online:

19 March 2024

Print publication:

08 February 2024, pp 82-119
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

Sensing is a key requirement for any but the simplest mobile behavior. In order for Robot to be able to warn the crew of Lost in Space that there is danger ahead, it must be able to sense and reason about its sensor responses. Sensing is a critical component of the fundamental tasks of pose estimation – determining where the robot is in its environment; pose maintenance – maintaining an ongoing estimate of the robot’s pose; and map construction – building a representation of the robot’s environment.

Dedication
Remco van der Hofstad, Technische Universiteit Eindhoven, The Netherlands
Book:

Random Graphs and Complex Networks

Published online:

08 February 2024

Print publication:

08 February 2024, pp v-vi
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

4 - Implementing Text Classification Using Perceptron and Logistic Regression
Mihai Surdeanu, University of Arizona, Marco Antonio Valenzuela-Escárcega, University of Arizona
Book:

Deep Learning for Natural Language Processing

Published online:

01 February 2024

Print publication:

08 February 2024, pp 49-72
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

In the previous chapters, we have discussed the theory behind the perceptron and logistic regression, including mathematical explanations of how and why they are able to learn from examples. In this chapter, we will transition from math to code. Specifically, we will discuss how to implement these models in the Python programming language. All the code that we will introduce throughout this book is available in this GitHub repository as well: https://github.com/clulab/gentlenlp. To get a better understanding of how these algorithms work under the hood, we will start by implementing them from scratch. However, as the book progresses, we will introduce some of the popular tools and libraries that make Python the language of choice for machine learning – for example, PyTorch, and Hugging Face’s transformers. The code for all the examples in the book is provided in the form of Jupyter notebooks. Fragments of these notebooks will be presented in the implementation chapters so that the reader has the whole picture just by reading the book.

8 - System Control
from Part two - Representation and Planning
Gregory Dudek, McGill University, Montréal, Michael Jenkin, York University, Toronto
Book:

Computational Principles of Mobile Robotics

Published online:

19 March 2024

Print publication:

08 February 2024, pp 229-251
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

Robotic systems, and in particular mobile robotic systems, are the embodiment of a set of complex computational processes, mechanical systems, sensors, user interface, and communications infrastructure. The problems inherent in integrating these components into a working robot can be very challenging. Overall system control requires an approach that can properly handle the complexity of the system goals while dealing with poorly defined tasks and the existence of unplanned and unexpected events. This task is complicated by the non-standard nature of much robotic equipment. Often the hardware seems to have been built following a philosophy of “ease of design” rather that with an eye toward assisting with later system integration.

11 - Implementing Part-of-Speech Tagging Using Recurrent Neural Networks
Mihai Surdeanu, University of Arizona, Marco Antonio Valenzuela-Escárcega, University of Arizona
Book:

Deep Learning for Natural Language Processing

Published online:

01 February 2024

Print publication:

08 February 2024, pp 165-177
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

The previous chapter was our first exposure to recurrent neural networks, which included intuitions for why they are useful for natural language processing, various architectures, and training algorithms. In this chapter, we will put them to use in order to implement a common sequence modeling task. In particular, we implement a Spanish part-of-speech tagger using a bidirectional long short-term memory and a set of pretrained, static word embeddings. Through this process, we have also introduced several new PyTorch features such as the pad_sequence, pack_padded_sequence, and pad_packed_sequence functions, which allow us to work more e?iciently with variable length sequences for recurrent neural networks.

Index
Remco van der Hofstad, Technische Universiteit Eindhoven, The Netherlands
Book:

Random Graphs and Complex Networks

Published online:

08 February 2024

Print publication:

08 February 2024, pp 486-490
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

Part I - Preliminaries
Remco van der Hofstad, Technische Universiteit Eindhoven, The Netherlands
Book:

Random Graphs and Complex Networks

Published online:

08 February 2024

Print publication:

08 February 2024, pp 1-2
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

8 - Distributional Hypothesis and Representation Learning
Mihai Surdeanu, University of Arizona, Marco Antonio Valenzuela-Escárcega, University of Arizona
Book:

Deep Learning for Natural Language Processing

Published online:

01 February 2024

Print publication:

08 February 2024, pp 117-131
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

All the algorithms we covered so far rely on handcrafted features that must be designed and implemented by the machine learning developer. This is problematic for two reasons. First, designing such features can be a complicated endeavor. Second, most words in any language tend to be very infrequent. In our context, this means that most words are very sparse, and our text classification algorithm trained on word-occurrence features may generalize poorly. For example, if the training data for a review classification dataset contains the word great but not the word fantastic, a learning algorithm trained on these data will not be able to properly handle reviews containing the latter word, even though there is a clear semantic similarity between the two. In this chapter, we will begin to addresses this limitation. In particular, we will discuss methods that learn numerical representations of words that capture some semantic knowledge. Under these representations, similar words such as great and fantastic will have similar forms, which will improve the generalization capability of our machine learning algorithms.

Part II - Connected Components in Random Graphs
Remco van der Hofstad, Technische Universiteit Eindhoven, The Netherlands
Book:

Random Graphs and Complex Networks

Published online:

08 February 2024

Print publication:

08 February 2024, pp 95-96
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

5 - Connected Components in Preferential Attachment Models
from Part II - Connected Components in Random Graphs
Remco van der Hofstad, Technische Universiteit Eindhoven, The Netherlands
Book:

Random Graphs and Complex Networks

Published online:

08 February 2024

Print publication:

08 February 2024, pp 189-242
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

In this chapter we investigate the connectivity structure of preferential attachment models. We start by discussing an important tool: exchangeable random variables and their distribution described in de Finetti’s Theorem. We apply these results to Pólya urn schemes, which, in turn, we use to describe the distribution of the degrees in preferential attachment models. It turns out that Pólya urn schemes can also be used to describe the local limit of preferential attachment models. A crucial ingredient is the fact that the edges in the Pólya urn representation are conditionally independent, given the appropriate randomness. The resulting local limit is the Pólya point tree, a specific multi-type branching process with continuous types.

Possible Course Outlines
Remco van der Hofstad, Technische Universiteit Eindhoven, The Netherlands
Book:

Random Graphs and Complex Networks

Published online:

08 February 2024

Print publication:

08 February 2024, pp xv-xvi
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

13 - Using Transformers with the Hugging Face Library
Mihai Surdeanu, University of Arizona, Marco Antonio Valenzuela-Escárcega, University of Arizona
Book:

Deep Learning for Natural Language Processing

Published online:

01 February 2024

Print publication:

08 February 2024, pp 194-215
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

One of the key advantages of transformer networks is the ability to take a model that was pretrained over vast quantities of text and fine-tune it for the task at hand. Intuitively, this strategy allows transformer networks to achieve higher performance on smaller datasets by relying on statistics acquired at scale in an unsupervised way (e.g., through the masked language model training objective). To this end, in this chapter, we will use the Hugging Face library, which has a rich repository of datasets and pretrained models, as well as helper methods and classes that make it easy to target downstream tasks. Using pretrained transformer encoders, we will implement the two tasks that served as use cases in the previous chapters: text classification and part-of-speech tagging.

Appendix B - Probability and Statistics
Gregory Dudek, McGill University, Montréal, Michael Jenkin, York University, Toronto
Book:

Computational Principles of Mobile Robotics

Published online:

19 March 2024

Print publication:

08 February 2024, pp 370-375
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

1 - Overview and Motivation
Gregory Dudek, McGill University, Montréal, Michael Jenkin, York University, Toronto
Book:

Computational Principles of Mobile Robotics

Published online:

19 March 2024

Print publication:

08 February 2024, pp 1-15
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

The ability to navigate purposefully through its environment is fundamental to most animals and to every intelligent organism. In this book we examine the computational issues specific to the creation of machines that move intelligently in their environment. From the earliest modern speculation regarding the creation of autonomous robots, it was recognized that regardless of the mechanisms used to move the robot around or the methods used to sense the environment, the computational principles that govern the robot are of paramount importance. As Powell and Donovan discovered in Isaac Asimov’s story “Runaround,” subtle definitions within the programs that control a robot can lead to significant changes in the robot’s overall behavior or action. Moreover, interactions among multiple complex components can lead to large-scale emergent behaviors that may be hard to predict.

Computer Science

Refine search

Refine search

Actions for selected content:

48584 results in Computer Science

16 - Neural Architectures for Natural Language Processing Applications

Summary

6 - Best Practices in Deep Learning

Summary

8 - Small-World Phenomena in Preferential Attachment Models

Summary

3 - Logistic Regression

Summary

7 - Small-World Phenomena in Configuration Models

Summary

Bibliography

4 - Non-Visual Sensors and Algorithms

Summary

Dedication

4 - Implementing Text Classification Using Perceptron and Logistic Regression

Summary

8 - System Control

Summary

11 - Implementing Part-of-Speech Tagging Using Recurrent Neural Networks

Summary

Index

Part I - Preliminaries

8 - Distributional Hypothesis and Representation Learning

Summary

Part II - Connected Components in Random Graphs

5 - Connected Components in Preferential Attachment Models

Summary

Possible Course Outlines

13 - Using Transformers with the Hugging Face Library

Summary

Appendix B - Probability and Statistics

1 - Overview and Motivation

Summary

Computer Science

Refine search

Refine search

Actions for selected content:

Save Search

48584 results in Computer Science

Summary

Summary

Summary

Summary

Summary

Summary

Summary

Summary

Summary

Summary

Summary

Summary

Summary