Search results for Knowledge Management, Databases and Data Mining

12 - Data Collection, Experimentation, and Evaluation
from Part IV: - Applications, Evaluations, and Methods
Chirag Shah, University of Washington
Book:

A Hands-On Introduction to Data Science

Published online:

01 February 2020

Print publication:

02 April 2020, pp 354-378
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

3 - Twyman’s Law and Experimentation Trustworthiness
from Part I - Introductory Topics for Everyone
Ron Kohavi, Diane Tang, Ya Xu
Book:

Trustworthy Online Controlled Experiments

Published online:

13 March 2020

Print publication:

02 April 2020, pp 39-57
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

William Anthony Twyman was a UK radio and television audience measurement veteran (MR Web 2014) credited with formulating Twyman’s law, although he apparently never explicitly put it in writing, and multiple variants of it exist, as shown in the above quotations.

5 - Python
from Part II: - Tools for Data Science
Chirag Shah, University of Washington
Book:

A Hands-On Introduction to Data Science

Published online:

01 February 2020

Print publication:

02 April 2020, pp 125-160
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

7 - MySQL
from Part II - Tools for Data Science
Chirag Shah, University of Washington
Book:

A Hands-On Introduction to Data Science

Published online:

01 February 2020

Print publication:

02 April 2020, pp 187-206
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

So far, we have seen data that comes in a file – whether it is in a table, a CSV, or an XML format. But text files (including CSV) are not the best way to store or transfer data when we are dealing with a large amount of them. We need something better – something that allows us not only to store data more effectively and efficiently, but also provides additional tools to process that data. That is where databases come in. There are several databases in use today, but MySQL tops them all in the free, open-source category. It is widely available and used, and thanks to its powerful Structured Query Language (SQL), it is also a comprehensive solution for data storage and processing.

2 - Running and Analyzing Experiments
from Part I - Introductory Topics for Everyone
Ron Kohavi, Diane Tang, Ya Xu
Book:

Trustworthy Online Controlled Experiments

Published online:

13 March 2020

Print publication:

02 April 2020, pp 26-38
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

In Chapter 1, we reviewed what controlled experiments are and the importance of getting real data for decision making rather than relying on intuition. The example in this chapter explores the basic principles of designing, running, and analyzing an experiment. These principles apply to wherever software is deployed, including web servers and browsers, desktop applications, mobile applications, game consoles, assistants, and more. To keep it simple and concrete, we focus on a website optimization example. In Chapter 12, we highlight the differences when running experiments for thick clients, such as native desktop and mobile apps.

13 - Instrumentation
from Part IV - Advanced Topics for Building an Experimentation Platform
Ron Kohavi, Diane Tang, Ya Xu
Book:

Trustworthy Online Controlled Experiments

Published online:

13 March 2020

Print publication:

02 April 2020, pp 162-165
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

Why you care: Before you can run any experiments, you must have instrumentation in place to log what is happening to the users and the system (e.g., website, application). Moreover, every business should have a baseline understanding of how the system is performing and how users interact with it, which requires instrumentation. When running experiments, having rich data about what users saw, their interactions (e.g., clicks, hovers, and time-to-click), and system performance (e.g., latencies) is critical.

Part II: - Tools for Data Science
Chirag Shah, University of Washington
Book:

A Hands-On Introduction to Data Science

Published online:

01 February 2020

Print publication:

02 April 2020, pp 97-98
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

12 - Client-Side Experiments
from Part IV - Advanced Topics for Building an Experimentation Platform
Ron Kohavi, Diane Tang, Ya Xu
Book:

Trustworthy Online Controlled Experiments

Published online:

13 March 2020

Print publication:

02 April 2020, pp 153-161
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

Why you care: You can run experiments either on a thin client, such as a web browser, or on a thick client, such as a native mobile app or a desktop client app. Changes for a webpage, regardless of whether it is frontend or backend, are fully controlled by the server. This is very different from a thick client. With an explosive growth of mobile usage, the number of experiments running on mobile apps has also grown (Xu and Chen 2016). Understanding the differences between thin and thick clients due to release process, infrastructure, and user behavior is useful to ensure trustworthy experiments.

Copyright page
Chirag Shah, University of Washington
Book:

A Hands-On Introduction to Data Science

Published online:

01 February 2020

Print publication:

02 April 2020, pp iv-iv
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

Part II - Tools for Data Science
Chirag Shah, University of Washington
Book:

A Hands-On Introduction to Data Science

Published online:

01 February 2020

Print publication:

02 April 2020, pp 97-206
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

1 - Introduction and Motivation
from Part I - Introductory Topics for Everyone
Ron Kohavi, Diane Tang, Ya Xu
Book:

Trustworthy Online Controlled Experiments

Published online:

13 March 2020

Print publication:

02 April 2020, pp 3-25
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

In 2012, an employee working on Bing, Microsoft’s search engine, suggested changing how ad headlines display (Kohavi and Thomke 2017). The idea was to lengthen the title line of ads by combining it with the text from the first line below the title, as shown in Figure 1.1.

Preface
Ron Kohavi, Diane Tang, Ya Xu
Book:

Trustworthy Online Controlled Experiments

Published online:

13 March 2020

Print publication:

02 April 2020, pp xv-xvi
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

Appendices
Chirag Shah, University of Washington
Book:

A Hands-On Introduction to Data Science

Published online:

01 February 2020

Print publication:

02 April 2020, pp 379-417
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

11 - Hands-On with Solving Data Problems
from Part IV: - Applications, Evaluations, and Methods
Chirag Shah, University of Washington
Book:

A Hands-On Introduction to Data Science

Published online:

01 February 2020

Print publication:

02 April 2020, pp 321-353
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

Part III: - Machine Learning for Data Science
Chirag Shah, University of Washington
Book:

A Hands-On Introduction to Data Science

Published online:

01 February 2020

Print publication:

02 April 2020, pp 207-208
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

Contents
Ron Kohavi, Diane Tang, Ya Xu
Book:

Trustworthy Online Controlled Experiments

Published online:

13 March 2020

Print publication:

02 April 2020, pp ix-xiv
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

8 - Machine Learning Introduction and Regression
from Part III: - Machine Learning for Data Science
Chirag Shah, University of Washington
Book:

A Hands-On Introduction to Data Science

Published online:

01 February 2020

Print publication:

02 April 2020, pp 209-234
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

Contents
Chirag Shah, University of Washington
Book:

A Hands-On Introduction to Data Science

Published online:

01 February 2020

Print publication:

02 April 2020, pp vii-xiv
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

Appendix B: Useful Formulas from Probability
Chirag Shah, University of Washington
Book:

A Hands-On Introduction to Data Science

Published online:

01 February 2020

Print publication:

02 April 2020, pp 381-382
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

Index
Chirag Shah, University of Washington
Book:

A Hands-On Introduction to Data Science

Published online:

01 February 2020

Print publication:

02 April 2020, pp 418-434
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

Knowledge Management, Databases and Data Mining

Refine search

Refine search

Actions for selected content:

1835 results in Knowledge Management, Databases and Data Mining

12 - Data Collection, Experimentation, and Evaluation

3 - Twyman’s Law and Experimentation Trustworthiness

Summary

5 - Python

7 - MySQL

Summary

2 - Running and Analyzing Experiments

Summary

13 - Instrumentation

Summary

Part II: - Tools for Data Science

12 - Client-Side Experiments

Summary

Copyright page

Part II - Tools for Data Science

1 - Introduction and Motivation

Summary

Preface

Appendices

11 - Hands-On with Solving Data Problems

Part III: - Machine Learning for Data Science

Contents

8 - Machine Learning Introduction and Regression

Contents

Appendix B: Useful Formulas from Probability

Index

Knowledge Management, Databases and Data Mining

Refine search

Refine search

Actions for selected content:

Save Search

1835 results in Knowledge Management, Databases and Data Mining

Summary

Summary

Summary

Summary

Summary

Summary