To save content items to your account,
please confirm that you agree to abide by our usage policies.
If this is the first time you use this feature, you will be asked to authorise Cambridge Core to connect with your account.
Find out more about saving content to .
To save content items to your Kindle, first ensure no-reply@cambridge.org
is added to your Approved Personal Document E-mail List under your Personal Document Settings
on the Manage Your Content and Devices page of your Amazon account. Then enter the ‘name’ part
of your Kindle email address below.
Find out more about saving to your Kindle.
Note you can select to save to either the @free.kindle.com or @kindle.com variations.
‘@free.kindle.com’ emails are free but can only be saved to your device when it is connected to wi-fi.
‘@kindle.com’ emails can be delivered even when you are not connected to wi-fi, but note that service fees apply.
Harnessing the power of data and AI methods to tackle complex societal challenges requires transdisciplinary collaborations across academia, industry, and government. In this compelling book, Munther A. Dahleh, founder of the MIT Institute for Data, Systems, and Society (IDSS), offers a blueprint for researchers, professionals, and institutions to create approaches to problems of high societal value using innovative, holistic, data-driven methods. Drawing on his experience at IDSS and knowledge of similar initiatives elsewhere, Dahleh describes in clear, non-technical language how statistics, data science, information and decision systems, and social and institutional behavior intersect across multiple domains. He illustrates key concepts with real-life examples from optimizing transportation to making healthcare decisions during pandemics to understanding the media's impact on elections and revolutions. Dahleh also incorporates crucial concepts such as robustness, causality, privacy, and ethics and shares key lessons learned about transdisciplinary communication and about unintended consequences of AI and algorithmic systems.
Published in collaboration with The British Universities Industrial Relations Association (BUIRA), this book critically reviews the future of Industrial Relations (IR) in a changing work landscape and traces its historical evolution. Essential for academics, students and trade unions, it explores IR's significant changes over the past decade and its ongoing influence on our lives.
It is impossible to view the news at present without hearing talk of crisis: the economy, the climate, the pandemic. This book asks how these larger societal issues lead to a crisis with work, making it ever more precarious, unequal and intense. Experts diagnose the nature of the problem and offer a programme for transcending above the crises.
Offering theoretical frameworks from experts as well as practical examples to support women transitioning through menopause in the workplace, this is a go-to reference for academics and policy makers working in the field.
Introduction to Probability and Statistics for Data Science provides a solid course in the fundamental concepts, methods and theory of statistics for students in statistics, data science, biostatistics, engineering, and physical science programs. It teaches students to understand, use, and build on modern statistical techniques for complex problems. The authors develop the methods from both an intuitive and mathematical angle, illustrating with simple examples how and why the methods work. More complicated examples, many of which incorporate data and code in R, show how the method is used in practice. Through this guidance, students get the big picture about how statistics works and can be applied. This text covers more modern topics such as regression trees, large scale hypothesis testing, bootstrapping, MCMC, time series, and fewer theoretical topics like the Cramer-Rao lower bound and the Rao-Blackwell theorem. It features more than 250 high-quality figures, 180 of which involve actual data. Data and R are code available on our website so that students can reproduce the examples and do hands-on exercises.
This chapter explores the crucial alternative to traditional data processing methods, focusing on in-memory data processing. It discusses storing large volumes of data in DRAM for efficient and rapid data access, while using disk and SSD storage mainly for backup and archival purposes. The chapter sheds light on the benefits and significance of this approach, emphasizing its role in enabling efficient computing tasks. It also examines the implications of this shift for disk utilization, highlighting the transition towards using disk and SSD storage as secondary mediums, rather than primary data sources.
This chapter delves into the management of structured data using GPUs. It demonstrates the construction of a GPU-based SQL database engine, encompassing both hash-based and sorting-based relational operator algorithms. The chapter explores how complex SQL concepts like subqueries can be efficiently interacted with GPUs for optimal performance, offering insights into the advancements and potential of GPU computing in structured data management.
This opening chapter provides a historical perspective on the evolution of computing, tracing its journey from early computational methods to the emergence of networking and the advent of data-centric computing. The chapter sets out to inspire readers to develop a holistic understanding of the intricate interactions among hardware, software, and networking. It introduces the principle of hardware and software codesign as a critical approach in constructing efficient data management systems. The goal is to achieve high throughput and low latency in modern data processing, setting the stage for the detailed exploration that follows in subsequent chapters.
The concluding chapter consolidates the key learnings and projects future trends and emerging technologies in the dynamic field of data management and computing. It explores how the convergence of advanced hardware, sophisticated algorithms, and AI-driven solutions is shaping the next frontier of data management and computing. Emphasizing practical implications and future possibilities, this final chapter aims to equip readers with a comprehensive understanding and vision of how these integrated technologies will continue to transform the landscape of computing and data management.
This chapter delves into the transformative world of ray tracing, a technology reshaping computational graphics and data processing. It bridges the gap between advanced graphical rendering and general computational tasks, exploring how ray tracing hardware, originally designed for stunning visual effects, is now being harnessed for diverse applications beyond graphics. The chapter employs Nvidia GPU RT Cores and the OptiX programming framework as conduits to explain ray tracing’s fundamental concepts and practical implementations.
This chapter offers a comprehensive examination of main memory, considering both its architectural aspects and its critical role in systems software. The discussion includes the utilization of physical memory addresses as a linkage mechanism, connecting programs in virtual space to their corresponding execution spaces in the cache and main memory. The chapter also presents advancements in CPU and memory products, elucidating their relevance to memory management. Additionally, it introduces the concept of the OS buffer cache and the development of a key–value store at the user level, highlighting their significance in the broader context of data management systems.
Providing a comprehensive introduction to GPU programming and its application in data management, this chapter uses sorting algorithms as a case study. It explores how parallel programming and architecture-oriented performance tuning are integral to unlocking the full potential of GPUs as powerful computing devices. The chapter takes readers through the transformation process of a sequential bubble sorting algorithm into GPU-friendly bitonic sorting and odd–even merging sorting algorithms, illustrating the capabilities and advantages of GPU computing in data management.
Focusing on the intricate world of spatial data management, this chapter offers an in-depth analysis of how spatial data management tasks, specifically in the context of pathology imaging applications, are approached and optimized on traditional CPU-based computing platforms versus GPU-accelerated platforms. Employing a case-study methodology, the chapter not only delves into the specifics of these applications but also extrapolates broader methodologies and strategies for leveraging advanced hardware to enhance application performance.
Addressing the challenges associated with data movement within the memory hierarchy, this chapter explores solutions from both hardware and systems software perspectives. It places special emphasis on buffer management techniques aimed at optimizing data movement and reducing access latency. The chapter also delves into the significance of nonvolatile memory (NVM), particularly flash memory devices, and their role in mitigating access latency within the memory hierarchy. Readers gain insights into strategies employed to minimize data movement, enhancing overall memory performance, a critical aspect of efficient data management.