Skip to main content Accessibility help
×
Hostname: page-component-76fb5796d-9pm4c Total loading time: 0 Render date: 2024-04-25T14:34:04.020Z Has data issue: false hasContentIssue false

4 - Data Integration for Systemic Risk in the Financial System

from PART I - DATA: THE PREREQUISITE FOR MANAGING SYSTEMIC RISK

Published online by Cambridge University Press:  05 June 2013

Arnon Rosenthal
Affiliation:
Usa
Len Seligman
Affiliation:
Usa
Jean-Pierre Fouque
Affiliation:
University of California, Santa Barbara
Joseph A. Langsam
Affiliation:
University of Maryland, College Park
Get access

Summary

Abstract The need to integrate data from many sources is likely to be a key bottleneck in the analysis of systemic risk. Oversimplified approaches result in large amounts of code being written and maintained by hand, and many software engineers having access to sensitive data. We describe the separate dimensions of the problem – many “silver bullets” only address one or two. We explain why it is neither feasible nor desirable to use a single standard for all data sharing for financial research, and then explore the pros and cons of standards at different levels of detail. We then explore some of the tools needed to implement and evolve a set of data exchanges. Finally, we discuss how data difficulties raise important research issues for both financial analysis experts and computer scientists.

The systemic risk data integration challenge

Systemic risk analysis in the financial sector requires massive data integration. Information must be collected from hundreds (and ultimately, perhaps, thousands) of financial firms, corresponding entities must be matched, and all the data must be transformed to meet the needs of systemic risk analysis models. The Dodd–Frank act established an Office of Financial Research (OFR) and gave it the mission of collecting, integrating, and analyzing diverse data in order to better track and analyze financial systemic risk. The data integration challenge is daunting and, if handled poorly, will be a significant impediment to the OFR being able to do systemic analyses.

Type
Chapter
Information
Publisher: Cambridge University Press
Print publication year: 2013

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

References

Army (2009). US Army Report and Message Formats. Department of Army (Headquarters) FM 6–99.2 (FM 101–5–2).
Aumueller, D., Do, H.-H., Massmann, S. and Rahm, E. (2005). Schema and ontology mapping with COMA++. In SIGMOD 2005: Proceedings of the 2005 ACM SIGMOD International Conference on Management of Data, ACM, 906–908.Google Scholar
Brodie, M. and Stonebraker, M. (1995). Migrating Legacy Systems: Gateways, Interfaces & the Incremental ApproachMorgan Kaufmann.Google Scholar
Chen, K., Madhavan, J., and Halevy, A. (2009). Exploring schema repositories with Schemr. In ACM SIGMOD/PODS 2009 Conference, Providence, Rhode Island, June 30 – July 2.Google Scholar
Do, H.H. and Rahm, E. (2002). COMA – A system for flexible combination of schema matching approaches. In VLDB 2002, Proceedings of 28th International Conference on Very Large Data Bases, Hong Kong, China, August 20–23.Google Scholar
Doan, A., Domingos, P., and Halevy, A.Y. (2003). Learning to match the schemas of databases: a multistrategy approach. Machine Learning, 50 (3) 279–301.CrossRefGoogle Scholar
Doan, A., Madhavan, J., Domingos, P., and Halevy, A.Ontology mapping: a machine learning approach. In Handbook on Ontologies, S., Staab and R., Stuber (eds), Springer, 385–404.
Fagin, R., Kolaitis, P.G., Miller, R.J., and Popa, L. (2005). Data exchange: semantics and query answering. Theoretical Computer Science, 336 (1) 89–124.CrossRefGoogle Scholar
Fagin, R., Kolaitis, P.G., and Popa, L. (2005). Data exchange: getting to the core. ACM Trans. Database Syst., 30 (1) 174–210.CrossRefGoogle Scholar
Gartner, (2005). Gartner says more than 50 percent of data warehouse projects will have limited acceptance or will be failures through 2007. http://www.gartner.com/ it/page.jsp?id=492112.Google Scholar
Goh, C., Madnick, S.E., and Siegel, M.D. (1994). Context interchange: overcoming the challenges of large-scale interoperable database systems in a dynamic environment. In Proceedings of the Third International Conference on Information and Knowledge Management (CIKM 1994), ACM, 337–346.Google Scholar
Goh, C., Bressan, S., Madnick, S.E., and Siegel, M.D. (1999). Context interchange: new features and formalisms for the intelligent integration of information. ACM Transactions on Information Systems 17 (3) 270–293.CrossRefGoogle Scholar
Haas, L.M. (2007). Beauty and the beast: the theory and practice of information integration. In 11th International Conference on Database Theory (ICDT 2007), 28–43.Google Scholar
Madhavan, J., Bernstein, P.A., and Rahm, E. (2001). Generic schema matching with Cupid. In VLDB 2001, Proceedings of 27th International Conference on Very Large Data Bases, Rome Italy, September 11–14. Morgan Kaufmann, 49–58.Google Scholar
Madhavan, J., Cohen, S., Dong, X.L., Halevy, A.Y., Jeffery, S.R., Ko, D. and Yu, C. (2007). Web-scale data integration: you can afford to Pay as You Go. In CIDR 2007, Third Biennial Conference on Innovative Data Systems Research, Asilomar, CA, USA, January 7–10. Online Proceedings www.cidrdb.org/cidr2007.Google Scholar
Mork, P., Seligman, L., Rosenthal, A., Morse, M., Wolf, C., Hoyt, J., and Smith, K. (2009). Galaxy: encouraging data sharing among sources with schema variants. Demo at IEEE International Conference on Data Engineering, Shanghai, China, March 29 – April 2 IEEE.Google Scholar
Mork, P., Seligman, L.J., Rosenthal, A., Korb, J., and Wolf, C. (2008). The Harmony Integration Workbench. Journal on Data Semantics 11 65–93.Google Scholar
Motro, A. (1996). Panorama: A database system that annotates its answers to queries with their properties. Journal of Intelligent Information Systems 7 (1) 51–73.CrossRefGoogle Scholar
Rahm, E., and Bernstein, P.A. (2001). A survey of approaches to automatic schema matching. The VDLB Journal 10 (4) 334–350.Google Scholar
Rosenthal, A., Allen, L.M.D., and Chapman, A. (2013). Fit for purpose: towards an engineering basis for data exchange standards. In IWEI 2013, IFIP Working Conference on Enterprise Interoperability, Enschede, The Netherlands, March 27–28.Google Scholar
Rosenthal, A., Seligman, L.J., and Renner, S. (2004). From semantic integration to semantics management: case studies and a way forward. SIGMOD Record 33 (4) 44–50.CrossRefGoogle Scholar
Sciore, E., Siegel, M., and Rosenthal, A. (1994). Using semantic values to facilitate interoperability among heterogeneous information systems. ACM Transactions on Database Systems 19 (2) 254–290.CrossRefGoogle Scholar
Seligman, L., Mork, P., Halevy, A., Smith, K., Carey, M.J., Chen, K., Wolf, C., Madhavan, J., Kannan, A., and Burdick, D. (2010). OpenII: an open source information integration toolkit. In Proceedings of the ACM SIGMOD International Conference on Management of Data, SIGMOD 2010, Indianapolis, Indiana, USA, June 6–10.Google Scholar
Smith, K., Bonaceto, C., Wolf, C., Yost, B., Morse, M., Mork, P., and Burdick, D. (2010). Exploring schema similarity at multiple resolutions. In Proceedings of the ACM SIG-MOD International Conference on Management of Data, SIGMOD 2010, Indianapolis, Indiana, USA, June 6–10.Google Scholar
Yu, C. and Jagadish, H.V. (2006). Schema summarization. In Proceedings of the 32nd International Conference on Very Large Data Bases, Seoul, Korea, September 12–15. ACM. http://www.pubzone.org/dblp/conf/vldb/YuJ06a.Google Scholar
Zhu, H. and Wu, H. (2011). Quality of data standards: framework and illustration using XBRL taxonomy and instances. Electronic Markets 21 (2) 129–139.CrossRefGoogle Scholar

Save book to Kindle

To save this book to your Kindle, first ensure coreplatform@cambridge.org is added to your Approved Personal Document E-mail List under your Personal Document Settings on the Manage Your Content and Devices page of your Amazon account. Then enter the ‘name’ part of your Kindle email address below. Find out more about saving to your Kindle.

Note you can select to save to either the @free.kindle.com or @kindle.com variations. ‘@free.kindle.com’ emails are free but can only be saved to your device when it is connected to wi-fi. ‘@kindle.com’ emails can be delivered even when you are not connected to wi-fi, but note that service fees apply.

Find out more about the Kindle Personal Document Service.

Available formats
×

Save book to Dropbox

To save content items to your account, please confirm that you agree to abide by our usage policies. If this is the first time you use this feature, you will be asked to authorise Cambridge Core to connect with your account. Find out more about saving content to Dropbox.

Available formats
×

Save book to Google Drive

To save content items to your account, please confirm that you agree to abide by our usage policies. If this is the first time you use this feature, you will be asked to authorise Cambridge Core to connect with your account. Find out more about saving content to Google Drive.

Available formats
×