Skip to main content Accessibility help
×
×
Home

Annotating a broad range of anaphoric phenomena, in a variety of genres: the ARRAU Corpus

  • Olga Uryupina (a1), Ron Artstein (a2), Antonella Bristot (a1) (a2) (a3) (a4) (a5) (a6), Federica Cavicchio (a3), Francesca Delogu (a4), Kepa J. Rodriguez (a5) and Massimo Poesio (a6)...

Abstract

This paper presents the second release of arrau, a multigenre corpus of anaphoric information created over 10 years to provide data for the next generation of coreference/anaphora resolution systems combining different types of linguistic and world knowledge with advanced discourse modeling supporting rich linguistic annotations. The distinguishing features of arrau include the following: treating all NPs as markables, including non-referring NPs, and annotating their (non-) referentiality status; distinguishing between several categories of non-referentiality and annotating non-anaphoric mentions; thorough annotation of markable boundaries (minimal/maximal spans, discontinuous markables); annotating a variety of mention attributes, ranging from morphosyntactic parameters to semantic category; annotating the genericity status of mentions; annotating a wide range of anaphoric relations, including bridging relations and discourse deixis; and, finally, annotating anaphoric ambiguity. The current version of the dataset contains 350K tokens and is publicly available from LDC. In this paper, we discuss in detail all the distinguishing features of the corpus, so far only partially presented in a number of conference and workshop papers, and we also discuss the development between the first release of arrau in 2008 and this second one.

Copyright

Corresponding author

Corresponding author. Email: uryupina@gmail.com

References

Hide All
Amoia, M., Kunz, K. and Lapshinova-Koltunski, E. (2011). Discontinuous constituents: A problematic case for parallel corpora annotation and querying. In Proceedings of RANLP2011 Workshop on Annotation and Exploitation of Parallel Corpora, pp. 210.
Artstein, R. and Poesio, M. (2006). Identifying reference to abstract objects in dialogue. In Schlangen, D. and Fernandez, R. (eds.), Proceedings of the Workshop on the Semantics and Pragmatics of Dialogue.
Artstein, R. and Poesio, M. (2008). Inter-coder agreement for computational linguistics. Computational Linguistics 34(4), 555596.
Asher, N. (1993). Reference to Abstract Objects in English. Dordrecht: D. Reidel.
Björkelund, A. and Farkas, R. (2012). Data-driven multilingual coreference Resolution using resolver stacking. In Proceedings of the Conference on Computational Natural Language Learning: Shared Task, 4955.
Björkelund, A. and Kuhn, J. (2014). Learning structured perceptions for coreference resolution with latent antecedents and non-local features. In Proceedings of the Annual Meeting of the Association for Computational Linguistics, pp. 4757.
Botley, S.P. (2006). Indirect anaphora: Testing the limits of corpus-based linguistics. International Journal of Corpus Linguistics 11(1), 73112.
Boyd, A., Dickinson, M. and Meurers, D. (2008). On detecting errors in dependency treebanks. Research on Language and Computation 6(2), 113137.
Burga, A., Cajal, S., Codina-Filba, J. and Wanner, L. (2016). Towards multiple antecedent coreference resolution in specialized discourse. In Proceedings of the Language Resources and Evaluation Conference.
Byron, D. and Allen, J. (1998). Resolving demonstrative anaphora in the TRAINS-93 corpus. In Proceedings of the Second Colloquium on Discourse, Anaphora and Reference Resolution, University of Lancaster.
Byron, D. (2002). Resolving pronominal references to abstract entities. In Proceedings of the Annual Meeting of the Association for Computational Linguistics, pp. 8087.
Carletta, J. (1996). Assessing agreement on classification tasks: The Kappa statistic. Computational Linguistics 22(2), 249254.
Carlson, G.N. and Pelletier, F.J. (eds.) (1995). The Generic Book. Chicago, IL: University of Chicago Press.
Carlson, L., Marcu, D. and Okurowski, M.E. (2002). RST Discourse Treebank LDC2002T07.
Carlson, L., Marcu, D. and Okurowski, M.E. (2003). Building a discourse-tagged corpus in the framework of rhetorical structure theory. In Kuppevelt, J. and Smith, R. (eds.), Current Directions in Discourse and Dialogue. Dordrecht: Kluwer, pp. 85112.
Chafe, W.L. (1980). The Pear Stories: Cognitive, Cultural and Linguistic Aspects of Narrative Production. Norwood, NJ: Ablex.
Chamberlain, J., Poesio, M. and Kruschwitz, U. (2016). Phrase detectives corpus 1.0: Crowdsourced anaphoric coreference. In Proceedings of the Language Resources and Evaluation Conference.
Chiarcos, C. and Krasavina, O. (2005). Annotation Guidelines PoCoS—Potsdam Coreference Scheme: Core Scheme, Draft 0.912. Potsdam and Berlin: University of Potsdam and Humboldt University.
Clark, H.H. (1975). Bridging. In Proceedings of the Conference on Theoretical Issues in Natural Language Processing.
Clark, K. and Manning, C.D. (2016). Improving coreference resolution by learning entity-level distributed representations. In Proceedings of ACL, Berlin.
Davies, S., Poesio, M., Bruneseaux, F. and Romary, L. (1998). Annotating Coreference in Dialogues: A Proposal for a Scheme for MATE. Available at http://www.cogsci.ed.ac.uk/poesio/MATE/anno_manual.html.
Dickinson, M. and Lee, C.M. (2008). Detecting errors in semantic annotation. In Proceedings of the Language Resources and Evaluation Conference.
Dickinson, M. and Meurers, D. (2003). Detecting errors in part-of-speech annotation. In Proceedings of the 10th Conference of European chapter of the Association for Computational Linguistics, pp. 107114.
Dipper, S. and Zinsmeister, H. (2012). Annotating abstract anaphora. Language Resources and Evaluation 46(1), 3752.
Doddington, G., Mitchell, A., Przybocki, M., Ramshaw, L., Strassell, S. and Weischedel, R. (2004). The automatic content extraction (ACE) program–tasks, data, and evaluation. In Proceedings of the Language Resources and Evaluation Conference.
Durrett, G. and Klein, D. (2013). Easy victories and uphill battles in coreference resolution. In Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, pp. 19711982.
Eckert, M. and Strube, M. (2000). Dialogue acts, synchronising units and anaphora resolution. Journal of Semantics 17(1), 5189.
Fernandes, E.R., dos Santos, C.N. and Milidiú, R.L. (2014). Latent trees for coreference resolution. Computational Linguistics 40(4), 801835.
Frank, A., Bögel, T., Hellwig, O. and Reiter, N. (2012). Semantic annotation for the digital humanities using Markov logic networks for annotation consistency control. Linguistic Issues in Language Technology 121.
Friedrich, A., Palmer, A., Sorensen, M.P. and Pinkal, M. (2015). Annotating genericity: A survey, a scheme, and a corpus. In Proceedings of the Annual Meeting of the Association for Computational Linguistics.
Gardent, C. and Manuélian, H. (2005). Creation d’un corpus annotée pour le traitment des descriptions définies. Traitement Automatique des Langues 46(1), 115139.
Ge, N., Hale, J. and Charniak, E. (1998). A statistical approach to anaphora resolution. In Proceedings of the 6th Workshop on Very Large Corpora, pp. 161170.
Grishina, Y. and Stede, M. (2015). Knowledge-lean projection of coreference chains across languages. In Proceedings of the 8th Workshop on Building and Using Comparable Corpora, pp. 1422.
Guillou, L., Hardmeier, C., Smith, A., Tiedemann, J. and Webber, B. (2014). ParCor 1.0: A parallel pronoun-coreference corpus to support statistical MT. In Proceedings of the Language Resources and Evaluation Conference, pp. 31913198.
Gundel, J.K., Hedberg, N. and Zacharski, R. (2002). Pronouns without explicit antecedents: How do we know when a pronoun is referential?. In Proceedings of the Discourse Anaphora and Anaphor Resolution Colloquium.
Gundel, J.K., Hegarty, M. and Borthen, K. (2003). Cognitive status, information structure, and pronominal reference to clausally introduced entities. Journal of Logic, Language and Information 12(3), 281299.
Hasler, L., Orasan, C. and Naumann, K. (2006). NPs for events: Experiments in coreference annotation. In Proceedings of the Language Resources and Evaluation Conference.
Hendrickx, I., Bouma, G., Coppens, F., Daelemans, W., Hoste, V., Kloosterman, G., Mineur, A.-M., Van Der Vloet, J. and Verschelde, J.-L. (2008). A coreference corpus and resolution system for Dutch. In Proceedings of the Language Resources and Evaluation Conference.
Herbelot, A. and Copestake, A. (2008). Annotating genericity: How do humans decide? (a case study in ontology extraction). In Featherston, S. and Winkler, S. (eds.), The Fruits of Empirical Linguistics. Berlin: de Gruyter.
Hinrichs, E., Kübler, S. and Naumann, K. (2005). A unified representation for morphological, syntactic, semantic and referential annotations. In ACL Workshop on Frontiers in Corpus Annotation II: Pie in the Sky.
Hou, Y., Markert, K. and Strube, M. (2013). Global inference for bridging anaphora resolution. In Proceedings of the Annual Conference of the North American Chapter of the Association for Computational Linguistics.
Hollenstein, N., Schneider, N. and Webber, B. (2016). Inconsistency detection in semantic annotation. In Proceedings of the Language Resources and Evaluation Conference, pp. 39863990.
Kolhatkar, V. and Hirst, G. (2012). Resolving “this-issue” anaphora. In Proceedings of the Conference on Empirical Methods in Natural Language Processing, pp. 12551265.
Kolhatkar, V. and Hirst, G. (2014). Resolving shell nouns. In Proceedings of the Conference on Empirical Methods in Natural Language Processing.
Kolhatkar, V., Zinsmeister, H. and Hirst, G. (2013). Interpreting anaphoric shell nouns using antecedents of cataphoric shell nouns as training data. In Proceedings of the Conference on Empirical Methods in Natural Language Processing.
Kolhatkar, V. (2014). Resolving Shell Nouns. PhD Thesis, University of Toronto.
Krasavina, O. and Chiarcos, C. (2007). The potsdam coreference scheme. In Proceedings of the 1st Linguistic Annotation Workshop, 156163.
Krippendorf, K. (2004). Content Analysis: An Introduction to Its Methodology, 2nd Edn., chapter 11. Thousand Oaks, CA: Sage.
Kummerfeld, J.K., Bansal, M., Burkett, D. and Klein, D. (2011). Mention detection: Heuristics for the OntoNotes annotations. In Proceedings of the Conference on Computational Natural Language Learning: Shared Task, 102106.
Kunz, K. and Lapshinova-Koltunski, E. (2015). Cross-linguistic analysis of discourse variation across genres. Nordic Journal of English Studies 14(1), 258288.
Kunz, K., Lapshinova-Koltunski, E. and Martínez, J.M. (2016). Beyond identity coreference: Contrasting indicators of textual coherence in English and German. In Proceedings of the Workshop on Coreference Resolution beyond OntoNotes, pp. 2331.
Lapshinova-Koltunski, E. and Kunz, K.A. (2014). Annotating cohesion for multilingual analysis. In Proceedings of the LREC ISO Workshop on Interoperable semantic resources, pp. 5764.
Lee, H., Chang, A., Peirsman, Y., Chambers, N., Surdeanu, M. and Jurafsky, D. (2013). Deterministic coreference resolution based on entity-centric, precision-ranked rules. Computational Linguistics 39(4), 885916.
Lee, K., He, L., Lewis, M. and Zettlemoyer, L. (2017). End-to-end neural coreference resolution. In Proceedings of EMNLP.
Loukachevitch, N.V., Dobrov, G.B., Kibrik, A.A., Khudyajova, M.V. and Linnik, A.S. (2011). Factors in referential choice. In Proceedings of Dialogue, Moscow.
Marasović, A., Born, L., Opitz, J. and Frank, A. (2017). A mention-ranking model for abstract anaphora resolution. In Proceedings of the Conference on Empirical Methods in Natural Language Processing.
Marcus, M.P., Santorini, B. and Marcinkiewicz, M.A. (1993). Building a large annotated corpus of English: The Penn Treebank. Computational Linguistics 19(2), 313330.
Martschat, S. and Strube, M. (2015). Latent structures for coreference resolution. Transactions of the Association for Computational Linguistics 3, 405418.
Moosavi, N.S. and Strube, M. (2016). A proposal for a link-based entity aware metric. In Proceedings of the Annual Meeting of the Association for Computational Linguistics.
Markert, K., Hou, Y. and Strube, M. (2012). Collective classification for fine-grained information status. In Proceedings of the Annual Meeting of the Association for Computational Linguistics.
Modieska, N.N. (2003). Resolving Other Anaphors. PhD Thesis, University of Edinburgh.
Müller, C. and Strube, M. (2006). Multi-level annotation of linguistic data with MMAX2. In Braun, S., Kohn, K. and Mukherjee, J. (eds.), Corpus Technology and Language Pedagogy. New Resources, New Tools, New Methods, vol. 3 of English Corpus Linguistics. New York: Peter Lang, pp. 197214.
Müller, M.-C. (2008). Fully Automatic Resolution of it, This and That in Unrestricted Multy-Party Dialog. PhD Thesis, Universität Tübingen.
Nasr, A., Damnati, G., Guerraz, A. and Bechet, F. (2016). Syntactic parsing of chat language in contact center conversation corpus. In Proceedings of the Annual SIGdial Meeting on Discourse and Dialogue, pp. 175184.
Navarretta, C. (2000). Abstract anaphora resolution in Danish. In Proceedings of SIGdial Workshop on Discourse and Dialogue, pp. 5665.
Navarretta, C. (2008). Pronominal types and abstract reference in the Danish and Italian DAD Corpora. In Johansson, C. (ed.), Proceedings of the Second Workshop on Anaphora Resolution NEALT Proceedings Series, Bergen. pp. 6371.
Nedoluzhko, A. (2013). Generic noun phrases and annotation of coreference and bridging relations in the Prague Dependency Treebank. In Proceedings of the Linguistic Annotation Workshop, pp. 103111.
Nedoluzhko, A., Mirokvský, J. and Pajas, P. (2009a). The coding scheme for annotating extended nominal coreference and bridging anaphora in the Prague Dependency Treebank. In Proceedings of the Linguistic Annotation Workshop, pp. 108111.
Nedoluzhko, A., Mírovský, J., Ocelák, R. and Pergler, J. (2009b). Extended coreferential relations and bridging anaphora in the Prague Dependency Treebank. In Proceedings of the 7th Discourse Anaphora and Anaphor Resolution Colloquium, 116.
Neumann, S. (2013). Contrastive Register Variation: A Quantitative Approach to the Comparison of English and German. Berlin: Mouton de Gruyter.
Nguyen, N.L.T., Kim, J.-D. and Tsujii, J. (2008). Challenges in pronoun resolution system for biomedical text. In Proceedings of the Language Resources and Evaluation Conference.
Nissim, M., Dingare, S., Carletta, J. and Steedman, M. (2004). An annotation scheme for information status in dialogue. In Proceedings of the Language Resources and Evaluation Conference.
Palmer, M., Gildea, D. and Kingsbury, (2005). The proposition bank: A corpus annotated with semantic roles. Computational Linguistics 31(1), 71106.
Passonneau, R.J. (1997). Instructions for applying discourse reference annotation for multiple applications (DRAMA). unpublished manuscript.
Peng, H., Chang, K.-W. and Roth, D. (2015). A joint framework for coreference resolution and mention head detection. In Proceedings of the Conference on Computational Natural Language Learning.
Poesio, M. (2000a). The GNOME Annotation Scheme Manual, 4th Edn. Scotland: University of Edinburgh, HCRC and Informatics. Available at http://cswww.essex.ac.uk/Research/nle/corpora/GNOME/anno_manual_4.htm.
Poesio, M. (2000b). Annotating a corpus to develop and evaluate discourse entity realization algorithms: Issues and preliminary results. In Proceedings of the Language Resources and Evaluation Conference, pp. 211218.
Poesio, M. (2004a). The MATE/gnome scheme for anaphoric annotation, revisited. In Proceedings of the SIGdial Workshop on Discourse and Dialogue.
Poesio, M. (2004b). Discourse annotation and semantic annotation in the gnome corpus. In Proceedings of the ACL Workshop on Discourse Annotation, pp. 7279.
Poesio, M., Stevenson, R., Di Eugenio, B. and Hitzeman, J.M. (2004a). Centering: A parametric theory and its instantiations. Computational Linguistics 30(3), 309363.
Poesio, M., Mehta, R., Maroudas, A. and Hitzeman, J. (2004b). Learning to solve bridging references. In Proceedings of the Annual Meeting of the Association for Computational Linguistics, pp. 143150.
Poesio, M., Pradhan, S., Recasens, M., Rodriguez, K. and Versley, Y. (2016). Annotated corpora and annotation tools. In Poesio, M., Stuckardt, R. and Versley, Y. (eds.), Anaphora Resolution: Algorithms, Resources and Applications, chapter 4. Berlin and Heidelberg: Springer.
Poesio, M. and Artstein, R. (2005a). The reliability of anaphoric annotation, reconsidered: Taking ambiguity into account. In Meyers, A. (ed.), Proceedings of ACL Workshop on Frontiers in Corpus Annotation. pp. 7683.
Poesio, M. and Artstein, R. (2005b). Annotating (anaphoric) ambiguity. In Proceedings of the Corpus Linguistics Conference, Birmingham.
Poesio, M., Delmonte, R., Bristot, A., Chiran, L. and Tonelli, S. (2004c). The VENEX Corpus of Anaphoric Information in Spoken and Written Italian. Available at http://cswww.essex.ac.uk/staff/poesio/publications/VENEX04.pdf.
Poesio, M., Patel, A. and Di Eugenio, B. (2006a). Discourse structure and anaphora in tutorial dialogues: An empirical analysis of two theories of the global focus. Research in Language and Computation 4, 229257 (special Issue on Generation and Dialogue).
Poesio, M., Chamberlain, J., Kruschwitz, U., Robaldo, L. and Ducceschi, L. (2013). Phrase detectives: Utilizing collective intelligence for Internet-scale language resource creation. ACM Transactions on Intelligent Interactive Systems 3(1), 144.
Poesio, M., Grishina, Y., Kolhatkar, V., Moosavi, N., Roesiger, I., Roussel, A., Simonjetz, F., Uma, A., Uryupina, O., Yu, J. and Zinsmeister, H. (2018). Anaphora resolution with the arrau corpus. In Proceedings of the NAACL Workshop on Computational Models of Reference, Anaphora and Coreference.
Poesio, M., Bruneseaux, F. and Romary, L. (1999). The MATE meta-scheme for coreference in dialogues in multiple languages. In Walker, M. (ed.), Proceedings of the ACL Workshop on Standards and Tools for Discourse Tagging, 6574.
Poesio, M., Sturt, P., Arstein, R. and Filik, R. (2006b). Underspecification and anaphora: Theoretical issues and preliminary evidence. Discourse Processes 42(2), 157175.
Poesio, M. and Reyle, U. (2001). Underspecification in anaphoric reference. In Bunt, E.T.H. and van der Sluis, I. (eds), Proceedings of the Fourth International Workshop on Computational Semantics, Tilburg: Tilburg University, pp. 286300.
Poesio, M. and Vieira, R. (1998). A corpus-based investigation of definite description use. Computational Linguistics 24(2), 183216. Also available as Research Paper CCS-RP-71, Centre for Cognitive Science, University of Edinburgh.
Poesio, M. and Artstein, R. (2008). Anaphoric annotation in the arrau corpus. In Proceedings of the Language Resources and Evaluation Conference.
Poesio, M. and Rieser, H. (2010). Completions, coordination, and alignment in dialogue. Dialogue and Discourse 1(1) 189.
Pradhan, S., Ramshaw, L., Weischedel, R., MacBride, J. and Micciulla, L. (2007). Unrestricted coreference: Identifying entities and events in OntoNotes. In in Proceedings of the IEEE International Conference on Semantic Computing.
Pradhan, S., Ramshaw, L., Marcus, M., Palmer, M., Weischedel, R. and Xue, N. (2011). CoNLL-2011 Shared Task: Modeling unrestricted coreference in OntoNotes. In Proceedings of the Conference on Computational Natural Language Learning.
Pradhan, S., Moschitti, A., Xue, N., Uryupina, O. and Zhang, Y. (2012). CoNLL-2012 shared task: Modeling multilingual unrestricted coreference in OntoNotes. In Proceedings of the Sixteenth Conference on Computational Natural Language Learning.
Pradhan, S., Luo, X., Recasens, M., Hovy, E., Ng, V. and Strube, M. (2014). Scoring coreference partitions of predicted mentions: A reference implementation. In Proceedings of the Annual Meeting of The Association for Computational Linguistics.
Prince, E.F. (1992). The ZPG letter: Subjects, definiteness, and information status. In Thompson, S. and Mann, W. (eds.), Discourse Description: Diverse Analyses of a Fund-Raising Text. Amsterdam, The Netherlands: John Benjamins, pp. 295325.
Reiter, N. and Frank, A. (2010). Identifying generic noun phrases. In Proceedings of the Annual Meeting of the Association for Computational Linguistics, pp. 4049.
Roesiger, I. (2018). Rule- and learning-based methods for bridging resolution in the arrau corpus. In Proceedings of the NAACL Worskhop on Computational Models of Reference, Anaphora and Coreference.
Recasens, M. (2008). Discourse deixis and coreference: Evidence from AnCoRa. In Johansson, C. (ed.), Proceedings of the 2nd Workshop on Anaphora Resolution.
Recasens, M., àrquez, L.M., Sapena, E., Mart, M.A.í, Taulé, M., Hoste, V., Poesio, M. and Versley, Y. (2010). SemEval-2010 Task 1: Coreference resolution in multiple languages. In Proceedings of the International Workshop on Semantic Evaluation.
Recasens, M. and Martí, M.A. (2010). AnCora-CO: Coreferentially annotated corpora for Spanish and Catalan. Language Resources and Evaluation 44(4), 315345.
Recasens, M., Hovy, E. and Martí, M.A. (2011). Identity, non-identity, and near-identity: Addressing the complexity of coreference. Lingua 121(6), 11381152.
Rodriguez, K. (2010). Resources for Linguistically Motivated Multilingual Anaphora Resolution. PhD Thesis, Universitá di Trento.
Rodriguez, K.-J., Delogu, F., Versley, Y., Stemle, E. and Poesio, M. (2010). Anaphoric annotation of Wikipedia and blogs in the live memories Corpus. In Proceedings of the Language Resources and Evaluation Conference.
Schuster, E. (1988). Pronominal Reference to Events and Actions: Evidence from Naturally-Occurring Data (LINC LAB 100). Philadelphia, PA: Department of Computer and Information Science, University of Pennsylvania.
Stede, M. (2004). The potsdam commentary corpus. In Proceedings of the ACL Workshop on Discourse Annotation.
Uryupina, O. and Poesio, M. (2012). Domain-specific vs. uniform modeling for coreference resolution. In Proceedings of the Language Resources and Evaluation Conference.
Uryupina, O. and Poesio, M. (2013). Evalita 2011: Anaphora resolution task. In Magnini, B., Cutugno, F., Falcone, M. and Pianta, E. (eds.), Evaluation of Natural Language and Speech Tools for Italian, vol. 7689 of Lecture Notes in Computer Science. Springer, pp. 146155.
Uryupina, O., Artstein, R., Bristot, A., Cavicchio, F., Rodriguez, K.J. and Poesio, M. (2016a). arrau: Linguistically-motivated annotation of anaphoric description. In Proceedings of the Language Resources and Evaluation Conference.
Uryupina, O. and Moschitti, A. (2013). Multilingual mention detection for coreference resolution. In Proceedings of the International Joint Conference on Natural Language Processing.
Uryupina, O., Kabadjov, M. and Poesio, M. (2016b). Detecting non-reference and non-anaphoricity. In Poesio, M., Stuckardt, R. and Versley, Y. (eds.), Anaphora Resolution: Algorithms, Resources and Applications, chapter 13. Berlin and Heidelberg: Springer.
van Deemter, K. and Kibble, R. (2000). On coreferring: Coreference in MUC and related annotation schemes. Computational Linguistics 26(4), 629637.
Versley, Y. (2005). Parser evaluation across text types. In Proceedings of the Fourth Workshop on Treebanks and Linguistic Theories.
Versley, Y. (2008). Vagueness and referential ambiguity in a large-scale annotated corpus. Research on Language and Computation 6, 333353.
Versley, Y., Ponzetto, S., Poesio, M., Eidelman, V., Jern, A., Smith, J., Yang, X. and Moschitti, A. (2008). BART: A modular toolkit for coreference resolution. In Proceedings of the Annual Meeting of The Association for Computational linguistics, demo session.
Vieira, R. (1998). Definite Description Resolution in Unrestricted Texts. PhD Thesis, University of Edinburgh, Centre for Cognitive Science.
Webber, B.L. (1991). Structure and ostension in the interpretation of discourse deixis. Language and Cognitive Processes 6(2), 107135.
Weischedel, R., Hovy, E., Marcus, M., Palmer, M., Belvin, R., Pradhan, S., Ramshaw, L. and Xue, N. (2011). OntoNotes: A large training corpus for enhanced processing. In Olive, J., Christianson, C. and McCary, J. (eds.), Handbook of Natural Language Processing and Machine Translation: DARPA Global Autonomous Language Exploitation. Berlin and Heidelberg: Springer.
Wiseman, S.J., Rush, A.M., Shieber, S.M. and Weston, J. (2015). Learning anaphoricity and antecedent ranking features for coreference resolution. In Proceedings of the Annual Meeting of The Association for Computational linguistics.
Zhekova, D. and Kübler, S. (2013). Machine learning for mention head detection in multilingual coreference resolution. In Proceedings of the Recent Advance in Natural Language Processing Conference, 747754.
Recommend this journal

Email your librarian or administrator to recommend adding this journal to your organisation's collection.

Natural Language Engineering
  • ISSN: 1351-3249
  • EISSN: 1469-8110
  • URL: /core/journals/natural-language-engineering
Please enter your name
Please enter a valid email address
Who would you like to send this to? *
×

Keywords

Metrics

Full text views

Total number of HTML views: 0
Total number of PDF views: 0 *
Loading metrics...

Abstract views

Total abstract views: 0 *
Loading metrics...

* Views captured on Cambridge Core between <date>. This data will be updated every 24 hours.

Usage data cannot currently be displayed