A framework for enriching lexical semantic resources with distributional semantics

CHRIS BIEMANN; STEFANO FARALLI; ALEXANDER PANCHENKO; SIMONE PAOLO PONZETTO

doi:10.1017/S135132491700047X

A framework for enriching lexical semantic resources with distributional semantics

Published online by Cambridge University Press: 15 January 2018

CHRIS BIEMANN ,

STEFANO FARALLI ,

ALEXANDER PANCHENKO and

SIMONE PAOLO PONZETTO

Show author details

CHRIS BIEMANN: Affiliation:
Language Technology Group, Department of Informatics, Faculty of Mathematics, Informatics, and Natural Sciences, Universität Hamburg, Germany e-mails: biemann@informatik.uni-hamburg.de, panchenko@informatik.uni-hamburg.de
STEFANO FARALLI: Affiliation:
Data and Web Science Group, School of Business Informatics and Mathematics, Universität Mannheim, Germany e-mails: stefano@informatik.uni-mannheim.de, simone@informatik.uni-mannheim.de
ALEXANDER PANCHENKO: Affiliation:
Language Technology Group, Department of Informatics, Faculty of Mathematics, Informatics, and Natural Sciences, Universität Hamburg, Germany e-mails: biemann@informatik.uni-hamburg.de, panchenko@informatik.uni-hamburg.de
SIMONE PAOLO PONZETTO: Affiliation:
Data and Web Science Group, School of Business Informatics and Mathematics, Universität Mannheim, Germany e-mails: stefano@informatik.uni-mannheim.de, simone@informatik.uni-mannheim.de

Article contents

Abstract
References

Get access

Rights & Permissions

Abstract

We present an approach to combining distributional semantic representations induced from text corpora with manually constructed lexical semantic networks. While both kinds of semantic resources are available with high lexical coverage, our aligned resource combines the domain specificity and availability of contextual information from distributional models with the conciseness and high quality of manually crafted lexical networks. We start with a distributional representation of induced senses of vocabulary terms, which are accompanied with rich context information given by related lexical items. We then automatically disambiguate such representations to obtain a full-fledged proto-conceptualization, i.e. a typed graph of induced word senses. In a final step, this proto-conceptualization is aligned to a lexical ontology, resulting in a hybrid aligned resource. Moreover, unmapped induced senses are associated with a semantic type in order to connect them to the core resource. Manual evaluations against ground-truth judgments for different stages of our method as well as an extrinsic evaluation on a knowledge-based Word Sense Disambiguation benchmark all indicate the high quality of the new hybrid resource. Additionally, we show the benefits of enriching top-down lexical knowledge resources with bottom-up distributional information from text for addressing high-end knowledge acquisition tasks such as cleaning hypernym graphs and learning taxonomies from scratch.

Information

Type: Articles
Information: Natural Language Engineering , Volume 24 , Issue 2 , March 2018 , pp. 265 - 312

DOI: https://doi.org/10.1017/S135132491700047X [Opens in a new window]
Copyright: Copyright © Cambridge University Press 2018

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Article purchase

Temporarily unavailable

References

Agirre, E. and Soroa, A. 2007. Semeval-2007 Task 02: evaluating word sense induction and discrimination systems. In Proceedings of the 4th International Workshop on Semantic Evaluations (SemEval'07), Prague, Czech Republic.Google Scholar

Aprosio, A. P., Giuliano, C. and Lavelli, A. 2013. Extending the coverage of DBpedia properties using distant supervision over Wikipedia. In Proceedings of the 2013 International Conference on NLP and DBpedia – Volume 1064 (NLP-DBPEDIA'13), Sydney, Australia.Google Scholar

Banko, M., Cafarella, M. J., Soderland, S., Broadhead, M. and Etzioni, O. 2007. Open information extraction from the web. In Proceedings of the 20th International Joint Conference on Artificial Intelligence (IJCAI'07), Hyderabad, India.Google Scholar

Baroni, M., Bernardi, R., Do, N.-Q. and Shan, C.-C. 2012. Entailment above the word level in distributional semantics. In Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics (EACL'12), Avignon, France.Google Scholar

Baroni, M., Dinu, G. and Kruszewski, G. 2014. Don't count, predict! A systematic comparison of context-counting vs. context-predicting semantic vectors. In Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (ACL'14), Baltimore, MD, USA.CrossRef Google Scholar

Bartunov, S., Kondrashkin, D., Osokin, A. and Vetrov, D. P. 2016. Breaking sticks and ambiguities with adaptive skip-gram. In Proceedings of the 19th International Conference on Artificial Intelligence and Statistics (AISTATS'16), Cadiz, Spain.Google Scholar

Biemann, C. 2005. Ontology learning from text: a survey of methods. LDV Forum 20 (2): 75–93.Google Scholar

Biemann, C. 2006. Chinese whispers: an efficient graph clustering algorithm and its application to natural language processing problems. In Proceedings of the First Workshop on Graph Based Methods for Natural Language Processing, TextGraphs-1, Brooklyn, NY, USA.Google Scholar

Biemann, C. and Riedl, M. 2013. Text: now in 2D! A framework for lexical expansion with contextual similarity. Journal of Language Modelling 1 (1): 55–95.Google Scholar

Bizer, C., Lehmann, J., Kobilarov, G., Auer, S., Becker, C., Cyganiak, R., and Hellmann, S. 2009. DBpedia – a crystallization point for the web of data. Journal of Web Semantics 7 (3): 154–65.CrossRef Google Scholar

Bordea, G., Buitelaar, P., Faralli, S. and Navigli, R. 2015. SemEval-2015 task 17: taxonomy extraction evaluation (TExEval). In Proceedings of the 9th International Workshop on Semantic Evaluation (SemEval@NAACL-HLT'15), Denver, CO, USA.Google Scholar

Bordea, G., Lefever, E. and Buitelaar, P. 2016. SemEval-2016 task 13: taxonomy extraction evaluation (TExEval-2). In Proceedings of the 10th International Workshop on Semantic Evaluation (SemEval@NAACL-HLT'16), San Diego, CA, USA.Google Scholar

Bordes, A., Weston, J., Collobert, R. and Bengio, Y. 2011. Learning structured embeddings of knowledge bases. In Proceedings of the 25th AAAI Conference on Artificial Intelligence (AAAI'11), San Francisco, CA, USA.Google Scholar

Brugmann, K. 1904. Kurze vergleichende Grammatik der indogermanischen Sprachen. Strassburg, France: Karl J. Trubner.Google Scholar

Bryl, V. and Bizer, C. 2014. Learning conflict resolution strategies for cross-language Wikipedia data fusion. In Proceedings of the 23rd International Conference on World Wide Web (WWW'14 Companion), Seoul, South Korea.CrossRef Google Scholar

Camacho-Collados, J., Pilehvar, M. T., and Navigli, R. 2015a. A unified multilingual semantic representation of concepts. In Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, Volume 1: Long Papers (ACL'15), Beijing, China.Google Scholar

Camacho-Collados, J., Pilehvar, M. T., and Navigli, R. 2015b. NASARI: a novel approach to a semantically aware representation of items. In Proceedings of the 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL-HLT'15), Denver, CO, USA.CrossRef Google Scholar

Carlson, A., Betteridge, J., Kisiel, R., Settles, B., Hruschka, E. R. Jr., and Mitchell, T. M. 2010. Toward an architecture for never-ending language learning. In Proceedings of the 24th AAAI Conference on Artificial Intelligence (AAAI'10), Atlanta, GA, USA.Google Scholar

Cecchini, F. M., Riedl, M. and Biemann, C. 2017. Using pseudowords for algorithm comparison: an evaluation framework for graph-based word sense induction. In Proceedings of the 21st Nordic Conference on Computational Linguistics (NODALIDA'17), Gothenburg, Sweden.Google Scholar

Chen, X., Liu, Z. and Sun, M. 2014. A unified model for word sense representation and disambiguation. In Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing, A meeting of SIGDAT, a Special Interest Group of the ACL (EMNLP'14), Doha, Qatar.Google Scholar

Chiarcos, C., Hellmann, S. and Nordhoff, S. 2012. Linking linguistic resources: examples from the open linguistics working group. In Chiarcos, C., Nordhoff, S., and Hellmann, S. (eds.), Linked Data in Linguistics – Representing and Connecting Language Data and Language Metadata, pp. 201–16. Heidelberg, Germany: Springer.Google Scholar

Clark, S. 2015. Vector space models of lexical meaning. In Lappin, S., and Fox, C. (eds.), Handbook of Contemporary Semantics, 2nd edition, pp. 493–522. New York, NY, USA: Wiley-Blackwell.Google Scholar

Cleuziou, G. and Moreno, J. G. 2016. QASSIT at SemEval-2016 Task 13: on the integration of semantic vectors in pretopological spaces for lexical taxonomy acquisition. In Proceedings of the 10th International Workshop on Semantic Evaluation (SemEval@NAACL-HLT'16), San Diego, CA, USA.Google Scholar

Cuadros, M. and Rigau, G. 2007. SemEval-2007 task 16: evaluation of wide coverage knowledge resources. In Proceedings of the 4th International Workshop on Semantic Evaluations (SemEval'07), Prague, Czech Republic.Google Scholar

Cuadros, M. and Rigau, G. 2008. KnowNet: building a large net of knowledge from the web. In Proceedings of the 22nd International Conference on Computational Linguistics – Volume 1 (COLING'08), Manchester, UK.Google Scholar

Curran, J. R. 2002. Ensemble methods for automatic thesaurus extraction. In Proceedings of the ACL-02 Conference on Empirical Methods in Natural Language Processing – Volume 10 (EMNLP'02), Philadelphia, PA, USA.Google Scholar

De Marneffe, M.-C., MacCartney, B., and Manning, C. D. 2006. Generating typed dependency parses from phrase structure parses. In Proceedings of the 5th International Conference on Language Resources and Evaluation (LREC'06), Genoa, Italy.Google Scholar

Di Marco, A., and Navigli, R. 2013. Clustering and diversifying web search results with graph-based word sense induction. Computational Linguistics 39 (3): 709–54.Google Scholar

Evert, S. 2005. The Statistics of Word Cooccurrences: Word Pairs and Collocations. Ph.D. thesis. Institut für maschinelle Sprachverarbeitung, University of Stuttgart, Germany.Google Scholar

Fader, A., Soderland, S. and Etzioni, O. 2011. Identifying relations for open information extraction. In Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP'11), Edinburgh, UK.Google Scholar

Faralli, S. and Navigli, R. 2012. A new minimally supervised framework for domain word sense disambiguation. In Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning (EMNLP-CoNLL'12), Jeju Island, South Korea.Google Scholar

Faralli, S. and Navigli, R. 2013. Growing multi-domain glossaries from a few seeds using probabilistic topic models. In Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, A meeting of SIGDAT, a Special Interest Group of the ACL (EMNLP'13), Seattle, WA, USA.Google Scholar

Faralli, S., Panchenko, A., Biemann, C. and Paolo Ponzetto, S. 2017. The contrast medium algorithm: taxonomy induction from noisy knowledge graphs with just a few links. In Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics, Volume 1: Long Papers (EACL'17), Valencia, Spain.CrossRef Google Scholar

Faralli, S., Stilo, G. and Velardi, P. 2015. Large scale homophily analysis in twitter using a twixonomy. In Proceedings of the 24th International Joint Conference on Artificial Intelligence (IJCAI'15), Buenos Aires, Argentina.Google Scholar

Faruqui, M., Dodge, J., Jauhar, S. K., Dyer, C., Hovy, E. H., and Smith, N. A. 2015. Retrofitting word vectors to semantic lexicons. In Proceedings of the 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL-HLT'15), Denver, CO, USA.Google Scholar

Faruqui, M. and Kumar, S. 2015. Multilingual open relation extraction using cross-lingual projection. In Proceedings of the 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL-HLT'15), Denver, CO, USA.Google Scholar

Fellbaum, C. (ed.) 1998. WordNet: An Electronic Database. Cambridge, MA: MIT Press.Google Scholar

Fleiss, J. L. 1971. Measuring nominal scale agreement among many raters. Psychological Bulletin 76 (5): 378.CrossRef Google Scholar

Gangemi, A., Guarino, N., Masolo, C., Oltramari, A., and Schneider, L. 2002. Sweetening ontologies with DOLCE. In Knowledge Engineering and Knowledge Management: Ontologies and the Semantic Web: 13th International Conference (EKAW 2002), Sigüenza, Spain, Berlin, Heidelberg.Google Scholar

Ganitkevitch, J., Van Durme, B., and Callison-Burch, C. 2013. PPDB: the paraphrase database. In Proceedings of the 2013 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL-HLT'13), Atlanta, GA, USA.Google Scholar

Gil-Lafuente, A. M., and Aluja, J. G. 2012. Towards an Advanced Modelling of Complex Economic Phenomena: Pretopological and Topological Uncertainty Research Tools. Berlin, Heidelberg: Springer.Google Scholar

Glavaš, G. and Ponzetto, S.-P. 2017. Dual tensor model for detecting asymmetric lexico-semantic relations. In Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing (EMNLP'17), Copenhagen, Denmark.Google Scholar

Goikoetxea, J., Soroa, A. and Agirre, E. 2015. Random walks and neural network language models on knowledge bases. In Proceedings of the 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL-HLT'15), Denver, CO, USA.Google Scholar

Gurevych, I., Eckle-Kohler, J., Hartmann, S., Matuschek, M., Meyer, C. M., and Wirth, C. 2012. UBY – a large-scale unified lexical-semantic resource based on LMF. In Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics (EACL'12), Avignon, France.Google Scholar

Hearst, M. A. 1992. Automatic acquisition of hyponyms from large text corpora. In Proceedings of the 14th Conference on Computational Linguistics – Volume 2 (COLING'92), Nantes, France.Google Scholar

Hoffart, J., Suchanek, F. M., Berberich, K. and Weikum, G. 2013. YAGO2: a spatially and temporally enhanced knowledge base from Wikipedia. Artificial Intelligence 194: 28–61.Google Scholar

Hovy, E., Navigli, R. and Ponzetto, S.-P. 2013. Collaboratively built semi-structured content and artificial intelligence: the story so far. Artificial Intelligence 194: 2–27.Google Scholar

Jauhar, S. K., Dyer, C. and Hovy, E. 2015. Ontologically grounded multi-sense representation learning for semantic vector space models. In Proceedings of the 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL-HLT'15), Denver, CO, USA.Google Scholar

Jenatton, R., Roux, N. L., Bordes, A. and Obozinski, G. 2012. A latent factor model for highly multi-relational data. In Proceedings of Advances in Neural Information Processing Systems 25: 26th Annual Conference on Neural Information Processing Systems (NIPS'12), Lake Tahoe, NV, USA.Google Scholar

Jurgens, D. and Pilehvar, M. T. 2016. SemEval-2016 task 14: semantic taxonomy enrichment. In Proceedings of the 10th International Workshop on Semantic Evaluation (SemEval@NAACL-HLT'16), San Diego, CA, USA.Google Scholar

Kapanipathi, P., Jain, P., Venkataramani, C. and Sheth, A. 2014. User interests identification on Twitter using a hierarchical knowledge base. In Proceedings of the Semantic Web: Trends and Challenges: 11th International Conference (ESWC'14), Cham.Google Scholar

Klein, D. and Manning, C. D. 2003. Accurate unlexicalized parsing. In Proceedings of the 41st Annual Meeting of the Association for Computational Linguistics – Volume 1 (ACL'03), Sapporo, Japan.Google Scholar

Klein, P., Ponzetto, S. P. and Glavaš, G. 2017. Improving neural knowledge base completion with cross-lingual projections. In Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics: Volume 2, Short Papers (EACL'17), Valencia, Spain.Google Scholar

Kozareva, Z. and Hovy, E. 2010. A semi-supervised method to learn and construct taxonomies using the web. In Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing (EMNLP'10), Cambridge, MA, USA.Google Scholar

Krause, S., Hennig, L., Gabryszak, A., Xu, F., and Uszkoreit, H. 2015. Sar-graphs: a linked linguistic knowledge resource connecting facts with language. In Proceedings of the 4th Workshop on Linked Data in Linguistics: Resources and Applications, Beijing, China.CrossRef Google Scholar

Lesk, M. 1986. Automatic sense disambiguation using machine readable dictionaries: how to tell a pine cone from an ice cream cone. In Proceedings of the 5th Annual International Conference on Systems Documentation (SIGDOC'86), Toronto, Ontario, Canada.Google Scholar

Levy, O. and Goldberg, Y. 2014. Dependency-based word embeddings. In Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (ACL'14), Baltimore, MD, USA.CrossRef Google Scholar

Li, J. and Jurafsky, D. 2015. Do multi-sense embeddings improve natural language understanding? In Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing (EMNLP'15), Lisbon, Portugal.Google Scholar

Lin, D. 1998. Automatic retrieval and clustering of similar words. In Proceedings of the 17th International Conference on Computational Linguistics – Volume 2 (COLING'98), Montreal, Quebec, Canada.Google Scholar

Lin, T., Pantel, P., Gamon, M., Kannan, A., and Fuxman, A. 2012. Active objects: actions for entity-centric search. In Proceedings of the 21st International Conference on World Wide Web (WWW'12), Lyon, France.Google Scholar

Maitra, P. and Das, D. 2016. JUNLP at SemEval-2016 Task 13: a language independent approach for hypernym identification. In Proceedings of the 10th International Workshop on Semantic Evaluation (SemEval@NAACL-HLT'16), San Diego, CA, USA.Google Scholar

McCrae, J. P., Fellbaum, C., and Cimiano, P. 2014. Publishing and linking WordNet using lemon and RDF. In Proceedings of the 3rd Workshop on Linked Data in Linguistics, Reykjavik, Iceland.Google Scholar

Mihalcea, R., Chklovski, T. and Kilgarriff, A. 2004. The Senseval-3 English lexical sample task. In Proceedings of the 3rd International Workshop on the Evaluation of Systems for the Semantic Analysis of Text (Senseval-3), Barcelona, Spain.Google Scholar

Mihalcea, R. and Csomai, A. 2007. Wikify!: linking documents to encyclopedic knowledge. In Proceedings of the 16th ACM Conference on Information and Knowledge Management (CIKM'07), Lisbon, Portugal.Google Scholar

Mihalcea, R. and Moldovan, D. I. 2001. eXtended WordNet: progress report. In Proceedings of NAACL Workshop on WordNet and Other Lexical Resources, Pittsburgh, PA, USA.Google Scholar

Mikolov, T., Sutskever, I., Chen, K., Corrado, G. S., and Dean, J. 2013. Distributed representations of words and phrases and their compositionality. In Proceedings of the Advances in Neural Information Processing Systems 26: 27th Annual Conference on Neural Information Processing Systems (NIPS'13), Lake Tahoe, NV, USA.Google Scholar

Mikolov, T., Yih, W.-T. and Zweig, G. 2013. Linguistic regularities in continuous space word representations. In Proceedings of the 2013 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL-HLT'15), Atlanta, GA, USA.Google Scholar

Miller, G. A., Leacock, C., Tengi, R. and Bunker, R. T. 1993. A semantic concordance. In Proceedings of the Workshop on Human Language Technology, Princeton, NJ, USA.Google Scholar

Mintz, M., Bills, S., Snow, R. and Jurafsky, D. 2009. Distant supervision for relation extraction without labeled data. In Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: Vol. 2 (ACL'09), Suntec, Singapore.Google Scholar

Nastase, V. and Strube, M. 2013. Transforming Wikipedia into a large scale multilingual concept network. Artificial Intelligence 194: 62–85.Google Scholar

Navigli, R. 2009. Word sense disambiguation: a survey. ACM CSUR 41 (2): 1–69.Google Scholar

Navigli, R. and Ponzetto, S. P. 2012a. BabelNet: the automatic construction, evaluation and application of a wide-coverage multilingual semantic network. Artificial Intelligence 193: 217–50.Google Scholar

Navigli, R. and Ponzetto, S. P. 2012b. Joining forces pays off: multilingual joint Word Sense Disambiguation. In Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning (EMNLP-CoNLL'12), Jeju Island, South Korea.Google Scholar

Navigli, R. and Velardi, P. 2010. Learning Word-class lattices for definition and hypernym extraction. In Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics (ACL'10), Uppsala, Sweden.Google Scholar

Neelakantan, A., Shankar, J., Passos, A. and McCallum, A. 2014. Efficient non-parametric estimation of multiple embeddings per word in vector space. In Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP'14), Doha, Qatar.Google Scholar

Nickel, M., Murphy, K., Tresp, V. and Gabrilovich, E. 2016. A review of relational machine learning for knowledge graphs. Proceedings of the IEEE 104 (1): 11–33.Google Scholar

Niemann, E. and Gurevych, I. 2011. The people's web meets linguistic knowledge: automatic sense alignment of Wikipedia and wordnet. In Proceedings of the Ninth International Conference on Computational Semantics (IWCS'11), Oxford, UK.Google Scholar

Nieto Piña, L., and Johansson, R. 2016. Embedding senses for efficient graph-based word sense disambiguation. In Proceedings of the 2016 Workshop on Graph-based Methods for Natural Language Processing (Textgraphs'16), San Diego, CA, USA.Google Scholar

Norvig, P. 2016. The semantic web and the semantics of the web: where does meaning come from? In Proceedings of the 25th International Conference on World Wide Web (WWW'16), Montreal, Quebec, Canada.Google Scholar

Padó, S. and Lapata, M. 2007. Dependency-based construction of semantic space models. Computational Linguistics 33 (2): 161–99.Google Scholar

Panchenko, A. 2016. Best of both worlds: making word sense embeddings interpretable. In Proceedings of the 10th International Conference on Language Resources and Evaluation (LREC'16), Portoro

, Slovenia.Google Scholar

Panchenko, A., Faralli, S., Ponzetto, S. P. and Biemann, C. 2017a. Using linked disambiguated distributional networks for word sense disambiguation. In Proceedings of the 1st EACL Workshop on Sense, Concept and Entity Representations and their Applications, Valencia, Spain.Google Scholar

Panchenko, A., Faralli, S., Ruppert, E., Remus, S., Naets, H., Fairon, C., Ponzetto, S. P., and Biemann, C. 2016. TAXI at SemEval-2016 task 13: a taxonomy induction method based on lexico-syntactic patterns, substrings and focused crawling. In Proceedings of the 10th International Workshop on Semantic Evaluation (SemEval@NAACL-HLT'16), San Diego, CA, USA.Google Scholar

Panchenko, A., Marten, F., Ruppert, E., Faralli, S., Ustalov, D., Ponzetto, S. P., and Biemann, C. 2017b. Unsupervised, knowledge-free, and interpretable word sense disambiguation. In Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing: System Demonstrations (EMNLP'17), Copenhagen, Denmark.Google Scholar

Panchenko, A. and Morozova, O. 2012. A study of hybrid similarity measures for semantic relation extraction. In Proceedings of the Workshop on Innovative Hybrid Approaches to the Processing of Textual Data, Avignon, France.Google Scholar

Panchenko, A., Romanov, P., Morozova, O., Naets, H., Philippovich, A., Romanov, A., and Fairon, C. 2013. Serelex: search and visualization of semantically related words. In Proceedings of the European Conference on Information Retrieval (ECIR'13), Moscow, Russia.Google Scholar

Panchenko, A., Ruppert, E., Faralli, S., Ponzetto, S. P., and Biemann, C. 2017c. Unsupervised does not mean uninterpretable: the case for word sense induction and disambiguation. In Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics (EACL'17), Valencia, Spain.Google Scholar

Panchenko, A., Simon, J., Riedl, M. and Biemann, C. 2016. Noun sense induction and disambiguation using graph-based distributional semantics. In Proceedings of the 13th Conference on Natural Language Processing (KONVENS'16), Bochum, Germany.Google Scholar

Parker, R., Graff, D., Kong, J., Chen, K., and Maeda, K. 2011. English Gigaword, 5th ed. Philadelphia: Linguistic Data Consortium.Google Scholar

Pavel, S. and Euzenat, J. 2013. Ontology matching: state of the art and future challenges. IEEE Transaction on Knowledge and Data Engineering 25 (1): 158–76.Google Scholar

Pedersen, T., Patwardhan, S. and Michelizzi, J. 2004. WordNet::similarity – measuring the relatedness of concepts. In Proceedings of the HLT-NAACL 2004: Demonstration Papers (HLT-NAACL'04 Demos), Boston, MA, USA.Google Scholar

Pelevina, M., Arefiev, N., Biemann, C. and Panchenko, A. 2016. Making sense of word embeddings. In Proceedings of the 1st Workshop on Representation Learning for NLP, Berlin, Germany.Google Scholar

Pennington, J., Socher, R. and Manning, C. 2014. GloVe: global vectors for word representation. In Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP'14), Doha, Qatar.Google Scholar

Pham, N. T., Lazaridou, A. and Baroni, M. 2015. A multitask objective to inject lexical contrast into distributional semantics. In Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, Volume 2: Short Papers (ACL'15), Beijing, China.Google Scholar

Pocostales, J. 2016. NUIG-UNLP at SemEval-2016 task 13: a simple word embedding-based approach for taxonomy extraction. In Proceedings of the 10th International Workshop on Semantic Evaluation (SemEval@NAACL-HLT'16), San Diego, CA, USA.Google Scholar

Ponzetto, S. P. and Navigli, R. 2009. Large-scale taxonomy mapping for restructuring and integrating Wikipedia. In Proceedings of the 21st International Joint Conference on Artificial Intelligence (IJCAI'09), Pasadena, CA, USA.Google Scholar

Ponzetto, S. P. and Navigli, R. 2010. Knowledge-rich word sense disambiguation rivaling supervised systems. In Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics (ACL'10), Uppsala, Sweden.Google Scholar

Ponzetto, S. P. and Strube, M. 2011. Taxonomy induction based on a collaboratively built knowledge repository. Artificial Intelligence 175: 1737–56.Google Scholar

Pradhan, S. S., Loper, E., Dligach, D. and Palmer, M. 2007. SemEval-2007 task 17: english lexical sample, SRL and all words. In Proceedings of the 4th International Workshop on Semantic Evaluations (SemEval'07), Prague, Czech Republic.Google Scholar

Reisinger, J. and Mooney, R. J. 2010. Multi-prototype vector-space models of word meaning. In Proceedings of the Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics (HLT'10), Los Angeles, CA, USA.Google Scholar

Richter, M., Quasthoff, U., Hallsteinsdóttir, E., and Biemann, C. 2006. Exploiting the Leipzig corpora collection. In Proceedings of IS-LTC'06, Ljubljana, Slovenia.Google Scholar

Riedl, M. 2016. Unsupervised Methods for Learning and Using Semantics of Natural Language. Ph.D. thesis. Germany: TU Darmstadt. http://tuprints.ulb.tu-darmstadt.de/5435/.Google Scholar

Riedl, M. and Biemann, C. 2013. Scaling to large³ data: an efficient and effective method to compute distributional thesauri. In Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, a Meeting of SIGDAT, a Special Interest Group of the ACL (EMNLP'13), Seattle, WA, USA.Google Scholar

Riedl, M. and Biemann, C. 2015. A single word is not enough: ranking multiword expressions using distributional semantics. In Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing (EMNLP'15), Lisbon, Portugal.Google Scholar

Roller, S., Erk, K. and Boleda, G. 2014. Inclusive yet selective: supervised distributional hypernymy detection. In Proceedings of the 25th International Conference on Computational Linguistics: Technical Papers (COLING'14), Dublin, Ireland.Google Scholar

Rospocher, M., van Erp, M., Vossen, P., Fokkens, A., Aldabe, I., Rigau, G., Soroa, A., Ploeger, T., and Bogaard, T. 2016. Building event-centric knowledge graphs from news. Journal of Web Semantics 37–38: 132–51.CrossRef Google Scholar

Rothe, S. and Schütze, H. 2015. AutoExtend: extending word embeddings to embeddings for synsets and lexemes. In Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, Volume 1: Long Papers (ACL'15), Beijing, China.Google Scholar

Ruppert, E., Kaufmann, M., Riedl, M. and Biemann, C. 2015. JoBimViz: a web-based visualization for graph-based distributional semantic models. In Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, System Demonstrations (ACL'15), Beijing, China.Google Scholar

Schuhmacher, M., Dietz, L. and Ponzetto, S. P. 2015. Ranking entities for web queries through text and knowledge. In Proceedings of the 24th ACM International on Conference on Information and Knowledge Management (CIKM'15), Melbourne, Australia.Google Scholar

Shwartz, V., Goldberg, Y. and Dagan, I. 2016. Improving hypernymy detection with an integrated path-based and distributional method. In Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, Volume 1: Long Papers (ACL'16), Berlin, Germany.Google Scholar

Snow, R., Jurafsky, D. and Ng, A. 2006. Semantic taxonomy induction from heterogenous evidence. In Proceedings of the 21st International Conference on Computational Linguistics and the 44th Annual Meeting of the Association for Computational Linguistics (ACL'06), Sydney, Australia.Google Scholar

Snow, R., Jurafsky, D. and Ng, A. Y. 2004. Learning syntactic patterns for automatic hypernym discovery. In Proceedings of the Advances in Neural Information Processing Systems 17 (NIPS'04), Vancouver, BC, Canada.Google Scholar

Socher, R., Chen, D., Manning, C. D. and Ng, A. 2013. Reasoning with neural tensor networks for knowledge base completion. In Proceedings of the Advances in Neural Information Processing Systems 26: 27th Annual Conference on Neural Information Processing Systems (NIPS'13), Lake Tahoe, NV, USA.Google Scholar

Suchanek, F. M., Kasneci, G. and Weikum, G. 2008. YAGO: a large ontology from Wikipedia and WordNet. Journal of Web Semantics 6 (3): 203–17.Google Scholar

Tan, L., Bond, F. and van Genabith, J. 2016. USAAR at SemEval-2016 task 13: hyponym endocentricity. In Proceedings of the 10th International Workshop on Semantic Evaluation (SemEval@NAACL-HLT'16), San Diego, CA, USA.Google Scholar

Tarjan, R. 1972. Depth-first search and linear graph algorithms. SIAM Journal on Computing 1 (2): 146–60.CrossRef Google Scholar

Turney, P. D. and Pantel, P. 2010. From frequency to meaning: vector space models of semantics. Journal of Artificial Intelligence Research 37: 141–88.Google Scholar

Van de Cruys, T. 2010. Mining for meaning: the extraction of lexico-semantic knowledge from text. Groningen Dissertations in Linguistics 82.Google Scholar

Van Dongen, S. 2008. Graph clustering via a discrete uncoupling process. SIAM Journal on Matrix Analysis and Applications 30 (1): 121–41.Google Scholar

Velardi, P., Faralli, S. and Navigli, R. 2013. OntoLearn reloaded: a graph-based algorithm for taxonomy induction. Computational Linguistics 39 (3): 665–707.Google Scholar

Velardi, P., Navigli, R., Faralli, S. and Ruiz-Martínez, J. M. 2012. A new method for evaluating automatically learned terminological taxonomies. In Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC'12), Istanbul, Turkey.Google Scholar

Véronis, J. 2004. HyperLex: lexical cartography for information retrieval. Computer Speech and Language 18: 223–52.Google Scholar

Wang, Z., Li, J., Wang, Z. and Tang, J. 2012. Cross-lingual knowledge linking across Wiki knowledge bases. In Proceedings of the 21st International Conference on World Wide Web (WWW'12), Lyon, France.CrossRef Google Scholar

Weeds, J., Weir, D. and McCarthy, D. 2004. Characterising measures of lexical distributional similarity. In Proceedings of the 20th International Conference on Computational Linguistics (COLING'04), Geneva, Switzerland.Google Scholar

West, R., Gabrilovich, E., Murphy, K., Sun, S., Gupta, R., and Lin, D. 2014. Knowledge base completion via search-based question answering. In Proceedings of the 23rd International Conference on World Wide Web (WWW'14), Seoul, South Korea.Google Scholar

Wu, W., Li, H., Wang, H. and Zhu, K. Q. 2012. Probase: a probabilistic taxonomy for text understanding. In Proceedings of the 2012 ACM SIGMOD International Conference on Management of Data (SIGMOD'12), Scottsdale, AZ, USA.CrossRef Google Scholar

Yu, M. and Dredze, M. 2014. Improving lexical embeddings with semantic knowledge. In Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics, Volume 2: Short Papers (ACL'14), Baltimore, MD, USA.Google Scholar

Article contents

A framework for enriching lexical semantic resources with distributional semantics

Abstract

Information

Access options

Article purchase

Temporarily unavailable

References

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests