Skip to main content
×
×
Home

A framework for enriching lexical semantic resources with distributional semantics

  • CHRIS BIEMANN (a1), STEFANO FARALLI (a2), ALEXANDER PANCHENKO (a1) and SIMONE PAOLO PONZETTO (a2)
Abstract

We present an approach to combining distributional semantic representations induced from text corpora with manually constructed lexical semantic networks. While both kinds of semantic resources are available with high lexical coverage, our aligned resource combines the domain specificity and availability of contextual information from distributional models with the conciseness and high quality of manually crafted lexical networks. We start with a distributional representation of induced senses of vocabulary terms, which are accompanied with rich context information given by related lexical items. We then automatically disambiguate such representations to obtain a full-fledged proto-conceptualization, i.e. a typed graph of induced word senses. In a final step, this proto-conceptualization is aligned to a lexical ontology, resulting in a hybrid aligned resource. Moreover, unmapped induced senses are associated with a semantic type in order to connect them to the core resource. Manual evaluations against ground-truth judgments for different stages of our method as well as an extrinsic evaluation on a knowledge-based Word Sense Disambiguation benchmark all indicate the high quality of the new hybrid resource. Additionally, we show the benefits of enriching top-down lexical knowledge resources with bottom-up distributional information from text for addressing high-end knowledge acquisition tasks such as cleaning hypernym graphs and learning taxonomies from scratch.

Copyright
References
Hide All
Agirre E. and Soroa A. 2007. Semeval-2007 Task 02: evaluating word sense induction and discrimination systems. In Proceedings of the 4th International Workshop on Semantic Evaluations (SemEval'07), Prague, Czech Republic.
Aprosio A. P., Giuliano C. and Lavelli A. 2013. Extending the coverage of DBpedia properties using distant supervision over Wikipedia. In Proceedings of the 2013 International Conference on NLP and DBpedia – Volume 1064 (NLP-DBPEDIA'13), Sydney, Australia.
Banko M., Cafarella M. J., Soderland S., Broadhead M. and Etzioni O. 2007. Open information extraction from the web. In Proceedings of the 20th International Joint Conference on Artificial Intelligence (IJCAI'07), Hyderabad, India.
Baroni M., Bernardi R., Do N.-Q. and Shan C.-C. 2012. Entailment above the word level in distributional semantics. In Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics (EACL'12), Avignon, France.
Baroni M., Dinu G. and Kruszewski G. 2014. Don't count, predict! A systematic comparison of context-counting vs. context-predicting semantic vectors. In Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (ACL'14), Baltimore, MD, USA.
Bartunov S., Kondrashkin D., Osokin A. and Vetrov D. P. 2016. Breaking sticks and ambiguities with adaptive skip-gram. In Proceedings of the 19th International Conference on Artificial Intelligence and Statistics (AISTATS'16), Cadiz, Spain.
Biemann C. 2005. Ontology learning from text: a survey of methods. LDV Forum 20 (2): 7593.
Biemann C. 2006. Chinese whispers: an efficient graph clustering algorithm and its application to natural language processing problems. In Proceedings of the First Workshop on Graph Based Methods for Natural Language Processing, TextGraphs-1, Brooklyn, NY, USA.
Biemann C. and Riedl M. 2013. Text: now in 2D! A framework for lexical expansion with contextual similarity. Journal of Language Modelling 1 (1): 5595.
Bizer C., Lehmann J., Kobilarov G., Auer S., Becker C., Cyganiak R., and Hellmann S. 2009. DBpedia – a crystallization point for the web of data. Journal of Web Semantics 7 (3): 154–65.
Bordea G., Buitelaar P., Faralli S. and Navigli R. 2015. SemEval-2015 task 17: taxonomy extraction evaluation (TExEval). In Proceedings of the 9th International Workshop on Semantic Evaluation (SemEval@NAACL-HLT'15), Denver, CO, USA.
Bordea G., Lefever E. and Buitelaar P. 2016. SemEval-2016 task 13: taxonomy extraction evaluation (TExEval-2). In Proceedings of the 10th International Workshop on Semantic Evaluation (SemEval@NAACL-HLT'16), San Diego, CA, USA.
Bordes A., Weston J., Collobert R. and Bengio Y. 2011. Learning structured embeddings of knowledge bases. In Proceedings of the 25th AAAI Conference on Artificial Intelligence (AAAI'11), San Francisco, CA, USA.
Brugmann K. 1904. Kurze vergleichende Grammatik der indogermanischen Sprachen. Strassburg, France: Karl J. Trubner.
Bryl V. and Bizer C. 2014. Learning conflict resolution strategies for cross-language Wikipedia data fusion. In Proceedings of the 23rd International Conference on World Wide Web (WWW'14 Companion), Seoul, South Korea.
Camacho-Collados J., Pilehvar M. T., and Navigli R. 2015a. A unified multilingual semantic representation of concepts. In Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, Volume 1: Long Papers (ACL'15), Beijing, China.
Camacho-Collados J., Pilehvar M. T., and Navigli R. 2015b. NASARI: a novel approach to a semantically aware representation of items. In Proceedings of the 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL-HLT'15), Denver, CO, USA.
Carlson A., Betteridge J., Kisiel R., Settles B., Hruschka E. R. Jr., and Mitchell T. M. 2010. Toward an architecture for never-ending language learning. In Proceedings of the 24th AAAI Conference on Artificial Intelligence (AAAI'10), Atlanta, GA, USA.
Cecchini F. M., Riedl M. and Biemann C. 2017. Using pseudowords for algorithm comparison: an evaluation framework for graph-based word sense induction. In Proceedings of the 21st Nordic Conference on Computational Linguistics (NODALIDA'17), Gothenburg, Sweden.
Chen X., Liu Z. and Sun M. 2014. A unified model for word sense representation and disambiguation. In Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing, A meeting of SIGDAT, a Special Interest Group of the ACL (EMNLP'14), Doha, Qatar.
Chiarcos C., Hellmann S. and Nordhoff S. 2012. Linking linguistic resources: examples from the open linguistics working group. In Chiarcos C., Nordhoff S., and Hellmann S. (eds.), Linked Data in Linguistics – Representing and Connecting Language Data and Language Metadata, pp. 201–16. Heidelberg, Germany: Springer.
Clark S. 2015. Vector space models of lexical meaning. In Lappin S., and Fox C. (eds.), Handbook of Contemporary Semantics, 2nd edition, pp. 493522. New York, NY, USA: Wiley-Blackwell.
Cleuziou G. and Moreno J. G. 2016. QASSIT at SemEval-2016 Task 13: on the integration of semantic vectors in pretopological spaces for lexical taxonomy acquisition. In Proceedings of the 10th International Workshop on Semantic Evaluation (SemEval@NAACL-HLT'16), San Diego, CA, USA.
Cuadros M. and Rigau G. 2007. SemEval-2007 task 16: evaluation of wide coverage knowledge resources. In Proceedings of the 4th International Workshop on Semantic Evaluations (SemEval'07), Prague, Czech Republic.
Cuadros M. and Rigau G. 2008. KnowNet: building a large net of knowledge from the web. In Proceedings of the 22nd International Conference on Computational Linguistics – Volume 1 (COLING'08), Manchester, UK.
Curran J. R. 2002. Ensemble methods for automatic thesaurus extraction. In Proceedings of the ACL-02 Conference on Empirical Methods in Natural Language Processing – Volume 10 (EMNLP'02), Philadelphia, PA, USA.
De Marneffe M.-C., MacCartney B., and Manning C. D. 2006. Generating typed dependency parses from phrase structure parses. In Proceedings of the 5th International Conference on Language Resources and Evaluation (LREC'06), Genoa, Italy.
Di Marco A., and Navigli R. 2013. Clustering and diversifying web search results with graph-based word sense induction. Computational Linguistics 39 (3): 709–54.
Evert S. 2005. The Statistics of Word Cooccurrences: Word Pairs and Collocations. Ph.D. thesis. Institut für maschinelle Sprachverarbeitung, University of Stuttgart, Germany.
Fader A., Soderland S. and Etzioni O. 2011. Identifying relations for open information extraction. In Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP'11), Edinburgh, UK.
Faralli S. and Navigli R. 2012. A new minimally supervised framework for domain word sense disambiguation. In Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning (EMNLP-CoNLL'12), Jeju Island, South Korea.
Faralli S. and Navigli R. 2013. Growing multi-domain glossaries from a few seeds using probabilistic topic models. In Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, A meeting of SIGDAT, a Special Interest Group of the ACL (EMNLP'13), Seattle, WA, USA.
Faralli S., Panchenko A., Biemann C. and Paolo Ponzetto S. 2017. The contrast medium algorithm: taxonomy induction from noisy knowledge graphs with just a few links. In Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics, Volume 1: Long Papers (EACL'17), Valencia, Spain.
Faralli S., Stilo G. and Velardi P. 2015. Large scale homophily analysis in twitter using a twixonomy. In Proceedings of the 24th International Joint Conference on Artificial Intelligence (IJCAI'15), Buenos Aires, Argentina.
Faruqui M., Dodge J., Jauhar S. K., Dyer C., Hovy E. H., and Smith N. A. 2015. Retrofitting word vectors to semantic lexicons. In Proceedings of the 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL-HLT'15), Denver, CO, USA.
Faruqui M. and Kumar S. 2015. Multilingual open relation extraction using cross-lingual projection. In Proceedings of the 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL-HLT'15), Denver, CO, USA.
Fellbaum C. (ed.) 1998. WordNet: An Electronic Database. Cambridge, MA: MIT Press.
Fleiss J. L. 1971. Measuring nominal scale agreement among many raters. Psychological Bulletin 76 (5): 378.
Gangemi A., Guarino N., Masolo C., Oltramari A., and Schneider L. 2002. Sweetening ontologies with DOLCE. In Knowledge Engineering and Knowledge Management: Ontologies and the Semantic Web: 13th International Conference (EKAW 2002), Sigüenza, Spain, Berlin, Heidelberg.
Ganitkevitch J., Van Durme B., and Callison-Burch C. 2013. PPDB: the paraphrase database. In Proceedings of the 2013 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL-HLT'13), Atlanta, GA, USA.
Gil-Lafuente A. M., and Aluja J. G. 2012. Towards an Advanced Modelling of Complex Economic Phenomena: Pretopological and Topological Uncertainty Research Tools. Berlin, Heidelberg: Springer.
Glavaš G. and Ponzetto S.-P. 2017. Dual tensor model for detecting asymmetric lexico-semantic relations. In Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing (EMNLP'17), Copenhagen, Denmark.
Goikoetxea J., Soroa A. and Agirre E. 2015. Random walks and neural network language models on knowledge bases. In Proceedings of the 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL-HLT'15), Denver, CO, USA.
Gurevych I., Eckle-Kohler J., Hartmann S., Matuschek M., Meyer C. M., and Wirth C. 2012. UBY – a large-scale unified lexical-semantic resource based on LMF. In Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics (EACL'12), Avignon, France.
Hearst M. A. 1992. Automatic acquisition of hyponyms from large text corpora. In Proceedings of the 14th Conference on Computational Linguistics – Volume 2 (COLING'92), Nantes, France.
Hoffart J., Suchanek F. M., Berberich K. and Weikum G. 2013. YAGO2: a spatially and temporally enhanced knowledge base from Wikipedia. Artificial Intelligence 194: 2861.
Hovy E., Navigli R. and Ponzetto S.-P. 2013. Collaboratively built semi-structured content and artificial intelligence: the story so far. Artificial Intelligence 194: 227.
Jauhar S. K., Dyer C. and Hovy E. 2015. Ontologically grounded multi-sense representation learning for semantic vector space models. In Proceedings of the 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL-HLT'15), Denver, CO, USA.
Jenatton R., Roux N. L., Bordes A. and Obozinski G. 2012. A latent factor model for highly multi-relational data. In Proceedings of Advances in Neural Information Processing Systems 25: 26th Annual Conference on Neural Information Processing Systems (NIPS'12), Lake Tahoe, NV, USA.
Jurgens D. and Pilehvar M. T. 2016. SemEval-2016 task 14: semantic taxonomy enrichment. In Proceedings of the 10th International Workshop on Semantic Evaluation (SemEval@NAACL-HLT'16), San Diego, CA, USA.
Kapanipathi P., Jain P., Venkataramani C. and Sheth A. 2014. User interests identification on Twitter using a hierarchical knowledge base. In Proceedings of the Semantic Web: Trends and Challenges: 11th International Conference (ESWC'14), Cham.
Klein D. and Manning C. D. 2003. Accurate unlexicalized parsing. In Proceedings of the 41st Annual Meeting of the Association for Computational Linguistics – Volume 1 (ACL'03), Sapporo, Japan.
Klein P., Ponzetto S. P. and Glavaš G. 2017. Improving neural knowledge base completion with cross-lingual projections. In Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics: Volume 2, Short Papers (EACL'17), Valencia, Spain.
Kozareva Z. and Hovy E. 2010. A semi-supervised method to learn and construct taxonomies using the web. In Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing (EMNLP'10), Cambridge, MA, USA.
Krause S., Hennig L., Gabryszak A., Xu F., and Uszkoreit H. 2015. Sar-graphs: a linked linguistic knowledge resource connecting facts with language. In Proceedings of the 4th Workshop on Linked Data in Linguistics: Resources and Applications, Beijing, China.
Lesk M. 1986. Automatic sense disambiguation using machine readable dictionaries: how to tell a pine cone from an ice cream cone. In Proceedings of the 5th Annual International Conference on Systems Documentation (SIGDOC'86), Toronto, Ontario, Canada.
Levy O. and Goldberg Y. 2014. Dependency-based word embeddings. In Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (ACL'14), Baltimore, MD, USA.
Li J. and Jurafsky D. 2015. Do multi-sense embeddings improve natural language understanding? In Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing (EMNLP'15), Lisbon, Portugal.
Lin D. 1998. Automatic retrieval and clustering of similar words. In Proceedings of the 17th International Conference on Computational Linguistics – Volume 2 (COLING'98), Montreal, Quebec, Canada.
Lin T., Pantel P., Gamon M., Kannan A., and Fuxman A. 2012. Active objects: actions for entity-centric search. In Proceedings of the 21st International Conference on World Wide Web (WWW'12), Lyon, France.
Maitra P. and Das D. 2016. JUNLP at SemEval-2016 Task 13: a language independent approach for hypernym identification. In Proceedings of the 10th International Workshop on Semantic Evaluation (SemEval@NAACL-HLT'16), San Diego, CA, USA.
McCrae J. P., Fellbaum C., and Cimiano P. 2014. Publishing and linking WordNet using lemon and RDF. In Proceedings of the 3rd Workshop on Linked Data in Linguistics, Reykjavik, Iceland.
Mihalcea R., Chklovski T. and Kilgarriff A. 2004. The Senseval-3 English lexical sample task. In Proceedings of the 3rd International Workshop on the Evaluation of Systems for the Semantic Analysis of Text (Senseval-3), Barcelona, Spain.
Mihalcea R. and Csomai A. 2007. Wikify!: linking documents to encyclopedic knowledge. In Proceedings of the 16th ACM Conference on Information and Knowledge Management (CIKM'07), Lisbon, Portugal.
Mihalcea R. and Moldovan D. I. 2001. eXtended WordNet: progress report. In Proceedings of NAACL Workshop on WordNet and Other Lexical Resources, Pittsburgh, PA, USA.
Mikolov T., Sutskever I., Chen K., Corrado G. S., and Dean J. 2013. Distributed representations of words and phrases and their compositionality. In Proceedings of the Advances in Neural Information Processing Systems 26: 27th Annual Conference on Neural Information Processing Systems (NIPS'13), Lake Tahoe, NV, USA.
Mikolov T., Yih W.-T. and Zweig G. 2013. Linguistic regularities in continuous space word representations. In Proceedings of the 2013 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL-HLT'15), Atlanta, GA, USA.
Miller G. A., Leacock C., Tengi R. and Bunker R. T. 1993. A semantic concordance. In Proceedings of the Workshop on Human Language Technology, Princeton, NJ, USA.
Mintz M., Bills S., Snow R. and Jurafsky D. 2009. Distant supervision for relation extraction without labeled data. In Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: Vol. 2 (ACL'09), Suntec, Singapore.
Nastase V. and Strube M. 2013. Transforming Wikipedia into a large scale multilingual concept network. Artificial Intelligence 194: 6285.
Navigli R. 2009. Word sense disambiguation: a survey. ACM CSUR 41 (2): 169.
Navigli R. and Ponzetto S. P. 2012a. BabelNet: the automatic construction, evaluation and application of a wide-coverage multilingual semantic network. Artificial Intelligence 193: 217–50.
Navigli R. and Ponzetto S. P. 2012b. Joining forces pays off: multilingual joint Word Sense Disambiguation. In Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning (EMNLP-CoNLL'12), Jeju Island, South Korea.
Navigli R. and Velardi P. 2010. Learning Word-class lattices for definition and hypernym extraction. In Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics (ACL'10), Uppsala, Sweden.
Neelakantan A., Shankar J., Passos A. and McCallum A. 2014. Efficient non-parametric estimation of multiple embeddings per word in vector space. In Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP'14), Doha, Qatar.
Nickel M., Murphy K., Tresp V. and Gabrilovich E. 2016. A review of relational machine learning for knowledge graphs. Proceedings of the IEEE 104 (1): 1133.
Niemann E. and Gurevych I. 2011. The people's web meets linguistic knowledge: automatic sense alignment of Wikipedia and wordnet. In Proceedings of the Ninth International Conference on Computational Semantics (IWCS'11), Oxford, UK.
Nieto Piña L., and Johansson R. 2016. Embedding senses for efficient graph-based word sense disambiguation. In Proceedings of the 2016 Workshop on Graph-based Methods for Natural Language Processing (Textgraphs'16), San Diego, CA, USA.
Norvig P. 2016. The semantic web and the semantics of the web: where does meaning come from? In Proceedings of the 25th International Conference on World Wide Web (WWW'16), Montreal, Quebec, Canada.
Padó S. and Lapata M. 2007. Dependency-based construction of semantic space models. Computational Linguistics 33 (2): 161–99.
Panchenko A. 2016. Best of both worlds: making word sense embeddings interpretable. In Proceedings of the 10th International Conference on Language Resources and Evaluation (LREC'16), Portoro , Slovenia.
Panchenko A., Faralli S., Ponzetto S. P. and Biemann C. 2017a. Using linked disambiguated distributional networks for word sense disambiguation. In Proceedings of the 1st EACL Workshop on Sense, Concept and Entity Representations and their Applications, Valencia, Spain.
Panchenko A., Faralli S., Ruppert E., Remus S., Naets H., Fairon C., Ponzetto S. P., and Biemann C. 2016. TAXI at SemEval-2016 task 13: a taxonomy induction method based on lexico-syntactic patterns, substrings and focused crawling. In Proceedings of the 10th International Workshop on Semantic Evaluation (SemEval@NAACL-HLT'16), San Diego, CA, USA.
Panchenko A., Marten F., Ruppert E., Faralli S., Ustalov D., Ponzetto S. P., and Biemann C. 2017b. Unsupervised, knowledge-free, and interpretable word sense disambiguation. In Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing: System Demonstrations (EMNLP'17), Copenhagen, Denmark.
Panchenko A. and Morozova O. 2012. A study of hybrid similarity measures for semantic relation extraction. In Proceedings of the Workshop on Innovative Hybrid Approaches to the Processing of Textual Data, Avignon, France.
Panchenko A., Romanov P., Morozova O., Naets H., Philippovich A., Romanov A., and Fairon C. 2013. Serelex: search and visualization of semantically related words. In Proceedings of the European Conference on Information Retrieval (ECIR'13), Moscow, Russia.
Panchenko A., Ruppert E., Faralli S., Ponzetto S. P., and Biemann C. 2017c. Unsupervised does not mean uninterpretable: the case for word sense induction and disambiguation. In Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics (EACL'17), Valencia, Spain.
Panchenko A., Simon J., Riedl M. and Biemann C. 2016. Noun sense induction and disambiguation using graph-based distributional semantics. In Proceedings of the 13th Conference on Natural Language Processing (KONVENS'16), Bochum, Germany.
Parker R., Graff D., Kong J., Chen K., and Maeda K. 2011. English Gigaword, 5th ed. Philadelphia: Linguistic Data Consortium.
Pavel S. and Euzenat J. 2013. Ontology matching: state of the art and future challenges. IEEE Transaction on Knowledge and Data Engineering 25 (1): 158–76.
Pedersen T., Patwardhan S. and Michelizzi J. 2004. WordNet::similarity – measuring the relatedness of concepts. In Proceedings of the HLT-NAACL 2004: Demonstration Papers (HLT-NAACL'04 Demos), Boston, MA, USA.
Pelevina M., Arefiev N., Biemann C. and Panchenko A. 2016. Making sense of word embeddings. In Proceedings of the 1st Workshop on Representation Learning for NLP, Berlin, Germany.
Pennington J., Socher R. and Manning C. 2014. GloVe: global vectors for word representation. In Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP'14), Doha, Qatar.
Pham N. T., Lazaridou A. and Baroni M. 2015. A multitask objective to inject lexical contrast into distributional semantics. In Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, Volume 2: Short Papers (ACL'15), Beijing, China.
Pocostales J. 2016. NUIG-UNLP at SemEval-2016 task 13: a simple word embedding-based approach for taxonomy extraction. In Proceedings of the 10th International Workshop on Semantic Evaluation (SemEval@NAACL-HLT'16), San Diego, CA, USA.
Ponzetto S. P. and Navigli R. 2009. Large-scale taxonomy mapping for restructuring and integrating Wikipedia. In Proceedings of the 21st International Joint Conference on Artificial Intelligence (IJCAI'09), Pasadena, CA, USA.
Ponzetto S. P. and Navigli R. 2010. Knowledge-rich word sense disambiguation rivaling supervised systems. In Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics (ACL'10), Uppsala, Sweden.
Ponzetto S. P. and Strube M. 2011. Taxonomy induction based on a collaboratively built knowledge repository. Artificial Intelligence 175: 1737–56.
Pradhan S. S., Loper E., Dligach D. and Palmer M. 2007. SemEval-2007 task 17: english lexical sample, SRL and all words. In Proceedings of the 4th International Workshop on Semantic Evaluations (SemEval'07), Prague, Czech Republic.
Reisinger J. and Mooney R. J. 2010. Multi-prototype vector-space models of word meaning. In Proceedings of the Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics (HLT'10), Los Angeles, CA, USA.
Richter M., Quasthoff U., Hallsteinsdóttir E., and Biemann C. 2006. Exploiting the Leipzig corpora collection. In Proceedings of IS-LTC'06, Ljubljana, Slovenia.
Riedl M. 2016. Unsupervised Methods for Learning and Using Semantics of Natural Language. Ph.D. thesis. Germany: TU Darmstadt. http://tuprints.ulb.tu-darmstadt.de/5435/.
Riedl M. and Biemann C. 2013. Scaling to large3 data: an efficient and effective method to compute distributional thesauri. In Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, a Meeting of SIGDAT, a Special Interest Group of the ACL (EMNLP'13), Seattle, WA, USA.
Riedl M. and Biemann C. 2015. A single word is not enough: ranking multiword expressions using distributional semantics. In Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing (EMNLP'15), Lisbon, Portugal.
Roller S., Erk K. and Boleda G. 2014. Inclusive yet selective: supervised distributional hypernymy detection. In Proceedings of the 25th International Conference on Computational Linguistics: Technical Papers (COLING'14), Dublin, Ireland.
Rospocher M., van Erp M., Vossen P., Fokkens A., Aldabe I., Rigau G., Soroa A., Ploeger T., and Bogaard T. 2016. Building event-centric knowledge graphs from news. Journal of Web Semantics 37–38: 132–51.
Rothe S. and Schütze H. 2015. AutoExtend: extending word embeddings to embeddings for synsets and lexemes. In Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, Volume 1: Long Papers (ACL'15), Beijing, China.
Ruppert E., Kaufmann M., Riedl M. and Biemann C. 2015. JoBimViz: a web-based visualization for graph-based distributional semantic models. In Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, System Demonstrations (ACL'15), Beijing, China.
Schuhmacher M., Dietz L. and Ponzetto S. P. 2015. Ranking entities for web queries through text and knowledge. In Proceedings of the 24th ACM International on Conference on Information and Knowledge Management (CIKM'15), Melbourne, Australia.
Shwartz V., Goldberg Y. and Dagan I. 2016. Improving hypernymy detection with an integrated path-based and distributional method. In Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, Volume 1: Long Papers (ACL'16), Berlin, Germany.
Snow R., Jurafsky D. and Ng A. 2006. Semantic taxonomy induction from heterogenous evidence. In Proceedings of the 21st International Conference on Computational Linguistics and the 44th Annual Meeting of the Association for Computational Linguistics (ACL'06), Sydney, Australia.
Snow R., Jurafsky D. and Ng A. Y. 2004. Learning syntactic patterns for automatic hypernym discovery. In Proceedings of the Advances in Neural Information Processing Systems 17 (NIPS'04), Vancouver, BC, Canada.
Socher R., Chen D., Manning C. D. and Ng A. 2013. Reasoning with neural tensor networks for knowledge base completion. In Proceedings of the Advances in Neural Information Processing Systems 26: 27th Annual Conference on Neural Information Processing Systems (NIPS'13), Lake Tahoe, NV, USA.
Suchanek F. M., Kasneci G. and Weikum G. 2008. YAGO: a large ontology from Wikipedia and WordNet. Journal of Web Semantics 6 (3): 203–17.
Tan L., Bond F. and van Genabith J. 2016. USAAR at SemEval-2016 task 13: hyponym endocentricity. In Proceedings of the 10th International Workshop on Semantic Evaluation (SemEval@NAACL-HLT'16), San Diego, CA, USA.
Tarjan R. 1972. Depth-first search and linear graph algorithms. SIAM Journal on Computing 1 (2): 146–60.
Turney P. D. and Pantel P. 2010. From frequency to meaning: vector space models of semantics. Journal of Artificial Intelligence Research 37: 141–88.
Van de Cruys T. 2010. Mining for meaning: the extraction of lexico-semantic knowledge from text. Groningen Dissertations in Linguistics 82.
Van Dongen S. 2008. Graph clustering via a discrete uncoupling process. SIAM Journal on Matrix Analysis and Applications 30 (1): 121–41.
Velardi P., Faralli S. and Navigli R. 2013. OntoLearn reloaded: a graph-based algorithm for taxonomy induction. Computational Linguistics 39 (3): 665707.
Velardi P., Navigli R., Faralli S. and Ruiz-Martínez J. M. 2012. A new method for evaluating automatically learned terminological taxonomies. In Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC'12), Istanbul, Turkey.
Véronis J. 2004. HyperLex: lexical cartography for information retrieval. Computer Speech and Language 18: 223–52.
Wang Z., Li J., Wang Z. and Tang J. 2012. Cross-lingual knowledge linking across Wiki knowledge bases. In Proceedings of the 21st International Conference on World Wide Web (WWW'12), Lyon, France.
Weeds J., Weir D. and McCarthy D. 2004. Characterising measures of lexical distributional similarity. In Proceedings of the 20th International Conference on Computational Linguistics (COLING'04), Geneva, Switzerland.
West R., Gabrilovich E., Murphy K., Sun S., Gupta R., and Lin D. 2014. Knowledge base completion via search-based question answering. In Proceedings of the 23rd International Conference on World Wide Web (WWW'14), Seoul, South Korea.
Wu W., Li H., Wang H. and Zhu K. Q. 2012. Probase: a probabilistic taxonomy for text understanding. In Proceedings of the 2012 ACM SIGMOD International Conference on Management of Data (SIGMOD'12), Scottsdale, AZ, USA.
Yu M. and Dredze M. 2014. Improving lexical embeddings with semantic knowledge. In Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics, Volume 2: Short Papers (ACL'14), Baltimore, MD, USA.
Recommend this journal

Email your librarian or administrator to recommend adding this journal to your organisation's collection.

Natural Language Engineering
  • ISSN: 1351-3249
  • EISSN: 1469-8110
  • URL: /core/journals/natural-language-engineering
Please enter your name
Please enter a valid email address
Who would you like to send this to? *
×

Metrics

Full text views

Total number of HTML views: 28
Total number of PDF views: 31 *
Loading metrics...

Abstract views

Total abstract views: 239 *
Loading metrics...

* Views captured on Cambridge Core between 15th January 2018 - 22nd February 2018. This data will be updated every 24 hours.