This list contains references from the content that can be linked to their source. For a full set of references and notes please see the PDF or HTML where available.
M. Arévalo , M. Civit , and M. A. Martí 2004. MICE: a module for named entity recognition and classification. International Journal of Corpus Linguistics 9 (1): 53–68.
A. Barrón-Cedeño , M. Vila , M. A. Martí , and P. Rosso 2013. Plagiarism meets paraphrasing: insights for the next generation in automatic plagiarism detection. Computational Linguistics – vol. 39, no. 4, doi:10.1162/COLI_a_00153.
S. Brin 1999. Extracting patterns and relations from the World Wide Web. In P. Atzeni , A. Mendelzon , and G. Mecca (eds.), Proceedings of the 1st International Workshop on the World Wide Web and Databases (WebDB 1998), Lecture Notes in Computer Science, Vol. 1590. pp. 172–83. Berlin, Heidelberg: Springer-Verlag.
S. Burrows , M. Potthast , and B. Stein 2013. Paraphrase acquisition via crowdsourcing and machine learning. ACM Transactions on Intelligent Systems and Technology 4 (3), article no. 43.
R. C Carrasco , and J. Oncina 1994. Learning stochastic regular grammars by means of a state merging method. In R. C. Carrasco and J. Oncina (eds.), Grammatical Inference and Applications. Proceedings of the 2nd International Colloquium (ICGI 1994), Lecture Notes in Computer Science, Vol. 862. pp. 139–52. Berlin, Heidelberg: Springer-Verlag.
P. Clough , and M. Stevenson 2011. Developing a corpus of plagiarised short answers. Language Resources and Evaluation 45 (1): 5–24.
T. Cohn , Callison-C. Burch , and M. Lapata 2008. Constructing corpora for the development and evaluation of paraphrase systems. Computational Linguistics 34 (4): 597–614.
T. Cohn , and M. Lapata 2008. Sentence compression beyond word deletion. In Proceedings of the 22nd International Conference on Computational Linguistics (COLING 2008), pp. 137–44. Manchester: International Committee on Computational Linguistics.
B. Dolan , C. Quirk , and C. Brockett 2004. Unsupervised construction of large paraphrase corpora: exploiting massively parallel news sources. In Proceedings of the 20th International Conference on Computational Linguistics (COLING 2004), pp. 350–6. Geneva: International Committee on Computational Linguistics.
M. Hall , E. Frank , G. Holmes , B. Pfahringer , P. Reutemann , and I. H Witten 2009. The WEKA data mining software: an update. ACM SIGKDD Explorations Newsletter 11 (1): 10–18.
Z. Harris 1954. Distributional structure. Word 10 (2–3): 146–62.
K. Knight , and D. Marcu , 2002. Summarization beyond sentence extraction: a probabilistic approach to sentence compression. Artificial Intelligence 139: 91–107.
N. Madnani , and B. J. Dorr 2010. Generating phrasal and sentential paraphrases: a survey of data-driven methods. Computational Linguistics 36 (3): 341–87.
O. Medelyan , D. Milne , C. Legg , and I. H Witten 2009. Mining meaning from Wikipedia. International Journal of Human–Computer Studies 67 (9): 716–54.
N. J. Nilsson , 1982. Principles of Artificial Intelligence. Berlin/Heidelberg/New York: Springer-Verlag.