Caroline Sporleder's Publications

Journal Papers

  • Linlin Li, Ivan Titov and Caroline Sporleder. Improved Estimation of Entropy for Evaluation of Word Sense Induction. To appear in Computational Linguistics, 40:3, 2014.

  • Josef Ruppenhofer, Russell Lee-Goldman, Caroline Sporleder, and Roser Morante. Beyond sentence-level semantic role labeling: linking argument structures in discourse. Language Resources and Evaluation, November 2012.
    [online version on Springer Link] (limited access)

  • Roser Morante and Caroline Sporleder. Modality and Negation: An Introduction to the Special Issue, in Computational Linguistics, 38:2, 2012.
    [pdf] (preprint)

  • Ines Rehbein, Josef Ruppenhofer, and Caroline Sporleder. Is it worth the effort? Assessing the benefits of partial automatic pre-labeling for frame-semantic annotation, in Language Resources and Evaluation 46:1, pp. 1-23, March 2012. (published online 19. Nov. 2011).
    [LRE Online First]

  • Caroline Sporleder. Natural Language Processing for Cultural Heritage Domains, Language and Linguistics Compass, Vol 4, Issue 9, September 2010, pp. 750-768, Wiley-Blackwell.

  • Antal van den Bosch, Marieke van Erp, and Caroline Sporleder. Making a Clean Sweep of Cultural Heritage, IEEE Intelligent Systems, Special Issue on AI and Cultural Heritage, March/April 2009 (vol. 25 no. 2), pp. 54-63.

  • Caroline Sporleder and Alex Lascarides. Using Automatically Labelled Examples to Classify Rhetorical Relations: An Assessment, Natural Language Engineering, Volume 14, Issue 03, July 2008, pp 369-416. (Note: this article was first published online by NLE on 19 December 2006.)
    [pdf]   (preprint)

  • Caroline Sporleder. Lexical Models to Identify Unmarked Discourse Relations: Does WordNet help?, Journal for Language Technology and Computational Linguistics, 24:2, December 2008, pp. 20-32.
    [pdf]

  • Caroline Sporleder. Manually vs. Automatically Labelled Data in Discourse Relation Classification. Effects of Example and Feature Selection, LDV Forum, 22:1, 1-20, May 2007.
    [pdf]   (preprint)

  • Caroline Sporleder and Mirella Lapata.   "Broad Coverage Paragraph Segmentation across Languages and Domains", ACM Transactions in Speech and Language Processing, 3:2, 1-35, July 2006.
    [pdf]   (preprint)     [data sets used in the experiments]

Conference & Workshop Papers (peer-reviewed except where indicated)

[2014]

    • Michael Fell and Caroline Sporleder. Lyrics-Based Analysis and Classification of Music. In Proceedings of the 25th International Conference on Computational Linguistics (COLING 2014), Dublin, Irland, August 23-29, 2014.
    • Mariona Coll Ardanuy and Caroline Sporleder.   Structure-based Clustering of Novels. In Third Workshop on Computational Linguistics for Literature (CLfL@EACL 2014). Gothenburg, Sweden, April 27, 2014.

    • Caroline Sporleder, Susanne Fertmann, Tim Krones, Robert Kolatzek und Isolde Teufel.   Converting Medieval Documents into a Searchable Database. In Digital Humanities 2014, Lausanne, Switzerland, July 8-12, 2014. (reviewed abstract)

    [2013]

    • Angeliki Lazaridou, Ivan Titov and Caroline Sporleder.   A Bayesian Model for Joint Unsupervised Induction of Sentiment, Aspect and Discourse Representations. In Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics (ACL 2013), Sofia, Bulgaria, August 4.9 2013, 1630-1639.
      [pdf]

    • Philip Gorinski, Josef Ruppenhofer and Caroline Sporleder.   Towards Weakly Supervised Resolution of Null Instantiations. In Proceedings of the 10th InternationalConference on Computational Semantics (IWCS 2013), Potsdam, March 19-22, 2013, 119-130.
      [pdf]

    [2012]

    • Ines Rehbein, Josef Ruppenhofer, Caroline Sporleder, Manfred Pinkal.   Adding nominal spice to SALSA - frame-semantic annotation of German nouns and verbs . In Proceedings of KONVENS 2012, Vienna, Austria, Sept. 19-21, 2012.

    • Peñas, A, Hovy, E, Forner, P, Rodrigo, A, Sutcliffe, R, Forascu, C, Sporleder, C.   Evaluating Machine Reading Systems through Comprehension Tests. In Proceedings of LREC 2012, Istanbul, Turkey, May 21-27, 2012.

    [2011]

    • Chenhua Chen, Alexis Palmer and Caroline Sporleder.   Enhancing Active Learning for Semantic Role Labeling via Compressed Dependency Trees. Proceedings of IJCNLP 2011, Chiang Mai, Thailand, November 8-13, 2011.
      [pdf]

    • Josef Ruppenhofer, Philip Gorinski and Caroline Sporleder.   In search of missing arguments: A linguistic approach. Proceedings of RANLP 2011, Hissar, Bulgaria, September 12-14, 2011.
      [pdf]

    • Alexis Palmer, Afra Alishahi and Caroline Sporleder.   Robust Semantic Analysis for Unseen Data in FrameNet. Proceedings of RANLP 2011, Hissar, Bulgaria, September 12-14, 2011.
      [pdf]

    • Peñas, A, Hovy, E, Forner, P, Rodrigo, À, Sutcliffe, R, Forascu, C, and Sporleder, C.   Overview of QA4MRE at CLEF 2011: Question Answering for Machine Reading Evaluation. In CLEF 2011 Labs and Workshop Notebook Papers, Amsterdam, 19-22 September, 2011. Online Proceedings. ISBN 978-88-904810-1-7, ISSN 2038-4726. (not peer reviewed).
      [pdf]


    [2010]

    • Alexis Palmer and Caroline Sporleder.   "Evaluating FrameNet-style semantic parsing: the role of coverage gaps in FrameNet", Proceedings of the 23rd International Conference on Computational Linguistics (COLING 2010), August, 23-27, 2010, Beijing, China.
      [pdf]

    • Linlin Li and Caroline Sporleder.   "Linguistic Cues for Distinguishing Literal and Non-Literal Usage", Proceedings of the 23rd International Conference on Computational Linguistics (COLING 2010), August, 23-27, 2010, Beijing, China.
      [pdf]

    • Linlin Li, Benjamin Roth and Caroline Sporleder.   "Topic Models for Word Sense Disambiguation and Token-Based Idiom Detection", Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics (ACL 2010), July 11-16, 2010, Uppsala, Sweden.
      [pdf]

    • Linlin Li and Caroline Sporleder.   "Using Gaussian Mixture Models to Detect Figurative Language in Context", Proceedings of the 11th Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL 2010) Short Papers, June 1-6, 2010, Los Angeles.
      [pdf]

    • Michael Backes, Markus Dürmuth, Sebastian Gerling, Manfred Pinkal, and Caroline Sporleder.   "Acoustic side-channel attacks on printers", Proceedings of the 19th USENIX Security Symposium, August 11-13, 2010, Washington, DC, USA.

    • Caroline Sporleder, Linlin Li and Alexis Palmer.   "Cohesive Links with Literal and Idiomatic Expressions in Discourse: An Empirical and Computational Study", Multidisciplinary Approaches to Discourse 2010 (MAD 2010), March 17-20, 2010, Moissac, FranceMarch 17-20, 2010, Moissac, France.
      [pdf]

    • Josef Ruppenhofer, Caroline Sporleder, Roser Morante, Collin Baker, and Martha Palmer.   "SemEval-2010 Task 10: Linking Events and Their Participants in Discourse", The ACL Workshop SemEval-2010: 5th International Workshop on Semantic Evaluations, July 15-16, 2010, Uppsala, Sweden.
      [pdf]

    • Caroline Sporleder, Linlin Li, Philip Gorinski, and Xaver Koch.   "Idioms in Context: The IDIX Corpus", The seventh international conference on Language Resources and Evaluation (LREC), May 19-21, 2010, Valletta, Malta.
      [pdf]

    • Rui Wang and Caroline Sporleder.   "Constructing a Textual Semantic Relation Corpus Using a Discourse Treebank", The seventh international conference on Language Resources and Evaluation (LREC), May 19-21, 2010, Valletta, Malta.
      [pdf]

    • Josef Ruppenhofer, Caroline Sporleder, and Fabian Shirokov.   "Speaker Attribution in Cabinet Protocols", The seventh international conference on Language Resources and Evaluation (LREC), May 19-21, 2010, Valletta, Malta.
      [pdf]


    [2009]

    • Linlin Li and Caroline Sporleder.   "Classifier Combination for Contextual Idiom Detection Without Labelled Data". Proceedings of EMNLP 2009, Singapore, August 6-7, 2009.
      [pdf]

    • Ines Rehbein, Josef Ruppenhofer and Caroline Sporleder.   "Assessing the benefits of partial automatic pre-labeling for frame-semantic annotation". Proceedings of the ACL 2009 Linguistic Annotation Workshop (LAW III), Singapore, August 6-7, 2009.
      [pdf]

    • Linlin Li and Caroline Sporleder.   "A Cohesion Graph Based Approach for Unsupervised Recognition of Literal and Nonliteral Use of Multiword Expressions". Proceedings of the ACL 2009 Workshop on TextGraphs-4: Graph-based Methods for Natural Language Processing, Singapore, August 7, 2009.
      [pdf]

    • Caroline Sporleder and Linlin Li.   "Unsupervised Recognition of Literal and Non-Literal Use of Idiomatic Expressions". Proceedings of EACL 2009, Athens, Greece, March 30-April 3, 2009.
      [pdf]

    • Josef Ruppenhofer, Caroline Sporleder, Roser Morante, Collin Baker and Martha Palmer.   "SemEval-2010 Task 10: Linking Events and Their Participants in Discourse". The NAACL-HLT 2009 Workshop on Semantic Evaluations: Recent Achievements and Future Directions (SEW-09), Boulder, Colorado, USA, June 4, 2009.
      [pdf]

    • Caroline Sporleder.   "Semantic Argument Structure In DiscoursE: The SEASIDE Project", Proceedings of the Eighth Conference on Computational Semantics (IWCS-8), Tilburg, The Netherlands, January 7-9, 2009.

    [2008]

    • Sebastian Pado, Marco Pennacchiotti and Caroline Sporleder.   "Semantic role assignment for event nominalisations by leveraging verbal data", Proceedings of Coling 2008, Manchester, UK, August 18--22, 2008.
      [pdf]     [data sets used in the experiments]

    [2007]

    • Sander Canisius and Caroline Sporleder.   "Bootstrapping Information Extraction from Field Books", Proceedings of EMNLP-CoNLL 2007, Prague, Czech Republic, June 28-30, 2007.
      [pdf]

    • Iris Hendrickx, Roser Morante, Caroline Sporleder, and Antal van den Bosch.   "Machine learning of semantic relations with shallow features and almost no data", Proceedings of the 4th International Workshop on Semantic Evaluations (SemEval 2007), Prague, Czech Republic, June 23-24, 2007.
      [pdf]

    • Sander Canisius and Caroline Sporleder.   "Learning to Segment and Label Semi-Structured Documents with Little or No Supervision", Proceedings of Benelearn 2007, Amsterdam, The Netherlands, May 14-15, 2007.
      [pdf]

    • Antal van den Bosch, Caroline Sporleder, Marieke van Erp, and Steve Hunt.  "Automatic Techniques for Generating and Correcting Cultural Heritage Collection Metadata", Digital Humanities 2007, Urbana-Champaign, USA, June 4-8, 2007.
      [html]   (reviewed abstract)

    [2006]

    • Caroline Sporleder, Marieke van Erp, Tijn Porcelijn and Antal van den Bosch.   "Correcting 'Wrong-Column' Errors in Text Databases.", Proceedings of the Annual Machine Learning Conference of Belgium and The Netherlands (Benelearn-06), Ghent, Belgium, 2006.
      [pdf]


    • Caroline Sporleder, Marieke van Erp, Tijn Porcelijn and Antal van den Bosch.   "Spotting the 'Odd-one-out': Data-Driven Error Detection and Correction in Textual Databases.", Proceedings of the EACL 2006 Workshop on Adaptive Text Extraction and Mining (ATEM-06), Trento, Italy, 2006.
      [pdf]

    • Caroline Sporleder, Marieke van Erp, Tijn Porcelijn, Antal van den Bosch and Pim Arntzen.   "Identifying Named Entities in Text Databases from the Natural History Domain", Proceedings of the Fifth International Conference on Language Resources and Evaluation (LREC-06), Genoa, Italy, 2006.
      [pdf]

    [2005]

    • Caroline Sporleder and Mirella Lapata.   "Discourse Chunking and its Application to Sentence Compression", Proceedings of the 2005 Human Language Technology Conference and the Conference on Empirical Methods in Natural Language Processing (HLT/EMNLP-05), Vancouver, Canada, 2005.
      [pdf]

    • Caroline Sporleder and Alex Lascarides.   "Exploiting Linguistic Cues to Classify Rhetorical Relations", Proceedings of Recent Advances in Natural Language Processing (RANLP-05), pp. 532-539, Borovets, Bulgaria, 2005. (<font color="#FF0000">RANLP-2005 Young Researcher Award</font>)
      [pdf]

    [2004]

    • Caroline Sporleder and Mirella Lapata.   "Automatic Paragraph Identification: A Study across Languages and Domains", Proceedings of the 2004 Conference on Empirical Methods in Natural Language Processing (EMNLP-04), pp. 72-79, Barcelona, Spain, 2004.
      [pdf]     [data sets used in the experiments]

    • Caroline Sporleder and Alex Lascarides.   "Combining Hierarchical Clustering and Machine Learning to Predict High-Level Discourse Structure", Proceedings of the 20th International Conference on Computational Linguistics (COLING-04), pp. 43-49, Geneva, Switzerland, 2004.
      [ps]

    • Caroline Sporleder.   "Combining Machine Learning and Set-Theory to Infer Inheritance Hierarchies", Beiträge zur 7. Konferenz zur Verarbeitung Natürlicher Sprache (KONVENS-04), pp. 193-200, Vienna, Austria, 2004.
      [ps]

    [2002]

    • Caroline Sporleder.   "Learning Lexical Inheritance Hierarchies with Maximum Entropy Models", Workshop on Machine Learning Approaches in Computational Linguistics, ESSLLI 2002, Trento, Italy, 5-16 August 2002.
      [ps]

    • Caroline Sporleder.   "Machine Learning of Lexical Inheritance Hierarchies: Linguistic Plausibility vs. Minimal Redundancy" Proceedings of the Student Research Workshop at the 40th ACL, Philadelphia, USA, 6-12 July 2002.

    • Caroline Sporleder.   "A Galois Lattice based Approach to Lexical Inheritance Learning", ECAI 2002 Workshop on Machine Learning and Natural Language Processing for Ontology Engineering (OLT2002), Lyon, France, July 22-23 2002.

    • Caroline Sporleder.   "Some Experiments on Lexical Inheritance Hierarchy Learning", Proceedings of TaCoS 2002, Potsdam, Germany, 6-9 June 2002.

    [1999]

    • Harald Lüngen and Caroline Sporleder.   "Automatic Induction of Lexical Inheritance Hierarchies", Multilingual Corpora: Codierung, Struktur, Analyse. 11 Jahrestagung der Gesellschaft für Linguistische Datenverarbeitung (GLDV-99), pp. 42-52, Frankfurt a.M., Germany, 1999.
      [ps]

    Book Chapters

    • Caroline Sporleder, Antal van den Bosch and Kalliopi Zervanou.   Language Technology for Cultural Heritage, Social Sciences and Humanities: Chances and Challenges. In: Caroline Sporleder, Antal van den Bosch and Kalliopi Zervanou (eds.)   Language Technology for Cultural Heritage. Selected Papers from the LaTeCH Workshop Series. Theory and Applications of Natural Language Processing. Heidelberg: Springer, 2011.

    • Caroline Sporleder and Alex Lascarides.   Exploiting Linguistic Cues to Classify Rhetorical Relations. In: Recent Advances in Natural Language Processing IV: Selected Papers from RANLP 2005 (Current Issues in Linguistic Theory), John Benjamins, 2007.

    Edited Volumes


    Theses

    • Caroline Sporleder.   Discovering Lexical Generalisations. A Supervised Machine Learning Approach to Inheritance Hierarchy Construction. PhD Thesis, School of Informatics, University of Edinburgh, 2004.
      [ps]

    • Caroline Sporleder.   Learning Lexical Generalisations. An Operational Evaluation of Current Machine Learning Methods. MA Thesis, Universität Bielefeld 1999.
      [ps]

    • Caroline Sporleder.   Interfacing Natural Language Generation and Speech Synthesis: A Topic-Comment Mark-Up for ILEX 2.0. MSc Thesis, University of Edinburgh 1997.

    Technical Reports

    • Caroline Sporleder, Marieke van Erp, Tijn Porcelijn, Antal van den Bosch, Pim Arntzen and Erik van Nieukerken.   Cleaning and Enriching Research Data on Reptiles and Amphibians. The MITCH Pilot Project and "nulmeting". Technical Report, ILK 06-01, Tilburg University, 2006.
      [pdf]