PULS Project Publications

    Edited Collections

  1. The 4th Biennial International Workshop on Balto-Slavic Natural Language Processing   
    Jakub Piskorski, Lidia Pivovarova, Hristo Tanev, Roman Yangarber (eds.)
    Proceedings of the Workshop, ACL-2013
    (2013) Sofia, Bulgaria

  2. Cover: Multi-source, Multilingual Information
                      Extraction and Summarization Multi-source,
    Multilingual
    Information
    Extraction and
    Summarization
     
     
    Thierry Poibeau,
    Horacio Saggion,
    Jakub Piskorski,
    Roman Yangarber
    (Eds.)
     
    Theory and Applications of Natural Language Processing.
    Springer-Verlag (2012)
    Berlin, Heidelberg

    Conference Papers, Journal Articles, Book Chapters

  3. MDL-based Models for Transliteration Generation   (pdf)
    Javad Nouri, Lidia Pivovarova, Roman Yangarber.
    SLSP 2013: International Conference on Statistical Language and Speech Processing
    Springer Verlag, Lecture Notes in Artificial Intelligence (LNAI), LNCS Volume 7978 (2013) Tarragona, Spain

  4. Combined analysis of news and Twitter messages    (pdf)
    Mian Du, Ossi Mikael Karkulahti, Jussi Kangasharju, Lidia Pivovarova, Roman Yangarber.
    RANLP-2013 workshop on Semantic Web and Information Extraction
    (2013) Hissar, Bulgaria

  5. Adapting the PULS event extraction framework to analyze Russian text    (pdf)
    Lidia Pivovarova, Mian Du, Roman Yangarber.
    The 4th Biennial Workshop on Balto-Slavic Natural Language Processing
    At ACL-2013 (2013) Sofia, Bulgaria

  6. Automatic detection of stable grammatical features in N-grams   (pdf)
    Mikhail Kopotev, Lidia Pivovarova, Natalia Kochetkova, Roman Yangarber.
    The 9th Workshop on Multiword Expressions: MWE 2013 Co-located with NAACL/HLT: Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (2013) Atlanta, GA

  7. Event representation across genre   (pdf)
    Lidia Pivovarova, Silja Huttunen, Roman Yangarber.
    Workshop on EVENTS: Definition, Detection, Coreference, and Representation Co-located with NAACL/HLT: Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (2013) Atlanta, GA

  8. An Overview of Internet biosurveillance   (pdf)
    DM Hartley, NP Nelson, RR Arthur, P Barboza, N Collier, N Lightfoot, JP Linge, E van der Goot, A Mawudeku, LC Madoff, L Vaillant, R Walters, R Yangarber, J Mantero, CD Corley, JS Brownstein.
    (2013) Journal of Clinical Microbiology and Infection, 19(6), Wiley

  9. Evaluation of epidemic intelligence systems integrated in the early alerting and reporting project for the detection of A/H5N1 influenza events   (pdf)
    Barboza P, Vaillant L, Mawudeku A, Nelson NP, Hartley DM, Madoff LC, Linge JP, Collier N, Brownstein JS, Yangarber R, Astagneau P.
    (2013) In PLoS One Journal, 8(3)

  10. Improving performance quality and user experience in the PULS News Mining system   (pdf)
    Master's Thesis: Mian Du  
    (2012) University of Helsinki, Department of Computer Science

  11. Techniques for Multilingual Security-related Event Extraction from Online News   (abstract)
    Martin Atkinson, Mian Du, Jakub Piskorski, Hristo Tanev, Roman Yangarber, Vanni Zavarella.
    In Computational Linguistics—Applications (A. Przepiórkowski, M. Piasecki, K. Jassem, P. Fuglewicz, eds.) Studies in Computational Intelligence, Vol. 458
    (2012) Springer Verlag

  12. Information Extraction: Past, Present and Future   (pdf)
    Jakub Piskorski, Roman Yangarber.
    Survey Chapter in "Multi-source, Multilingual Information Extraction and Summarization", Theory and Applications of Natural Language Processing (T. Poibeau et al., eds.).
    Springer-Verlag (2012) Berlin, Heidelberg

  13. Predicting Relevance of Event Extraction for the End User   (abstract, pdf)
    Silja Huttunen, Arto Vihavainen, Mian Du, Roman Yangarber.
    In "Multi-source, Multilingual Information Extraction and Summarization", Theory and Applications of Natural Language Processing (T. Poibeau et al., eds.).
    Springer-Verlag (2012) Berlin, Heidelberg

  14. Tietojenkäsittelytiede: Tiedoneristäminen ("Information Extraction", in Finnish )   
    Silja Huttunen.
    Invited chapter in "Genreanalyysi—tekstilajitutkimuksen käsikirja"
    ("The Handbook of Genre Analysis and Text-Type Research")
    (Heikkinen, V., Voutilainen, E., Lauerma, P., Tiililä, U. & Lounela, M., eds.).
    Gaudeamus Helsinki University Press (2012) Helsinki
    (Kotimaisten kielten keskuksen julkaisuja 169).

  15. Building support tools for Russian-language information extraction   
    Mian Du, Peter von Etter, Mikhail Kopotev, Mikhail Novikov, Natalia Tarbeeva, Roman Yangarber.
    BSNLP-2011: Balto-Slavonic Natural Language Processing (2011) Plzeň, Czech Republic.
    Springer-Verlag, Lecture Notes in Computer Science, Volume 6836. Series: Text, Speech and Dialogue.

  16. User-Oriented Information Extraction   (pdf)   (HTML)
    Master's Thesis: Peter von Etter
    University of Helsinki, Department of Computer Science (2011)

  17. Event Relevance in Information Extraction   (pdf)
    Master's Thesis: Arto Vihavainen
    University of Helsinki, Department of Computer Science (2011)

  18. Multilingual real-time event extraction for border security intelligence gathering   
    Martin Atkinson, Jakub Piskorski, Erik Van der Goot, Roman Yangarber.
    Counterterrorism and Open Source Intelligence. Springer Lecture Notes in Social Networks, Vol. 2. (Uffe Kock Wiil, editor).
    (2011) pp. 355-390

  19. Relevance prediction in information extraction using discourse and lexical features
    Silja Huttunen, Arto Vihavainen, Peter von Etter, Roman Yangarber.
    Nodalida-2011: Nordic Conference on Computational Linguistics
    (2011) Riga, Latvia

  20. Assessment of utility in Web mining for the domain of Public Health    (pdf)
    Peter von Etter, Silja Huttunen, Arto Vihavainen, Matti Vuorinen, Roman Yangarber.
    In Proceedings of LOUHI-2010: the Second Louhi Workshop on Text and Data Mining of Health Documents, at the NAACL/HLT Conference,
    (2010) Los Angeles, California

  21. MedISys—Medical Information System   
    Jens P. Linge, Ralf Steinberger, Flavio Fuart, Stefano Bucci, Jenya Belyaeva, Monica Gemo, Delilah Al-Khudhairy, Roman Yangarber, Erik van der Goot.
    In Advanced ICTs for Disaster Management and Threat Detection: Collaborative and Distributed Frameworks. Eleana Asimakopoulou, Nik Bessis (eds.),
    (2010) IGI GLobal Press,

  22. Real-time Text Mining in Multilingual News for the Creation of a Pre-frontier Intelligence Picture    (pdf)
    Jakub Piskorski, Martin Atkinson, Jenya Belyaeva, Vanni Zavarella, Silja Huttunen, Roman Yangarber.
    In Proceedings of the 16th Conference on Knowledge Discovery and Data Mining (KDD-2010); ACM SIGKDD Workshop on Intelligence and Security Informatics.
    (2010) Washington, DC

  23. Filtering news for epidemic surveillance: towards processing more languages with fewer resources   
    Gael Lejeune, Antoine Doucet, Roman Yangarber, Nadine Lucas.
    CLIA: Fourth International Workshop On Cross Lingual Information Access, at COLING 2010
    (2010) Beijing, China

  24. Utility evaluation of tools for collaborative development and maintenance of ontologies   
    Alex Norta, Roman Yangarber, Lauri Carlson.
    VORTE-2010: Joint 5th International Workshop on Vocabularies, Ontologies and Rules for The Enterprise / International Workshop on Metamodels, Ontologies and Semantic Technologies (MOST) at EDOC-2010: the Fourteenth IEEE International Conference On Enterprise Computing
    (2010) Vitória, ES, Brazil

  25. News mining for border security intelligence    (pdf)
    Jakub Piskorski, Martin Atkinson, Jenya Belayeva, Vanni Zavarella, Silja Huttunen, Roman Yangarber.
    In IEEE ISI-2010: Intelligence and Security Informatics
    (2010) Vancouver, BC, Canada

  26. The landscape of international event-based biosurveillance    (pdf)
    D Hartley, N Nelson, R Walters, R Arthur, R Yangarber, L Madoff, J Linge, A Mawudeku, N Collier, J Brownstein, G Thinus, N Lightfoot.
    In Emerging Health Threats Journal, 3:e3 (2010)

  27. A proposal for a multilingual epidemic surveillance system.   
    Gael Lejeune, Mohammed Hatmi, Antoine Doucet, Silja Huttunen, Nadine Lucas.
    In Proceedings of MINUCS-2009: Workshop on Mining User-Generated Content for Security, at the UCMedia-2009: ICST Conference on User-Centric Media
    (2009) Venice, Italy

  28. Automated event extraction in the domain of Border Security    (pdf)
    Martin Atkinson, Jakub Piskorski, Hristo Tanev, Eric van der Goot, Roman Yangarber, Vanni Zavarella.
    In Proceedings of MINUCS-2009: Workshop on Mining User-Generated Content for Security, at the UCMedia-2009: ICST Conference on User-Centric Media
    (2009) Venice, Italy

  29. Automatic epidemiological surveillance from on-line news in MedISys and PULS    (pdf)
    Roman Yangarber, Peter von Etter, Ralf Steinberger.
    In Proceedings of IMED-2009: International Meeting on Emerging Diseases and Surveillance
    (2009) Vienna, Austria

  30. Internet surveillance systems for early alerting of health threats    (pdf)
    Jens P. Linge, Ralf Steinberger, Thomas P. Weber, Roman Yangarber, Erik van der Goot, Delilah H. Al-Khudhairy, Nikolaos I. Stilianakis.
    In Eurosurveillance Journal, 14(13)
    (2009) Stockholm, Sweden

  31. Text mining from the Web for Medical Intelligence   (pdf)   (abstract)
    Ralf Steinberger, Flavio Fuart, Erik van der Groot, Clive Best,
    Peter von Etter, Roman Yangarber.
    In: Mining Massive Data Sets for Security, D. Perrotta, J. Piskorski, F. Soulié-Fogelman & R. Steinberger (eds.): OIS Press
    (2008) Amsterdam, The Netherlands

  32. Content Collection and Analysis in the Domain of Epidemiology   (pdf)
    Roman Yangarber, Peter von Etter, Ralf Steinberger.
    In Proceedings of DrMED-2008: International Workshop on Describing Medical Web Resources, at MIE-2008: the 21st International Congress of the European Federation for Medical Informatics
    (2008) Göteborg, Sweden

  33. Combining information retrieval and information extraction for medical intelligence    (pdf)
    Roman Yangarber, Ralf Steinberger, Clive Best, Peter von Etter, Flavio Fuart, David Horby.
    Mining Massive Data Sets for Security, NATO Advanced Study Institute
    (2007) Gazzada, Italy

  34. Combining Information about Epidemic Threats from Multiple Sources    (pdf)
    Roman Yangarber, Clive Best, Peter von Etter, Flavio Fuart, David Horby, Ralf Steinberger.
    In Proceedings Multi-source, Multilingual Information Extraction and Summarization at RANLP-2007.
    (2007) Borovets, Bulgaria

  35. Verification of Facts across Document Boundaries    (pdf)
    Roman Yangarber.
    In Proceedings IIIA-2006: International Workshop on Intelligent Information Access
    (2006) Helsinki, Finland

  36. Confidence measuring and data improvement of extracted information from disease outbreak reports   (pdf)
    Master's Thesis: Lauri Jokipii   (html)
    University of Helsinki, Department of Computer Science (2006)