Text2Story 2024
Seventh International Workshop on Narrative Extraction from Texts
held in conjunction with the 46th European Conference on Information Retrieval
Seventh International Workshop on Narrative Extraction from Texts
held in conjunction with the 46th European Conference on Information Retrieval
Over these past years, significant breakthroughs, led by Transformers and Large Language Models (LLMs), have been made in understanding natural language text. However, the ability to capture, represent, and analyze contextual nuances in longer texts is still an elusive goal, let alone the understanding of consistent fine-grained narrative structures in text. In the seventh edition of the Text2Story workshop, we aim to bring to the forefront the challenges involved in understanding the structure of narratives and in incorporating their representation in well-established frameworks, as well as in modern architectures (e.g., transformers) and AI-powered language models (e.g, chatGPT) which are now common and form the backbone of almost every IR and NLP application. It is hoped that the workshop will provide a common forum to consolidate the multi-disciplinary efforts and foster discussions to identify the wide-ranging issues related to the narrative extraction task.
Research works submitted to the workshop should foster the scientific advance on all aspects of storyline generation and understanding from texts including but not limited to narrative information extraction aspects, narratives representation, knowledge extraction, ethics and bias in narratives, datasets and evaluation protocols and narrative applications such as visualization of narratives, multi-modal aspects, Q&A, etc. To this regard, we encourage the submission of high-quality and original submissions covering the following topics:
We challenge the interested researchers to consider submitting a paper that makes use of the tls-covid19 dataset - published at ECIR'21 - under the scope and purposes of the text2story workshop. tls-covid19 consists of a number of curated topics related to the Covid-19 outbreak, with associated news articles from Portuguese and English news outlets and their respective reference timelines as gold-standard. While it was designed to support timeline summarization research tasks it can also be used for other tasks (e.g., Q&A), especially when combined with Large Language Models (LLMs) like ChatGPT.
We solicit the following types of contributions:
Original and high-quality unpublished contributions to the theory and practical aspects of the narrative extraction task. Full papers should introduce existing approaches, describe the methodology and the experiments conducted in detail. Negative result papers to highlight tested hypotheses that did not get the expected outcome are also welcomed.
Unpublished short papers describing work in progress; position papers introducing a new point of view, a research vision or a reasoned opinion on the workshop topics; and dissemination papers describing project ideas, ongoing research lines, case studies or summarized versions of previously published papers in high-quality conferences/journals that is worthwhile sharing with the Text2Story community, but where novelty is not a fundamental issue.
Unpublished papers presenting research/industrial demos; papers describing important resources (datasets or software packages) to the Text2Story community;
Papers must be submitted electronically in PDF format through Easy Chair . All submissions must be in English and formatted according to the one-column CEUR-ART style with no page numbers. Templates, either in Word or LaTeX, can be found in the following zip folder . There is also an Overleaf page for LaTeX users.
IMPORTANT: Please include between brackets the type of submission (full; negative results; work in progress; demo and resource; position; dissemination) in the paper title.
Papers submitted to Text2Story 2024 should be original work and different from papers that have been previously published, accepted for publication, or that are under review at other venues. Exceptions to this rule are "dissemination papers". Pre-prints submitted to ArXiv are eligible.
All papers will be refereed through a double-blind peer-review process by at least two members of the programme committee. The accepted papers will appear in the proceedings published at CEUR workshop proceedings (indexed in Scopus and DBLP) as long as they don't conflict with previous publication rights.
Abstract: Humans are curious creatures, equipped with a sense of (and desire for) finding meaning in their environment. They are predisposed to identify patterns, real and spurious, in the world they live in, and above anything else, they understand the world in terms of narratives. In this talk, we will explore a set of questions about narratives: what is a narrative made up of? What signals from textual prose tell us what the narrative is? What about signals from structured data that imply a particular narrative? What is the essence of a story? How can narrative information be extracted and presented? Open source intelligence analysts and investigative reporters alike are hunting for the story, the narrative, behind the petabyte intercepts or terabyte leaks. The more data we gather or have available, the stronger will be our thirst to distill meaningful stories from it.
Bio: Professor Jochen L. Leidner MA MPhil PhD FRGS is the Research Professor for Explainable and Responsible Artificial Intelligence in Insurance at Coburg University of Applied Sciences and Arts, Germany, where he leads the Information Access Research Group, a Visiting Professor of Data Analytics in the Department of Computer Science, University of Sheffield and founder and CEO of the consultancy KnowledgeSpaces. He is also a Fellow of the Royal Geographical Society. Dr. Leidner's experience includes positions as Director of Research at Thomson Reuters and Refinitiv in London, where he headed its R&D team (2013-2022). He has built up research and innovation teams. He was also the Royal Academy of Engineering Visiting Professor of Data Analytics at the Department of Computer Science. His background includes a Master's in computational linguistics, English and computer science (University of Erlangen-Nuremberg), a Master's in Computer Speech, Text and Internet Technology (University of Cambridge) and a PhD in Informatics (University of Edinburgh), which won the first ACM SIGIR Doctoral Consortium Award. He is a scientific expert for the European Commission (FP7, H2020, Horizon Europe) and other funding bodies in Germany, Austria, the UK and the USA. He also is a past chair of the Microsoft-BCS/BCS IRSG Karen Sparck Jones award. Professor Leidner is an author or co-author of several dozen peer-reviewed publications (including one best paper award), has authored or co-edited two books and holds several patents in the areas of information retrieval, natural language processing, and mobile computing. He has been twice winner of the Thomson Reuters inventor of the year award for the best patent application, and is the past received of a Royal Society of Edinburgh Enterprise Fellowship in Electronic Markets.
Text2Story 2024 will be held at the 46th European Conference on Information Retrieval (ECIR'24) in Dublin, Ireland
Be aware that power plug sockets in Scotland are of type G (of British origin). You may need to consider bringing or buying an adaptor.
Registration at ECIR 2024 is required to attend the workshop (don't forget to select the Text2Story workshop).
This work is financed by National Funds through the FCT - Fundação para a Ciência e a Tecnologia, I.P. (Portuguese Foundation for Science and Technology) within the project StorySense, with reference 2022.09312.PTDC (DOI 10.54499/2022.09312.PTDC).