WOSP 2014 : 3rd International Workshop on Mining Scientific Publications (associated with DL 2014)

Country: United Kingdom

City: London

Abstr. due: 13.07.2014

Dates: 12.09.14 — 12.09.14

Area Of Sciences: Computer science;

Organizing comittee e-mail: https://www.easychair.org/account/signin.cgi?key=13066417.vjZzZrXHl06sRyiw

Organizers: The Open University, UK

 

1. INTRODUCTION

Digital libraries that store scientific publications are becoming increasingly central to the research process. They are not only used for traditional tasks, such as finding and storing research outputs, but also as a source for discovering new research trends or evaluating research excellence. With the current growth of scientific publications deposited in digital libraries, it is no longer sufficient to provide only access to content. To aid research it is especially important to improve the process of how research is being done.

The recent development in natural language processing, information retrieval and the semantic web make it possible to transform the way we work with scientific publications. However, in order to be able to improve these technologies and carry out experiments, researchers need to be able to easily access and use large databases of scientific publications.

This workshop aims to bring together people from different backgrounds who: (a) are interested in analysing and mining databases of scientific publications, (b) develop systems that enable such analysis and mining of scientific databases or (c) who develop novel technologies that improve the way research is being done.
2. TOPICS

The topics of the workshop will be organised around the following themes:

    The whole ecosystem of infrastructures including repositories, aggregators, text-and data-mining facilities, impact monitoring tools, datasets, s ervices and APIs that enable analysis of large volumes of scientific publications.
    Semantic enrichment of scientific publications by means of text-mining, crowdsourcing or other methods.
    Analysis of large databases of scientific publications to identify research trends, high impact, cross-fertilisation between disciplines, research excellence etc.

Topics of interest relevant to theme 1 include, but are not limited to:

    Infrastructures including repositories, aggregators, text-and data-mining facilities, impact monitoring tools, datasets,, services and APIs for accessing scientific publications and/or research data. The existence of datasets, services, systems and APIs (in particular those that are open) providing access to large volumes of scientific publications and research data, is an essential prerequisite for being able to research and develop new technologies that can transform the way people do research. We invite papers presenting innovative approaches to the development of these systems that enable people to access databases and carry out their analysis. Papers addressing Open Access are of special interest. We also welcome submissions discussing the technical aspects of supporting Open Science, in particular reproducibility of research, sharing of scientific workflows and linking research data with publications. Finally, we also invite papers discussing issues and current challenges in the design of these systems.

Topics of interest relevant to theme 2 include, but are not limited to:

    Novel information extraction and text-mining approaches to semantic enrichment of publications. This might range from mining publication structure, such as title, abstract, authors, citation information etc. to more challenging tasks, such as extracting names of applied methods, research questions (or scientific gaps), identifying parts of the scholarly discourse structure etc.
    Automatic categorization and clustering of scientific publications. Methods that can automatically categorize publications according to an established subject-based classification/taxonomy (such as Library of Congress classification, UNESCO thesaurus, DOAJ subject classification, Library of Congress Subject Headings) are of particular interest. Other approaches might involve automatic clustering or classification of research publications according to various criteria.
    New methods and models for connecting and interlinking scientific publications. Scientific publications in digital libraries are not isolated islands. Connecting publications using explicitly defined citations is very restrictive and has many disadvantages. We are interested in innovative technologies that can automatically connect and interlink publications or parts of publications according to various criteria, such as semantic similarity, contradiction, argument support or other relationship types.
    Models for semantically representing and annotating publications. This topic is related to the aspect of semantically modeling publications and scholarly discourse. Models that are practical with respect to the state-of-the-art in Natural Language Processing (NLP) technologies are of a special interest.
    Semantically enriching/annotating publications by crowdsourcing. Crowdsourcing can be used in innovative ways to annotate publications with richer metadata or to approve/disapprove annotations created using text-mining or other approaches. We welcome papers that address the following questions: (a) what incentives should be provided to motivate users in contributing, (b) how to apply crowdsourcing in the specialized domains of scientific publications, (c) what tasks in the domain of organising scientific publications is crowdsourcing suitable for and where it might fail, other relevant crowdsourcing topics relevant to the domain of scientific publications.

Topics of interest relevant to theme 3 include, but are not limited to:

    New methods, models and innovative approaches for measuring impact of publications. The most widely used metrics for measuring impact are based on citations. However, counting citations not taking into account the publication content and the qualitative nature of the citation. In addition, there is a delay between the publication and the measurable impact in citations. We in particular encourage papers addressing new ways of evaluating publications’ impact beyond standard citation measures.
    New methods for measuring performance of researchers. Methods for assessing impact of a publication can be often extended to methods that can assess the impact of individual researchers. However, there are also other criteria for measuring impact in addition to publications, such as the development and publication of research data, economical and market impact that should also be taken into account. We welcome papers addressing these aspects.
    Evaluating impact of research groups. The same as for impact of individuals holds for research communities.
    Methods for identifying research trends and cross-fertilization between research disciplines. Identifying research trends should allow discovering newly emerging disciplines or it should help to explain why certain fields are attracting the attention of a wider research community. Such monitoring is important for research funders and governments in order to be able to quickly respond to new developments. We invite papers discussing new methods for identifying trends and cross-fertilization between research disciplines using methods ranging from social network analysis and text- and datamining to innovative visualization approaches.
    Application and case studies of mining from scientific databases and publications. New methods and models developed for mining from scientific publications can be applied in many different scenarios, such as improving access to scientific publications, providing exploratory search in digital collections, identifying experts. We encourage papers describing innovative approaches that use scientific publications and data to solve real-world problems.
    Improving the infrastructure of repositories to support the development and integration of new impact and performance metrics. New ways of improving the repository infrastructure can include, for example, tracking accesses and downloads, researcher profiling and the interlinking of repository data with external services.. These can be in turn used for developing new impact metrics. We welcome papers addressing these issues.

3. SPECIAL OPEN PUBLICATIONS DATASET TRACK

This year we would like to invite the workshop participants to make use of the CORE publications dataset containing large volume of research publications from a wide variety of research areas. The dataset contains not only full-texts, but also an enriched version of publications’ metadata. The aim is to provide a framework for developing and testing methods and tools addressing the workshop topics. The use of this dataset is not mandatory, however it is encouraged. The dataset will be made available within a week
4. EXPECTED AUDIENCE

The workshop on Mining Scientific Publications aims to bring together researchers, digital library developers and practitioners from government and industry to address the current challenges in the domain of mining scientific publications.
5. PREVIOUS ORGANISATION

The The 1st International Workshop on Mining Scientific Publications was previously held in conjunction with JCDL 2012. The 2nd run of this workshop was held in conjunction with JCDL 2013. Both runs of the workshop have been extremely successful in terms of attracting submissions and participants from leading institutions in the area, such as British Library, Elsevier Labs, National Library of Medicine, Library of Congress, University of Pennsylvania (CiteSeerX) or Mendeley. The submissions from both of these workshops have been published as a special issue in D-Lib.

Conference Web-Site: http://core-project.kmi.open.ac.uk/dl2014/