September event will bring together cross-disciplinary researchers, practitioners from industry and government, open access enthusiasts, and digital library developers
The vast amount of content hosted by digital libraries nowadays poses new challenges to the research community. It’s no longer enough to make publications available for human readers; enormous value can be derived from opening these catalogues of data to machine reading.
Data mining tools can be used to read scholarly articles, transforming them from lists of words and images into real-world information such as facts, opinions and hypothesis. This can be a great time-saver for researchers, helping them to find important information in huge corpora quickly and easily.
The 3rd International Workshop on Mining Scientific Publications, which is collocated with the ACM/IEEE Joint Conference on Digital Libraries (JCDL 2014) September 8 to 12 in London, will explore these possibilities by bringing together people from different backgrounds who are interested in analysing and mining databases of scientific publications. In addition, the conference is open to all those involved in developing systems to enable such analysis, designing new technologies to improve the way research is being done, and supporting the openness and free availability of publications and research data.
As Chief Data Scientist at Mendeley, I am always interested in exploring how we can make our Crowdsourced Research Catalogue even more open and useful to researchers. That is why I’ve been involved with events such as these for a while, and delivered a keynote at last year’s event. The collaborations that come from these initiatives always yield really interesting results.
We expect to draw researchers from across academia and industry to work through the amazing possibilities and challenges around mining scientific content. Previous workshops have attracted contributions and participants from leading institutions such as the British Library, the Library of Congress, Elsevier Labs and the National Library of Medicine. We have already received some great proposals and are looking forward to seeing more ground-breaking work come through before the closing date on 13 July 2014.
The audience for the workshop will be a mixture of cross-disciplinary researchers, practitioners from industry and government, open access enthusiasts, and digital library developers.
Keynote speakers will include Dr. C. Lee Giles (David Reese Professor of Information Sciences and Technology at the Pennsylvania State University) and Dr. Birger Larsen (Professor in Information Analysis and Information Retrieval at the Department of Communication at Aalborg University Copenhagen).
Together, we will discuss different approaches to addressing the challenges of mining scientific publications, and we’re now inviting people tosubmit their papers online, for inclusion in the programme. Both long papers (up to eight pages in the ACM style) and short papers (not exceeding four pages) are welcome, as are practical demonstrations and presentation of systems and methods (demonstration submissions should consist of a two-page description of the system, method or tool).
The workshop will be structured around three main themes:
- The whole ecosystem of infrastructures, including repositories, aggregators, text-and data-mining facilities, impact monitoring tools, datasets, services and APIs that enable analysis of large volumes of scientific publications.
- Semantic enrichment of scientific publications by means of text-mining, crowdsourcing or other methods.
- Analysis of large databases of scientific publications to identify research trends, high impact, cross-fertilisation between disciplines, research excellence etc.
This year, we have also put together a CORE publications dataset containing a large array of publications from various research areas. This includes full-text as well as enriched versions of metadata, with the aim of providing workshop participants with a framework for developing and testing methods and tools around the workshop topics. You can access this data through the CORE portal.