Recommender Systems

Mr. DLib’s Living Lab for Scholarly Recommendations (preprint)

We published a manuscript on arXiv about the first living lab for scholarly recommender systems. This lab allows recommender-system researchers to conduct online evaluations of their novel algorithms for scholarly recommendations, i.e., research papers, citations, conferences, research grants etc. Recommendations are delivered through the living lab´s API in platforms such as reference management software and digital libraries. The living lab is built on top of the recommender system as-a-service Mr. DLib. Current partners are the reference management software JabRef and the CORE research team. We present the architecture of Mr. DLib’s living lab as well as usage statistics on the first ten months of operating it. During this time, 970,517 recommendations were delivered with a mean click-through rate of 0.22%. Read more…

By Joeran Beel, ago
Recommender Systems

RARD II: The 2nd Related-Article Recommendation Dataset (preprint)

We released a new version of RARD, i.e. RARD II and describe the new release in a preprint published on arXiv. The dataset is available at http://data.mr-dlib.org and the new manuscript is available on arXiv and here in our Blog. The main contribution of this paper is to introduce and describe a new recommender-systems dataset (RARD II). It is based on data from a recommender-system in the digital library and reference management software domain. As such, it complements datasets from other domains such as books, movies, and music. The RARD II dataset encompasses 89m recommendations, covering an item-space of 24m unique items. RARD II provides a range of rich recommendation data, beyond conventional ratings. For example, in addition to the usual rating Read more…

By Joeran Beel, ago
Partnerships

MoU between the National Institute of Informatics (NII) and Trinity College Dublin (TCD) / SCSS / ADAPT

After being appointed as a visiting professor at the National Institute of Informatics (NII) a few weeks ago, there is more good news. The NII has signed a memorandum of understanding (MoU) with the School of Computer Science and Statistics (SCSS) of Trinity College Dublin (TCD) and the ADAPT Research Centre. The MoU originates from our long-term collaboration with the NII and outlines a partnership between TCD/SCSS/ADAPT and the NII. The goal of the partnership is to: Engage in joint efforts in research Work together in the area of institute management Exchange administrative and managerial staff Collaborate with industry and agencies for public administration and public services Exchange of research and academic staff and students We begin the collaboration with two projects. First, PhD Read more…

By Joeran Beel, ago
Mr. DLib

Mr. DLib v1.2.1: Improved keyphrase recommendations and Apache Lucene query handling

The new version of our recommender system completes 104 issues and significantly improves the recommendations. The most notable improvements are: We improved the keyphrase extraction process in the recommender system, i.e. keyphrases are not stored differently in Lucene. We expect better recommendation effectiveness and are currently running an A/B test. More robust path encoding for search queries (special characters in a URL caused errors) Lucene’s eDismax function is A/B tested (together with Lucene’s standard query parser) Improved queries for CORE recommender system (their system needs queries to be of a certain length; Mr. DLib now just multiplies the queries until they are at least 50 characters) Abstracts and keywords in the XML response of Mr. DLib are enclosed in <![CDATA[ HTML Snippet is improved Read more…

By Joeran Beel, ago
Mr. DLib

RARD: The Related-Article Recommendation Dataset

We are proud to announce the release of ‘RARD’, the related-article recommendation dataset from the digital library Sowiport and the recommendation-as-a-service provider Mr. DLib. The dataset contains information about 57.4 million recommendations that were displayed to the users of Sowiport. Information includes details on which recommendation approaches were used (e.g. content-based filtering, stereotype, most popular), what types of features were used in content based filtering (simple terms vs. keyphrases), where the features were extracted from (title or abstract), and the time when recommendations were delivered and clicked. In addition, the dataset contains an implicit item-item rating matrix that was created based on the recommendation click logs. RARD enables researchers to train machine learning algorithms for research-paper recommendations, perform offline evaluations, and Read more…

By Joeran Beel, ago
Mr. DLib

Several new publications: Mr. DLib, Lessons Learned, Choice Overload, Bibliometrics (Mendeley Readership Statistics), Apache Lucene, CC-IDF, TF-IDuF

In the past few weeks, we published (or received acceptance notices for) a number of papers related to Mr. DLib, research-paper recommender systems, and recommendations-as-a-service. Many of them were written during our time at the NII or in collaboration with the NII. Here is the list of publications: Beel, Joeran, Bela Gipp, and Akiko Aizawa. “Mr. DLib: Recommendations-as-a-Service (RaaS) for Academia.” In Proceedings of the ACM/IEEE-CS Joint Conference on Digital Libraries (JCDL), 2017. Beel, Joeran. “Real-World Recommender Systems for Academia: The Gain and Pain in Developing, Operating, and Researching them.” In 5th International Workshop on Bibliometric-enhanced Information Retrieval (BIR) at the 39th European Conference on Information Retrieval (ECIR), 2017. [short version, official], [long version, arxiv] Beierle, Felix, Akiko Aizawa, and Joeran Beel. Read more…

By Joeran Beel, ago
Recommendations as-a-Service (RaaS)

Mr. DLib v1.1 released: JavaScript Client, 15 million CORE documents, new URL for recommendations-as-a-service via title search

We are proud to announce version 1.1 of Mr. DLib’s Recommender-System as-a-Service. The major new features are: A JavaScript Client to request recommendations from Mr. DLib. The JavaScript offers many advantages compared to a server-side processing of our recommendations. Among others, the main page will load faster while recommendations are requested in the background and a loading animation is shown. Using the JavaScript also means that the logging will be more reliable because web spiders are not logged any more. Our partner Sowiport uses the JavaScript already. We indexed 15 million documents from CORE and recommend them through our API. Another 5 million will follow soon. So far, recommendations could only be requested by specifying a particular document ID such as https://api-beta.mr-dlib.org/v1/documents/<ID>/related_documents/. Now, recommendations can Read more…

By Joeran Beel, ago
Help Wanted

Students & PostDocs: We have open positions in Tokyo, Copenhagen, and Konstanz (2-24 months)

Update 2016-01-12: The salary in Tokyo would be around 1.600 US$ per month, not 1.400. 2015 has been a rather quiet year for Docear, but 2016 will be different. We have lots of ideas for new projects, and even better –  we have funding to pay at least 1 Master or PhD student, to help us implementing the ideas. There is also a good chance that we get more funding, maybe also for Bachelor students and postdoctoral researchers. The positions will be located in Tokyo, Copenhagen or Konstanz (Germany). In the following, there is a list of potential projects. If you are interested, please apply, and if you have own ideas, do not hesitate to discuss them with us. What exactly you Read more…

By Joeran Beel, ago