Skip to main content
Home

Navigation controls

  • Search
  • Menu
September 9 – 12, 2025 | NIH Main Bethesda, Campus

Social

  • Twitter
  • Facebook
  • Email

Current Research Festival

  • 2025
  • 2024
    • General Schedule of Events
    • 2024 NIH Distinguished Scholars Program
    • Poster Sessions
    • Concurrent Workshops
    • NIH Resource Information Fair
    • National Academy of Science Mini-Symposium
    • Special Wednesday Events
    • Vendor Exhibit Information
    • Green Labs Fair
    • Research Festival Committee
  • 2023
    • General Schedule of Events
    • NIH Early–Career Investigator Lectures
    • Poster Sessions
    • Concurrent Workshops
    • NIH Resource Information Fair
    • Vendor Exhibit Information
    • Research Festival Committee
  • 2021
  • 2019
    • General Schedule of Events
    • Plenary Sessions
    • Poster Session
    • Data Blitz
    • FARE Award Ceremony
    • Special Exhibits on Resources for Intramural Research
    • Technical Sales Association (TSA) Research Festival Exhibit Tent Show
    • Research Festival Committee
  • 2018
    • General Schedule of Events
    • Plenary Sessions
    • Concurrent Symposia Sessions
    • Poster Sessions
    • FARE Award Ceremony
    • Special Exhibits on Resources for Intramural Research
    • Technical Sales Association (TSA) Research Festival Exhibit Tent Show
    • Research Festival Committees
  • 2017
    • General Schedule of Events
    • Plenary Sessions
    • Concurrent Symposia Sessions
    • Poster Sessions
    • FARE Award Ceremony
    • Future Research Leaders
    • Special Exhibits on Resources for Intramural Research
    • Technical Sales Association (TSA) Research Festival Exhibit Tent Show
    • Research Festival Committees
  • 2016
    • General Schedule of Events
    • Plenary Sessions
    • Concurrent Symposia Sessions
    • Poster Sessions
    • Browse by Presenter Name
    • Browse by Presenter IC
    • FARE Award Ceremony
    • Future Research Leaders
    • Special Exhibits on Resources for Intramural Research
    • Technical Sales Association (TSA) Research Festival Exhibit Tent Show
    • Tours of the National Library of Medicine and Clinical Center
    • Research Festival Committees
  • 2015
    • General Schedule of Events
    • Plenary Sessions
    • Concurrent Workshop Sessions
    • Poster Sessions
    • Browse by Presenter Name
    • Browse by Presenter IC
    • FARE Award Ceremony
    • Future Research Leaders
    • Special Exhibits on Resources for Intramural Research
    • TSA Research Festival Exhibit Show
    • NIH Tours
    • Research Festival Committees
  • 2014
  • 2013
  • 2012
  • 2011
  • 2010
  • 2009
  • 2008
  • 2007
  • 2006
  • 2005
  • 2004
  • 2003
  • 2002
  • 2001
  • 2000

Main navigation

  • All Research Festivals
    • 2025
    • 2024
    • 2023
    • 2021
    • 2019
    • 2018
    • 2017
    • 2016
    • 2015
    • 2014
    • 2013
    • 2012
    • 2011
    • 2010
    • 2009
    • 2008
    • 2007
    • 2006
    • 2005
    • 2004
    • 2003
    • 2002
    • 2001
    • 2000

NIH Research Festival

September 13 – 15, 2017

New Relevance Search Algorithm for PubMed

Friday, September 15, 2017 – Poster Session IV
1:00 – 2:30 p.m.

FAES Terrace

NLM

COMPBIO-19

Authors

  • N Fiorini
  • Z Lu

Abstract

With more than 27 million articles in MEDLINE, retrieving and ranking the most relevant papers for a given query is increasingly challenging. Starting in the 2000’s, the machine learning community have focused on document ranking and created learning-to-rank (L2R) algorithms, demonstrating that robust and accurate relevance models can be built by utilizing various relevance signals and large training datasets. Recently, this technology has matured enough to scale up to real-world applications. In order for L2R to learn a ranking model, it needs a gold standard to target. We one from actual PubMed queries, using the anonymized queries stored in the logs, as well as any actions users subsequently took. There are two main user actions that we consider to indicate that the document is relevant. One is the abstract click, when a user clicks on a document in the list of results matching their query. The other is full text click, which occurs when a user requests the full text, after having clicked on an abstract. We collected about one year and a half worth of logs, and we assigned relevance scores to documents for each query, based on their number of abstract and full text clicks. The gold standard consists of the queries and the corresponding documents, ordered by descending relevance. Finally, we designed a set of more than 150 features that capture the relatedness between the query and the document (e.g., the number of matches), document specifications (e.g., its publication type) and query specifications (e.g., the query length). The objective for L2R is to correctly predict the relevance score of each document in the gold standard, based on this set of features only. We manually analyzed the output of our system by submitting them to experts in various domains. Their encouraging conclusions motivated us to implement it in production. We optimized the pipeline, as it needed to comply with PubMed’s load requirements. It is now able to process about a thousand queries per second at an average of 100ms per query. We measured the performance of our approach and then-current PubMed by calculating the click through rates for each, that is, the proportion of queries where users click at least once on the first page of results. Solr-L2R showed an improvement in terms of click through rates of 10.8% over PubMed. This new PubMed relevance search algorithm has been deployed in PubMed production system and is used when ‘Best Match’ sort order is selected.

Scientific Focus Area: Computational Biology

This page was last updated on Friday, March 26, 2021

  • General Schedule of Events
  • Plenary Sessions
  • Concurrent Symposia Sessions
  • Poster Sessions
  • FARE Award Ceremony
  • Future Research Leaders
  • Special Exhibits on Resources for Intramural Research
  • Technical Sales Association (TSA) Research Festival Exhibit Tent Show
  • Research Festival Committees

2017 program

Download the 2017 Research Festival Schedule Overview (6 pages)

  • Email
  • Print
  • Share Twitter Facebook LinkedIn

Current Research Festival

  • 2025
  • 2024
    • General Schedule of Events
    • 2024 NIH Distinguished Scholars Program
    • Poster Sessions
    • Concurrent Workshops
    • NIH Resource Information Fair
    • National Academy of Science Mini-Symposium
    • Special Wednesday Events
    • Vendor Exhibit Information
    • Green Labs Fair
    • Research Festival Committee
  • 2023
    • General Schedule of Events
    • NIH Early–Career Investigator Lectures
    • Poster Sessions
    • Concurrent Workshops
    • NIH Resource Information Fair
    • Vendor Exhibit Information
    • Research Festival Committee
  • 2021
  • 2019
    • General Schedule of Events
    • Plenary Sessions
    • Poster Session
    • Data Blitz
    • FARE Award Ceremony
    • Special Exhibits on Resources for Intramural Research
    • Technical Sales Association (TSA) Research Festival Exhibit Tent Show
    • Research Festival Committee
  • 2018
    • General Schedule of Events
    • Plenary Sessions
    • Concurrent Symposia Sessions
    • Poster Sessions
    • FARE Award Ceremony
    • Special Exhibits on Resources for Intramural Research
    • Technical Sales Association (TSA) Research Festival Exhibit Tent Show
    • Research Festival Committees
  • 2017
    • General Schedule of Events
    • Plenary Sessions
    • Concurrent Symposia Sessions
    • Poster Sessions
    • FARE Award Ceremony
    • Future Research Leaders
    • Special Exhibits on Resources for Intramural Research
    • Technical Sales Association (TSA) Research Festival Exhibit Tent Show
    • Research Festival Committees
  • 2016
    • General Schedule of Events
    • Plenary Sessions
    • Concurrent Symposia Sessions
    • Poster Sessions
    • Browse by Presenter Name
    • Browse by Presenter IC
    • FARE Award Ceremony
    • Future Research Leaders
    • Special Exhibits on Resources for Intramural Research
    • Technical Sales Association (TSA) Research Festival Exhibit Tent Show
    • Tours of the National Library of Medicine and Clinical Center
    • Research Festival Committees
  • 2015
    • General Schedule of Events
    • Plenary Sessions
    • Concurrent Workshop Sessions
    • Poster Sessions
    • Browse by Presenter Name
    • Browse by Presenter IC
    • FARE Award Ceremony
    • Future Research Leaders
    • Special Exhibits on Resources for Intramural Research
    • TSA Research Festival Exhibit Show
    • NIH Tours
    • Research Festival Committees
  • 2014
  • 2013
  • 2012
  • 2011
  • 2010
  • 2009
  • 2008
  • 2007
  • 2006
  • 2005
  • 2004
  • 2003
  • 2002
  • 2001
  • 2000

Main navigation

  • All Research Festivals
    • 2025
    • 2024
    • 2023
    • 2021
    • 2019
    • 2018
    • 2017
    • 2016
    • 2015
    • 2014
    • 2013
    • 2012
    • 2011
    • 2010
    • 2009
    • 2008
    • 2007
    • 2006
    • 2005
    • 2004
    • 2003
    • 2002
    • 2001
    • 2000
  • Department of Health and Human Services
  • National Institutes of Health
  • USA.gov

Footer

  • Home
  • Contact Us
  • IRP
  • HHS Vulnerability Disclosure
  • Web Policies and Notices
  • Site Map
  • Search