• News item 152

Preprint 266 published

25 January 2021
News picture

Preprint 266, "A Question Answering System for Chemistry", has been published!

Abstract

This paper describes the implementation and evaluation of a proof-of-concept Question Answering system for accessing chemical data from knowledge graphs which offer data from chemical kinetics to chemical and physical properties of species. We trained a question type classification model and an entity extraction model to interpret chemistry questions of interest. The system has a novel design which applies a topic model to identify the question-to-ontology affiliation. The topic model helps the system to provide more accurate answers. A new method that automatically generates training questions from ontologies is also implemented. The question set generated for training contains 80085 questions under 8 types. Such a training set has been proven to be effective for training both the question type classification model and the entity extraction model. We evaluated the system using the Google search engine as the baseline. We found that it can answer 114 questions of interest that Google or Wolfram alpha can not give a direct answer to. Moreover, the application of the topic model was found to increase the accuracy of constructing the correct queries.