Towards Idea Mining: Problem-Solution Phrase Extraction from Text

Haixia Liu, Tim Brailsford, James Goulding, Tomas Maul, Tao Tan, Debanjan Chaudhuri

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

This paper investigates the feasibility of problem-solution phrases extraction from scientific publications using neural network approaches. Bidirectional Long Short-Term Memory with Conditional Random Fields (Bi-LSTM-CRFs) and Bidirectional Encoder Representations from Transformers (BERT) were evaluated on two datasets, one of which was created by University of Cambridge Computer Laboratory containing 1000 positive examples of problems and solutions (UCCL1000) with the corresponding phrases annotated. The F1-scores computed on the UCCL1000 dataset indicate that BERT is an effective approach to extract solution phrases (with an F1-score of 97%) and problem phrases (with an F1-score of 83%). To test the model’s robustness on a different corpus with a different annotation scheme, a dataset consisting of 488 problem-solution samples from the Conference on Neural Information Processing Systems (NIPS488) was collected and annotated by human readers. Both Bi-LSTM-CRFs and BERT performances were dramatically lower for NIPS488 in comparison with UCCL1000.

Original languageEnglish
Title of host publicationAdvanced Data Mining and Applications - 18th International Conference, ADMA 2022, Proceedings
EditorsWeitong Chen, Lina Yao, Taotao Cai, Shirui Pan, Tao Shen, Xue Li
PublisherSpringer Science and Business Media Deutschland GmbH
Pages3-14
Number of pages12
ISBN (Print)9783031221361
DOIs
Publication statusPublished - 2022
Event18th International Conference on Advanced Data Mining and Applications, ADMA 2022 - Brisbane, Australia
Duration: 28 Nov 202230 Nov 2022

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume13726 LNAI
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference18th International Conference on Advanced Data Mining and Applications, ADMA 2022
Country/TerritoryAustralia
CityBrisbane
Period28/11/2230/11/22

Keywords

  • NLP
  • Problem-solution extraction
  • Text mining

Fingerprint

Dive into the research topics of 'Towards Idea Mining: Problem-Solution Phrase Extraction from Text'. Together they form a unique fingerprint.

Cite this