Full metadata record
DC FieldValueLanguage
dc.contributor.authorTrappey, Amy J. C.en_US
dc.contributor.authorTrappey, Charles V.en_US
dc.contributor.authorWu, Jheng-Longen_US
dc.contributor.authorTsai, Kevin T. -Cen_US
dc.date.accessioned2020-10-05T02:00:32Z-
dc.date.available2020-10-05T02:00:32Z-
dc.date.issued2019-01-01en_US
dc.identifier.isbn978-1-64368-021-7; 978-1-64368-020-0en_US
dc.identifier.issn2352-7528en_US
dc.identifier.urihttp://dx.doi.org/10.3233/ATDE190158en_US
dc.identifier.urihttp://hdl.handle.net/11536/155069-
dc.description.abstractTechnical or knowledge documents, such as research papers, patents, and technical documents, e.g., request for quotations (RFQ), are important knowledge references for multiple purposes. For example, enterprises and R&D institutions often need to conduct literature and patent searches and analyses before, during, and after R&D and commercialization. These knowledge discovery processes help them identify prior arts related to the current R&D efforts to avoid duplicating research efforts or infringing upon existing intellectual property rights (IPRs). It is common to have many synonyms (i.e., words and phrases with near-identical meanings) appeared in documents, which may hinder search results, if queries do not consider these synonyms. For instance, conducting "freedom-to-operate" (FTO) patent search may not find all related patents if synonyms were not taking into consideration. This research develops methodologies of generating domain specific "word" and "phrase" synonym dictionaries using machine learning. The generation and validation of both domain-specific "word" and "phrase" synonym dictionaries are conducted using more than 2000 solar power related patents as testing document set. The testing result shows that, in the solar power domain, both word level and phrase level dictionaries identify synonyms effectively and, thus, significantly improve the patent search results.en_US
dc.language.isoen_USen_US
dc.subjectSynonym Extractionen_US
dc.subjectMachine Learningen_US
dc.subjectPattern-based Extractionen_US
dc.subjectSelf-supervised Learningen_US
dc.titleUsing Machine Learning Approach to Identify Synonyms for Document Miningen_US
dc.typeProceedings Paperen_US
dc.identifier.doi10.3233/ATDE190158en_US
dc.identifier.journalTRANSDISCIPLINARY ENGINEERING FOR COMPLEX SOCIO-TECHNICAL SYSTEMSen_US
dc.citation.volume10en_US
dc.citation.spage509en_US
dc.citation.epage518en_US
dc.contributor.department管理科學系zh_TW
dc.contributor.departmentDepartment of Management Scienceen_US
dc.identifier.wosnumberWOS:000544261400052en_US
dc.citation.woscount0en_US
Appears in Collections:Conferences Paper